[Training] Error building gradient graph for bert models for on-device training #22465
Labels
contributions welcome
lower priority issues for the core ORT teams
training
issues related to ONNX Runtime training; typically submitted using template
Describe the issue
Hello,
see also this discussion. I'm opening this one as I think it's an issue as sifting through previous issues training should work for bert models.
I am trying to generate artifacts for distilbert like so:
The exported onnx model works perfectly for inference, but artifact generation throws up:
Seems to have issues building the gradient graph as it gets out of bounds on OutputDefs.
To reproduce
See the code provided above.
Urgency
It's blocking the development of go bindings to onnx training which we want to use in our product.
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.19.2
PyTorch Version
2.4.1+cu121
Execution Provider
Default CPU
Execution Provider Library Version
No response
The text was updated successfully, but these errors were encountered: