-
Notifications
You must be signed in to change notification settings - Fork 207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fails to load saved model : Trying to set a tensor of shape torch.Size([1376, 4096]) in "qweight" (which has shape torch.Size([4096, 1376])), this look incorrect. #1407
Comments
|
Thank you for the response. using I get following error
can you share an example how to save quantized model ( |
It looks like load/save mismatch, can you try to use latest commit instead of g494a5712fa2 and set use_neural_speed=False? |
Hi, Thank you. Saving works, however loading the saved model leads to following error
following is the code snippet
|
@kranipa , This issue is caused by mismatch the version of ITREX and neural-compressor. You can use neural-compressor version 2.5.1 and try it again. ITREX 1.4 is released now, Please try it. thanks very much. |
okay , thank you. |
@kranipa Did you get it to run? I'm having the same problem. |
@PhzCode , could you post your code and let me try to reproduce it. thanks very much. |
Loading saved model runs into following error
It also takes a very long time to run and save quantized models.
Tried following example.
The text was updated successfully, but these errors were encountered: