Release Intel® Extension for Transformers v1.2.1 Release · intel/intel-extension-for-transformers

Examples

Bug Fixing & Improvements

Fix bug for woq with AWQ due to not set calib_iters if calib_dataloader is not None.( 565ab4)
Fix init issue of langchain chroma (fdefe2)
Fix NeuralChat starcoder mha fusion issue (ce3d24)
Fix setuptools version limitation for build (2cae32)
Fix post process with topk topp of python api (7b4730)
Fix msvc compile issues (87b00d)
Refine notebook and fix restful api issues (d8cc11)
Upgrade qbits backend (45e03b )
Fix starcoder issues for IPEX int8 and Weight Only int4 (e88c7b )
Fix ChatGLM2 model loading issue (4f2169 )
Remove OneDNN graph env setting for BF16 inference (59ab03 )
Improve database by escape sql string (be6790 )
fix qbits backend get wrong workspace malloc size (6dbd0b )

Validated Configurations

Provide feedback