Closed
Description
Hello - I'm running ipex: '1.10.0+cpu'
and torch version: '1.10.0+cu113'
when running this:
conf = ipex.QuantConf(dtype=torch.int8)
I'm getting this error:
AttributeError: module 'intel_extension_for_pytorch' has no attribute 'QuantConf'
Please help!
Also, what sort of performance gain should I expect when using int8 quantization with ipex vs the standard qint8 quantization straight from pytorch:
torch.quantization.quantize_dynamic(model=self._model, qconfig_spec={torch.nn.Linear, torch.nn.Embedding}, dtype=torch.qint8)
thanks!
Metadata
Metadata
Assignees
Labels
No labels