does onnx can support the model an inference with onnxruntime? #3

dragen1860 · 2024-06-07T06:49:35Z

Hi, dear author:
The memory reduce is very attractive and will benefits its application. I wonder does current onnx support the techniques you proposed and inference with onnxruntime framework?

A-suozhang · 2024-06-07T08:00:33Z

Thank you for your interest in our work! We haven't tried ONNXRuntime yet, we think it is applicable. MixDQ adopts the standard and deployment-friendly quantization scheme, We have already tested MixDQ with the pytorch_quantization deployment tool.

A-suozhang · 2024-06-07T08:10:42Z

If you are interested in deploying MixDQ with onnxrumtime or other tool,s we are also open for discussion and support, PRs are welcomed!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

does onnx can support the model an inference with onnxruntime? #3

does onnx can support the model an inference with onnxruntime? #3

dragen1860 commented Jun 7, 2024

A-suozhang commented Jun 7, 2024

A-suozhang commented Jun 7, 2024

does onnx can support the model an inference with onnxruntime? #3

does onnx can support the model an inference with onnxruntime? #3

Comments

dragen1860 commented Jun 7, 2024

A-suozhang commented Jun 7, 2024

A-suozhang commented Jun 7, 2024