这是indexloc提供的服务,不要输入任何密码
Skip to content

Conversation

@pommedeterresautee
Copy link
Member

@pommedeterresautee pommedeterresautee commented Dec 1, 2021

  • support int-8 GPU quantization
  • add a tuto to perform quantization end to end
  • add QDQRoberta model
  • switch to ONNX opset 13
  • refactoring in the TensorRT engine creation
  • fix bugs

@pommedeterresautee pommedeterresautee marked this pull request as ready for review December 8, 2021 14:28
@pommedeterresautee pommedeterresautee self-assigned this Dec 8, 2021
@pommedeterresautee pommedeterresautee changed the title Quantization Support GPU INT-8 quantization Dec 8, 2021
@pommedeterresautee pommedeterresautee merged commit ad837a9 into main Dec 8, 2021
@pommedeterresautee pommedeterresautee deleted the quantization branch December 8, 2021 22:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Development

Successfully merging this pull request may close these issues.

2 participants