### Has this been supported or requested before? - [x] I have checked [the GitHub README](https://github.com/QwenLM/Qwen3). - [x] I have checked [the Qwen documentation](https://qwen.readthedocs.io). - [x] I have checked the documentation of the related framework. - [x] I have searched [the issues](https://github.com/QwenLM/Qwen3/issues?q=is%3Aissue) and there is not a similar one. ### What is this feature about? speed of responses ### Proposal work on the speed of response generation. ### Contributions are welcomed - [ ] I am willing to help implement this feature.