tool calling issues

Based on [this list](https://github.com/mlc-ai/web-llm/blob/d8b25fed8e81d6f6b27cdc07e839c1c09cfaa43d/src/config.ts#L297C1-L303C3) of tool-calling models, only the `Hermes-2-Pro-Mistral-7B-q4f16_1-MLC` model actually outputs proper JSON. 
The other models always just output an empty array: `[]`. This is also happening in your [function calling example](https://github.com/mlc-ai/web-llm/tree/main/examples/function-calling/function-calling-openai).

However, the "working" model ALWAYS outputs structured JSON (tries to call a tool) even if the user message doesn't explicitly prompts it to. 
It also always disregards the `tool_choice` set. Also even if it is set to `none`, it will always output structured data.

To replicate this, you can run the `function-calling-openai` example. The specified model will always output `[]`. Change it to `Hermes-2-Pro-Mistral-7B-q4f16_1-MLC` and it will always output structured data, even if the user message is changed to something like `Hey` and the `tool_choice` is set to `none`.


Maybe we could remove the [manual checks](https://github.com/mlc-ai/web-llm/blob/d8b25fed8e81d6f6b27cdc07e839c1c09cfaa43d/src/openai_api_protocols/chat_completion.ts#L518C2-L568) and just allow tool calling for all models? 
For instance reasoning models (such as Qwen3) are particularly good at executing tool calls, and it's a shame that we don't allow using those models for it.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tool calling issues #712

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

tool calling issues #712

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions