[BUG/CHORE]: Gemini models token metrics incorrect

### How are you running AnythingLLM?

All versions

### What happened?

The TPS measurement of Gemini models is incorrect. This is likely due to manual counting of the tokens on request, which has always been a rough estimation. If possible, this value should come directly from the API chunks.

### Are there known steps to reproduce?

Use Google Gemini as the LLM provider (any model)
In a chat send a simple prompt like `Tell me a short story`
Observe that the metrics are clearly wrong (~3-8 tps, which should be 100+)



More Context:
https://github.com/Mintplex-Labs/anything-llm/pull/4459#pullrequestreview-3287426632

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BUG/CHORE]: Gemini models token metrics incorrect #4449

How are you running AnythingLLM?

What happened?

Are there known steps to reproduce?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[BUG/CHORE]: Gemini models token metrics incorrect #4449

Description

How are you running AnythingLLM?

What happened?

Are there known steps to reproduce?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions