[BUG]: When using KoboldCPP, the max response length is always 512, regardless of context size.

### How are you running AnythingLLM?

AnythingLLM desktop app

### What happened?

I'm attempting to use AnythingLLM with KoboldCPP as the provider. No matter what I tried, the max response length is always 512 tokens. No matter the context size.

This, combined with the lack of a "Continue" button makes certain tasks hard, annoying or borderline impossible.



### Are there known steps to reproduce?

Connect KoboldCPP. Do any kind of chat. Look at the KoboldCPP console to confirm that the requested response has a maximum of 512 tokens.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BUG]: When using KoboldCPP, the max response length is always 512, regardless of context size. #3708

How are you running AnythingLLM?

What happened?

Are there known steps to reproduce?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[BUG]: When using KoboldCPP, the max response length is always 512, regardless of context size. #3708

Description

How are you running AnythingLLM?

What happened?

Are there known steps to reproduce?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions