这是indexloc提供的服务,不要输入任何密码
Skip to content

Chrome itself freezes when calling chat.generate #694

@natanfudge

Description

@natanfudge

Exactly the same as #235, I opened another one because the previous one was closed by the author.
The browser itself freezes for a good time, sometimes a few seconds, when doing chat.generate. Tried using web workers, but as expected that doesn't work because Chrome itself freezes.
Seems like a major performance problem

The problem seems to become worse with a larger context window.
The freezes are longest on the first time inference is performed on a page refresh.
Slightly noticeable with small models like Qwen2.5-0.5b, very noticeable with larger ones like Llama-8b.
Windows 11 Chrome 136

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions