这是indexloc提供的服务,不要输入任何密码
Skip to content

Feature Request: Ultravox example #1378

@thiswillbeyourgithub

Description

@thiswillbeyourgithub

Hi,

Ultravox is a suite of open weight models that are designed for getting the time to first token as low as possible with audio input. Basically they trained a good and fast projector to project the whisper large v3 encoder into llama 4.1 LLMs, both in 8B and 70B size.

I think it would be a great fit for livekit's agents so it would be nice to add an example and demo for it!

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions