这是indexloc提供的服务,不要输入任何密码
Skip to content

v0.19.dev0

@Lanssi Lanssi tagged this 14 Dec 01:38
This PR add support for OLMo architecture.

Additional support: add support for clip-qkv.

Test: already tested on android(pixel 4) and cuda(setting tensor_parallel_shrads=2)
Assets 2
Loading