Stars
AudioBench: A Universal Benchmark for Audio Large Language Models
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Implementation of the paper: WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine
Provides a common interface to many IR ranking datasets.
CLIReval is an open-source toolkit that evaluates the quality of MT outputs in the context of a CLIR system, without the need for any actual CLIR dataset.