Stars
Models and examples built with TensorFlow
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Code and documentation to train Stanford's Alpaca models, and generate the data.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
100+ Chinese Word Vectors 上百种预训练中文词向量
The official GitHub page for the survey paper "A Survey of Large Language Models".
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
Fast and accurate AI powered file content types detection
Deep learning library featuring a higher-level API for TensorFlow.
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
Large World Model -- Modeling Text and Video with Millions Context
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
The official PyTorch implementation of Google's Gemma models
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
DeepSeek-VL: Towards Real-World Vision-Language Understanding
MTEB: Massive Text Embedding Benchmark
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Mirror of the mapnik stylesheets formerly used on OpenStreetMap.org
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
lianjia / beike estate crawler/analysis 2024