Starred repositories
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Cosmos is an operating system "construction kit". Build your own OS using managed languages such as C#, VB.NET, and more!
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Lumina-T2X is a unified framework for Text to Any Modality Generation
A Comprehensive Toolkit for High-Quality PDF Content Extraction
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Robust Speech Recognition via Large-Scale Weak Supervision
littlemosquito123 / openface
Forked from cmusatyalab/openfaceFace recognition with deep neural networks.
Trained model files for dlib example programs.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Retinaface get 80.99% in widerface hard val using mobilenet0.25.
littlemosquito123 / opencv_zoo
Forked from opencv/opencv_zooModel Zoo For OpenCV DNN and Benchmarks.
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)
Unofficial implemention of lanenet model for real time lane detection
中文nlp解决方案(大模型、数据、模型、训练、推理)
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scal…
Build ChatGPT over your data, all with natural language
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs