+
Skip to content
View frk-tt's full-sized avatar
⛷️
⛷️

Block or report frk-tt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Vector (and Scalar) Quantization, in Pytorch

Python 3,611 294 Updated Aug 29, 2025

Semantic IDs: How to train an LLM-Recommender Hybrid with steerability and reasoning on recommendations.

Jupyter Notebook 47 13 Updated Sep 15, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 15,914 1,193 Updated Oct 11, 2025
Python 34 11 Updated Sep 3, 2025

Frontier Open-Source Text-to-Speech

9,578 1,187 Updated Sep 5, 2025

The first challenge on short-form video quality assessment

Python 93 3 Updated Dec 1, 2024

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,687 488 Updated Oct 13, 2025

Official implementation of the paper "Watermark Anything with Localized Messages"

Jupyter Notebook 1,066 48 Updated Jun 20, 2025

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsin…

Python 91 4 Updated Sep 28, 2025

Awesome papers & datasets specifically focused on long-term videos.

318 14 Updated Oct 9, 2025

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Java 76,479 14,021 Updated Aug 14, 2023
JavaScript 56 5 Updated Jun 11, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,312 77 Updated Sep 12, 2025

🔥 [ICCV 2025 Highlight] Official open-source repo for LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition

Python 33 10 Updated Aug 21, 2025

A generalized information-seeking agent system with Large Language Models (LLMs).

Python 1,189 116 Updated Jun 19, 2024

A curated list of awesome LLM agents frameworks.

Python 1,127 111 Updated Oct 12, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,007 510 Updated Oct 11, 2025

Retrieval and Retrieval-augmented LLMs

Python 10,669 797 Updated Oct 10, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,810 1,843 Updated Oct 6, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 22,343 2,507 Updated Oct 8, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,916 559 Updated Feb 26, 2025

The official PyTorch Implementation of Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment

Python 37 1 Updated Aug 12, 2025

[CVPR2025] KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception

20 Updated Jun 6, 2025

🎓Automatically Update Recommendation Papers Daily using Github Actions (Update Every 12th hours)

Python 82 7 Updated Oct 14, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,080 254 Updated Oct 14, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,258 102 Updated Oct 9, 2025

[NeurIPS 2025 Spotlight] Q-Insight: Understanding Image Quality via Visual Reinforcement Learning

Python 172 4 Updated Oct 10, 2025

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 32,075 4,950 Updated Sep 26, 2025

🎓 Path to a free self-taught education in Computer Science!

HTML 195,345 24,354 Updated Aug 23, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,459 58 Updated Jun 14, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载