-
Shanghai Jiao Tong university
- Shanghai, China
-
16:35
(UTC -12:00) - https://scholar.google.com/citations?user=0Q6lKJ8AAAAJ&hl=zh-CN
Stars
🌐 WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor & WebShaper https://arxiv.org/abs/2507.15061 https://arxiv.org/pdf/2507.02592
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
verl: Volcano Engine Reinforcement Learning for LLMs
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
A lecture note for understanding deep learning
[EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
Official Code for DragGAN (SIGGRAPH 2023)
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Code for visualizing the loss landscape of neural nets
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…