-
Department of Computer Science, UCLA
- http://web.cs.ucla.edu/~qgu
Highlights
- Pro
Stars
Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
The official implementation of Self-Play Fine-Tuning (SPIN)
The official implementation of Cross-Task Experience Sharing (COPS)
[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)
Official Implemetation of ConfDiff (ICML'24) - Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
The Family of Diffusion Protein Language Models (DPLM)
The official implementation of Self-Play Preference Optimization (SPPO)
Official repo of Respond-and-Respond: data, code, and evaluation
Leveraging Structural Priors and Constraints for Cryo-EM Heterogeneous Reconstruction
Official repo of Progressive Data Expansion: data, code and evaluation
Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks" (accepted by IJCAI 2020)