Google at ICML 2025
Google at ICML 2025
The 42nd International Conference on Machine Learning (ICML 2025) is being held Sunday, July 13th through Saturday July 19th in Vancouver, Canada. Google is proud to be a Diamond Sponsor of ICML 2025, where researchers from Google Research, Google Deepmind and more will be contributing at all levels. This year we are presenting over 140 papers and are actively involved in a number of different events, including an Invited talk, an Expo talk, 24 workshops, 7 orals, and several in-booth demo sessions.
Attending ICML 2025 in person? Stop by the Google booth to learn more about how we’re actively exploring the latest machine learning techniques for application across the fields of computer vision and machine perception. Visit the @GoogleResearch X and Google Research LinkedIn accounts for announcements about Google booth activities (e.g., demos and Q&A sessions, which are also listed below).
Continue below to learn more about how Google researchers are engaged at ICML 2025 (Google affiliations highlighted in bold).
All session times are provided in PDT.
*Date, time and session location may be subject to change.
Expo & oral talks
Orals
-
Oral: Tue, Jul 15 | 10:00AM — 11:00AM, West Ballroom D Poster: Tue, Jul 15 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-3510
Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural NetworksShikai Qiu*, Lechao Xiao, Andrew Gordon Wilson, Jeffrey Pennington, Atish Agarwala
-
Oral: Tue, Tue, Jul 15 | 10:00AM — 11:00AM, West Ballroom C Poster: Wed, Jul 16 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-3409
Strategy Coopetition Explains the Emergence and Transience of In-Context LearningAaditya K Singh, Ted Moskovitz, Sara Dragutinović, Felix Hill, Stephanie C.Y. Chan, Andrew M Saxe
-
Oral: Wed, Jul 16 | 10:00AM — 11:00AM, West Exhibition Hall C Poster: Wed, Jul 16 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-2505
Roll the Dice & Look Before You Leap: Going Beyond the Creative Limits of Next-Token PredictionVaishnavh Nagarajan, Chen Henry Wu, Charles Ding, Aditi Raghunathan
-
Oral: Wed, Jul 16 | 3:30PM — 4:30PM, West Ballroom C Poster: Wed, Jul 16 | 4:30PM — 7:00PM, West Exhibition Hall B2-B3 #W-417
Long-Form Speech Generation with Spoken Language ModelsSe Jin Park*, Julian Salazar, Aren Jansen, Keisuke Kinoshita, Yong Man Ro, RJ Skerry-Ryan
-
Oral: Wed, Jul 16 | 3:30PM — 4:30PM, West Ballroom A Poster: Wed, Jul 16 | 4:30PM — 7:00PM, East Exhibition Hall A-B #E-602
Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI EvaluationD. Sculley*, Will Cukierski, Phil Culliton, Sohier Dane, Maggie Demkin, Ryan Holbrook,
Addison Howard, Paul Mooney, Walter Reade, Megan Risdal, Nate Keating
-
Oral: Thu, Jul 17 | 10:00AM — 11:00AM, West Exhibition Hall C Poster: Thu, Jul 17 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-601
AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example DefensesNicholas Carlini, Edoardo Debenedetti, Javier Rando, Milad Nasr, Florian Tramèr
-
Oral: Thu, Jul 17 | 10:00AM — 11:00AM, West Exhibition Hall C Poster: Thu, Jul 17 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-701
Exploring and Mitigating Adversarial Manipulation of Voting-Based LeaderboardsYangsibo Huang, Milad Nasr, Anastasios Nikolas Angelopoulos, Nicholas Carlini, Wei-Lin Chiang, Christopher A. Choquette-Choo, Daphne Ippolito, Matthew Jagielski, Katherine Lee, Ken Liu, Ion Stoica, Florian Tramèr, Chiyuan Zhang
Spotlights
-
Tue, Jul 15 | 11:00AM — 1:30PM, West Exhibition Hall B2-B3 #W-900 (Poster Session 1 West)
The Role of Randomness in StabilityMax Hopkins, Shay Moran
-
Tue, Jul 15 | 4:30PM — 7:00PM, West Exhibition Hall B2-B3 #W-408 (Poster Session 2 West)
Discovering Symbolic Cognitive Models from Human and Animal BehaviorPablo Samuel Castro, Nenad Tomasev, Ankit Anand, Navodita Sharma, Rishika Mohanta, Aparna Dev, Kuba Perlin, Siddhant Jain, Kyle Levin, Noémi Éltető, Will Dabney, Alexander Novikov, Glenn C Turner, Maria K Eckstein, Nathaniel D Daw, Kevin J Miller, Kimberly L Stachenfeld
-
Tue, Jul 15 | 4:30PM — 7:00PM, East Exhibition Hall A-B #E-1812 (Poster Session 2 East)
Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling LawsXiyuan Wei, Ming Lin, Fanjiang Ye, Fengguang Song, Liangliang Cao, My T. Thai, Tianbao Yang
-
Wed, Jul 16 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-2508 (Poster Session 3 East)
Mastering Board Games by External and Internal Planning with Language ModelsJohn Schultz, Jakub Adamek, Matej Jusup*, Marc Lanctot, Michael Kaisers, Sarah Perrin, Daniel Hennes, Jeremy Shar, Cannada A. Lewis, Anian Ruoss, Tom Zahavy, Petar Veličković, Laurel Prince, Satinder Singh, Eric Malmi, Nenad Tomašev
-
Wed, Jul 16 | 4:30PM — 7:00PM, East Exhibition Hall A-B #E-3109 (Poster Session 4 East)
Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of ExpertsMarta Skreta, Tara Akhound-Sadegh, Viktor Ohanesian, Roberto Bondesan, Alan Aspuru-Guzik, Arnaud Doucet, Rob Brekelmans, Alexander Tong, Kirill Neklyudov
-
Wed, Jul 16 | 4:30PM — 7:00PM, East Exhibition Hall A-B #E-1301 (Poster Session 4 East)
New Bounds for Sparse Variational Gaussian ProcessesMichalis Titsias
-
Wed, Jul 16 | 4:30PM — 7:00PM, East Exhibition Hall A-B #E-500 (Poster Session 4 East)
Position: We Can’t Understand AI Using Our Existing VocabularyJohn Hewitt, Robert Geirhos, Been Kim
-
Thu, Jul 17 | 11:00AM — 1:30PM, West Exhibition Hall B2-B3 #W-919 (Poster Session 5 West)
Catoni Contextual Bandits are Robust to Heavy-Tailed RewardsChenlu Ye, Yujia Jin*, Alekh Agarwal, Tong Zhang
-
Thu, Jul 17 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-2810 (Poster Session 5 East)
Language Models May Verbatim Complete Text They Were Not Explicitly Trained OnKen Liu*, Christopher A. Choquette-Choo, Matthew Jagielski, Peter Kairouz, Sanmi Koyejo, Percy Liang, Nicolas Papernot
-
Thu, Jul 17 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-3500 (Poster Session 5 East)
Learning the RoPEs: Better 2D and 3D Position Encodings with STRINGConnor Schenck, Isaac Reid, Mithun George Jacob, Alex Bewley, Joshua Ainslie, David Rendleman, Deepali Jain, Mohit Sharma, Avinava Dubey, Ayzaan Wahid, Sumeet Singh, Rene Wagner, Tianli Ding, Chuyuan Fu, Arunkumar Byravan, Jake Varley, Alexey Gritsenko, Matthias Minderer, Dmitry Kalashnikov, Jonathan Tompson, Vikas Sindhwani, Krzysztof Choromanski
-
Thu, Jul 17 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-2605 (Poster Session 5 East)
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic LocalizationPhillip Huang Guo, Aaquib Syed, Abhay Sheshadri, Aidan Ewart, Gintare Karolina Dziugaite
-
Thu, Jul 17 | 11:00AM — 1:30PM, West Exhibition Hall B2-B3 #W-810 (Poster Session 5 West)
Rapid Overfitting of Multi-Pass SGD in Stochastic Convex OptimizationShira Vansover-Hager, Tomer Koren, Roi Livni
-
Thu, Jul 17 | 11:00AM — 1:30PM, East Exhibition Hall A-B #E-2701 (Poster Session 5 East)
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent SystemsShaokun Zhang, Ming Yin, Jieyu Zhang, Jiale Liu, Zhiguang Han, Jingyang Zhang, Beibin Li, Chi Wang, Huazheng Wang, Yiran Chen, Qingyun Wu
-
Thu, Jul 17 | 4:30PM — 7:00PM, East Exhibition Hall A-B #E-2209 (Poster Session 6 East)
ActionPiece: Contextually Tokenizing Action Sequences for Generative RecommendationYupeng Hou*, Jianmo Ni, Zhankui He, Noveen Sachdeva, Wang-Cheng Kang, Ed H. Chi, Julian McAuley, Derek Zhiyuan Cheng
-
Thu, Jul 17 | 4:30PM — 7:00PM, West Exhibition Hall B2-B3 #W-912 (Poster Session 6 West)
Procurement Auctions via Approximately Optimal Submodular OptimizationYuan Deng, Amin Karbasi, Vahab Mirrokni, Renato Paes Leme, Grigoris Velegkas*, Song Zuo
Accepted Papers
Avoiding Spurious Sharpness Minimization Broadens Applicability of SAM
Sidak Pal Singh, Hossein Mobahi, Atish Agarwala, Yann Nicolas Dauphin
Can RLHF Be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective
Jiawei Huang, Bingcong Li, Christoph Dann, Niao He
Competing Bandits in Matching Markets via Super Stability
Soumya Basu
Does Learning the Right Latent Variables Necessarily Improve In-Context Learning?
Sarthak Mittal, Eric Elmoznino, Léo Gagnon, Sangnie Bhardwaj, Guillaume Lajoie, Dhanya Sridhar
GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance
Jinuk Kim*, Marwa El Halabi, Wonpyo Park, Clemens JS Schaefer, Deokjae Lee, Yeonhong Park, Jae W. Lee, Hyun Oh Song
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Zhuowei Li, Haizhou Shi, Yunhe Gao, Di Liu, Zhenting Wang, Yuxiao Chen, Ting Liu, Long Zhao, Hao Wang, Dimitris N. Metaxas
The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
Walter Mayor, Johan Obando-Ceron, Aaron Courville, Pablo Samuel Castro
Improving the Variance of Differentially Private Randomized Experiments Through Clustering
Adel Javanmard, Vahab Mirrokni, Jean Pouget-Abadie
It's My Data Too: Private ML for Datasets with Multi-User Training Examples
Arun Ganesh, Ryan McKenna, Brendan McMahan, Adam Smith, Fan Wu*
LAION-C: An Out-of-Distribution Benchmark for Web-Scale Vision Models
Fanfei Li, Thomas Klein, Wieland Brendel, Robert Geirhos, Roland S. Zimmermann
Learning-Order Autoregressive Models with Application to Molecular Graph Generation
Zhe Wang, Jiaxin Shi, Nicolas Heess, Arthur Gretton, Michalis Tistias
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding
Tian Jin*, Ellie Cheng, Zachary Ankner, Nikunj Saunshi, Blake Elias, Amir Yazdanbakhsh, Jonathan Ragan-Kelley, Suvinay Subramanian, Michael Carbin
LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Bowen Jin, Jinsung Yoon, Zhen Qin, Ziqi Wang, Wei Xiong, Yu Meng, Jiawei Han, Sercan Ö. Arik
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
Anian Ruoss, Fabio Pardo, Harris Chan, Bonnie Li, Vlad Mnih, Tim Genewin
Masked Generative Nested Transformers with Decode Time Scaling
Sahil Goyal, Debapriya Tula*, Gagan Jain, Pradeep Shenoy, Prateek Jain, Sujoy Paul
On the Learnability of Distribution Classes with Adaptive Adversaries
Tosca Lechner, Alex Bie, Gautam Kamath
Randomized Dimensionality Reduction for Euclidean Maximization and Diversity Measures
Jie Gao, Rajesh Jayaram, Benedikt Kolbe, Shay Sapir, Chris Schwiegelshohn, Sandeep Silwal, Erik Waingarten
Relative Error Fair Clustering in the Weak-Strong Oracle Model
Vladimir Braverman, Prathamesh Dharangutte, Shaofeng Jiang, Hoai-An Nguyen, Chen Wang, Yubo Zhang, Samson Zhou
Solving Zero-Sum Convex Markov Games
Fivos Kalogiannis, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Ian Gemp, Georgios Piliouras
Synthetic Text Generation for Training Large Language Models via Gradient Matching
Dang Nguyen, Zeman Li, MohammadHossein Bateni, Vahab Mirrokni, Meisam Razaviyayn, Baharan Mirzasoleiman
A Bayesian Model Selection Criterion for Selecting Pretraining Checkpoints
Michael Munn, Susan Wei
Accelerated Diffusion Models via Speculative Sampling
Valentin De Bortoli, Alexandre Galashov, Arthur Gretton, Arnaud Doucet
Adapting to Evolving Adversaries with Regularized Continual Robust Training
Sihui Dai, Christian Cianfarani, Arjun Bhagoji, Vikash Sehwag, Prateek Mittal
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney
Approximate Differential Privacy of the L2 Mechanism
Matthew Joseph, Alex Kulesza, Alexander Yu
Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data
Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong
Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression
Jingfeng Wu, Peter L. Bartlett, Matus Telgarsky, Bin Yu
Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data
David Heurtel-Depeiges*, Anian Ruoss, Joel Veness, Tim Genewein
Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning
Ian Gemp, Andreas Haupt, Luke Marris, Siqi Liu, Georgios Piliouras
Design Considerations in Offline Preference-Based RL
Alekh Agarwal, Christoph Dann, Teodor Marinov
Empirical Privacy Variance
Yuzheng Hu, Fan Wu, Ruicheng Xian, Yuhang Liu, Lydia Zakynthinou, Pritish Kamath, Chiyuan Zhang, David Forsyth
EvoControl: Multi-Frequency Bi-Level Control for High-Frequency Continuous Control
Samuel Holt*, Dhruva Tirumala, Todor Davchev, Ben Moran, Atil Iscen, Antoine Laurens, Yixin Lin, Erik Frey, Francesco Romano, Markus Wulfmeier, Nicolas Heess
Improving Transformer World Models for Data-Efficient RL
Antoine Dedieu, Joseph Ortiz, Xinghua Lou, Carter Wendelken, Wolfgang Lehrach, J. Swaroop Guntupalli, Miguel Lazaro-Gredilla, Kevin Murphy
Position: Machine Learning Models Have a Supply Chain Problem
Sarah Meiklejohn, Hayden Blauzvern, Mihai Maruseac, Spencer Schrock, Laurent Simon, Ilia Shumailov
Retraining with Predicted Hard Labels Provably Increases Model Accuracy
Rudrajit Das*, Inderjit S Dhillon, Alessandro Epasto, Adel Javanmard, Jieming Mao, Vahab Mirrokni, Sujay Sanghavi, Peilin Zhong
Scalable Private Partition Selection via Adaptive Weighting
Justin Y. Chen, Vincent Cohen-Addad, Alessandro Epasto, Morteza Zadimoghaddam
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-Training
Tianzhe Chu, Yuexiang Zhai, Jihan Yang, Shengbang Tong, Saining Xie, Dale Schuurmans, Quoc V. Le, Sergey Levine, Yi Ma
Stochastic Deep Restoration Priors for Imaging Inverse Problems
Yuyang Hu, Albert Peng, Weijie Gan, Peyman Milanfar, Mauricio Delbracio, Ulugbek Kamilov
A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?
Ibrahim Alabdulmohsin, Andreas Steiner
Wasserstein Policy Optimization
David Pfau, Ian Davies, Diana Borsa, João G. M. Araújo, Brendan Tracey, Hado van Hasselt
Can Transformers Reason Logically? A Study in SAT Solving
Leyan Pan, Vijay Ganesh, Jacob Abernethy, Chris Esposo, Wenke Lee
DeepCrossAttention: Supercharging Transformer Residual Connections
Mike Heddes, Adel Javanmard, Kyriakos Axiotis, Thomas Fu, MohammadHossein Bateni, Vahab Mirrokni
Diffusion Instruction Tuning
Chen Jin, Ryutaro Tanno, Amrutha Saseendran, Tom Diethe, Philip Teare
Distributional Diffusion Models with Scoring Rules
Valentin De Bortoli, Alexandre Galashov, J. Swaroop Guntupalli, Guangyao Zhou, Kevin Murphy, Arthur Gretton, Arnaud Doucet
From Jack of All Trades to Master of One: Specializing LLM-Based Autoraters to a Test Set
Mara Finkelstein, Daniel Deutsch, Parker Riley, Juraj Juraska, Geza Kovacs, Markus Freitag
Integer Programming for Generalized Causal Bootstrap Designs
Jennifer Brennan, Sébastien Lahaie, Adel Javanmard, Nick Doudchenko*, Jean Pouget-Abadie
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities Against Hard Perturbations
Kaixuan Huang, Jiacheng Guo, Zihao Li, Xiang Ji, Jiawei Ge, Wenzhe Li, Yingqing Guo, Tianle Cai, Hui Yuan, Runzhe Wang, Yue Wu, Ming Yin, Shange Tang, Yangsibo Huang, Chi Jin, Xinyun Chen, Chiyuan Zhang, Mengdi Wang
Matryoshka Quantization
Pranav Nair, Puranjay Datta, Jeff Dean, Prateek Jain, Aditya Kusupati
Near-Optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback
Tal Lancewicki, Yishay Mansour
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation
Muhammed Yusuf Kocyigit*, Eleftheria Briakou, Daniel Deutsch, Jiaming Luo, Colin Cherry, Markus Freitag
Principled Algorithms for Optimizing Generalized Metrics in Binary Classification
Anqi Mao, Mehryar Mohri, Yutao Zhong
QuEst: Enhancing Estimates of Quantile-Based Distributional Measures Using Model Predictions
Zhun Deng, Tom Zollo, Benjamin Eyre, Amogh Inamdar, David Madras, Richard Zemel
Structured Preconditioners in Adaptive Optimization: A Unified Analysis
Shuo Xie, Tianhao Wang, Sashank Reddi, Sanjiv Kumar, Zhiyuan Li
Theoretical Guarantees on the Best-of-n Alignment Policy
Ahmad Beirami, Alekh Agarwal, Jonathan Berant, Alexander D'Amour, Jacob Eisenstein, Chirag Nagpal, Ananda Theertha Suresh
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
Hao Zhou, Xu Yang, Mingyu Fan, Lu Qi, Ming-Hsuan Yang, Fei Luo
What Makes a Good Feedforward Computational Graph?
Alex Vitvitskyi, João G. M. Araújo, Marc Lackenby, Petar Veličković
Almost Optimal Fully Dynamic k-Center Clustering with Recourse
Sayan Bhattacharya, Martín Costa, Ermiya Farokhnejad, Silvio Lattanzi, Nikos Parotsidis
AuPair: Golden Example Pairs for Code Repair
Aditi Mavalankar, Hassan Mansoor, Zita Marinho, Masha Samsikova, Tom Schaul
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models
Ali Behrouz, Ali Parviz, Mahdi Karami, Clayton Sanford, Bryan Perozzi, Vahab Mirrokni
Bipartite Ranking from Multiple Labels: On Loss Versus Label Aggregation
Michal Lukasik, Lin Chen, Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum, Felix X. Yu, Sashank J. Reddi, Gang Fu, Mohammadhossein Bateni, Sanjiv Kumar
Fast Tensor Completion via Approximate Richardson Iteration
Mehrdad Ghadiri, Matthew Fahrbach, Yunbum Kook, Ali Jadbabaie
Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts
Marta Skreta, Tara Akhound-Sadegh, Viktor Ohanesian, Roberto Bondesan, Alan Aspuru-Guzik, Arnaud Doucet, Rob Brekelmans, Alexander Tong, Kirill Neklyudov
FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch
Virginia Aglietti, Ira Ktena, Jessica Schrouff*, Eleni Sgouritsa, Francisco J. R. Ruiz, Alan Malek, Alexis Bellot, Silvia Chiappa
The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence
Tom Wollschläger, Jannes Elstner, Simon Markus Geisler, Vincent Cohen-Addad, Stephan Günnemann, Johannes Gasteiger
Hardware and Software Platform Inference
Cheng Zhang, Hanna Foerster, Robert D. Mullins, Yiren Zhao, Ilia Shumailov
How Expressive Are Knowledge Graph Foundation Models?
Xingyue Huang, Pablo Barceló, Michael Bronstein, İsmail Ceylan, Mikhail Galkin, Juan Reutter, Miguel Romero Orth
In-Context Fine-Tuning for Time-Series Foundation Models
Matthew Faw*, Rajat Sen, Yichen Zhou, Abhimanyu Das
Learning From Others' Mistakes: Finetuning Machine Translation Models with Span-level Error Annotations
Lily H. Zhang, Hamid Dadkhahi, Mara Finkelstein, Firas Trabelsi, Jiaming Luo, Markus Freitag
The Limits of Predicting Agents from Behaviour
Alexis Bellot, Jonathan Richens, Tom Everitt
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Marwa Abdulhai, Isadora White, Charlie Snell, Charles Sun, Joey Hong, Yuexiang Zhai, Kelvin Xu, Sergey Levine
Mastering Multiple-Expert Routing: Realizable H-Consistency and Strong Guarantees for Learning to Defer
Anqi Mao, Mehryar Mohri, Yutao Zhong
On Mitigating Affinity Bias Through Bandits with Evolving Biased Feedback
Matthew Faw, Constantine Caramanis, Jessica Hoffmann
Position: Iterative Online-Offline Joint Optimization Is Needed to Manage Complex LLM Copyright Risks
Yanzhou Pan, Jiayi Chen, Jiamin Chen, Zhaozhuo Xu, Denghui Zhang
Preference Adaptive and Sequential Text-to-Image Generation
Ofir Nabati, Guy Tennenholtz, ChihWei Hsu, Moonkyung Ryu, Deepak Ramachandran, Yinlam Chow, Xiang Li, Craig Boutilier
Provable Length Generalization in Sequence Prediction via Spectral Filtering
Annie Marsden, Evan Dogariu, Naman Agarwal, Xinyi Chen, Daniel Suo, Elad Hazan
Regression for the Mean: Auto-Evaluation and Inference with Few Labels Through Post-Hoc Regression
Benjamin Eyre*, David Madras
REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective
Simon Geisler, Tom Wollschläger, M. Hesham Abdalla, Vincent Cohen-Addad, Johannes Gasteiger, Stephan Günnemann
Reward-Guided Prompt Evolving in Reinforcement Learning for LLMs
Ziyu Ye, Rishabh Agarwal, Tianqi Liu, Rishabh Joshi, Sarmishta Velury, Quoc V. Le, Qijun Tan, Yuan Liu
Scalable Meta-Learning via Mixed-Mode Differentiation
Iurii Kemaev, Dan A. Calian, Luisa M. Zintgraf, Gregory Farquhar, Hado van Hasselt
SLiM: One-Shot Quantization and Sparsity with Low-Rank Approximation for LLM Weight Compression
Mohammad Mozaffari, Amir Yazdanbakhsh, Maryam Mehri Dehnavi
Synthesizing Privacy-Preserving Text Data via Finetuning Without Finetuning Billion-Scale LLMs
Bowen Tan*, Zheng Xu, Eric Xing, Zhiting Hu, Shanshan Wu
Algorithms and Hardness for Active Learning on Graphs
Vincent Cohen-Addad, Silvio Lattanzi, Simon Meierhans
Best of Both Worlds: Regret Minimization Versus Minimax Play
Adrian Müller, Jon Schneider, Stratis Skoulakis, Luca Viano, Volkan Cevher
The Brain's Bitter Lesson: Scaling Speech Decoding with Self-Supervised Learning
Dulhan Jayalath, Gilad Landau, Brendan Shillingford, Mark Woolrich, ʻŌiwi Parker Jones
Breaking the Quadratic Barrier: Robust Cardinality Sketches for Adaptive Queries
Edith Cohen, Mihir Singhal, Uri Stemmer
Categorical Distributional Reinforcement Learning with Kullback-Leibler Divergence: Convergence and Asymptotics
Tyler Kastner, Mark Rowland, Yunhao Tang*, Murat Erdogdu, Amir-massoud Farahmand
Catoni Contextual Bandits Are Robust to Heavy-Tailed Rewards
Chenlu Ye, Yujia Jin*, Alekh Agarwal, Tong Zhang
Correlation Clustering Beyond the Pivot Algorithm
Soheil Behnezhad, Moses Charikar, Vincent Cohen-Addad, Alma Ghafari, Weiyun Ma
Deliberation in Latent Space via Differentiable Cache Augmentation
Luyang Liu, Jonas Pfeiffer, Jiaxing Wu, Jun Xie, Arthur Szlam
Differentially Private Space-Efficient Algorithms for Counting Distinct Elements in the Turnstile Model
Rachel Cummings, Alessandro Epasto, Jieming Mao, Tamalika Mukherjee, Tingting Ou, Peilin Zhong
Direct Motion Models for Assessing Generated Videos
Kelsey Allen, Carl Doersch, Guangyao Zhou, Mohammed Suhail, Danny Driess, Ignacio Rocco, Yulia Rubanova, Thomas Kipf, Mehdi Sajjadi, Kevin Murphy, João Carreira, Sjoerd van Steenkiste
Epsilon-VAE: Denoising as Visual Decoding
Long Zhao, Sanghyun Woo, Ziyu Wan*, Yandong Li, Han Zhang, Boqing Gong, Hartwig Adam, Xuhui Jia, Ting Liu
Exact Risk Curves of SignSGD in High-Dimensions: Quantifying Preconditioning and Noise-Compression Effects
Kevin Xiao, Noah Marshall, Atish Agarwala, Elliot Paquette
General Agents Need World Models
Jonathan Richens, David Abel, Tom Everitt
Gradient Descent Converges Arbitrarily Fast for Logistic Regression via Large and Adaptive Stepsizes
Ruiqi Zhang, Jingfeng Wu, Peter L. Bartlett
InfAlign: Inference-Aware Language Model Alignment
Ananth Balashankar, Ziteng Sun, Jonathan Berant, Jacob Eisenstein, Michael Collins, Adrian Hutter, Jong Lee, Chirag Nagpal, Flavien Prost, Aradhana Sinha, Ananda Suresh, Ahmad Beirami
Joint Learning of Energy-based Models and Their Partition Function
Michaël E. Sander, Vincent Roulet, Tianlin Liu, Mathieu Blondel
Loss Functions and Operators Generated by f-Divergences
Vincent Roulet, Tianlin Liu, Nino Vieillard, Michaël E. Sander, Mathieu Blondel
LAuReL: Learned Augmented Residual Layer
Gaurav Menghani, Ravi Kumar, Sanjiv Kumar
Leveraging Per-Instance Privacy for Machine Unlearning
Naz Sepahvand, Anvith Thudi, Berivan Isik, Ashmita Bhattacharyya, Nicolas Papernot, Eleni Triantafillou, Daniel Roy, Gintare Karolina Dziugaite
Nearly Optimal Sample Complexity for Learning with Label Proportions
Robert Busa-Fekete, Travis Dick, Claudio Gentile, Haim Kaplan, Tomer Koren, Uri Stemmer
Positional Attention: Expressivity and Learnability of Algorithmic Computation
Artur Back de Luca, George Giapitzakis, Shenghao Yang, Petar Veličković, Kimon Fountoulakis
Position: Graph Learning Will Lose Relevance Due to Poor Benchmarks
Maya Bechler-Speicher, Ben Finkelshtein, Fabrizio Frasca, Luis Müller, Jan M Tönshoff, Antoine Siraudin, Viktor Zaverkin, Michael Bronstein, Mathias Niepert, Bryan Perozzi, Mikhail Galkin, Christopher Morris
Position: Stop Treating ‘AGI’ as the North-Star Goal of AI Research
Borhane Blili-Hamelin, Chris Graziul, Leif Hancox-Li, Hananel Hazan, El-Mahdi El-Mhamdi, Avijit Ghosh, Katherine Heller, Jacob Metcalf, Fabricio Murai, Eryk Salvaggio, Andrew Smart, Todd Snider, Mariame Tighanimine, Talia Ringer, Margaret Mitchell, Shiri Dori-Hacohen
Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty
Meera Hahn, Wenjun Zeng, Nithish Kannen, Rich Galt, Kartikeya Badola, Been Kim, Zi Wang
Retrieval Augmented Time Series Forecasting
Sungwon Han, Seungeon Lee, Meeyoung Cha, Sercan Ö. Arik, Jinsung Yoon
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification
Eric Zhao, Pranjal Awasthi, Sreenivas Gollapudi
Softmax Is Not Enough (for Sharp Size Generalisation)
Petar Veličković, Christos Perivolaropoulos, Federico Barbero*, Razvan Pascanu
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
Samir Khaki, Xiuyu Li, Junxian Guo, Ligeng Zhu, Konstantinos N. Plataniotis, Amir Yazdanbakhsh, Kurt Keutzer, Song Han, Zhijian Liu
Concurrent Reinforcement Learning with Aggregated States via Randomized Least Squares Value Iteration
Yan Chen, Jerry Bai, Yiteng Zhang, Maria Dimakopoulou, Shi Dong, Qi Sun, Zhengyuan Zhou
Convergence of Policy Mirror Descent Beyond Compatible Function Approximation
Uri Sherman, Tomer Koren, Yishay Mansour
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Christopher A. Choquette-Choo, Adam Dziedzic
EVOLvE: Evaluating and Optimizing LLMs for In-Context Exploration
Allen Nie, Yi Su, Bo Chang, Jonathan N. Lee, Ed H. Chi, Quoc V. Le, Minmin Chen
Geometric Median (GM) Matching for Robust k-Subset Selection from Noisy Data
Anish Acharya, Sujay Sanghavi, Alex Dimakis, Inderjit Dhillon
Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks
Yuhang Cai, Kangjie Zhou, Jingfeng Wu, Song Mei, Michael Lindsey, Peter L. Bartlett
Interpreting the Repeated Token Phenomenon in Large Language Models
Itay Yona, Jamie Hayes, Ilia Shumailov, Yossi Gandelsman
Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures
Alina Ene, Alessandro Epasto, Vahab Mirrokni, Hoai-An Nguyen, Huy Nguyen, David P. Woodruff, Peilin Zhong
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Shangbin Feng, Zifeng Wang, Yike Wang, Sayna Ebrahimi, Hamid Palangi, Lesly Miculicich, Achin Kulshrestha, Nathalie Rauschmayr, Yejin Choi, Yulia Tsvetkov, Chen-Yu Lee, Tomas Pfister
MONA: Myopic Optimization with Non-Myopic Approval Can Mitigate Multi-Step Reward Hacking
Sebastian Farquhar, Vikrant Varma, David Lindner, David Elson, Caleb Biddulph, Ian Goodfellow, Rohin Shah
On Teacher Hacking in Language Model Distillation
Daniil Tiapkin*, Daniele Calandriello, Johan Ferret, Sarah Perrin, Nino Vieillard, Alexandre Ramé, Mathieu Blondel
Rejecting Hallucinated State Targets During Planning
Mingde Zhao, Tristan Sylvain, Romain Laroche, Doina Precup, Yoshua Bengio
Scaling Laws for Differentially Private Language Models
Ryan McKenna, Yangsibo Huang, Amer Sinha, Borja Balle, Zachary Charles, Christopher A. Choquette-Choo, Badih Ghazi, Georgios Kaissis, Ravi Kumar, Ruibo Liu, Da Yu, Chiyuan Zhang
Towards Flexible Perception with Visual Memory
Robert Geirhos, Priyank Jaini, Austin Stone, Sourabh Medapati, Xi Yi, George Toderici, Abhijit Ogale, Jonathon Shlens
Workshops
-
Fri, Jul 18 | 8:30AM — 6:00PM, West Ballroom C
AI for MathOrganizer: Isabelle Guyon
Speaker: Swarat Chaudhuri -
Fri, Jul 18 | 8:45AM — 5:15PM, West Ballroom B
Assessing World Models: Methods and Metrics for Evaluating UnderstandingOrganizer: Michael Lepori
-
Fri, Jul 18 l 9:00AM — 6:00PM, West Meeting Room 211-214
CODEML: Championing Open-source DEvelopment in Machine LearningSpeaker: Matthew Johnson
-
Fri, Jul 18 | 8:30AM — 6:00PM, East Ballroom A
Computational Optimization of Buildings (CO-BUILD)Organizers: Judah Goldfeder, John Sipple
Panelist: John Sipple -
Fri, Jul 18 | 8:30AM — 6:00PM, East Exhibition Hall A
Generative AI for BiologyOrganizers: Minkai Xu, Zhenqiao Song
-
Fri, Jul 18 l 9:00AM — 5:30PM, West Meeting Room 118-120
High-dimensional Learning Dynamics (HiLD)Organizer: Atish Agarwala
Speaker: Yasaman Bahri -
Fri, Jul 18 | 9:00AM — 5:00PM, West Meeting Room 202-204
Machine Unlearning for Generative AIOrganizers: Vaidehi Patil, Katherine Lee
Speaker: Eleni Triantafillou -
Fri, Jul 18 | 9:00AM — 5:00PM, West Ballroom A
Models of human Feedback for AI Alignment (MoFA)Organizers: Maria Teresa Parreira, Anca Dragan
Speaker: Natasha Jaques
Panelist: Natasha Jaques -
Fri, Jul 18 | 9:00AM — 5:15PM, West Meeting Room 109-110
Multi-Agent Systems in the Era of Foundation Models: Opportunities, Challenges and FuturesSpeakers: Natasha Jaques, Yilun Du
-
Fri, Jul 18 | 8:20AM — 5:30PM, West Meeting Room 301-305
Programmatic Representations for Agent LearningOrganizers: Xinyun Chen, Ching-An Cheng, Kuang-Huei Lee
Speakers: Dale Schuurmans, Wenhao Yu -
Fri, Jul 18 | 9:00AM — 5:20PM, West Meeting Room 223-224
Scaling Up Intervention ModelsSpeaker: Amir Feder
-
Fri, Jul 18 | 8:30AM — 5:15PM, West Meeting Room 215-216
Tiny Titans: The Next Wave of On-Device Learning for Foundation Models (TTODLer-FM)Organizers: Berivan Isik, Peter Kairouz
Speaker: Morgane Rivière -
Fri, Jul 18 | 8:30AM — 6:00PM, West Meeting Room 111-112
Tokenization Workshop (TokShop)Organizer: Elizabeth Salesky
-
Sat, Jul 19 | 8:30AM — 6:00PM,, East Ballroom A
Actionable InterpretabilityOrganizers: Ian Tenney, Mor Geva
Speaker: Been Kim -
Sat, Jul 19 | 9:00AM — 5:00PM, West Meeting Room 118-120
AI Heard That! ICML 2025 Workshop on Machine Learning for AudioOrganizers: Sander Dieleman, Chris Donahue, Shrikanth Narayanan
Speaker: Daniel PW Ellis -
Sat, Jul 19 8:50AM — 5:45PM, West Ballroom D
Building Physically Plausible World ModelsOrganizers: Yilun Du, Hiroki Furuta, Ruiqi Gao, Sean Kirmani, Kuang-Huei Lee, Wenhao Yu
Speakers: Sherry Yang, Agrim Gupta -
Sat, Jul 19 | 8:30AM — 5:15PM, West Meeting Room 215-216
Collaborative and Federated Agentic Workflow (CFAgentic @ ICML'25)Speakers: Aviral Kumar, Chi Wang
-
Sat, Jul 19 | 8:45AM — 6:30PM, West Meeting Room 211-214
Computer Use AgentsOrganizer: Doina Precup
Speaker: Sercan Ö. Arik -
Sat, Jul 19 | 8:30AM — 6:00PM, East Exhibition Hall A
ES-FoMo III: Efficient Systems for Foundation ModelsOrganizer: Azalia Mirhoseini
-
Sat, Jul 19 | 8:30AM — 5:15PM, West Meeting Room 205-207
Exploration in AI Today (EXAIT)Advisor: Olivier Bachem
Speaker: Natasha Jaques -
Sat, Jul 19 | 8:50AM — 6:00PM, West Ballroom C
Reliable and Responsible Foundation ModelsOrganizer: David Madras
-
Sat, Jul 19 | 8:25AM — 4:50PM, West Meeting Room 223-224
The Impact of Memorization on Trustworthy Foundation ModelsSpeaker: Reza Shokri
Demos and Q&A at the Google Booth
*Dates and times may be subject to change. Stop by the Google booth (#225) for more details.
-
Tuesday, July 15 | 9:30 AM - 10:00 AM
Taming the Wild West of ML: From Model Signing to Trust in ML SystemsPresenter: Hayden Blauzvern, Mihai Maruseac, Sarah Meiklejohn
-
Tuesday, July 15 | 11:30 AM - 12:00 PM
Project AstraPresenter: Tony Nguyễn, Lei Shu
-
Tuesday, July 15 | 12:30 PM - 1:00 PM
Graph Foundation Models for Relational DataPresenters: Michael Galkin, Bryan Perozzi
-
Tuesday, July 15 | 1:30 PM - 2:00 PM
Google Cloud TPUs: Specialized Power for AIPresenter: Ran Ran
-
Tuesday, July 15 | 3:00 PM - 3:30 PM
SequenceLayers: Sequence Processing and Streaming Neural Networks Made EasyPresenters: Julian Salazar
-
Tuesday, July 15 | 4:30 PM - 5:00 PM
Introduction to the Gemini APIs and AI StudioPresenter: Paige Bailey
-
Wednesday, July 16 | 9:30 AM — 10:00 AM
State of the Art Cyclone Track and Intensity Forecasting with Functional Generative NetworksPresenter: Dominic Masters
-
Wednesday, July 16 | 11:30 AM — 12:00 PM
AI-First Drug Design at Isomorphic Labs: Towards a System for Rational Drug Design with AIPresenters: Jack Lynch, Benedek Rózemberczki
-
Wednesday, July 16 | 12:30 PM — 1:00 PM
AI Agent Training with Accelerated Physical Simulation for Building Operations OptimizationPresenters: Judah Goldfeder, John Sipple
-
Wednesday, July 16 | 1:30 PM — 2:30 PM
3D Robotics Transformers: Learning the RoPEs with STRINGPresenters: Alex Bewley, Krzysztof Choromanski, Mithun Jacob, Deepali Jain, David Rendleman, Connor Schenck
-
Wednesday, July 16 | 3:00 PM — 3:30 PM
Hex-LLM: High-Efficiency Large Language Model Serving on TPUs in Vertex AI Model GardenPresenter: Pooya Moradi
-
Wednesday, July 16 | 4:30 PM — 5:00 PM
Introduction to Vertex Model Development ServicePresenter: Mohammadreza Mohseni
-
Thursday, July 17 | 9:30 AM — 10:00 AM
Proactive Agents for Multi-Turn Text-to-Image Generation Under UncertaintyPresenters: Meera Hahn, Zi Wang
-
Thursday, July 17 | 11:30 AM — 12:00 PM
JAX and Its EcosystemPresenter: Chris Achard
Board & Organizing Committee
-
Corinna Cortes
- Board Member
-
Katherine Heller
- Board Member
-
Courtney Paquette
- Workshop Chair
-
Alexander D'Amour
- Accessibility and Diversity & Inclusion Chair
*Work done while at Google
Socials
Mon, Jul 14 | 8:00AM — 5:30PM, West Meeting Room 118-120
Navigating Generative AI and LLMs Across LanguagesOrganizers: Nikka Mofid, Kristina Nasr, Ching-Ching Liu, Christian Latunos, Patrick Tan, Austin Li