Anirudh Satheesh

Undergraduate Student, University of Maryland, College Park

anirudhs [AT] terpmail.umd.edu

About

Hello! I'm an undergraduate student in the Computer Science Department at University of Maryland, College Park. My research interest focuses on developing robust agents and agentic systems. I'm currently working on scaling robust reinforcement learning to foundation models and developing theoretical guarantees for such algorithms.

I've been fortunate to work with and be advised by the following professors: Furong Huang on LLM safety and reasoning, Radu Balan on physics-informed machine learning, Hua Wei on multi-agent reinforcement learning, and Vaneet Aggarwal on robust reinforcement learning.

Publications

Most recent publications on Google Scholar.
indicates equal contribution.

PICore: Physics-Informed Unsupervised Coreset Selection for Data Efficient Neural Operator Training

Anirudh Satheesh , Anant Khandelwal, Mucong Ding, Radu Balan

TMLR 2025

cMALC-D: Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending

Anirudh Satheesh , Keenan Powell, Hua Wei

CIKM 2025

Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs

Anirudh Satheesh , Sooraj Sathish , Swetha Ganesh, Keenan Powell, Vaneet Aggarwal

Under Submission to AISTATS 2026

Distributionally Robust Self Paced Curriculum Reinforcement Learning

Anirudh Satheesh , Keenan Powell, Vaneet Aggarwal

Under Submission to AAMAS 2026

A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control

Anirudh Satheesh , Keenan Powell

ACM Journal of Autonomous Transportation Systems 2025

PICore: Physics-Informed Unsupervised Coreset Selection for Data Efficient Neural Operator Training

Anirudh Satheesh , Anant Khandelwal, Mucong Ding, Radu Balan

TMLR 2025

cMALC-D: Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending

Anirudh Satheesh , Keenan Powell, Hua Wei

CIKM 2025

Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs

Anirudh Satheesh , Sooraj Sathish , Swetha Ganesh, Keenan Powell, Vaneet Aggarwal

Under Submission to AISTATS 2026

Distributionally Robust Self Paced Curriculum Reinforcement Learning

Anirudh Satheesh , Keenan Powell, Vaneet Aggarwal

Under Submission to AAMAS 2026

A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control

Anirudh Satheesh , Keenan Powell

ACM Journal of Autonomous Transportation Systems 2025

EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?

Aakriti Agrawal, Mucong Ding, Zora Che, Chenghao Deng, " Anirudh Satheesh , John Langford, Furong Huang"

NeurIPS 2024 Safe Generative AI Workshop

Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities

Zora Che, Stephen Casper, Robert Kirk, Anirudh Satheesh , Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai, Yarin Gal, Furong Huang, Dylan Hadfield-Menell"

TMLR 2025

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Zikui Cai, Andrew Wang, <> Anirudh Satheesh , Ankit Nakhawa, Hyunwoo Jae, Keenan Powell, Minghui Liu, Neel Jay, Sungbin Oh, Xiyao Wang, Yongyuan Liang, Tom Goldstein, Furong Huang

Under Submission to ICLR 2026

SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation

Mucong Ding, Bang An, Yuancheng Xu, , Anirudh Satheesh , Furong Huang

ICLR 2024

Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems

Aakriti Agrawal, Rohith Aralikatti, Anirudh Satheesh , Souradip Chakraborty, Amrit Singh Bedi, Furong Huang

EMNLP 2025 Findings

A Technical Report on 'Erasing the Invisible': The 2024 NeurIPS Competition on Stress Testing Image Watermarks

Mucong Ding, Bang An, Tahseen Rabbani, Chenghao Deng, Anirudh Satheesh , Souradip Chakraborty, Mehrdad Saberi, Yuxin Wen, Kyle Rui Sang, Aakriti Agrawal, Xuandong Zhao, Mo Zhou, Mary-Anne Hartley, Lei Li, Yu-Xiang Wang, Vishal M. Patel, Soheil Feizi, Tom Goldstein, Furong Huang

NeurIPS D&B 2025

CV

Full Resume in PDF.

Acknowledgements

This website is made using a template by Martin Saveski.