Stable Offline Value Function Learning with Bisimulation-based Representations
[arxiv]Brahma S. Pavse, Yudong Chen, Qiaomin Xie, Josiah P. Hanna
Proceedings of the 42nd International Conference on Machine Learning (ICML), July 2025.  
I am a fourth-year Computer Science PhD candidate at the University of Wisconsin-Madison, where I am advised by Josiah Hanna. My research is supported by the Cisco Systems Distinguished Graduate Fellowship. During Summer 2025, I will be a machine learning intern at Netflix Research. I have also worked as an AI research intern at Sony AI.
I am broadly interested in representation learning and abstractions for reinforcement learning. Poorly learned representations can lead to data-inefficient learning, instability, and high variance. My work studies how RL agents can learn appropriate representations to make reliable predictions about their environment for validation and control.
Previously, I completed my BS and MS in Computer Science from the University of Texas at Austin, where I was fortunate to be advised by Peter Stone. I also worked as a software engineer at Salesforce and SAS Institute.
Feel free to shoot me an email if you want to chat!