Brahma S. Pavse

Conference Papers

2025

Stable Offline Value Function Learning with Bisimulation-based Representations

[arxiv]
Brahma S. Pavse, Yudong Chen, Qiaomin Xie, Josiah P. Hanna
Proceedings of the 42nd International Conference on Machine Learning (ICML), July 2025.

2024

Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces

[arxiv] [code]
Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, Josiah P. Hanna
Proceedings of the 41st International Conference on Machine Learning (ICML), July 2024.

2023

State-Action Similarity-Based Representations for Off-Policy Evaluation

[arxiv] [bibtex] [code]
Brahma S. Pavse, Josiah P. Hanna
Proceedings of the 36th Neural Information Processing Systems (NeurIPS), December 2023.

Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction (Oral Presentation)

[pdf] [bibtex]
Brahma S. Pavse, Josiah P. Hanna
Proceedings of the 37th Association for the Advancement of Artificial Intelligence (AAAI), February 2023.
An earlier version appeared at the Offline RL Workshop: Offline RL as a "Launchpad" at NeurIPS 2022.

2020

Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

[pdf] [bibtex]
Brahma S. Pavse*, Faraz Torabi*, Josiah Hanna, Garrett Warnell, Peter Stone
*Equal contribution.
Contains material from my undergraduate honors thesis.
IEEE Robotics and Automation Letters, July 2020.
Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020), October 2020.
An earlier version appeared in the Imitation, Intent, and Interaction (I3) workshop at ICML 2019.

Reducing Sampling Error in Batch Temporal Difference Learning

[pdf] [bibtex]
Brahma S. Pavse, Ishan Durugkar, Josiah Hanna, Peter Stone
Master's thesis.
Proceedings of the 37th International Conference on Machine Learning (ICML 2020), July 2020.

Journal Articles

2020

Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

Theses

Reducing Sampling Error in Batch Temporal Difference Learning

[pdf] [bibtex]
Brahma S. Pavse, advised by Peter Stone and Josiah Hanna
MS Thesis, University of Texas at Austin, 2020.

Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

[pdf] [bibtex]
Brahma S. Pavse, advised by Peter Stone
BS Honors Thesis, University of Texas at Austin, 2019.

Brahma S. Pavse

Welcome!

News

Publications

Conference Papers

2025

Stable Offline Value Function Learning with Bisimulation-based Representations

2024

Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces

2023

State-Action Similarity-Based Representations for Off-Policy Evaluation

Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction (Oral Presentation)

2020

Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

Reducing Sampling Error in Batch Temporal Difference Learning

Journal Articles

2020

Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

Theses

Reducing Sampling Error in Batch Temporal Difference Learning

Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration