Publications
- HAZARD Challenge: Embodied Decision Making in Dynamically Changing EnvironmentsICLR 2024, 2023
- Iteratively Learn Diverse Strategies with State Distance InformationNeurIPS 2023, 2023
- Improving Reinforcement Learning from Human Feedback with Efficient Reward Model EnsembleNAACL 2024 Submission, 2023
- SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World KnowledgeCVPR 2024 Submission, 2023