Publications
-
HAZARD Challenge: Embodied Decision Making in Dynamically Changing EnvironmentsICLR 2024, 2023 -
Iteratively Learn Diverse Strategies with State Distance InformationNeurIPS 2023, 2023 -
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model EnsembleNAACL 2024 Submission, 2023 -
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World KnowledgeCVPR 2024 Submission, 2023