You are viewing in chronological order, alternatively, you can check out my Google Scholar.

“SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation”
[Arxiv 2025]
Authors: Haoquan Fang, Markus Grotz, Wilbert Pumacay, Yi Ru Wang, Dieter Fox*, Ranjay Krishna*, Jiafei Duan*

“AHA: A Vision-Language-Model for Detecting and Reasoning over Failures in Robotic Manipulation”
[ICLR 2025]
Authors: Jiafei Duan, Wilbert Pumacay, Nishanth Kumar, Yi Ru Wang, Shulin Tian, Wentao Yuan, Ranjay Krishna, Dieter Fox, Ajay Mandlekar*, Yijie Guo*

“SAT: Spatial Aptitude Training for Multimodal Language Models”
[Arxiv 2024]
Authors: Arijit Ray, Jiafei Duan, Reuben Tan, Dina Bashkirova, Ross Hendrix, Kiana Ehsani, Aniruddha Kembhavi, Bryan A. Plummer, Ranjay Krishna*, Kuo-Hao Zeng*, Kate Saenko*

“Manipulate-Anything:
Automating Real-World Robots using Vision-Language Models”
[CoRL 2024]
Authors: Jiafei Duan*, Wentao Yuan*, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna

“RoboPoint: A Vision-Language Model for
Spatial Affordance Prediction for Robotic”
[CoRL 2024]
Authors: Wentao Yuan, Jiafei Duan, Valts Blukis, Wilbert Pumacay, Ranjay Krishna Adithyavairavan Murali,Arsalan Mousavian, Dieter Fox
[Project page][Paper][Demo][Checkpoint][Code]

“EVE: Enabling Anyone to Train Robot using Augmented Reality”
[UIST 2024]
(24% Acceptance)
Authors: Jun Wang, Chun-Cheng Chang*, Jiafei Duan*, Dieter Fox, Ranjay Krishna

“Octopi: Object Property Reasoning with Large Tactile-Language Models”
[RSS 2024, Oral Presentation (28%)]
Authors: Samson Yu, Kelvin Lin, Anxing Xiao, Jiafei Duan, Harold Soh

“THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation”
[RSS 2024, Oral Presentation (28%)]
Authors: Jiafei Duan*, Ishika Singh*, Wibert Pumacay*, Ranjay Krishna, Jesse Thomason, Dieter Fox
[Paper][Project Page][Code][Real-world setup]

“Selective Visual Representations Improve Convergence and Generalization for Embodied-AI”
[ICLR 2024, Spotlight (5%) ]
Authors: Ainaz Eftekhar*, Kuo-Hao Zeng*, Jiafei Duan, Ali Farhadi, Ani Kembhavi , Ranjay Krishna

“NEWTON: Are Language models Capable of Physical Reasoning”
[EMNLP Findings 2023]
Authors: Yi Ru Wang, Jiafei Duan, Dieter Fox, Siddhartha Srinivasa
[Paper][Project Page][Code][Dataset]

“AR2-D2:Training a Robot Without a Robot”
[CoRL 2023]
(Poster)
Authors: Jiafei Duan, Yi Ru Wang, Mohit Shridhar, Dieter Fox, Ranjay Krishna
[Paper][Project Page][My Talk][Dieter Fox’s Talk][Code]

“BOSS: A Benchmark for Human Belief Prediction in Object-context Scenarios”
ArXiv Preprint
Authors: Jiafei Duan*, Samson Yu*, Nicholas Tan, Li Yi, Cheston Tan

“Good Time to Ask: A Learning Framework for Asking for Help in Embodied Visual Navigation”
Ubiquitous Robots 2023 (Best Paper Award)
Authors: Jenny Zhang, Samson Yu, Jiafei Duan, Cheston Tan

“A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories”
[CogSci 2023]
(Poster)
Authors: Arijit Dasgupta, Jiafei Duan, Marcelo Ang, Yi Lin, Su-Hua Wang, Renée Baillargeon, Cheston Tan

“ABCDE: An Agent-Based Cognitive Development Environment”
[CVPR 2022]
(Embodied AI Workshop)
Authors: Jieyi Ye, Jiafei Duan, Samson Yu, Bihan Wen, Cheston Tan

“A Survey on Machine Learning Approaches for Modelling Intuitive Physics ”
[IJCAI-ECCAI 2022]
(Survey Track (Oral) , 18% Acceptance Rate)
Authors: Jiafei Duan*, Arijit Dasgupta*, Jason Fischer, Cheston Tan
[Paper] [Project Page] [Video]
“PIP: Physical Interaction Prediction via Mental Simulation with Span Selection”
[ECCV 2022]
(Poster Presentation, 28% Acceptance Rate)
Authors: Jiafei Duan*, Samson Yu*, Soujanya Poria, Bihan Wen, Cheston Tan

“AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition ”
[NeurIPS 2021]
(Physical Reasoning and Inductive Bias Workshop)
Authors: Arijit Dasgupta, Jiafei Duan, Marcelo H.Ang Jr, Cheston Tan

“SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments ”
[ICCV2021]
(Simulation Technology for Embodied AI Workshop)
Authors: Jiafei Duan, Samson Yu, Cheston Tan
[Paper] [Code] [Video] [Project Page]

“A Survey of Embodied AI: From Simulators to Research Tasks.”
[IEEE Transactions on Emerging Topics in Computational Intelligence]
(CIS Journal Featured Publication)
Authors: Jiafei Duan, Samson Yu, Tan Hui Li, Hongyuan Zhu, Cheston Tan

“ActioNet: An Interactive End-to-End Platform for Tasked-Based Data Collection and Augmentation in 3D Environment.”
[ICIP 2020]
(Poster Presentation, 47.9% Acceptance Rate)
Authors: Jiafei Duan, Samson Yu, Tan Hui Li, Cheston Tan