Paper Note on robot-related physical AI
Around Jiajun Wu and Yilun Du
- Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
- Spatial Mental Modeling from Limited Views
- Autoregressive Flow Matching for Motion Prediction
- Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
- Diffusion policy: Visuomotor policy learning via action diffusion
- Training diffusion models with reinforcement learning
- Improving factuality and reasoning in language models through multiagent debate
- Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making