Publications

2025

LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos

LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos

Daniel Etaat, Dvij Kalaria, Nima Rahmanian, S. Shankar Sastry

CVPR Conference on Computer Vision and Pattern Recognition , 2025

TL;DR: LATTE-MV is a scalable system that teaches robots to play table tennis by reconstructing 800+ hours of YouTube videos and using anticipatory control to improve performance against high-speed shots

Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing

Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing

Haoru Xue*, Chaoyi Pan*, Zeji Yi, Guannan Qu, Guanya Shi (* equal contribution)

ICRA International Conference on Robotics and Automation , 2025

TL;DR: DIAL-MPC is the first training-free method achieving real-time whole-body torque control using full-order dynamics.

AnyCar to Anywhere: Learning Universal Dynamics Model for Agile and Adaptive Mobility

AnyCar to Anywhere: Learning Universal Dynamics Model for Agile and Adaptive Mobility

Wenli Xiao*, Haoru Xue*, Tony Tao, Dvij Kalaria, John M. Dolan, Guanya Shi (* equal contribution)

ICRA International Conference on Robotics and Automation , 2025

TL;DR: AnyCar 🏎️ 🚗 🚙 🛻 🚚: a generalist dynamics model 🌎 built with transformer + large-scale sim pre-training + small-scale real fine-tuning. Achieves agile and adaptive control on a family of wheeled embodiments (few or zero shot) and outperforms specialist policies.