Real-time action chunking with large models

fennecbutt · 2025-06-17T23:13:23 1750202003

Alright, I'm building the robot project I was putting off. This is so fucking cool.

Excellent work!

jauntywundrkind · 2025-06-17T20:42:50 1750192970

Anyone have good intro recommendations for VLAs?

lachyg · 2025-06-18T16:43:44 1750265024

(I work at Pi.)

We open-sourced Pi0 (referenced in this post): https://github.com/Physical-Intelligence/openpi

UltraSane · 2025-06-18T05:41:16 1750225276

I love the implications of a robot that can plug in Ethernet cables.

lysp · 2025-06-19T03:10:20 1750302620

Just need one that can plug in USB-A cables the first attempt (I average 3 attempts).

meepmorp · 2025-06-18T19:11:13 1750273873

“Soon, a robot will fix the cables in the server room for me!”

LoganDark · 2025-06-19T01:07:00 1750295220

New job title: Spaghetti Organizer

b0a04gl · 2025-06-18T16:40:16 1750264816

rtc handling 300ms+ delay and still pulling off tasks like plugging ethernet is kinda nuts. what i'm not getting is but how's it keeping the control loop stable without retraining? some sort of latent plan caching?

kvablack · 2025-06-18T17:42:40 1750268560

It uses an inpainting algorithm (adapted from image generation literature) to produce future actions that are consistent with the current trajectory. It's sort of like warm-starting from a cached plan, although the plan isn't latent, it's directly in action space. Hopefully that answers your question -- there are many more details in the blog post and paper :)