SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data
Hugging Face releases SmolVLA, an efficient Vision-Language-Action model trained using LeRobot community data. This model aims to democratize robotic control by providing a lightweight, open-source VLA for embodied AI tasks.


