Welcome to SmolVLA! This repository contains the SmolVLA (Small Vision Language Action) implementation.
For detailed information about the lerobot framework and SmolVLA implementation, please see the lerobot README.md.
SmolVLA is a vision-language-action model that combines multimodal understanding with robotic control capabilities.