RT-2: New model translates vision and language into action

Robotic Transformer 2 (RT-2) is a novel vision-language-action (VLA) model that learns from both web and robotics data, and translates this knowledge into generalised instructions for robotic control.

Fonte: https://deepmind.google/discover/blog/rt-2-new-model-translates-vision-and-language-into-action/

Leave a Reply

Your email address will not be published. Required fields are marked *