Flexible Picking by Voice Command

Sereact
Image: Sereact GmbH

With PickGPT, Sereact has combined LLMs and VLMs to create a Vision Language Action Model (VLAM). Thanks to zero-shot planning, the model is able to solve tasks without special training and to adapt dynamically to new situations. A special feature is the ability to respond flexibly to voice commands.