
With PickGPT, Sereact has combined LLMs and VLMs to create a Vision Language Action Model (VLAM). Thanks to zero-shot planning, the model is able to solve tasks without special training and to adapt dynamically to new situations. A special feature is the ability to respond flexibly to voice commands.

















