ETH Zurich students demo a voice-controlled robotic arm using frozen vision-language models for zero-shot object manipulation · Digg