Claude Opus 4.7 performs complex robotics tasks without human assistance 37 times faster than human teams from a year earlier and writes code that works correctly on the first attempt in most cases.
While video generation models produce visually convincing movements, visual quality does not correlate with practical executability by robots — an evaluation criterion overlooked by standard metrics.