IFPT News

πŸ€– Everyone's talking about humanoid robots taking over factories – what does the literature say?

In our new preprint, we systematically reviewed the state of Embodied AI (EAI) in industrial operations β€” screening 800+ papers down to 42 through a PRISMA-guided process.

A few findings stood out:
πŸ“ Current research is dominated by assembly-targeted use cases β€” mostly on stationary cobot arms. More complex morphologies (humanoids, mobile manipulators) remain rare.
πŸ“ Vision and language dominate the inputs, while tactile and force feedback β€” arguably essential for, e.g., real-world manufacturing use cases β€” are almost absent (just 4 of 42 studies).
πŸ“ "Foundation model paradox": LLMs and VLMs are increasingly used as modular, swappable planners. This is practical, but it pulls systems away from the tightly coupled sensorimotor intelligence and closer to "Physical AI."
πŸ“ True Vision-Language-Action (VLA) models stay rare, bottlenecked by the scarcity of domain-specific kinematic training data.
Takeaway: Industrial EAI is still early. Much of the work realizes the label but not the embodied principle β€” intelligence emerging from the coupling of body, learning, and environment. The near-term path forward is likely to be task-specific, human-centered systems, richer multimodal feedback, and real industrial benchmarks, not fully autonomous general-purpose robots just yet.
#EmbodiedAI #Robotics #Manufacturing #AI #FoundationModels #Humanoids