Jiqizhixin (机器之心)→ original

Glass Ceiling Shattered: Ant Group Taught Robots to See the Invisible

Imagine you bought a robotic assistant for a couple of tens of thousands of dollars, and on the very first day it crashed into a panoramic window or tried to…

AI-processed from Jiqizhixin (机器之心); edited by Hamidun News
Glass Ceiling Shattered: Ant Group Taught Robots to See the Invisible
Source: Jiqizhixin (机器之心). Collage: Hamidun News.
◐ Listen to article

Imagine you bought a robotic assistant for a couple of tens of thousands of dollars, and on the very first day it crashed into a panoramic window or tried to grab thin air instead of a glass of expensive Chardonnay. Funny? Only if it's not happening at your place. The problem of transparent objects has for years remained the "Achilles heel" of robotics. While we argue about whether AI will gain consciousness, engineers are struggling to make it stop acting stupid in front of an ordinary glass door. Recently, Chinese giant Ant Group decided it was time to close this gestalt gap and presented its first serious research in the field of Embodied AI.

Why is this even a problem? Most modern robots rely on LIDARs and RGB-D cameras. LIDARs work on the principle of laser beam reflection: if the beam passes through glass or refracts chaotically, the robot thinks there's nothing in front of it. Depth cameras also go crazy trying to calculate the distance to a transparent surface. As a result, we get advanced machines that behave in a normal living room like disoriented kittens.

Ant Group proposed a framework that combines visual data and deep learning to correctly restore the shape and position of transparent objects in space. Ant Group's entry into this niche looks like a logical continuation of their strategy. After regulators cooled the ardor of their fintech direction, the company began actively diversifying its portfolio toward fundamental technologies.

Research in the field of Embodied AI is not just an attempt to create a robot waiter. It's a fight for leadership in the next technological paradigm, where AI goes beyond chatbots and begins to manipulate physical objects. The fact that they immediately released their work as open-source speaks to a desire to quickly build a community of developers around themselves and set industry standards.

Analyzing the proposed solution, you understand that the focus has shifted from simple pattern recognition to understanding the physics of light. Ant Group's algorithms teach neural networks to predict how light will refract through a transparent medium. This allows the robot not only to "see" glass but to understand its volume and boundaries. For the industry, this means a qualitative leap: from warehouse robots working in sterile conditions with cardboard boxes, we're moving to household assistants capable of adequately interacting with the fragile and complex world of human habitation.

What does this mean for us? Most likely, in the next couple of years we'll see a wave of new startups using this open code to create smarter vacuum cleaners, manipulators, and even unmanned carts in shopping malls. China once again proves that their approach to innovation is not only about copying but also about solving fundamental engineering problems that Western companies sometimes sidestep, preferring to focus on software. Ant Group is clearly making it clear: they're here seriously and for the long haul, and their ambitions extend far beyond digital wallets.

Bottom line: Ant Group successfully entered the territory of physical AI, solving the "invisibility" problem of glass. The open-source research will accelerate the emergence of robots that will finally stop being a threat to your windows and dishes. Will this be the beginning of the end for specialized sensors in favor of pure machine vision?

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…