Current location: Home> Ai News

Microsoft's open source multimodal AI Agent Magma: Reshaping the interactive experience of shopping and robots

Author: LoRA Time: 26 Feb 2025 829

Microsoft officially released the multimodal AI Agent basic model "Magma" on its official website and has conducted open source. Compared with traditional smart assistants, this emerging technology has shown more powerful multimodal capabilities, which can process various data forms such as images, videos, texts, etc., breaking the barriers between the digital and physical worlds.

Magma can not only help users automatically place orders and check daily affairs such as weather on e-commerce platforms, but also collaborate with physical robots to perform more complex operations. For example, when playing real chess, Magma can provide users with real-time strategic advice, greatly enhancing the gaming experience. At the same time, it has psychological prediction functions, which can infer the future behavior of characters or objects in the video, allowing virtual assistants or robots to better understand the surrounding dynamic environment and respond accordingly.

image.png

According to official introduction, Magma has a wide range of application scenarios. Not only does it help the home robot learn how to organize items it has never seen before, it also generates step-by-step user interface navigation instructions for unfamiliar tasks for the virtual assistant. Such functions allow users to receive more accurate help and guidance when facing new environments or new tasks.

image.png

Magma is part of the basic model of Visual Language Action (VLA) and can be learned through massive public visual and linguistic data. This capability allows Magma to effectively integrate language, space and time intelligence to provide solutions to users’ complex tasks in the digital and physical worlds.

Magma's open source provides developers and researchers with a powerful tool that facilitates the further development of smart assistants and home robots. In the future, with the continuous improvement of this technology, we may be able to see more innovative applications based on Magma in our daily lives.

Project address: https://microsoft.github.io/Magma/