A new open-source AI model called Molmo could help advance the development of AI agents. Developed by the Allen Institute for AI (Ai2), the model can interpret images and communicate via a chat interface. According to Wired’s Will Knight, this enables AI agents to perform tasks such as web browsing or document creation.
In some benchmarks, it outperforms leading proprietary models such as OpenAI’s GPT-4o, Anthropic’s Claude, and Google’s Gemini. This was reported by Carl Franzen at VentureBeat. According to Ai2, Molmo uses a thousand times less data than the competition. Ai2 demonstrated Molmo’s capabilities in a video in which the system analyzes and interprets photos.
Molmo is completely open source and available in different sizes, including a version for mobile devices.