OpenAI unveils new developer tools for building AI agents

OpenAI has released a new suite of tools designed to help developers build AI agents similar to the company’s own Deep Research and Operator. The new offerings include the Responses API and the open-source Agents SDK, which provide developers with the building blocks to create AI applications that can search the web, analyze files, and … Read more

New AI techniques promise huge cost savings and improved performance for enterprises

Recent research has unveiled two promising approaches that could dramatically reduce the costs of running large language models (LLMs) while simultaneously improving their performance on complex reasoning tasks. These innovations come at a critical time as enterprises increasingly deploy AI solutions but struggle with computational expenses. Chain of draft: Less is more Researchers at Zoom … Read more

Anthropic launches collaborative AI platform for cross-functional teams

Anthropic has unveiled a major update to its developer platform, introducing collaborative features that extend AI capabilities beyond technical teams. According to Michael Nuñez reporting for VentureBeat, the upgraded Anthropic Console allows team members from different departments to work together on AI prompts. The platform now supports Anthropic’s Claude 3.7 Sonnet model with new “extended … Read more

Mistral launches new OCR API to convert complex documents for AI processing

Mistral AI has introduced Mistral OCR, a new optical character recognition API designed to transform complex PDF documents into AI-ready Markdown files. According to TechCrunch’s Romain Dillet, the French large language model developer launched this tool as a solution for organizations struggling to make their document repositories accessible to AI systems. Unlike conventional OCR tools, … Read more

Alibaba launches QwQ-32B, a powerful reasoning model that rivals larger competitors

Alibaba’s Qwen Team has introduced QwQ-32B, a new open-source language model that matches the performance of much larger models like DeepSeek-R1 despite having significantly fewer parameters. The 32-billion-parameter model, released under the Apache 2.0 license, leverages reinforcement learning (RL) to enhance reasoning capabilities for complex problem-solving tasks. Key features and capabilities QwQ-32B demonstrates impressive performance … Read more

Salesforce introduces autonomous AI system for enterprise workflow automation

Salesforce has unveiled Agentforce 2dx, a platform that enables AI agents to work autonomously across enterprise systems without constant human supervision. The system represents a shift from reactive AI interactions to proactive agents that monitor systems and initiate processes independently. Michael Nuñez of VentureBeat reports that this release marks a significant evolution from previous approaches … Read more

Contextual AI’s new model outperforms GPT-4o in factual accuracy

Contextual AI has released a new grounded language model (GLM) that reportedly achieves 88% factual accuracy on the FACTS benchmark, outperforming offerings from Google, Anthropic, and OpenAI. According to Michael Nuñez from VentureBeat, the startup was founded by pioneers of retrieval-augmented generation (RAG) technology. The new model specifically targets enterprise applications where factual precision is … Read more

Salesforce launches AgentExchange marketplace for enterprise AI agents

Salesforce has introduced AgentExchange, a marketplace for AI agents designed to automate business tasks. According to Michael Nuñez of VentureBeat, the platform launches with over 200 partners including Google Cloud, DocuSign, Box, and Workday. Salesforce positions this as the first trusted marketplace for enterprise AI agents, targeting what it estimates as a $6 trillion “digital … Read more

These diffusion-based language models run 10 times faster than current LLMs

Inception Labs has unveiled Mercury, a new family of diffusion-based large language models (dLLMs) that can generate text up to 10 times faster than conventional autoregressive LLMs. According to the company, Mercury models can process over 1,000 tokens per second on NVIDIA H100 GPUs, speeds previously achievable only with specialized hardware. The company’s first publicly … Read more

You.com’s AI research tool processes 400+ sources simultaneously

You.com has unveiled a new AI research tool called Advanced Research & Insights agent (ARI) that can analyze more than 400 sources at once. According to CEO Richard Socher, interviewed by Michael Nuñez for VentureBeat, the tool aims to transform market research by producing comprehensive reports in minutes instead of weeks. ARI features direct source … Read more