Developer | Page 8 of 13 | ✦ Smart Content Report

OpenAI Swarm is a framework for AI agents

February 5, 2025October 14, 2024

OpenAI has unveiled “Swarm,” an experimental framework for networks of AI agents, according to VentureBeat. Swarm allows developers to create interconnected AI networks that can communicate and solve tasks autonomously. The framework has potential applications in automating various business functions, from market analysis to customer support. However, it also raises ethical concerns regarding security, bias, …

DeepMind’s Michelangelo tests reasoning in long context windows

February 5, 2025October 11, 2024

DeepMind has introduced the Michelangelo benchmark to evaluate the long-context reasoning capabilities of large language models (LLMs), Ben Dickson reports for VentureBeat. While LLMs can manage extensive context windows, research indicates they struggle with reasoning over complex data structures. Current benchmarks often focus on retrieval tasks, which do not adequately assess a model’s reasoning abilities. …

Palmyra X 004 is the David of AI models

February 5, 2025October 10, 2024

Writer has launched its new AI language model, Palmyra X 004, which seemingly excels in function calling and workflow execution for businesses. Michael Nuñez reports for VentureBeat that the model outperforms competitors like OpenAI, Anthropic, Google, and Meta by nearly 20% on Berkeley’s Tool Calling Leaderboard, achieving a score of 78.76%. Palmyra X 004 achieves …

Braintrust evaluates and monitors AI products

February 5, 2025October 10, 2024

Braintrust, a startup specializing in evaluating AI products, has secured $36 million in Series A funding. The company, founded by Ankur Goyal, helps businesses improve the accuracy of their AI tools. According to Forbes contributor Alex Konrad, Braintrust already boasts clients such as Airtable, Brex, Instacart, and Stripe. Braintrust’s software evaluates and monitors the performance …

Anthropic lowers costs for batch processing

February 5, 2025October 10, 2024

Anthropic has launched a new, more affordable batch processing API for businesses. According to a VentureBeat report by Michael Nuñez, the new Message Batches API allows companies to process up to 10,000 queries asynchronously within a 24-hour window, at half the cost of standard API calls. Both input and output tokens are 50% cheaper compared …

AI Gateway wants to make Enterprise AI safe and efficient

February 5, 2025October 9, 2024

Vera AI has launched a new platform called “AI Gateway” to help companies deploy artificial intelligence safely and efficiently, reports Michael Nuñez for VentureBeat. The platform offers customizable safeguards and model routing capabilities. “We’ve focused ourselves squarely on the last mile problems, which (…) are actually quite hard,” explains Liz O’Sullivan, CEO and co-founder of …

OpenAI adds Canvas to ChatGPT

February 5, 2025October 6, 2024

OpenAI introduces a new feature called “Canvas” for ChatGPT. The new interface allows users to edit text and code directly next to the chat window. Canvas is initially available for ChatGPT Plus and Teams users, later also for Enterprise and Edu customers. The function aims to simplify writing and programming with AI assistance. With Canvas, …

Apple’s Depth Pro generates 3D depth maps from images

February 5, 2025October 6, 2024

Apple has developed a new AI model called Depth Pro that could revolutionize how machines perceive 3D vision. Michael Nuñez reports for VentureBeat that Depth Pro can generate detailed 3D depth maps from single 2D images in a fraction of a second, without relying on traditional camera data. The system outperforms previous models in speed …

Black Forest Labs releases Flux 1.1 Pro

February 5, 2025October 6, 2024

Black Forest Labs has released a new, faster text-to-image model called Flux 1.1 Pro, reports Carl Franzen for VentureBeat. According to an independent benchmark, the model outperforms other AI image generators in terms of visual quality and speed. It generates images six times faster than its predecessor, with improved image quality and accuracy. Flux 1.1 …

DeepMind’s SCoRE makes AI models more reliable

February 5, 2025October 2, 2024

DeepMind has developed a new technique called SCoRe that significantly improves the self-correction abilities of large language models (LLMs). Ben Dickson reports this in an article for VentureBeat. SCoRe uses self-generated data and enables LLMs to use their internal knowledge to identify and correct errors. In tests, SCoRe significantly outperformed other self-correction methods. The technique …