OpenAI brings image generation to a new level

OpenAI has launched native image generation capabilities directly within ChatGPT, powered by its multimodal model GPT-4o. This new feature, called “Images in ChatGPT,” is now available to users across Plus, Pro, Team, and Free subscription tiers, with Enterprise, Edu, and API access coming soon. Unlike the previous DALL-E 3 image generator, which was a separate …

Read more

Google introduces Gemini 2.5 Pro with built-in reasoning capabilities

Google has launched Gemini 2.5 Pro, describing it as their “most intelligent AI model” to date. The new model represents a significant advancement in Google’s AI capabilities, with a particular focus on reasoning abilities that are now built directly into the system. According to Google’s announcement, Gemini 2.5 models are “thinking models” that can reason …

Read more

Reve Image 1.0 is a promising new AI image generator

Reve AI has released Reve Image 1.0, a new text-to-image generation model that currently ranks first in image generation quality according to third-party evaluator Artificial Analysis. As reported by Carl Franzen in VentureBeat, the model excels at prompt adherence, aesthetics, and typography – outperforming competitors like Midjourney v6.1 and Google’s Imagen 3. The Palo Alto-based …

Read more

OpenAI improves ChatGPT’s voice assistant to reduce interruptions

OpenAI has updated its Advanced Voice Mode to make conversations with ChatGPT more natural by reducing interruptions when users pause. According to Maxwell Zeff at TechCrunch, the improvements were announced by OpenAI researcher Manuka Stratta. Free users now have access to a version that allows pauses without interruption, while paying subscribers receive additional benefits including …

Read more

Tencent launches Hunyuan T1 AI model to compete with DeepSeek’s R1

Tencent Holdings has introduced its Hunyuan T1 AI reasoning model that matches DeepSeek’s R1 in both performance and pricing. According to Coco Feng of the South China Morning Post, the model scored 87.2 points on the MMLU Pro benchmark, surpassing DeepSeek-R1’s 84 points but falling short of OpenAI’s o1 at 89.3 points. Pricing is competitive …

Read more

OpenAI launches improved AI models for voice and transcription

OpenAI has introduced three new AI models designed to enhance speech-to-text and text-to-speech capabilities. The models gpt-4o-transcribe, gpt-4o-mini-transcribe, and gpt-4o-mini-tts offer improved accuracy and customization options for developers building voice applications. According to OpenAI, the new transcription models significantly outperform their predecessor, Whisper, particularly in noisy environments and with various accents. The company’s internal benchmarks …

Read more

Claude gets web search capability, allowing access to real-time information

Anthropic has officially launched a web search feature for its AI chatbot Claude, enabling the assistant to access and process real-time information from the internet. The new capability, which addresses one of the most requested features from users, is currently available in preview for paid Claude users in the United States, with plans to expand …

Read more

Google shares six effective strategies for using Gemini Deep Research

Google has released new advice on how to effectively use its Deep Research tool, which is now available to all users. According to a post on the official Google blog, Deep Research helps users explore complex topics through comprehensive reports. Aarush Selvan, Gemini Senior Product Manager, and Mukund Sridhar developed the tool to reduce research …

Read more

OpenAI launches o1-pro, its most expensive AI model to date

OpenAI has released o1-pro, an upgraded version of its o1 reasoning AI model, available to select developers who have spent at least $5 on OpenAI API services. As reported by Kyle Wiggers, o1-pro costs $150 per million tokens for input and $600 per million tokens for output, making it twice as expensive as GPT-4.5 for …

Read more

Operative Games launches AI platform for interactive storytelling

Operative Games has unveiled its AI-driven interactive storytelling platform after emerging from stealth with backing from 1AM Gaming, Samsung Next, and LongJourney.vc. As reported by Dean Takahashi, the company was founded by Jon Snoddy, former head of Walt Disney R&D, Jon Kraft, founding CEO of Pandora Media, and game industry veteran Pegi Bryant. Their proprietary …

Read more