Tested: DeepSeek-V3 matches top AI models at lower cost

A detailed analysis published by Sunil Kumar Dash reveals that DeepSeek’s latest AI model achieves performance comparable to leading closed-source models while offering significant cost advantages. The model outperforms existing open-source alternatives in mathematics and reasoning tasks, according to extensive benchmark testing. The analysis demonstrates that DeepSeek-V3 surpasses GPT-4 and Claude 3.5 Sonnet in mathematical …

Read more

Nvidia acquires Run:ai and makes software open source

Nvidia has completed its acquisition of Run:ai, a software company specializing in GPU cloud management for artificial intelligence, as reported by Dean Takahashi. While the purchase price wasn’t officially disclosed, earlier reports valued the deal at $700 million. The company announced plans to make Run:ai’s software platform open source, potentially allowing it to support GPUs …

Read more

OpenAI delays copyright opt-out tool Media Manager

OpenAI has failed to deliver its promised Media Manager tool, which was intended to help creators control the use of their work in AI training data. According to reporting by Kyle Wiggers for TechCrunch, the tool was announced in May but has shown no signs of development. Former OpenAI employees told TechCrunch that the project …

Read more

Meta introduces new AI reasoning method “Coconut”

Meta AI researchers have developed a new method called Coconut (Chain of Continuous Thought) that allows large language models to reason in continuous latent space rather than only through words. The research presents an alternative to traditional Chain-of-Thought (CoT) reasoning methods. The new approach enables AI models to process information in a more abstract way, …

Read more

Analysis: Strengths and weaknesses of OpenAI o3

OpenAI’s latest AI model o3 features significant advancements in AI capabilities. According to Matt Marshall’s report in VentureBeat, the model introduces five major innovations: However, the model faces a significant challenge: its high computational requirements. The system consumes millions of tokens per task, raising concerns about economic feasibility. To address this, OpenAI plans to release …

Read more

Open model DeepSeek-V3 performs similar to closed competition

Chinese AI startup DeepSeek has launched DeepSeek-V3, a powerful new AI model that outperforms existing open-source alternatives. According to reporting by Shubham Sharma at VentureBeat, the model features 671 billion parameters but activates only 37 billion for each task through its mixture-of-experts architecture. The model was trained on 14.8 trillion diverse tokens and demonstrates superior …

Read more

Alibaba releases new visual AI model QVQ for enhanced reasoning capabilities

Alibaba’s Qwen team has released QVQ-72B-Preview, a new experimental visual AI model designed to enhance visual reasoning capabilities. Built upon their Qwen2-VL-72B architecture, the model aims to combine language and vision processing to tackle complex analytical tasks. According to company statements, QVQ achieved a score of 70.3 on the MMMU benchmark, marking an improvement over …

Read more

AI assistant Claude drives major changes in software development

Anthropic’s AI assistant Claude has become a significant force in the global software development market, with coding-related revenue increasing by 1,000% in three months. According to an article by Michael Nuñez in VentureBeat, software development now represents more than 10% of all Claude interactions. The AI tool can analyze up to 200,000 tokens of context …

Read more

OpenAI introduces new safety system for o1 and o3

OpenAI has developed a new approach called “deliberative alignment” to make its AI models safer and more aligned with human values. According to Maxwell Zeff’s article, the company implemented this system in its latest AI reasoning models, o1 and o3. The new method enables the models to consider OpenAI’s safety policy during the inference phase …

Read more

OpenAI’s GPT-5 development faces significant delays and cost issues

OpenAI’s next major project, GPT-5 (code-named Orion), is experiencing substantial setbacks and escalating costs, according to a Wall Street Journal report by Deepa Seetharaman. The project, which has been in development for over 18 months, has encountered multiple challenges during training runs, each costing approximately half a billion dollars in computing expenses alone. The company’s …

Read more

×