New AI evaluation model Glider matches GPT-4’s performance with fewer resources

Startup Patronus AI has developed a breakthrough AI evaluation model that achieves comparable results to much larger systems while using significantly fewer computational resources. As reported by Michael Nuñez for VentureBeat, the new open-source model named Glider uses only 3.8 billion parameters yet matches or exceeds the performance of GPT-4 on key benchmarks. The model …

Read more

TuSimple rebrands as CreateAI to focus on gaming technology

Chinese autonomous trucking company TuSimple has transformed into CreateAI, shifting its focus to video game development and animation. According to Evelyn Cheng’s report for CNBC, the company plans to leverage its artificial intelligence expertise to reduce triple-A game production costs by 70% over the next five to six years. CEO Cheng Lu expects the company …

Read more

Salesforce launches AI reasoning platform for enterprise tasks

Salesforce has introduced Agentforce 2.0, a significant upgrade to its artificial intelligence platform that enables AI agents to perform complex reasoning and autonomous actions in enterprise environments. As reported by Michael Nuñez for VentureBeat, the new system represents a major shift from traditional chatbots to more sophisticated AI assistants. The platform’s core innovation is the …

Read more

OpenAI launches telephone and WhatsApp access to ChatGPT

OpenAI has introduced a new way to interact with ChatGPT through a toll-free number (1-800-CHATGPT) and WhatsApp messaging service. According to the company’s official explanation, users in the United States and Canada can now call 1-800-242-8478 to speak with ChatGPT, while international users can access the service via WhatsApp messaging. The service is available without …

Read more

Google launches new benchmark to test AI models’ factual accuracy

Google has introduced FACTS Grounding, a new benchmark system to evaluate how accurately large language models (LLMs) use source material in their responses. The benchmark comprises 1,719 examples across various domains including finance, technology, and medicine. The FACTS team at Google DeepMind and Google Research developed the system, which uses three frontier LLM judges – …

Read more

IBM launches improved Granite 3.1 language models

IBM has released a new version of its open-source large language models, Granite 3.1, featuring significant improvements in performance and capabilities. According to reporting by Sean Michael Kerner for VentureBeat, the new models offer extended context length and integrated hallucination detection. The Granite 8B Instruct model reportedly outperforms similar-sized competitors including Meta Llama 3.1 and …

Read more

OpenAI releases o1 model for developer access

OpenAI has made its advanced o1 artificial intelligence model available to third-party developers through its API. According to an article by Carl Franzen in VentureBeat, this release represents a significant advancement in making sophisticated AI technology accessible to developers. The o1 model, first announced in September 2024, differs from traditional large language models by incorporating …

Read more

UK creative industry opposes AI copyright exemption plan

British creative organizations have united against a government proposal that would allow AI companies to use copyrighted works without explicit permission. According to Robert Booth’s report in The Guardian, the Creative Rights in AI Coalition, representing thousands of artists, writers, and media companies, rejected the plan announced by technology minister Chris Bryant. The proposal would …

Read more

Runway launches AI filmmaker talent network

Runway, a New York-based AI company, has introduced a new online platform connecting AI filmmakers with potential employers. According to an article by Shubham Sharma for VentureBeat, the network aims to help brands, agencies, and studios find specialized AI video talent. The platform already features dozens of independent artists and production houses, including members of …

Read more

Perplexity raises $500 million at $9 billion valuation

Perplexity AI Inc., a company developing an AI-powered search engine to compete with Google, has secured $500 million in new funding. According to reporting by Shirin Ghaffary for Bloomberg, the investment was led by Institutional Venture Partners and values the company at $9 billion. The valuation represents a dramatic increase from the company’s $3 billion …

Read more