Stable Diffusion 3.5 launches on Amazon’s enterprise AI platform

Stability AI has released its latest text-to-image generation model, Stable Diffusion 3.5 Large, on Amazon’s Bedrock service. As reported by Sean Michael Kerner for VentureBeat, this marks AWS as the only public cloud service offering Stability AI’s flagship models. The integration allows enterprises to access multiple AI models through a single unified API, meeting the … Read more

New AI evaluation model Glider matches GPT-4’s performance with fewer resources

Startup Patronus AI has developed a breakthrough AI evaluation model that achieves comparable results to much larger systems while using significantly fewer computational resources. As reported by Michael Nuñez for VentureBeat, the new open-source model named Glider uses only 3.8 billion parameters yet matches or exceeds the performance of GPT-4 on key benchmarks. The model … Read more

IBM launches improved Granite 3.1 language models

IBM has released a new version of its open-source large language models, Granite 3.1, featuring significant improvements in performance and capabilities. According to reporting by Sean Michael Kerner for VentureBeat, the new models offer extended context length and integrated hallucination detection. The Granite 8B Instruct model reportedly outperforms similar-sized competitors including Meta Llama 3.1 and … Read more

Nvidia and DataStax launch storage-efficient AI retrieval system

Nvidia and DataStax have introduced a new AI technology that reduces data storage requirements by 35 times for generative AI systems. As reported by Michael Nuñez for VentureBeat, the Nvidia NeMo Retriever microservices, integrated with DataStax’s AI platform, enables faster and more accurate information retrieval across multiple languages. The technology has already shown impressive results … Read more

Slack integrates Salesforce AI agents for workplace automation

Slack is implementing Salesforce’s Agentforce AI agents into its collaboration platform to enhance workplace productivity. According to Michael Nuñez at VentureBeat, the integration will give AI agents access to organizational conversations and data within Slack channels. Rob Seaman, Slack’s chief product officer, emphasized that the system will provide AI agents with contextual knowledge, reasoning capabilities, … Read more

Cohere launches new compact AI language model Command R7B

AI company Cohere has introduced Command R7B, a new compact language model designed for enterprise applications. According to VentureBeat reporter Taryn Plumb, the model supports 23 languages and specializes in retrieval-augmented generation (RAG). Command R7B outperforms similar-sized models from competitors like Google, Meta, and Mistral in mathematics and coding tasks. The model features a 128K … Read more

Sakana AI develops new memory optimization for language models

Tokyo-based startup Sakana AI has created a breakthrough technique that reduces memory usage in large language models by up to 75%. As reported by Ben Dickson, the system called “universal transformer memory” uses neural attention memory modules (NAMMs) to efficiently manage information processing. These modules analyze the model’s attention layers to determine which information to … Read more

Lambda launches new AI inference service with competitive pricing

Lambda, a San Francisco-based technology company, has introduced a new AI inference API service that promises the lowest costs in the industry. According to VentureBeat reporter Carl Franzen, the service allows enterprises to deploy AI models without managing computing infrastructure. The API supports various advanced models including Meta’s Llama 3.3 and Alibaba’s Qwen 2.5, with … Read more

Writer launches Palmyra Creative to diversify AI-generated content

Writer, an enterprise AI startup valued at $1.9 billion, has introduced a new AI model designed to overcome the uniformity often found in AI-generated content. As reported by Michael Nuñez for VentureBeat, Palmyra Creative uses innovative techniques to produce more varied and original outputs. The model employs merging techniques and adaptive model layering, departing from … Read more

Google Cloud predicts AI agents and multimodal systems to reshape enterprise computing in 2025

According to a new Google Cloud trends report, enterprises will significantly scale their AI implementations in 2025, with a focus on AI agents and multimodal systems. As reported by Taryn Plumb in VentureBeat, companies are expected to move beyond current experimentation phases toward production-scale deployments. The report identifies six types of AI agents, from customer … Read more