Amphion: open-source toolkit for audio, music and speech generation

Amphion is an open-source toolkit designed to support research and development in audio, music and speech generation. According to the project’s GitHub site, it offers unique visualizations of classic models and architectures to help junior researchers and engineers better understand them. The toolkit supports various individual generation tasks such as text-to-speech (TTS), singing voice synthesis …

Read more

Cerebras Inference achieves breakthrough performance for Llama 3.1-70B

Cerebras has announced a major update to its Cerebras Inference platform, which now runs the Llama 3.1-70B language model at an impressive 2,100 tokens per second – a threefold performance increase compared to the previous release. According to James Wang from the official Cerebras blog, this performance is 16 times faster than the fastest GPU …

Read more

Hugging Face helps companies develop AI

New York-based AI startup Hugging Face is teaming up with Amazon and Google to launch new open-source software aimed at lowering the cost of developing chatbots and other AI systems, Stephen Nellis reports for Reuters. The offering, called “HUGS” (Hugging Face for Generative AI Services), automates the implementation of AI models and will be available …

Read more

Cohere’s Embed 3 now searches for images

AI company Cohere has added multimodal capabilities to its Embed 3 embedding model, allowing images to be included in RAG-based company searches. This is reported by Emilia David for VentureBeat. The new version can create embeddings for both images and text, with both formats stored in a unified database. According to Cohere, this allows companies …

Read more

“Computer Use”: Anthropic’s Claude can now control your PC

Anthropic has unveiled an updated version of its AI model Claude 3.5 Sonnet. According to the company, the model can now control desktop applications and perform PC tasks. It uses a new “Computer Use” feature, which is in public beta. Anthropic emphasizes that the technology is still error-prone and recommends developers initially test it only …

Read more

IBM launches Granite 3.0 models for enterprise

IBM has launched its Granite 3.0 large language models (LLMs), expanding its enterprise AI offerings, Sean Michael Kerner reports for VentureBeat. The new open-source models, available under the Apache 2.0 license, are designed for various enterprise applications, including customer service, IT automation, and cybersecurity. IBM claims the models outperform competitors like Google and Anthropic, having …

Read more

Sana is a small and extremely fast AI image generator

A new text-to-image framework called Sana can efficiently and quickly generate high-resolution images up to 4096 x 4096 pixels. The system uses a deep compression autoencoder, linear attention, and a decoder-based text encoder to optimize performance. According to the developers, Sana-0.6B can compete with state-of-the-art large diffusion models, but is 20 times smaller and over …

Read more

Endor Labs scores open source AI models

Endor Labs has launched a new platform to score over 900,000 open-source AI models available on Hugging Face, focusing on security, activity, quality, and popularity. This initiative aims to address concerns regarding the trustworthiness and security of AI models, which often have complex dependencies and vulnerabilities, reports VentureBeat. Developers can query the platform about model …

Read more

Galileo evaluates AI models for business use

Galileo, an AI startup led by Vikram Chatterji, has raised $45 million in a Series B funding round, totaling $68 million since its inception three years ago. The company focuses on evaluating AI models to ensure they function effectively and do not generate incorrect information or leak sensitive data, reports Forbes. Its product suite includes …

Read more

DataStax and Nvidia accelerate AI development for companies

DataStax has unveiled a new AI platform in collaboration with Nvidia, aimed at assisting enterprises with AI development. As reported by Sean Michael Kerner for VentureBeat, the platform combines DataStax’s database technology and visual AI orchestration tool Langflow with Nvidia’s enterprise AI components. According to DataStax, the new solution can reduce AI development time by …

Read more