Hugging Face LightEval helps evaluate LLMs

Hugging Face has introduced LightEval, a new open source solution for evaluating large language models (LLMs). This was reported by Michael Nuñez on VentureBeat. The lightweight suite allows companies and researchers to evaluate AI models in a precise and customizable way. LightEval integrates seamlessly with existing Hugging Face tools and supports multiple devices including CPUs, …

Read more

Google Prompt Poet helps with complex prompts

Google has introduced Prompt Poet, an innovative tool for advanced prompt engineering. As Michael Trestman reports for VentureBeat, the system developed by Character.ai simplifies the creation of complex prompts for large language models through a user-friendly template system. Prompt Poet seamlessly integrates external data to enable contextual AI responses. The special feature of Prompt Poet …

Read more

Replit Agent helps to code complete applications

Replit has introduced an AI agent that can build complete applications from scratch. As Chris McKay reports in an article for Maginative, this is much more than just a coding assistant. The agent can develop software on its own, from project planning and code writing to debugging and deployment. According to Replit CEO Amjad Masad, …

Read more

Infinity creates talking characters

A new AI model called Infinity generates realistic talking characters. It is based on a video diffusion transformer that has been trained with audio input. According to the developers, this is the first model of its kind. Users can enter their scripts and get videos of animated characters speaking the text. It can process different …

Read more

Salesforce xGen-Sales and xLAM introduced

Salesforce is introducing two new AI models to automate sales processes. The company unveiled xGen-Sales and xLAM, as Michael Nuñez reports in an article for VentureBeat. xGen-Sales is designed to perform complex sales tasks accurately and quickly, while xLAM is designed to trigger actions in software systems. Adam Evans, senior vice president of product for …

Read more

Reflection 70B corrects its own errors

A new open source AI model called Reflection 70B has been introduced by Matt Shumer, co-founder of AI startup HyperWrite. As Shumer announced on the platform X (formerly Twitter), the model outperforms leading commercial systems in benchmarks. Reflection 70B is based on Meta’s Llama 3.1-70B Instruct and uses a new technique for self-correcting errors: the …

Read more

Google launches “Ask Photos” in the US

Google is launching its AI-powered “Ask Photos” feature. The new feature, based on the Gemini AI model, allows users to use complex natural language queries to find photos. Sarah Perez reports for TechCrunch that the feature will initially be available to select customers in the US. With “Ask Photos,” users can search for the best …

Read more

New AI model from developer search engine Phind

Phind is introducing a new AI model called Phind-405B, which specializes in technical tasks. The company announced it in a blog post. Phind is an intelligent search engine for developers that uses generative AI to solve problems and turn ideas into working products. The new model is based on Meta Llama 3.1 405B and can …

Read more

AnythingLLM: Chat with documents

A new AI application called AnythingLLM allows users to chat with any document. The software supports several AI language models and vector databases. According to the developers, users can use it to create a private ChatGPT-like application that can be hosted locally or remotely. AnythingLLM offers features such as multimodality, multi-user support and embedded chat …

Read more

Maitai monitors and optimizes LLM outputs

A new AI startup called Maitai has developed a platform to optimize the use of Large Language Models (LLMs) in practice. As founders Christian and Ian tell Hacker News, Maitai acts as a proxy to analyze the traffic between the application and the LLM. The system automatically detects errors, corrects them in real time, and …

Read more