Hugging Face LightEval helps evaluate LLMs

Hugging Face has introduced LightEval, a new open source solution for evaluating large language models (LLMs). This was reported by Michael Nuñez on VentureBeat. The lightweight suite allows companies and researchers to evaluate AI models in a precise and customizable way. LightEval integrates seamlessly with existing Hugging Face tools and supports multiple devices including CPUs, … Read more

Google Prompt Poet helps with complex prompts

Google has introduced Prompt Poet, an innovative tool for advanced prompt engineering. As Michael Trestman reports for VentureBeat, the system developed by Character.ai simplifies the creation of complex prompts for large language models through a user-friendly template system. Prompt Poet seamlessly integrates external data to enable contextual AI responses. The special feature of Prompt Poet … Read more

Replit Agent helps to code complete applications

Replit has introduced an AI agent that can build complete applications from scratch. As Chris McKay reports in an article for Maginative, this is much more than just a coding assistant. The agent can develop software on its own, from project planning and code writing to debugging and deployment. According to Replit CEO Amjad Masad, … Read more

New AI model from developer search engine Phind

Phind is introducing a new AI model called Phind-405B, which specializes in technical tasks. The company announced it in a blog post. Phind is an intelligent search engine for developers that uses generative AI to solve problems and turn ideas into working products. The new model is based on Meta Llama 3.1 405B and can … Read more

AnythingLLM: Chat with documents

A new AI application called AnythingLLM allows users to chat with any document. The software supports several AI language models and vector databases. According to the developers, users can use it to create a private ChatGPT-like application that can be hosted locally or remotely. AnythingLLM offers features such as multimodality, multi-user support and embedded chat … Read more

Maitai monitors and optimizes LLM outputs

A new AI startup called Maitai has developed a platform to optimize the use of Large Language Models (LLMs) in practice. As founders Christian and Ian tell Hacker News, Maitai acts as a proxy to analyze the traffic between the application and the LLM. The system automatically detects errors, corrects them in real time, and … Read more

Anthropic helps to craft better prompts

Anthropic has released new features for its Claude language model to help developers create better prompts to optimize the performance of their AI applications.

LiveBench is a new benchmark for LLMs

LiveBench is a new benchmark for large language models developed by a team of scientists. Unlike existing benchmarks, it uses constantly updated questions from current sources and automatically scores the answers based on objective criteria. The team has taken special care to avoid the risk of “contamination”, where the training data of a language model … Read more

Kong’s AI Gateway is a new platform for enterprise AI

Kong Inc. has released its “AI Gateway”, a platform designed to make it easier for enterprises to control and securely deploy generative AI in various cloud environments. According to Kong, the gateway enables the integration and management of various AI technologies through a single interface and provides security features to prevent misuse by manipulating the … Read more

Nvidia shows new model for synthetic data

According to Nvidia, their new Nemotron-4 340B open language model will revolutionize the generation of synthetic data and enable companies to develop custom AI models.