Meta’s Llama 3.3 70B model runs GPT-4 level AI on high-end laptops

Meta has released Llama 3.3 70B, a new large language model that achieves GPT-4 level performance while running on high-end consumer laptops. The breakthrough was documented by developer Simon Willison testing the model on a 64 GB MacBook Pro M2, demonstrating capabilities comparable to much larger models like Meta’s own Llama 3.1 405B. The model … Read more

Meta uses OpenAI’s GPT-4 alongside Llama in internal coding tool

Meta’s internal coding assistant Metamate combines both OpenAI’s GPT-4 and Meta’s own Llama AI model to support developers, according to a report by Kali Hays in Fortune. The tool has been using GPT-4 since early 2024, despite CEO Mark Zuckerberg’s public promotion of Llama as a leading AI model. Current and former Meta employees, speaking … Read more

Meta rebuilds company strategy around open-source AI model Llama

Meta has fundamentally transformed its business strategy by focusing on Llama, its open-source artificial intelligence model. According to Sharon Goldman’s detailed report in Fortune, CEO Mark Zuckerberg made the pivotal decision to release Llama 2 as open-source in July 2023, despite internal concerns about monetization and security risks. The model has since been downloaded over … Read more

AnyChat unifies access to multiple AI language models

AnyChat, a new development tool, enables seamless integration of multiple large language models (LLMs) through a single interface. Developer Ahsen Khaliq, machine learning growth lead at Gradio, created the platform to allow users to switch between models like ChatGPT, Google’s Gemini, Perplexity, Claude, and Meta’s LLaMA without being restricted to one provider, as reported by … Read more

Cerebras Inference achieves breakthrough performance for Llama 3.1-70B

Cerebras has announced a major update to its Cerebras Inference platform, which now runs the Llama 3.1-70B language model at an impressive 2,100 tokens per second – a threefold performance increase compared to the previous release. According to James Wang from the official Cerebras blog, this performance is 16 times faster than the fastest GPU … Read more

Meta releases AI models for mobile devices

Meta Platforms has released quantized versions of its Llama 3.2 1B and 3B models, which the company says offer reduced memory requirements, faster on-device inference, accuracy, and portability. The models were developed in close collaboration with Qualcomm and MediaTek and are available on SoCs with Arm CPUs. According to Meta, the average model size has … Read more

Nvidia releases powerful and open AI model

Nvidia has introduced a new AI model, Llama-3.1-Nemotron-70B-Instruct, which outperforms existing models from OpenAI and others, continuing a significant shift in its AI strategy. The model, available on Hugging Face, achieved impressive benchmark scores, positioning Nvidia as a competitive player in AI language understanding and generation. This development showcases Nvidia’s transition from a GPU manufacturer … Read more

INTELLECT-1 undergoes decentralized training

Decentralized training of a 10-billion-parameter model called INTELLECT-1 has begun. Anyone can contribute computing power and participate. INTELLECT-1 is based on the Llama-3 architecture and is trained on a high quality open source dataset called Fineweb-Edu by Hugging Face. The dataset contains over six trillion tokens and consists of Fineweb-edu (55%), DLCM (20%), Stack v2 … Read more

How expensive is your own conversational AI?

Companies can achieve significant cost savings by building their own conversational AI based on the open source model Llama 3. This is the result of an analysis by Sam Oliver, founder of OpenFi, published on VentureBeat. According to Oliver, an average conversation with Llama 3 costs about $0.08, while the same conversation with OpenAI’s GPT-4 … Read more

Reflection 70B corrects its own errors

A new open source AI model called Reflection 70B has been introduced by Matt Shumer, co-founder of AI startup HyperWrite. As Shumer announced on the platform X (formerly Twitter), the model outperforms leading commercial systems in benchmarks. Reflection 70B is based on Meta’s Llama 3.1-70B Instruct and uses a new technique for self-correcting errors: the … Read more