Nvidia has released a powerful open-source AI model that rivals proprietary systems from industry leaders like OpenAI and Google. The model, called NVLM 1.0, demonstrates exceptional performance in vision and language tasks while also enhancing text-only capabilities. Michael Nuñez reports on this development for VentureBeat.
The main model, NVLM-D-72B, with 72 billion parameters, can process complex visual and textual inputs, such as interpreting memes and solving mathematical problems step-by-step. Nvidia is making the model weights publicly available and promises to release the training code. This decision grants researchers and developers unprecedented access to cutting-edge AI technology.