Nvidia makes a bombshell: its new AI model is open, large-scale, and ready to rival GPT-4

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI. learn more

Nvidia has released a powerful open source artificial intelligence model that competes with proprietary systems from industry leaders like OpenAI and Google.

The company’s new NVLM 1.0 family of large-scale multimodal language models is led by the 72 billion parameter NVLM-D-72B, which delivers superior performance across visual and language tasks while also providing enhanced text-only capabilities. .

“We are introducing NVLM 1.0, a family of frontier-class multimodal large-scale language models that achieve state-of-the-art results in visual language tasks. comparable to open access models,” the researchers explain in their paper.

By disclosing model weights and promising to make the training code publicly available, Nvidia breaks away from its tendency to keep advanced AI systems closed. This decision will give researchers and developers unprecedented access to cutting-edge technology.

Benchmark results comparing NVIDIA’s NVLM-D model with AI giants such as GPT-4, Claude 3.5, and Llama 3-V demonstrate NVLM-D’s competitive performance across a variety of visual and language tasks. Masu. (Credit: arxiv.org)

NVLM-D-72B: Versatile performer of visual and textual tasks

The NVLM-D-72B model shows great adaptability in processing complex visual and textual inputs. The researchers provided examples highlighting the model’s ability to interpret memes, analyze images, and solve mathematical problems step-by-step.

In particular, NVLM-D-72B improves performance on text-only tasks after multimodal training. While many similar models suffer from poor text performance, NVLM-D-72B improved accuracy by an average of 4.3 points across key text benchmarks.

“Our NVLM-D-1.0-72B shows significant improvements over text backbones on text-only math and coding benchmarks,” the researchers said, highlighting key advantages of their approach. I am.

NVIDIA’s new AI model analyzes memes comparing academic summaries to full papers, demonstrating its ability to interpret visual humor and academic concepts. (Credit: arxiv.org)

AI researchers react to Nvidia’s open source initiative

The AI community responded positively to this release. One AI researcher commented on social media: Nvidia has just released a 72B model that is equivalent to the llama 3.1 405B in math and coding assessments, and it also has vision. ”

Nvidia’s decision to make such a powerful model openly available could accelerate AI research and development across the field. By providing access to models that rival the proprietary systems of deep-funded tech companies, Nvidia could enable smaller organizations and independent researchers to make greater contributions to advances in AI.

The NVLM project also introduces innovative architectural designs, including a hybrid approach that combines various multimodal processing techniques. This development may shape the direction of future research in this field.

NVLM 1.0: A new chapter in open source AI development

Nvidia’s release of NVLM 1.0 marks a pivotal moment in AI development. By open sourcing a model that rivals proprietary giants, Nvidia is not just sharing code, it’s challenging the very structure of the AI industry.

This move could cause a chain reaction. Other technology leaders may feel pressure to publish their research, potentially accelerating advances in AI overall. It also levels the playing field, allowing smaller teams and researchers to innovate with tools once reserved for the tech giants.

However, the release of NVLM 1.0 is not without risks. As powerful AI becomes more accessible, concerns about abuse and ethical implications are likely to increase. The AI community currently faces the complex challenge of fostering innovation while establishing guardrails for responsible use.

Nvidia’s decision also raises questions about the future of AI business models. As cutting-edge models become freely available, companies may need to rethink how they create value with AI and stay competitive.

The true impact of NVLM 1.0 will become apparent in the coming months and years. It could usher in an era of unprecedented collaboration and innovation in AI. Or perhaps we need to take into account the unintended consequences of widely available advanced AI.

One thing is for sure: Nvidia has struck across the AI industry’s bow. The question now is not whether things will change, but how dramatically they will change, and who will adapt fast enough to thrive in this new world of open AI. .

VB Daily

Be sure to know! Get the latest news in your inbox every day

By subscribing, you agree to VentureBeat’s Terms of Use.

Thank you for subscribing. Check out other VB newsletters here.

An error has occurred.

What's Hot

Nvidia’s leader says it’s a “challenging time” for young programmers

Onondaga Co. reveals project plans for semiconductor plants

GFN Thursday: “Monster Hunter Wild”

Nvidia’s leader says it’s a “challenging time” for young programmers

GFN Thursday: “Monster Hunter Wild”

Nvidia CEO Jensen Huang and industry visionaries announce the following with AI at GTC 2025

Nvidia’s big day is here: What to expect when the AI giant reports after the bell

Can Nvidia’s bull market continue? Timothy Arcuri predicts

Nvidia shows off progress on Blackwell server installation — AI and datacenter roadmap sees Blackwell Ultra coming next year, Vera CPUs and Rubin GPUs coming in 2026

Most Popular

Nvidia’s big day is here: What to expect when the AI giant reports after the bell

Can Nvidia’s bull market continue? Timothy Arcuri predicts

Nvidia shows off progress on Blackwell server installation — AI and datacenter roadmap sees Blackwell Ultra coming next year, Vera CPUs and Rubin GPUs coming in 2026

Our Picks

Pharma’s AI bright future

US AI Action Plan\Humanity’s recommendations for the OSTP of humanity

Huff discusses AI at CAA meeting – News

Subscribe to Updates