* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, November 6, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Trixie Mattel to share journey in entertainment, advocacy at UW–Madison – WKOW

    Trixie Mattel to Share Her Inspiring Journey in Entertainment and Advocacy at UW-Madison

    Cleveland State to Broadcast Six Basketball Games on Rock Entertainment Sports Network – csuvikings.com

    Cleveland State to Broadcast Six Basketball Games on Rock Entertainment Sports Network – csuvikings.com

    Can Caesars Entertainment’s (CZR) Investment in Digital Offset Las Vegas Weakness? – simplywall.st

    How do you spell success? ‘Spelling Bee’ lands at Surfside Playhouse – Florida Today

    How Do You Spell Success? Catch ‘Spelling Bee’ Live at Surfside Playhouse!

    Belmont Names Debbie Carroll Head of New Center for Mental Health in Entertainment – Billboard

    Debbie Carroll Named Leader of Groundbreaking New Center for Mental Health in Entertainment

    Call of Duty Movie’s Plot Setting Revealed in New Rumor – Yahoo

    Exciting New Rumor Reveals the Plot Setting of the Call of Duty Movie!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    How We Lost Ourselves to Technology—and How We Can Come Back – The Free Press

    How Technology Took Over Our Lives-and How We Can Take Back Control

    Sleeper Picks: World Wide Technology Championship – PGA Tour

    Discover the Ultimate Sleeper Picks for the World Wide Technology Championship

    Rowland.ai Named Disruptive Technology of the Year by The Energy Council – GlobeNewswire

    Rowland.ai Named Disruptive Technology of the Year by Industry Leaders

    Peraton Honored As Silver Stevie® Award Winner in 2025 Stevie Awards for Technology Excellence – The AI Journal

    Peraton Honored As Silver Stevie® Award Winner in 2025 Stevie Awards for Technology Excellence – The AI Journal

    [News] China Makes Breakthrough in Chip Technology, Paving the Way for Lithography Advancements – TrendForce

    [News] China Makes Breakthrough in Chip Technology, Paving the Way for Lithography Advancements – TrendForce

    Can RFID technology solve the global medicine shortage crisis? – World Health Expo

    Can RFID technology solve the global medicine shortage crisis? – World Health Expo

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Trixie Mattel to share journey in entertainment, advocacy at UW–Madison – WKOW

    Trixie Mattel to Share Her Inspiring Journey in Entertainment and Advocacy at UW-Madison

    Cleveland State to Broadcast Six Basketball Games on Rock Entertainment Sports Network – csuvikings.com

    Cleveland State to Broadcast Six Basketball Games on Rock Entertainment Sports Network – csuvikings.com

    Can Caesars Entertainment’s (CZR) Investment in Digital Offset Las Vegas Weakness? – simplywall.st

    How do you spell success? ‘Spelling Bee’ lands at Surfside Playhouse – Florida Today

    How Do You Spell Success? Catch ‘Spelling Bee’ Live at Surfside Playhouse!

    Belmont Names Debbie Carroll Head of New Center for Mental Health in Entertainment – Billboard

    Debbie Carroll Named Leader of Groundbreaking New Center for Mental Health in Entertainment

    Call of Duty Movie’s Plot Setting Revealed in New Rumor – Yahoo

    Exciting New Rumor Reveals the Plot Setting of the Call of Duty Movie!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    How We Lost Ourselves to Technology—and How We Can Come Back – The Free Press

    How Technology Took Over Our Lives-and How We Can Take Back Control

    Sleeper Picks: World Wide Technology Championship – PGA Tour

    Discover the Ultimate Sleeper Picks for the World Wide Technology Championship

    Rowland.ai Named Disruptive Technology of the Year by The Energy Council – GlobeNewswire

    Rowland.ai Named Disruptive Technology of the Year by Industry Leaders

    Peraton Honored As Silver Stevie® Award Winner in 2025 Stevie Awards for Technology Excellence – The AI Journal

    Peraton Honored As Silver Stevie® Award Winner in 2025 Stevie Awards for Technology Excellence – The AI Journal

    [News] China Makes Breakthrough in Chip Technology, Paving the Way for Lithography Advancements – TrendForce

    [News] China Makes Breakthrough in Chip Technology, Paving the Way for Lithography Advancements – TrendForce

    Can RFID technology solve the global medicine shortage crisis? – World Health Expo

    Can RFID technology solve the global medicine shortage crisis? – World Health Expo

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Business

Nvidia AI Foundry And NIMs: A Huge Competitive Advantage

July 24, 2024
in Business
Nvidia AI Foundry And NIMs: A Huge Competitive Advantage
Share on FacebookShare on Twitter

Nvidia has fleshed out a complete software stack to ease custom model development and deployment for enterprises. Is this AI Nervana? And can AMD and Intel compete with this?

In order for enterprises to adopt AI, its got to become a lot easier and more affordable. Nvidia has (re-) launched AI Foundry to help Enterprises adapt and adopt AI to meet their business needs without having to start from scratch. And without having to spend gazillions of dollars.

The timing is spot-on as investors grow nervous that it may be hard for enterprises to make a good return on their AI investments. And without Enterprise adoption, AI will fail and we will be back to an AI Winter. To counter that narrative, Nvidia is expected to share Enterprise ROI stories during its next earnings call. And the new AI Foundry coupled with NIMs could become the standard path forward for most companies. While many components of this story are indeed open source, they only run on Nvidia GPUs. I know of no other chip company with anything even close to NIMs or the AI Foundry.

What is the AI Foundry?

The Nvidia AI Foundry is a combination of software, models, and expert services to help Enterprises not only get started, but complete their AI journey. Will this put Nvidia on a collision course with its ecosystem consulting partners such as IBM and Accenture? Accenture has been using the Nvidia AI Foundry to revamp its internal enterprise functions, and has now taken what they have learned and created the Accenture AI Refinery to help its clients do the same. Deloitte is on a similar path.

The custom model creation workflow.

NVidia

According to Nvidia’s blog on the Foundry, “Just as TSMC manufactures chips designed by other companies, NVIDIA AI Foundry provides the infrastructure and tools for other companies to develop and customize AI models — using DGX Cloud, foundation models, NVIDIA NeMo software, NVIDIA expertise, as well as ecosystem tools and support.”

When initially rolled out back in late 2023, Nvidia Foundry was focussed on Microsoft Azure hosted AI. Since then, Nvidia has recruited dozens of partners to help deliver the platform, including AWS, Google Cloud, and Oracle Cloud as well as scores of generative AI companies, model builders, integrators and OEMs.

The ecosystem for Nvidia AI Foundry has exploded with new partners,

Nvidia

The NVIDIA AI foundry service pulls together three elements needed to customize a model for a specific data set or company — a collection of NVIDIA AI Foundation Models, NVIDIA NeMo framework and tools, and NVIDIA DGX Cloud AI supercomputing services — giving enterprises an end-to-end solution for creating custom generative AI models.

But you thought thats what RAG was for, right? Yes, Retrieval Augmented Generation can do a great job of adding company-specific data to an LLM. But Nvidia said that the Foundry can produce a customized model that is fully ten points more accurate than a simple RAG augmentation. Ten points can make the difference between a great model and one that may be thrown on the trash heap.

And NIMs

NIMs provide the building blocks needed to greatly simplify and expand the domains that the Foundry can build on. Nvidia shared over 50 NIMs they have already created for various domains. Recall that a NIM is a containerized inference processing micro-service that the Nvidia NIM Factory has built, and that an Enterprise AI License provides access to the ever-growing NIM Library on ai.nvidia.com.

Nvidia NIMs are multiplying rapidly and cover most of the major modes of data and AI.

Nvidia

The Foundry launch was timed to coincide with Meta’s release of Llama 3.1 405B, which is the first open model that can rival the top AI models from OpenAI, Google, and others, when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and now with multilingual translation. Meta believes the latest generation of Llama will ignite new applications and modeling paradigms, including synthetic data generation to enable the improvement and training of smaller models, as well as model distillation. Nvidia Foundry also supports the NVIDIA Nemotron, CodeGemma by Google DeepMind, CodeLlama, Gemma by Google DeepMind, Mistral, Mixtral, Phi-3, StarCoder2 and others.

And true to form, Nvidia shows that it can increased performance of models like Llama 3.1 with optimized NIMs. Inferencing solutions like NVIDIA TensorRT-LLM improve efficiency for Llama 3.1 models to minimize latency and maximize throughput, enabling enterprises to generate tokens faster while reducing total cost of running the models in production.

For Llama 3.1 from Meta, NIMs deliver higher performance on the same hardware,

Nvidia

Nvidia also released today four new NeMo Retriever NIM microservices to enable enterprises to scale to “agentic AI” workflows — where AI applications operate accurately with minimal intervention or supervision — while delivering the highest accuracy retrieval-augmented generation, or RAG. These new NeMo Retriever embedding and reranking NIM microservices are now generally available:

NV-EmbedQA-E5-v5, a popular community base embedding model optimized for text question-answering retrieval
NV-EmbedQA-Mistral7B-v2, a popular multilingual community base model fine-tuned for text embedding for high-accuracy question answering
Snowflake-Arctic-Embed-L, an optimized community model, and
NV-RerankQA-Mistral4B-v3, a popular community base model fine-tuned for text reranking for high-accuracy question answering.

“NeMo Retriever provides the best of both worlds. By casting a wide net of data to be retrieved with an embedding NIM, then using a reranking NIM to trim the results for relevancy, developers tapping NeMo Retriever can build a pipeline that ensures the most helpful, accurate results for their enterprise,” Nvidia explained in their blog.

A NIM Example: A Healthcare Chatbot

Perhaps an example would help. Suppose you want to build a digital assistant to help patients with personalized information. Nvidia showed how they can combine 3 agents and 9 NIMs to build an assistant application. This is pretty close to Nervana and way beyond anything that the competition can offer.

A collection of NIMs can be used to create a healthcare digital assistant.

Nvidia

Conclusions

While the competition continues to improve the performance and connectivity of their accelerators, Nvidia is building the software that enables AI adoption. I know of no competitor to NIMs, nor a competitor to Foundry. And of course, nobody has introduced a competitor to Transformer Engine nor TensorRT-LLM, both of which can deliver 2-4 times the performance of a GPU without these features.

As enterprises work to adapt and adopt custom models for their business and applications, Nvidia is providing an easy on ramp to become an AI-enabled enterprise.

As for pricing, while NIM is included in the Enterprise AI license for each GPU, Foundry is priced based on a specific customer situation and is not included in Enterprise AI.

Here’s more detail on the Foundry:

NVIDIA BlogHow NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Forbes – https://www.forbes.com/sites/karlfreund/2024/07/23/nvidia-ai-foundry-and-nims-a-huge-competitive-advantage

Tags: businessFoundryNVIDIA
Previous Post

Apple iPhone 16 Pro: New Leak Reveals Powerful Design Upgrade Incoming, Report Says

Next Post

Donald Trump Says He Will Debate Kamala Harris

Dynamic and dangerous vs. Dortmund, Foden must be part of England’s World Cup squad – ESPN

Dynamic and Dangerous vs. Dortmund: Why Foden Must Be in England’s World Cup Squad

November 6, 2025
Democrats tap anxiety over Trump’s economy in victories that signal midterm strategy – USA Today

Democrats Leverage Economic Worries Over Trump to Secure Crucial Midterm Victories

November 6, 2025
Trixie Mattel to share journey in entertainment, advocacy at UW–Madison – WKOW

Trixie Mattel to Share Her Inspiring Journey in Entertainment and Advocacy at UW-Madison

November 6, 2025
Iowa seeks federal funding to support rural health care, Gov. Kim Reynolds announces – Iowa Capital Dispatch

Iowa Launches Bold Effort to Secure Federal Funds for Boosting Rural Health Care, Governor Kim Reynolds Reveals

November 6, 2025
Federal judge warns Justice Department it may be veering close to mishandling evidence in Comey case – CNN

Federal judge warns Justice Department it may be veering close to mishandling evidence in Comey case – CNN

November 6, 2025
Deep Dive Into Shark Ecology Provides Path to Conservation – Georgia Institute of Technology

Unlocking Shark Secrets: Exploring Their Ecology to Drive Conservation Efforts

November 5, 2025
Science diplomacy in small states: a case study of global players’ engagement in Slovakia – Nature

How Small States Like Slovakia Master the Art of Global Science Diplomacy

November 5, 2025
Academics welcome ‘change of tone’ on Serbia but fear sanctions – Science|Business

Academics Praise New Approach to Serbia but Express Ongoing Concerns Over Sanctions

November 5, 2025
The $1.25 Dollar Tree Pantry Staple I Buy Every Time I Go – Yahoo

The $1.25 Dollar Tree Pantry Staple I Buy Every Time I Go – Yahoo

November 5, 2025
How We Lost Ourselves to Technology—and How We Can Come Back – The Free Press

How Technology Took Over Our Lives-and How We Can Take Back Control

November 5, 2025

Categories

Archives

November 2025
M T W T F S S
 12
3456789
10111213141516
17181920212223
24252627282930
« Oct    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (904)
  • Economy (926)
  • Entertainment (21,798)
  • General (18,015)
  • Health (9,967)
  • Lifestyle (938)
  • News (22,149)
  • People (927)
  • Politics (937)
  • Science (16,137)
  • Sports (21,426)
  • Technology (15,906)
  • World (910)

Recent News

Dynamic and dangerous vs. Dortmund, Foden must be part of England’s World Cup squad – ESPN

Dynamic and Dangerous vs. Dortmund: Why Foden Must Be in England’s World Cup Squad

November 6, 2025
Democrats tap anxiety over Trump’s economy in victories that signal midterm strategy – USA Today

Democrats Leverage Economic Worries Over Trump to Secure Crucial Midterm Victories

November 6, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version