NVDA NVIDIA Corporation

NVIDIA Launches Generative AI Microservices for Developers to Create and Deploy Generative AI Copilots Across NVIDIA CUDA GPU Installed Base

NVIDIA Launches Generative AI Microservices for Developers to Create and Deploy Generative AI Copilots Across NVIDIA CUDA GPU Installed Base

  • New Catalog of GPU-Accelerated NVIDIA NIM Microservices and Cloud Endpoints for Pretrained AI Models Optimized to Run on Hundreds of Millions of CUDA-Enabled GPUs Across Clouds, Data Centers, Workstations and PCs

  • Enterprises Can Use Microservices to Accelerate Data Processing, LLM Customization, Inference, Retrieval-Augmented Generation and Guardrails
  • Adopted by Broad AI Ecosystem, Including Leading Application Platform Providers Cadence, CrowdStrike, SAP, ServiceNow and More

SAN JOSE, Calif., March 18, 2024 (GLOBE NEWSWIRE) -- NVIDIA today launched dozens of enterprise-grade generative AI microservices that businesses can use to create and deploy custom applications on their own platforms while retaining full ownership and control of their intellectual property.

Built on top of the ® platform, the catalog of cloud-native microservices includes ™ microservices for optimized inference on more than two dozen popular AI models from NVIDIA and its partner ecosystem. In addition, NVIDIA accelerated software development kits, libraries and tools can now be accessed as ™ microservices for retrieval-augmented generation (RAG), guardrails, data processing, HPC and more. NVIDIA also separately announced over two dozen .

The curated selection of microservices adds a new layer to NVIDIA’s full-stack computing platform. This layer connects the AI ecosystem of model developers, platform providers and enterprises with a standardized path to run custom AI models optimized for NVIDIA’s CUDA installed base of hundreds of millions of GPUs across clouds, data centers, workstations and PCs.

Among the first to access the new NVIDIA generative AI microservices available in are leading application, data and cybersecurity platform providers including , , , Getty Images, , , and Shutterstock.

“Established enterprise platforms are sitting on a goldmine of data that can be transformed into generative AI copilots,” said Jensen Huang, founder and CEO of NVIDIA. “Created with our partner ecosystem, these containerized AI microservices are the building blocks for enterprises in every industry to become AI companies.”

NIM Inference Microservices Speed Deployments From Weeks to Minutes

NIM microservices provide pre-built containers powered by NVIDIA inference software — including Triton Inference Server™ and TensorRT™-LLM — which enable developers to reduce deployment times from weeks to minutes.

They provide industry-standard APIs for domains such as language, speech and drug discovery to enable developers to quickly build AI applications using their proprietary data hosted securely in their own infrastructure. These applications can scale on demand, providing flexibility and performance for running generative AI in production on NVIDIA-accelerated computing platforms.

NIM microservices provide the fastest and highest-performing production AI container for deploying models from NVIDIA, , Adept, , Getty Images, and Shutterstock as well as open models from Google, , Meta, Microsoft, Mistral AI and Stability AI.

today announced that it is using NIM to develop and deploy new domain-specific copilots and other generative AI applications faster and more cost effectively.

Customers will be able to access NIM microservices from , and , and integrate with popular AI frameworks like , and .

CUDA-X Microservices for RAG, Data Processing, Guardrails, HPC

provide end-to-end building blocks for data preparation, customization and training to speed production AI development across industries.

To accelerate AI adoption, enterprises may use CUDA-X microservices including for customizable speech and translation AI, ™ for routing optimization, as well as for high resolution climate and weather simulations.

™ microservices let developers link their AI applications to their business data — including text, images and visualizations such as bar graphs, line plots and pie charts — to generate highly accurate, contextually relevant responses. With these RAG capabilities, enterprises can offer more data to copilots, chatbots and generative AI productivity tools to elevate accuracy and insight.

Additional are coming soon for custom model development. These include NVIDIA NeMo Curator for building clean datasets for training and retrieval, NVIDIA NeMo Customizer for fine-tuning LLMs with domain-specific data, NVIDIA NeMo Evaluator for analyzing AI model performance, as well as for LLMs.

Ecosystem Supercharges Enterprise Platforms With Generative AI Microservices

In addition to leading application providers, data, infrastructure and compute platform providers across the NVIDIA ecosystem are working with NVIDIA microservices to bring generative AI to enterprises.

Top data platform providers including , Cloudera, Cohesity, , Dropbox and are working with NVIDIA microservices to help customers optimize their RAG pipelines and integrate their proprietary data into generative AI applications. leverages NeMo Retriever to harness enterprise data for building AI applications.

Enterprises can deploy NVIDIA microservices included with NVIDIA AI Enterprise 5.0 across the infrastructure of their choice, such as leading clouds , , and .

NVIDIA microservices are also supported on over 400 NVIDIA-Certified Systems™, including servers and workstations from Cisco, , , HP, and Supermicro. Separately today, HPE announced availability of HPE’s enterprise computing solution for generative AI, with planned integration of NIM and into HPE’s AI software.

NVIDIA AI Enterprise microservices are coming to infrastructure software platforms including with NVIDIA. OpenShift supports NVIDIA NIM microservices to help enterprises more easily integrate generative AI capabilities into their applications with optimized capabilities for security, compliance and controls. is adding Charmed Kubernetes support for NVIDIA microservices through NVIDIA AI Enterprise.

NVIDIA’s ecosystem of hundreds of AI and MLOps partners, including Abridge, Anyscale, Dataiku, , , H2O.ai, , , and , are adding support for NVIDIA microservices through NVIDIA AI Enterprise.

Apache Lucene, , Faiss, Kinetica, Milvus, Redis, and Weaviate are among the vector search providers working with NVIDIA NeMo Retriever microservices to power responsive RAG capabilities for enterprises.

Availability

Developers can experiment with NVIDIA microservices at at no charge. Enterprises can deploy production-grade NIM microservices with NVIDIA AI Enterprise 5.0 running on NVIDIA-Certified Systems and leading cloud platforms.

For more information, watch the replay of and visit the NVIDIA booth at GTC, held at the San Jose Convention Center through March 21.

About NVIDIA

Since its founding in 1993, (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling industrial digitalization across markets. NVIDIA is now a full-stack computing infrastructure company with data-center-scale offerings that are reshaping industry. More information at .

For further information, contact:

Anna Kiachian

Senior PR Manager

NVIDIA Corporation



Certain statements in this press release including, but not limited to, statements as to: the benefits, impact, performance, features, and availability of NVIDIA’s products and technologies, including NVIDIA CUDA platform, NVIDIA NIM microservices, NVIDIA CUDA-X microservices, NVIDIA AI Enterprise 5.0, NVIDIA inference software including Triton Inference Server and TensorRT-LLM, NVIDIA Riva, NVIDIA cuOpt, NVIDIA Earth-2, NeMo Retriever, NVIDIA NeMo Curator, NVIDIA NeMo Customizer, NVIDIA NeMo Evaluator, NVIDIA NeMo Guardrails, NVIDIA AI Foundation models and NVIDIA AI Enterprise microservices; and established enterprise platforms sitting on a goldmine of data that can be transformed into generative AI copilots are forward-looking statements that are subject to risks and uncertainties that could cause results to be materially different than expectations. Important factors that could cause actual results to differ materially include: global economic conditions; our reliance on third parties to manufacture, assemble, package and test our products; the impact of technological development and competition; development of new products and technologies or enhancements to our existing product and technologies; market acceptance of our products or our partners' products; design, manufacturing or software defects; changes in consumer preferences or demands; changes in industry standards and interfaces; unexpected loss of performance of our products or technologies when integrated into systems; as well as other factors detailed from time to time in the most recent reports NVIDIA files with the Securities and Exchange Commission, or SEC, including, but not limited to, its annual report on Form 10-K and quarterly reports on Form 10-Q. Copies of reports filed with the SEC are posted on the company's website and are available from NVIDIA without charge. These forward-looking statements are not guarantees of future performance and speak only as of the date hereof, and, except as required by law, NVIDIA disclaims any obligation to update these forward-looking statements to reflect future events or circumstances.

Many of the products and features described herein remain in various stages and will be offered on a when-and-if-available basis. The statements above are not intended to be, and should not be interpreted as a commitment, promise, or legal obligation, and the development, release, and timing of any features or functionalities described for our products is subject to change and remains at the sole discretion of NVIDIA. NVIDIA will have no liability for failure to deliver or delay in the delivery of any of the products, features or functions set forth herein.

© 2024 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, CUDA, CUDA-X, NVIDIA NeMo, NVIDIA NeMo Retriever, NVIDIA NIM, NVIDIA Triton Inference Server, NVIDIA-Certified Systems, and TensorRT are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. Other company and product names may be trademarks of the respective companies with which they are associated. Features, pricing, availability and specifications are subject to change without notice.

A photo accompanying this announcement is available at



EN
18/03/2024

Underlying

To request access to management, click here to engage with our
partner Phoenix-IR's CorporateAccessNetwork.com

Reports on NVIDIA Corporation

Wedbush Research
  • Wedbush Research
UBI UBISOFT ENTERTAINMENT SA
XNCR XENCOR INC.
XENE XENON PHARMACEUTICALS INC
WDC WESTERN DIGITAL CORPORATION
VYGR VOYAGER THERAPEUTICS INC.
TTWO TAKE-TWO INTERACTIVE SOFTWARE INC.
SIMON SILICON MOTION TECHNOLOGY CORPORATION SPONSORED ADR
RARE ULTRAGENYX PHARMACEUTICAL INC.
PSTG PURE STORAGE INC. CLASS A
NVDA NVIDIA CORPORATION
NVCR NOVOCURE LTD.
NTAP NETAPP INC.
NKTR NEKTAR THERAPEUTICS
NBIX NEUROCRINE BIOSCIENCES INC.
MU MICRON TECHNOLOGY INC.
KURA KURA ONCOLOGY INC.
INTC INTEL CORPORATION
GOOGL ALPHABET INC. CLASS A
GERN GERON CORP.
FB FACEBOOK INC. CLASS A
FATE FATE THERAPEUTICS INC
CTMX CYTOMX THERAPEUTICS INC.
BMRN BIOMARIN PHARMACEUTICAL INC.
BIIB BIOGEN INC.
BCRX BIOCRYST PHARMACEUTICALS INC.
AXTI AXT INC.
ASND ASCENDIS PHARMA A/S SPONSORED ADR
ARDX ARDELYX INC
ACRS ACLARIS THERAPEUTICS INC.
CDR CD PROJEKT S.A.
FRX_CN FENNEC PHARMACEUTICALS
7974 NINTENDO CO. LTD.
2303 UNITED MICROELECTRONICS CORP.
EA ELECTRONIC ARTS INC.
ANAB ANAPTYSBIO INC.
ARGX ARGENX ADS
OVID OVID THERAPEUTICS
ISR ISORAY
AMZN AMAZON.COM INC.
AMD ADVANCED MICRO DEVICES INC.
AAPL APPLE INC.
MSFT MICROSOFT CORPORATION
APLS APELLIS PHARMACEUTICALS
DNLI DENALI THERAPEUTICS INC
RCUS ARCUS BIOSCIENCES
SRRK SCHOLAR ROCK HOLDING CORPORATION
KNSA KINIKSA PHARMACEUTICALS
REPL REPLIMUNE GROUP
ARVN ARVINAS HOLDING
STRO SUTRO BIOPHARMA
UBER UBER TECHNOLOGIES INC.
STOK STOKE THERAPEUTICS
SMCI SUPER MICRO COMPUTER
IDYA INC.
PASG IDEAYA BIOSCIENCES
BDTX PASSAGE BIO
ACET BLACK DIAMOND THERAPEUTICS
KROS ADICET BIO INC
ORIC KEROS THERAPEUTICS
ZNTL ORIC PHARMACEUTICALS
BEAM ZENTALIS PHARMACEUTICALS
LRMR BEAM THERAPEUTICS
PRAX LARIMAR THERAPEUTICS INC
U PRAXIS PRECISION MEDICINES
TVTX UNITY SOFTWARE
VRDN TRAVERE THERAPEUTICS INC
SANA VIRIDIAN THERAPEUTICS INC
PLTK SANA BIOTECHNOLOGY
RBLX PLAYTIKA HOLDING
DAWN ROBLOX
STX DAY ONE BIOPHARMACEUTICALS INC
APP SEAGATE TECHNOLOGY HLDGS PLC
GFS APPLOVIN CORP
GMTX GLOBALFOUNDRIES INC
TYRA GEMINI THERAPEUTICS INC
PEPG TYRA BIOSCIENCES INC
EWTX PEPGEN INC
SLDB EDGEWISE THERAPEUTICS INC
DDI SOLID BIOSCIENCES INC
APGE DOUBLEDOWN INTERACTIVE CO LTD
EMBRAC B APOGEE THERAPEUTICS INC
DNTH EMBRACER GROUP AB
ORKA DIANTHUS THERAPEUTICS INC
BCAX ORUKA THERAPEUTICS INC
ZBIO BICARA THERAPEUTICS INC.
TLX ZENAS BIOPHARMA INC.
MAZE TELIX PHARMACEUTICALS LIMITED
JBIO MAZE THERAPEUTICS INC
IMA JADE BIOSCIENCES INC
IMAGENEBIO INC.
 PRESS RELEASE

NVIDIA AI Ecosystem Expands as Marvell Joins Forces Through NVLink Fus...

NVIDIA AI Ecosystem Expands as Marvell Joins Forces Through NVLink Fusion Collaboration Delivers Greater Choice and Flexibility for Customers and Fully Compatible With NVIDIA AI Infrastructure SANTA CLARA, Calif., March 31, 2026 (GLOBE NEWSWIRE) -- NVIDIA and Marvell Technology, Inc. (NASDAQ: MRVL) today announced a strategic partnership to connect Marvell to the NVIDIA AI factory and AI-RAN ecosystem through NVIDIA NVLink Fusion™, offering customers building on NVIDIA architectures greater choice and flexibility in developing next-generation infrastructure. The companies will also coll...

Pierre FerraguÊ
  • Pierre FerraguÊ

The Tech Infrastructure Quarterly Bible 4Q25

​​​​We publish today our comprehensive quarterly bible: 243 pages of detailed analyses on what happened in the last 3 months, and how we interpret it, in light of our current convictions. The first section acts as a PM summary, outlining our key findings, and latest thoughts on the semi cycle, in 6 slides:

ResearchPool Subscriptions

Get the most out of your insights

Get in touch