-
Computer Vision / Video AnalyticsAI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells
-
Generative AIHow to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails
-
Simulation / Modeling / DesignStrengthening Climate Resilience with AI-Powered Flood Modeling and 3D Visualizations
-
Data Center / CloudNVIDIA Project DIGITS, A Grace Blackwell AI Supercomputer On Your Desk
-
Top StoriesAdvancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
Recent
Jan 16, 2025
Introducing New KV Cache Reuse Optimizations in NVIDIA TensorRT-LLM
Language models generate text by predicting the next token, given all the previous tokens including the input text tokens. Key and value elements of the...
7 MIN READ
Jan 16, 2025
NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules
The introduction of the NVIDIA Jetson Orin Nano Super Developer Kit sparked a new age of generative AI for small edge devices. The new Super Mode delivered an...
12 MIN READ
Jan 16, 2025
AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells
With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
5 MIN READ
Jan 16, 2025
Accelerating Time Series Forecasting with RAPIDS cuML
Time series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...
4 MIN READ
Jan 16, 2025
How to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails
AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...
15 MIN READ
Jan 16, 2025
Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with iGenius and NVIDIA DGX Cloud
In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
17 MIN READ
Jan 15, 2025
Strengthening Climate Resilience with AI-Powered Flood Modeling and 3D Visualizations
AI-driven flood modeling and 3D visualization tools are transforming how communities prepare for and respond to climate risks. In this NVIDIA GTC 2024 session,...
3 MIN READ
Jan 15, 2025
GPU Memory Essentials for AI Performance
Generative AI has revolutionized how people bring ideas to life, and agentic AI represents the next leap forward in this technological evolution. By leveraging...
6 MIN READ
Jan 14, 2025
Upcoming Event: CUDA Developer Meet Up in Silicon Valley
Whether you’re just starting your GPU programming journey or you’re a CUDA ninja looking to share advanced techniques, join us in San Jose on 1/30/25.
1 MIN READ
Jan 14, 2025
Transforming Data Centers into AI Factories for the 5th Industrial Revolution
In a recent DC Anti-Conference Live presentation, Wade Vinson, chief data center distinguished engineer at NVIDIA, shared insights based upon work by NVIDIA...
2 MIN READ
Jan 13, 2025
Just Released: Learn OpenUSD with New Applied Concepts Courses
Take the three self-paced courses at no cost through the NVIDIA Deep Learning Institute (DLI).
1 MIN READ
Jan 13, 2025
Powering the Next Wave of DPU-Accelerated Cloud Infrastructures with NVIDIA DOCA Platform Framework
Organizations are increasingly turning to accelerated computing to meet the demands of generative AI, 5G telecommunications, and sovereign clouds. NVIDIA has...
9 MIN READ
Inference Performance
Dec 18, 2024
NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM)...
6 MIN READ
Dec 17, 2024
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ
Dec 05, 2024
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ
Dec 02, 2024
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
Nov 21, 2024
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
Nov 19, 2024
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Meta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
6 MIN READ
Nov 15, 2024
Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill
In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
4 MIN READ
Nov 08, 2024
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ
Nov 01, 2024
3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot
Deploying generative AI workloads in production environments where user numbers can fluctuate from hundreds to hundreds of thousands – and where input...
5 MIN READ
Oct 28, 2024
NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models
Deploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
7 MIN READ
Oct 09, 2024
NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency
NVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU—one built to deliver breakthrough energy efficiency and optimized...
8 MIN READ
Oct 09, 2024
Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch
The continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of...
8 MIN READ
Generative AI
Jan 16, 2025
Introducing New KV Cache Reuse Optimizations in NVIDIA TensorRT-LLM
Language models generate text by predicting the next token, given all the previous tokens including the input text tokens. Key and value elements of the...
7 MIN READ
Jan 16, 2025
NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules
The introduction of the NVIDIA Jetson Orin Nano Super Developer Kit sparked a new age of generative AI for small edge devices. The new Super Mode delivered an...
12 MIN READ
Jan 16, 2025
How to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails
AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...
15 MIN READ
Jan 16, 2025
Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with iGenius and NVIDIA DGX Cloud
In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
17 MIN READ
Jan 15, 2025
GPU Memory Essentials for AI Performance
Generative AI has revolutionized how people bring ideas to life, and agentic AI represents the next leap forward in this technological evolution. By leveraging...
6 MIN READ
Jan 14, 2025
Transforming Data Centers into AI Factories for the 5th Industrial Revolution
In a recent DC Anti-Conference Live presentation, Wade Vinson, chief data center distinguished engineer at NVIDIA, shared insights based upon work by NVIDIA...
2 MIN READ
Jan 13, 2025
Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator
In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...
5 MIN READ
Jan 13, 2025
Evaluating GenMol as a Generalist Foundation Model for Molecular Generation
Traditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....
8 MIN READ
Jan 13, 2025
Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design
Designing a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...
4 MIN READ
Jan 09, 2025
Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining
NVIDIA is excited to announce the release of Nemotron-CC, a 6.3-trillion-token English language Common Crawl dataset for pretraining highly accurate large...
4 MIN READ
Jan 09, 2025
NVIDIA Project DIGITS, A Grace Blackwell AI Supercomputer On Your Desk
Powered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.
1 MIN READ
Jan 09, 2025
Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
As robotics and autonomous vehicles advance, accelerating development of physical AI—which enables autonomous machines to perceive, understand, and perform...
14 MIN READ
Data Science
Jan 16, 2025
AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells
With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
5 MIN READ
Jan 16, 2025
Accelerating Time Series Forecasting with RAPIDS cuML
Time series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...
4 MIN READ
Jan 13, 2025
Upcoming Webinar: Inside the RAPIDS-Accelerated Polars GPU Engine
In the webinar on January 28th, you'll get an inside look of the new GPU engine to learn how Polars' declarative API and query optimizer enable seamless GPU...
1 MIN READ
Jan 13, 2025
Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator
In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...
5 MIN READ
Jan 13, 2025
Evaluating GenMol as a Generalist Foundation Model for Molecular Generation
Traditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....
8 MIN READ
Jan 13, 2025
Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design
Designing a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...
4 MIN READ
Dec 20, 2024
Accelerating GPU Analytics Using RAPIDS and Ray
RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...
4 MIN READ
Dec 20, 2024
NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows
Approximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...
8 MIN READ
Dec 19, 2024
Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models
Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...
11 MIN READ
Dec 19, 2024
RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs
RAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...
8 MIN READ
Dec 18, 2024
Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost
XGBoost is a machine learning algorithm widely used for tabular data modeling. To expand the XGBoost model from single-site learning to multisite collaborative...
10 MIN READ
Dec 16, 2024
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ
Robotics
Jan 16, 2025
NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules
The introduction of the NVIDIA Jetson Orin Nano Super Developer Kit sparked a new age of generative AI for small edge devices. The new Super Mode delivered an...
12 MIN READ
Jan 09, 2025
Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
As robotics and autonomous vehicles advance, accelerating development of physical AI—which enables autonomous machines to perceive, understand, and perform...
14 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Jan 07, 2025
Accelerate Custom Video Foundation Model Pipelines with New NVIDIA NeMo Framework Capabilities
Generative AI has evolved from text-based models to multimodal models, with a recent expansion into video, opening up new potential uses across various...
10 MIN READ
Jan 06, 2025
Just Released: Omniverse Kit SDK 106.5
Kit 106.5 now supports USDz exports, improved new project flow, and preview of new RTX real-time mode.
1 MIN READ
Jan 06, 2025
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release
At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Jan 06, 2025
Building a Synthetic Motion Generation Pipeline for Humanoid Robot Learning
General-purpose humanoid robots are built to adapt quickly to existing human-centric urban and industrial work spaces, tackling tedious, repetitive, or...
6 MIN READ
Dec 17, 2024
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
Dec 14, 2024
Introducing Tile-Based Programming in Warp 1.5.0
With the latest release of Warp 1.5.0, developers now have access to new tile-based programming primitives in Python. Leveraging cuBLASDx and cuFFTDx, these new...
14 MIN READ
Dec 10, 2024
New AI Research Foreshadows Autonomous Robotic Surgery
A robot commonly used and manually manipulated by surgeons for routine operations can now autonomously perform key surgical tasks as precisely as humans....
4 MIN READ
Dec 03, 2024
Scaling Action Recognition Models with Synthetic Data
Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Simulation / Modeling / Design
Jan 15, 2025
Strengthening Climate Resilience with AI-Powered Flood Modeling and 3D Visualizations
AI-driven flood modeling and 3D visualization tools are transforming how communities prepare for and respond to climate risks. In this NVIDIA GTC 2024 session,...
3 MIN READ
Jan 14, 2025
Upcoming Event: CUDA Developer Meet Up in Silicon Valley
Whether you’re just starting your GPU programming journey or you’re a CUDA ninja looking to share advanced techniques, join us in San Jose on 1/30/25.
1 MIN READ
Jan 13, 2025
Just Released: Learn OpenUSD with New Applied Concepts Courses
Take the three self-paced courses at no cost through the NVIDIA Deep Learning Institute (DLI).
1 MIN READ
Jan 09, 2025
Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
As robotics and autonomous vehicles advance, accelerating development of physical AI—which enables autonomous machines to perceive, understand, and perform...
14 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Jan 06, 2025
Just Released: Omniverse Kit SDK 106.5
Kit 106.5 now supports USDz exports, improved new project flow, and preview of new RTX real-time mode.
1 MIN READ
Jan 06, 2025
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release
At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Jan 06, 2025
Experience Digital Twins in XR with NVIDIA Omniverse Spatial Streaming
Spatial computing experiences are transforming how we interact with data, connecting the physical and digital worlds through technologies like extended reality...
5 MIN READ
Jan 06, 2025
Building a Synthetic Motion Generation Pipeline for Humanoid Robot Learning
General-purpose humanoid robots are built to adapt quickly to existing human-centric urban and industrial work spaces, tackling tedious, repetitive, or...
6 MIN READ
Jan 06, 2025
How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception-Based Physical AI
Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
7 MIN READ
Jan 06, 2025
NVIDIA RTX Neural Rendering Introduces Next Era of AI-Powered Graphics Innovation
NVIDIA today unveiled next-generation hardware for gamers, creators, and developers—the GeForce RTX 50 Series desktop and laptop GPUs. Alongside these GPUs,...
12 MIN READ
Dec 20, 2024
Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU
Computational fluid dynamics (CFD) is used in industry and academia to address a wide range of use cases, including external aerodynamics, internal flows, heat...
5 MIN READ
Computer Vision / Video Analytics
Jan 16, 2025
AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells
With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
5 MIN READ
Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
Dec 12, 2024
Time-Lapse AI Model Enhances IVF Embryo Selection
Researchers from Weill Cornell Medicine have developed an AI-powered model that could help couples undergoing in vitro fertilization (IVF) and guide...
3 MIN READ
Dec 09, 2024
Just Released: NVIDIA VILA VLM
Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
1 MIN READ
Dec 05, 2024
Celebrating Open Science and Enterprise AI Innovation on MONAI’s 5th Anniversary
As MONAI celebrates its fifth anniversary, we're witnessing the convergence of our vision for open medical AI with production-ready enterprise solutions. ...
7 MIN READ
Dec 03, 2024
Scaling Action Recognition Models with Synthetic Data
Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Nov 25, 2024
Just Released: NVIDIA DeepStream 7.1
The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Nov 21, 2024
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
Nov 21, 2024
AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans
Your eyes could hold the key to unlocking early detection of Alzheimer’s and dementia, with a groundbreaking AI study. Called Eye-AD, the deep learning...
3 MIN READ
Oct 31, 2024
Deep Learning AI Model Identifies Breast Cancer Spread without Surgery
A new deep learning model could reduce the need for surgery when diagnosing whether cancer cells are spreading, including to nearby lymph nodes—also known as...
4 MIN READ
Content Creation / Rendering
Jan 13, 2025
Just Released: Learn OpenUSD with New Applied Concepts Courses
Take the three self-paced courses at no cost through the NVIDIA Deep Learning Institute (DLI).
1 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Jan 06, 2025
NVIDIA RTX Neural Rendering Introduces Next Era of AI-Powered Graphics Innovation
NVIDIA today unveiled next-generation hardware for gamers, creators, and developers—the GeForce RTX 50 Series desktop and laptop GPUs. Alongside these GPUs,...
12 MIN READ
Dec 20, 2024
Just Released: GPU Zen 3: Advanced Rendering Techniques
Grab your copy of GPU Zen 3 to learn about the latest in real-time rendering.
1 MIN READ
Dec 19, 2024
Accelerating Film Production with Dell AI Factory and NVIDIA
Filmmaking is an intricate and complex process that involves a diverse team of artists, writers, visual effects professionals, technicians, and countless other...
5 MIN READ
Dec 17, 2024
Efficient Ray Tracing with NVIDIA OptiX Shader Binding Table Optimization
NVIDIA OptiX is the API for GPU-accelerated ray tracing with CUDA, and is often used to render scenes containing a wide variety of objects and materials. During...
11 MIN READ
Dec 17, 2024
Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models
NVIDIA just announced a series of small language models (SLMs) that increase the amount and type of information digital humans can use to augment their...
4 MIN READ
Dec 13, 2024
High-Fidelity 3D Mesh Generation at Scale with Meshtron
Meshes are one of the most important and widely used representations of 3D assets. They are the default standard in the film, design, and gaming industries and...
7 MIN READ
Dec 05, 2024
Optimize GPU Workloads for Graphics Applications with NVIDIA Nsight Graphics
One of the great pastimes of graphics developers and enthusiasts is comparing specifications of GPUs and marveling at the ever-increasing counts of shader...
11 MIN READ
Nov 21, 2024
Powering AI-Augmented Workloads with NVIDIA and Windows 365
We are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
7 MIN READ
Oct 07, 2024
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Oct 02, 2024
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
5 MIN READ
Conversational AI
Jan 09, 2025
Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining
NVIDIA is excited to announce the release of Nemotron-CC, a 6.3-trillion-token English language Common Crawl dataset for pretraining highly accurate large...
4 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Dec 20, 2024
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Dec 16, 2024
Sandboxing Agentic AI Workflows with WebAssembly
Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...
7 MIN READ
Dec 11, 2024
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Nov 22, 2024
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Nov 19, 2024
Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain
In the dynamic world of modern business, where communication and efficient workflows are crucial for success, AI-powered solutions have become a competitive...
9 MIN READ
Oct 28, 2024
Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA
The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...
11 MIN READ
Oct 22, 2024
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes
Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs...
16 MIN READ
Oct 21, 2024
IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient
Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...
5 MIN READ
Oct 16, 2024
Simplify AI Application Development with NVIDIA Cloud Native Stack
In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...
5 MIN READ
Oct 01, 2024
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
11 MIN READ
Edge Computing
Jan 16, 2025
NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules
The introduction of the NVIDIA Jetson Orin Nano Super Developer Kit sparked a new age of generative AI for small edge devices. The new Super Mode delivered an...
12 MIN READ
Jan 06, 2025
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release
At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
Dec 18, 2024
Five Takeaways from NVIDIA 6G Developer Day 2024
NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Dec 17, 2024
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
Nov 25, 2024
Just Released: NVIDIA DeepStream 7.1
The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Nov 22, 2024
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Nov 21, 2024
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
Nov 14, 2024
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
Oct 29, 2024
AI-Powered Devices Track Howls to Save Wolves
A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
Oct 24, 2024
Powering the Next Wave of AI Robotics with Three Computers
NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ
Oct 21, 2024
AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead
New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...
3 MIN READ
Data Center / Cloud
Jan 16, 2025
Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with iGenius and NVIDIA DGX Cloud
In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
17 MIN READ
Jan 14, 2025
Transforming Data Centers into AI Factories for the 5th Industrial Revolution
In a recent DC Anti-Conference Live presentation, Wade Vinson, chief data center distinguished engineer at NVIDIA, shared insights based upon work by NVIDIA...
2 MIN READ
Jan 13, 2025
Powering the Next Wave of DPU-Accelerated Cloud Infrastructures with NVIDIA DOCA Platform Framework
Organizations are increasingly turning to accelerated computing to meet the demands of generative AI, 5G telecommunications, and sovereign clouds. NVIDIA has...
9 MIN READ
Jan 09, 2025
NVIDIA Project DIGITS, A Grace Blackwell AI Supercomputer On Your Desk
Powered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.
1 MIN READ
Dec 19, 2024
New Whitepaper: NVIDIA AI Enterprise Security
This white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure...
1 MIN READ
Dec 19, 2024
Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA Modulus on AWS
Risk and uncertainty inherent in energy exploration include unknown geological parameters, variations in fluid and rock properties, boundary conditions, and...
8 MIN READ
Dec 18, 2024
Five Takeaways from NVIDIA 6G Developer Day 2024
NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Dec 16, 2024
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ
Dec 12, 2024
An Introduction to NVIDIA Air
The advent of AI has introduced a new type of data center, the AI factory, purpose-built from the ground up to handle AI workloads. AI workloads can...
6 MIN READ
Dec 12, 2024
Advancing Solar Irradiance Prediction with NVIDIA Earth-2
As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...
9 MIN READ
Dec 12, 2024
Integration of NVIDIA BlueField DPUs with WEKA Client Boosts AI Workload Efficiency
WEKA, a pioneer in scalable software-defined data platforms, and NVIDIA are collaborating to unite WEKA's state-of-the-art data platform solutions with powerful...
5 MIN READ
Dec 11, 2024
Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture
Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...
8 MIN READ