Generative AI

An illustration showing molecules and a brain.

May 09, 2025

Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...

11 MIN READ

May 08, 2025

Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks

NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...

12 MIN READ

May 08, 2025

Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT

Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....

5 MIN READ

May 07, 2025

Concept‑Driven AI Teaching Assistant Guides Students to Deeper Insights

In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...

8 MIN READ

May 07, 2025

Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...

7 MIN READ

Decorative image of a datacenter with floating icons overlaid.

May 06, 2025

LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM

This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...

11 MIN READ

May 02, 2025

Integrate and Deploy Tongyi Qwen3 Models into Production Applications with NVIDIA

Alibaba recently released Tongyi Qwen3, a family of open-source hybrid-reasoning large language models (LLMs). The Qwen3 family consists of two MoE models,...

7 MIN READ

May 02, 2025

HackAI Challenge Winners Announced

Explore the groundbreaking projects and real-world impacts of the HackAI Challenge powered by NVIDIA AI Workbench and Dell Precision.

1 MIN READ

An image representing matrix multiplication.

May 01, 2025

Boosting Matrix Multiplication Speed and Flexibility with NVIDIA cuBLAS 12.9

The NVIDIA CUDA-X math libraries empower developers to build accelerated applications for AI, scientific computing, data processing, and more. Two...

8 MIN READ

Apr 29, 2025

Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva

It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...

6 MIN READ

Apr 29, 2025

Structuring Applications to Secure the KV Cache

When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...

11 MIN READ

An illustration representing generative AI.

Apr 29, 2025

Choosing Your First Local AI Project

AI is rapidly moving beyond centralized cloud and data centers, becoming a powerful tool deployable directly on professional workstations. Thanks to advanced...

7 MIN READ

Apr 29, 2025

NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support

The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...

5 MIN READ

Apr 28, 2025

Advancing Cybersecurity Operations with Agentic AI Systems

The age of passive AI is over. A new era is beginning, where AI doesn’t just respond—it thinks, plans, and acts. The rapid advancement of large language...

15 MIN READ

Apr 24, 2025

Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...

7 MIN READ

Apr 23, 2025

Spotlight: Qodo Innovates Efficient Code Search with NVIDIA DGX

Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there...

8 MIN READ