Generative AI

May 09, 2025
Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research
Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...
11 MIN READ

May 08, 2025
Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks
NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...
12 MIN READ

May 08, 2025
Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT
Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....
5 MIN READ

May 07, 2025
Concept‑Driven AI Teaching Assistant Guides Students to Deeper Insights
In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...
8 MIN READ

May 07, 2025
Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator
Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...
7 MIN READ

May 06, 2025
LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM
This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
11 MIN READ

May 02, 2025
Integrate and Deploy Tongyi Qwen3 Models into Production Applications with NVIDIA
Alibaba recently released Tongyi Qwen3, a family of open-source hybrid-reasoning large language models (LLMs). The Qwen3 family consists of two MoE models,...
7 MIN READ

May 02, 2025
HackAI Challenge Winners Announced
Explore the groundbreaking projects and real-world impacts of the HackAI Challenge powered by NVIDIA AI Workbench and Dell Precision.
1 MIN READ

May 01, 2025
Boosting Matrix Multiplication Speed and Flexibility with NVIDIA cuBLAS 12.9
The NVIDIA CUDA-X math libraries empower developers to build accelerated applications for AI, scientific computing, data processing, and more. Two...
8 MIN READ

Apr 29, 2025
Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva
It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...
6 MIN READ

Apr 29, 2025
Structuring Applications to Secure the KV Cache
When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...
11 MIN READ

Apr 29, 2025
Choosing Your First Local AI Project
AI is rapidly moving beyond centralized cloud and data centers, becoming a powerful tool deployable directly on professional workstations. Thanks to advanced...
7 MIN READ

Apr 29, 2025
NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support
The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...
5 MIN READ

Apr 28, 2025
Advancing Cybersecurity Operations with Agentic AI Systems
The age of passive AI is over. A new era is beginning, where AI doesn’t just respond—it thinks, plans, and acts. The rapid advancement of large language...
15 MIN READ

Apr 24, 2025
Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM
This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
7 MIN READ

Apr 23, 2025
Spotlight: Qodo Innovates Efficient Code Search with NVIDIA DGX
Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there...
8 MIN READ