I applied through an employee referral. I interviewed at NVIDIA (Santa Clara, CA) in Dec 2025
Interview
Round 1: LeetCode style question focusing on OS Schedulers based on Heaps, Intervals, Sorting, Binary Search, etc + Cloud Computing Basics
Round 2: Project Deep Dive (NLP, RAG, AI Agents, Secure Agents, Context Engineering, LLM Performance Evaluation, GPU deployment, Distributed ML)
Interview questions [1]
Question 1
Round 1:
What is Load Balancer?
What is a Pubsub?
What is the debugging process for PubSub?
Talk about your Production experience?
Round 2:
Types of Tokenization Strategies?
Scenario based Tokenization choice and reasoning?
How to ensure LLM generated code is not harmful?
Project Deep Dive and technical followup questions