AI Engineering Interview Questions — Practice Library

Quantization — Hitting a 50ms p99 Without Wrecking Accuracy

Reported at NvidiaML EngineeringHard

Theory

Handling Class Imbalance and Preventing Overfitting in Machine Learning

Reported at AmazonML EngineeringMedium

Theory

How do you monitor performance drift and hallucinations in production LLMs?

ML EngineeringHard

System Design

Average Product Rating Calculation and Top Five Identification

Reported at LinkedInML EngineeringEasy

Coding

Drift Detection — Covariate Shift vs Concept Drift

Reported at StripeML EngineeringMedium

Theory

Model Monitoring — Diagnosing Drift in a Credit Scoring Model

ML EngineeringHard

Theory

Elbow Method and Silhouette Score for Optimal Cluster Selection

Reported at MicrosoftML EngineeringEasy

Theory

Distributed Training — Fitting and Scaling a 70B Model

Reported at MetaML EngineeringHard

System Design

CI/CD for LLM workflows — what is different from traditional ML?

ML EngineeringHard

System Design

Feature Store — Backfilling a New Feature Without Downtime

Reported at UberML EngineeringMedium

System Design

Handling Multicollinearity: Select Features From Highly Correlated Sets

Reported at OpenAIML EngineeringMedium

Theory

Model Compression — Hitting a 50ms Latency Target

ML EngineeringHard

Theory

Model Benchmarking: Metrics, Traffic Split, and Rollout Strategy

Reported at NetflixML EngineeringMedium

Theory

Data Cleaning Strategies for Handling Messy and Incomplete Datasets

Reported at AmazonML EngineeringMedium

Theory

Load balancing for distributed AI model serving and inference requests

ML EngineeringMedium

Theory

Distributed Search System Architecture with LLM Inference at Scale

Reported at AnthropicML EngineeringHard

System Design

ML Experiment Tracking System with Metrics Analysis

Reported at Google DeepMindML EngineeringHard

System Design

Prevent Re-identification in Anonymized Data Through Differential Privacy

ML EngineeringMedium

Theory

ML Model Deployment Monitoring Metrics and Performance Indicators

Reported at NetflixML EngineeringMedium

Theory

Quantization — PTQ vs QAT and Calibration Pitfalls

Reported at NvidiaML EngineeringMedium

Theory

Walk me through debugging a session with incorrect LLM outputs

ML EngineeringMedium

Theory

Medical LLM Governance: Audit Infrastructure at Meta Scale

Reported at MetaML EngineeringHard

Theory

Housing Price Outlier Detection and Visualization System

Reported at LinkedInML EngineeringEasy

Coding

ML Engineer vs Data Scientist on a Fraud Detection System

ML EngineeringEasy

Theory