48 questions · Free · No signup

AI Interview Question Bank

Curated questions on system design, prompt engineering, RAG, LLM evaluation, and AI agents — sourced from real interviews at Google, Meta, Amazon, and more. With walkthroughs, follow-ups, and the kind of detail that actually helps you prep.

Try: "rag", "prompt", "agent memory"

Browse by Category

Browse by Company

View all →

All Questions(26 of 48)

AI AgentsIntermediate
GoogleMetaMicrosoft+1

How Would You Implement Memory for a Long-Running AI Agent?

Design a memory system for a long-running AI agent — covering in-context working memory, episodic recall, semantic knowledge, and retrieval strategies.

Read question
AI AgentsIntermediate
GoogleMetaMicrosoft+1

How Do You Decide What Tools to Give an AI Agent?

A framework for deciding which tools to give an AI agent — covering granularity, safety boundaries, observability, and the principle of minimal tool sets.

Read question
AI AgentsIntermediate
GoogleMetaMicrosoft+1

What Is the Plan-and-Execute Agent Pattern, and When Should You Use It Over ReAct?

Plan-and-Execute separates planning from execution in AI agents. Walk through how it works, how it compares to ReAct, and the tradeoffs in multi-step task completion.

Read question
AI AgentsIntermediate
OpenAIGoogleMicrosoft+1

What's the Difference Between OpenAI Function Calling and LangChain Agents?

OpenAI function calling and LangChain agents both let LLMs use tools, but they operate at different abstraction levels. Walk through how each works and when to use each.

Read question
AI System DesignIntermediate
GoogleMicrosoftAmazon+1

Design a Conversational AI Customer Support System

Design an AI-powered customer support system that handles common queries automatically while escalating complex issues to human agents.

Read question
AI System DesignIntermediate
GoogleMicrosoftAmazon

Design a Document Q&A System for a Large Corpus

Design an AI system that answers natural language questions over a large collection of documents, with accurate citations and low hallucination rates.

Read question
AI System DesignIntermediate
GoogleMetaMicrosoft+2

How Do You Estimate the Cost of Running a Production LLM System?

Walk through how to estimate and model the cost of running an LLM system in production — covering API token costs, open source GPU infra, and key levers for optimization.

Read question
analytics-case-studiesIntermediate
Meta+1

Analyze the Impact of a New Notification Strategy

The growth team doubled notification frequency last quarter and DAU went up 8%. Leadership wants to know if this was a good decision. What's your analysis?

Read question
analytics-case-studiesIntermediate
MetaGoogle+1

Diagnose a 20% Drop in Daily Active Users

You wake up to an alert: DAU is down 20% compared to the same day last week. Walk me through how you'd investigate this.

Read question
data-modelingIntermediate
Amazon+1

Design an Analytics Schema for an E-Commerce Platform

Design a data warehouse schema for an e-commerce platform that needs to answer questions about revenue by product, customer lifetime value, and cohort retention.

Read question
data-pipeline-designIntermediate

Design a Batch ETL Platform for a Data Warehouse

Design a batch ETL system that ingests data from 50+ source systems nightly, transforms it into a clean analytics layer, and surfaces data quality issues before dashboards are updated.

Read question
data-pipeline-designIntermediate
+1

Design a Real-Time Event Ingestion Pipeline

Design a pipeline to ingest millions of user events per second in real time, make them available for analytics within 30 seconds, and guarantee no data loss.

Read question
LLM Eval & OpsIntermediate
GoogleMetaMicrosoft+2

How Do You Build an Eval Suite for an LLM-Powered Feature?

Walk through building a systematic evaluation suite for an LLM feature — from test case design to automated metrics and regression tracking.

Read question
LLM Eval & OpsIntermediate
GoogleMetaMicrosoft+2

How Do You Evaluate a RAG System End-to-End?

RAG evaluation is distinct from general LLM evaluation — it requires measuring both retrieval quality and generation quality independently and together. Walk through the key metrics and frameworks.

Read question
metrics-and-measurementIntermediate
Meta+1

Define Success Metrics for a Social Feed Product

You're the data analyst for a social feed product. Define the success metrics for a new algorithmic ranking change shipping next month.

Read question
metrics-and-measurementIntermediate
Amazon+1

Design an A/B Test for a Checkout Flow Change

The product team wants to test a redesigned checkout flow they expect will increase conversion rate by 5%. Design the experiment.

Read question
Prompt EngineeringIntermediate
GoogleMetaMicrosoft+2

What Is Prompt Injection, and How Do You Defend Against It?

Prompt injection is one of the most significant security risks in LLM-powered applications. Walk through the attack types and the layered defenses used in production.

Read question
Prompt EngineeringIntermediate
GoogleMetaMicrosoft+2

What Strategies Do You Use to Reduce Hallucinations?

Walk through a layered approach to reducing LLM hallucinations — from prompt-level techniques to retrieval grounding and output validation.

Read question
Prompt EngineeringIntermediate
GoogleMetaMicrosoft+1

How Would You Design a Prompt for Structured Data Extraction?

Design a prompt that reliably extracts structured data (JSON, tables) from unstructured text — handling missing fields, ambiguity, and format errors.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Do You Handle Chunking Strategies for Different Document Types?

Compare chunking strategies for different document types — PDFs, code, HTML, and tables — and learn when each approach works best.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Do You Handle Tables, Charts, and Complex Documents in a RAG Pipeline?

Real-world documents contain tables, charts, and complex layouts that naive text extraction mangles. Walk through how to build a robust document processing pipeline for structured and visual content.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+2

Design a RAG Pipeline from Scratch

Walk through designing a production-ready RAG system covering document ingestion, chunking strategies, embedding models, vector search, and LLM generation.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Would You Evaluate Retrieval Quality in a RAG System?

Walk through metrics and methods for evaluating retrieval quality in a RAG pipeline — from offline metrics to end-to-end answer quality.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Do Vector Embeddings Work, and How Do You Choose the Right Embedding Model?

Explain what vector embeddings are, how embedding models convert text to vectors, and how you'd benchmark and improve retrieval accuracy for a production RAG system.

Read question
sql-and-query-reasoningIntermediate
+1

Design a SQL Query for User Funnel Analysis

Design a SQL query that computes the conversion funnel from signup to first purchase, broken down by acquisition channel and cohort week.

Read question
sql-and-query-reasoningIntermediate
MetaGoogleAmazon+1

Optimize a Slow Reporting Query on a 10-Billion-Row Table

A dashboard query scanning a 10-billion-row events table takes 8 minutes to run. Walk through how you'd diagnose and fix it.

Read question

Prep for the full interview loop

Know the concepts. Now prove it. Practice GenAI, Coding, System Design, and AI/ML Design interviews with an AI that tells you exactly where you fell short.

Start a mock interview