39 questions · Free · No signup

AI Interview Question Bank

Curated questions on system design, prompt engineering, RAG, LLM evaluation, and AI agents — sourced from real interviews at Google, Meta, Amazon, and more. With walkthroughs, follow-ups, and the kind of detail that actually helps you prep.

Try: "rag", "prompt", "agent memory"

Browse by Category

Browse by Company

View all →

All Questions(17 of 39)

AI AgentsIntermediate
GoogleMetaMicrosoft+1

How Would You Implement Memory for a Long-Running AI Agent?

Design a memory system for a long-running AI agent — covering in-context working memory, episodic recall, semantic knowledge, and retrieval strategies.

Read question
AI AgentsIntermediate
GoogleMetaMicrosoft+1

How Do You Decide What Tools to Give an AI Agent?

A framework for deciding which tools to give an AI agent — covering granularity, safety boundaries, observability, and the principle of minimal tool sets.

Read question
AI AgentsIntermediate
GoogleMetaMicrosoft+1

What Is the Plan-and-Execute Agent Pattern, and When Should You Use It Over ReAct?

Plan-and-Execute separates planning from execution in AI agents. Walk through how it works, how it compares to ReAct, and the tradeoffs in multi-step task completion.

Read question
AI AgentsIntermediate
OpenAIGoogleMicrosoft+1

What's the Difference Between OpenAI Function Calling and LangChain Agents?

OpenAI function calling and LangChain agents both let LLMs use tools, but they operate at different abstraction levels. Walk through how each works and when to use each.

Read question
AI System DesignIntermediate
GoogleMicrosoftAmazon+1

Design a Conversational AI Customer Support System

Design an AI-powered customer support system that handles common queries automatically while escalating complex issues to human agents.

Read question
AI System DesignIntermediate
GoogleMicrosoftAmazon

Design a Document Q&A System for a Large Corpus

Design an AI system that answers natural language questions over a large collection of documents, with accurate citations and low hallucination rates.

Read question
AI System DesignIntermediate
GoogleMetaMicrosoft+2

How Do You Estimate the Cost of Running a Production LLM System?

Walk through how to estimate and model the cost of running an LLM system in production — covering API token costs, open source GPU infra, and key levers for optimization.

Read question
LLM Eval & OpsIntermediate
GoogleMetaMicrosoft+2

How Do You Build an Eval Suite for an LLM-Powered Feature?

Walk through building a systematic evaluation suite for an LLM feature — from test case design to automated metrics and regression tracking.

Read question
LLM Eval & OpsIntermediate
GoogleMetaMicrosoft+2

How Do You Evaluate a RAG System End-to-End?

RAG evaluation is distinct from general LLM evaluation — it requires measuring both retrieval quality and generation quality independently and together. Walk through the key metrics and frameworks.

Read question
Prompt EngineeringIntermediate
GoogleMetaMicrosoft+2

What Is Prompt Injection, and How Do You Defend Against It?

Prompt injection is one of the most significant security risks in LLM-powered applications. Walk through the attack types and the layered defenses used in production.

Read question
Prompt EngineeringIntermediate
GoogleMetaMicrosoft+2

What Strategies Do You Use to Reduce Hallucinations?

Walk through a layered approach to reducing LLM hallucinations — from prompt-level techniques to retrieval grounding and output validation.

Read question
Prompt EngineeringIntermediate
GoogleMetaMicrosoft+1

How Would You Design a Prompt for Structured Data Extraction?

Design a prompt that reliably extracts structured data (JSON, tables) from unstructured text — handling missing fields, ambiguity, and format errors.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Do You Handle Chunking Strategies for Different Document Types?

Compare chunking strategies for different document types — PDFs, code, HTML, and tables — and learn when each approach works best.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Do You Handle Tables, Charts, and Complex Documents in a RAG Pipeline?

Real-world documents contain tables, charts, and complex layouts that naive text extraction mangles. Walk through how to build a robust document processing pipeline for structured and visual content.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+2

Design a RAG Pipeline from Scratch

Walk through designing a production-ready RAG system covering document ingestion, chunking strategies, embedding models, vector search, and LLM generation.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Would You Evaluate Retrieval Quality in a RAG System?

Walk through metrics and methods for evaluating retrieval quality in a RAG pipeline — from offline metrics to end-to-end answer quality.

Read question
RAG & RetrievalIntermediate
GoogleMetaMicrosoft+1

How Do Vector Embeddings Work, and How Do You Choose the Right Embedding Model?

Explain what vector embeddings are, how embedding models convert text to vectors, and how you'd benchmark and improve retrieval accuracy for a production RAG system.

Read question

Prep for the full interview loop

Know the concepts. Now prove it. Practice GenAI, Coding, System Design, and AI/ML Design interviews with an AI that tells you exactly where you fell short.

Start a mock interview