Capstone Projects
Thirty plus real world, AI heavy project ideas across every track. Each idea has a problem statement, a suggested stack, milestones, a difficulty, and the skills it proves. Build a few of these to turn a resume into a portfolio.
Overview
Projects are the strongest signal in a placement process. This catalog gives buildable ideas that map to the other tracks. Pick by your target role. Each project page in the app tracks status (planned, building, shipped) and links a document and a drawing. Aim to ship at least one starter, one intermediate, and one advanced project per target role.
How to read each entry: Problem, Stack, Milestones, Difficulty, Proves.
AI and agentic engineering
Problem: answer questions over a set of PDFs with citations.
Stack: Python, FastAPI, an embedding model, Qdrant, an LLM.
Milestones: ingest and chunk, embed and store, retrieve and rerank, answer with citations, evaluate with RAGAS.
Difficulty: intermediate.
Proves: retrieval, chunking, evaluation.
Problem: research a topic, browse, synthesize, and write a report.
Stack: LangGraph, a web search tool, an LLM, FastAPI.
Milestones: planner, search tool, writer, editor, human approval gate, tracing.
Difficulty: advanced.
Proves: agent orchestration, tool use, human in the loop.
Problem: review a pull request and suggest improvements.
Stack: LangGraph, the git platform API, an LLM.
Milestones: fetch diff, analyze, comment, summarize, guardrails.
Difficulty: advanced.
Proves: multi step agents, structured output.
Problem: pull typed fields from invoices and receipts.
Stack: Python, an LLM with structured output, pydantic.
Milestones: schema, extraction, validation, batch mode, accuracy report.
Difficulty: intermediate.
Proves: function calling, validation.
Problem: search your own notes by meaning.
Stack: sentence transformers, FAISS, a small web UI.
Milestones: index, search, hybrid keyword and vector, ranking.
Difficulty: starter.
Proves: embeddings, hybrid search.
Problem: answer support questions and escalate hard ones.
Stack: RAG, an LLM, a ticket system, FastAPI.
Milestones: knowledge base, retrieval, confidence threshold, escalation, analytics.
Difficulty: intermediate.
Proves: RAG, routing, guardrails.
1. Document question and answer with RAG
2. Multi agent research assistant
3. Code review agent
4. Structured data extractor
5. Semantic search over personal notes
6. Customer support bot with escalation
Data engineering
Problem: ingest financial events and write clean tables.
Stack: PySpark Structured Streaming, Delta Lake, object storage.
Milestones: source, transform, watermarks, Delta write, dashboard.
Difficulty: advanced.
Proves: streaming, lakehouse.
Problem: bronze, silver, and gold layers over raw data.
Stack: object storage, Auto Loader, PySpark, a catalog.
Milestones: ingest, clean, model, quality checks, governance.
Difficulty: advanced.
Proves: lakehouse architecture, data quality.
Problem: schedule and monitor a daily ETL.
Stack: Airflow or Prefect, Python, a warehouse.
Milestones: extract, validate, transform, load, alerting.
Difficulty: intermediate.
Proves: orchestration, reliability.
Problem: turn raw tables into tested marts.
Stack: dbt, DuckDB or a warehouse.
Milestones: staging, marts, tests, docs, a dashboard.
Difficulty: intermediate.
Proves: dimensional modeling, testing.
7. Streaming events pipeline
8. Medallion data lake
9. Batch ETL with orchestration
10. Analytics warehouse with dbt
MLOps and machine learning
Problem: train, register, serve, and monitor a model.
Stack: MLflow, FastAPI, Docker, Evidently.
Milestones: pipeline, registry, serving, drift monitoring, retrain trigger.
Difficulty: advanced.
Proves: the full MLOps lifecycle.
Problem: predict customer churn and serve scores.
Stack: scikit-learn or xgboost, FastAPI, a feature store.
Milestones: features, model, evaluation, serving, monitoring.
Difficulty: intermediate.
Proves: tabular ML, serving.
Problem: specialize a small model on domain data.
Stack: transformers, PEFT and QLoRA, a cloud GPU.
Milestones: dataset, train with LoRA, evaluate, serve, model card.
Difficulty: advanced.
Proves: fine tuning, evaluation.
Problem: regression test prompts and models.
Stack: promptfoo or deepeval, a dataset, CI.
Milestones: golden set, metrics, CI gate, cost report.
Difficulty: intermediate.
Proves: evaluation, cost control.
11. End to end ML pipeline
12. Churn prediction service
13. Fine tuned small language model
14. LLM evaluation harness
Backend and system design
Problem: shorten and redirect with analytics.
Stack: FastAPI, PostgreSQL, Redis.
Milestones: hashing, redirect, analytics, caching, rate limit.
Difficulty: starter.
Proves: API design, caching.
Problem: a reusable distributed rate limiter.
Stack: Redis, FastAPI.
Milestones: token bucket, sliding window, distributed state, tests.
Difficulty: intermediate.
Proves: algorithms, distributed state.
Problem: run background jobs reliably.
Stack: Redis or a broker, workers, FastAPI.
Milestones: enqueue, workers, retries, dead letter, dashboard.
Difficulty: intermediate.
Proves: queues, reliability.
Problem: live messaging with presence.
Stack: WebSockets, Redis pub and sub, PostgreSQL.
Milestones: sockets, rooms, presence, history, fanout.
Difficulty: intermediate.
Proves: real time, pub and sub.
Problem: multi channel delivery with retries.
Stack: a queue, FastAPI, email and push providers.
Milestones: channels, priority, retry, deduplication, templates.
Difficulty: intermediate.
Proves: messaging, idempotency.
Problem: process orders with events and sagas.
Stack: Kafka or Redis Streams, FastAPI, PostgreSQL.
Milestones: events, CQRS, saga, outbox, idempotent consumers.
Difficulty: advanced.
Proves: event driven architecture.
Problem: a small replicated key value store.
Stack: Go or Python, a consensus library.
Milestones: storage engine, replication, consistent hashing, leader election.
Difficulty: advanced.
Proves: distributed systems fundamentals.
15. URL shortener at scale
16. Rate limiter service
17. Job queue and worker system
18. Real time chat
19. Notification service
20. Event driven order system
21. Distributed key value store
Frontend and full stack
Problem: the very app described in the docs folder.
Stack: Next.js, Supabase, BlockNote, Monaco, Excalidraw, React Flow.
Milestones: see docs/04_build_phases.md.
Difficulty: advanced.
Proves: full stack, product thinking.
Problem: a fast notes app with offline support.
Stack: Next.js, a block editor, a database.
Milestones: editor, folders, search, offline, sync.
Difficulty: intermediate.
Proves: editors, local first.
Problem: drag charts onto a dashboard.
Stack: Next.js, a charting library, a database.
Milestones: data sources, chart config, layout, sharing.
Difficulty: intermediate.
Proves: data UI, state.
Problem: a shared drawing canvas.
Stack: Next.js, a canvas library, real time sync.
Milestones: canvas, shapes, real time, export.
Difficulty: advanced.
Proves: canvas, real time.
22. This placement dashboard
23. Markdown notes app with sync
24. Analytics dashboard builder
25. Collaborative whiteboard
DevOps and platform
Problem: a full pipeline for a sample app.
Stack: GitHub Actions, Docker, Kubernetes, Argo CD.
Milestones: build, test, image, GitOps deploy, rollback.
Difficulty: advanced.
Proves: delivery automation.
Problem: full metrics, logs, and traces for a service.
Stack: Prometheus, Grafana, Loki, OpenTelemetry.
Milestones: instrument, dashboards, alerts, traces.
Difficulty: intermediate.
Proves: observability.
Problem: reusable Terraform modules for a stack.
Stack: Terraform, a cloud.
Milestones: modules, remote state, CI validation, docs.
Difficulty: intermediate.
Proves: infrastructure as code.
26. CI and CD platform demo
27. Observability stack
28. Infrastructure as code module library
Data science and analytics
Problem: upload results and get a significance verdict.
Stack: Python, statsmodels, a simple UI.
Milestones: input, tests, power, report.
Difficulty: starter.
Proves: experiment analysis.
Problem: recommend items from interactions.
Stack: implicit or surprise, FastAPI.
Milestones: data, model, evaluation, serving.
Difficulty: intermediate.
Proves: recommendation, evaluation.
Problem: forecast a metric and serve predictions.
Stack: prophet or statsmodels, FastAPI.
Milestones: features, model, backtest, serve.
Difficulty: intermediate.
Proves: forecasting.
Problem: from raw data to a decision dashboard.
Stack: dbt, a warehouse, Metabase.
Milestones: model, test, dashboard, write up.
Difficulty: intermediate.
Proves: analytics engineering, storytelling.
29. A and B test analysis tool
30. Recommendation system
31. Time series forecasting service
32. Sales analytics end to end
How to choose
Targeting AI engineering: build 1, 2, and 13.
Targeting backend: build 15, 20, and 21.
Targeting data engineering: build 7, 8, and 10.
Targeting full stack: build 22 and 25.
Targeting MLOps: build 11 and 14.
Ship fewer projects with depth rather than many shallow ones. For each shipped project, write a one page summary covering the problem, the approach, the trade offs, and the results, and record a short demo.