Movie Finder — Backend¶

AI-powered movie discovery and Q&A. Describe a film you half-remember, the system searches a Qdrant-backed movie corpus, enriches candidates with live IMDb data, and answers follow-up questions once you confirm the right match.

This repo is the backend integration root — it owns the FastAPI app and standardizes a Docker-only local development workflow for the entire backend stack.

How it works¶

User describes a movie
        │
        ▼
  [RAG Search] ───── Qdrant Cloud (external vector store)
        │
        ▼
  [IMDb Enrichment] ── live ratings, credits, metadata via imdbapi.dev
        │
        ▼
  [Validation] ── deduplicate, filter low-confidence candidates
        │
        ▼
  [Presentation] ── ranked candidate list → user
        │
   ┌────┴─────────────────┐
   │ user confirms        │ user says "none match"
   ▼                      ▼
[Q&A Agent]         [Refinement] ── rebuild query → RAG Search
(answers anything          │
 about the film)     max 3 cycles → dead end

Repository structure¶

This is a multi-repo monorepo. The backend root integrates three independent child repos, but this iteration only standardizes the backend app workflow from the root.

Repo	Path	Team	Description
movie-finder-backend	`/`	App / Backend	FastAPI app, backend Docker contract, Jenkins pipeline
movie-finder-chain	`chain/`	AI Engineering	LangGraph multi-agent pipeline imported by the app
imdbapi-client	`chain/imdbapi/`	IMDb API	Async IMDb REST client imported by chain (nested inside chain/)
movie-finder-rag	`rag_ingestion/`	AI / Data Engineering	Offline ingestion pipeline that writes to Qdrant

backend/
├── app/                  FastAPI application (auth, chat, sessions)
│   ├── pyproject.toml
│   └── src/app/
│       ├── main.py           FastAPI lifespan + health endpoints
│       ├── config.py         Pydantic settings
│       ├── dependencies.py   Shared FastAPI dependencies
│       ├── routers/          HTTP API surface
│       ├── auth/             JWT middleware + models
│       └── session/          PostgreSQL session store
├── chain/                git submodule: movie-finder-chain
│   └── imdbapi/          git submodule (nested): imdbapi-client
├── rag_ingestion/        git submodule: movie-finder-rag
├── docker/
│   └── postgres/init/    bootstrap SQL for the local test database
├── scripts/
│   ├── setup.sh          Docker-only backend bootstrap helper
│   └── migrate_sqlite_to_postgres.py
├── Dockerfile            dev + runtime images
├── docker-compose.yml    backend app local stack (postgres + backend)
├── Makefile              Docker-only developer entrypoint
├── Jenkinsfile           backend CI/CD pipeline
├── INTEGRATION.md        cross-repo workflow, secret sharing, FAQ
└── CONTRIBUTING.md       branching, PRs, code standards

Prerequisites¶

Tool	Version	Install
Docker	24+	docs.docker.com
git	2.20+	system package manager
make	recent	build tools / Xcode CLT / GNU Make package

You do not need a host .venv, host uv sync, or host fastapi dev.

Quick start (backend app stack)¶

# 1. Clone with all child repos
git clone --recurse-submodules https://github.com/aharbii/movie-finder-backend.git
cd movie-finder-backend

# 2. Build the dev image (also creates .env from template and installs git hook)
make init

# 3. Edit .env with your API keys
$EDITOR .env

# 4. Start the local stack
make up

# 4. Inspect the running stack
make logs

The backend is available at:

API: http://localhost:8000
OpenAPI docs: http://localhost:8000/docs

If the parent movie-finder/ stack is already using the default ports, change POSTGRES_HOST_PORT and BACKEND_HOST_PORT in .env before make up.

Minimum values you must fill in before startup:

APP_SECRET_KEY
QDRANT_URL
QDRANT_API_KEY_RO
VECTOR_COLLECTION_PREFIX
provider keys for the selected providers, for example ANTHROPIC_API_KEY and OPENAI_API_KEY when using the default cloud profile

Use make down when you are done.

Database migrations run automatically during backend container startup via alembic upgrade head. Manual migration operations are Docker-backed through make targets only.

The backend app also owns the production LangGraph checkpointer lifecycle. During startup it creates one shared saver from DATABASE_URL, injects it into the singleton graph, and closes it on shutdown. In deployed environments, DATABASE_URL is therefore part of both the session-store and persistent checkpoint contract.

Common development commands¶

make init           # build backend dev image, create .env from template, install git hook
make up             # start postgres + backend in the background
make down           # stop and remove local stack
make ci-down        # full cleanup for CI (volumes + images)
make logs           # follow backend + postgres logs
make shell          # open bash shell in the running backend container

make editor-up      # start only backend for editing/linting
make editor-down    # stop the editor container

make lint           # ruff check for app/ (report only)
make fix            # ruff check --fix + ruff format for app/ (auto-apply)
make format         # ruff format for app/
make typecheck      # mypy --strict for app/
make test           # pytest app/tests/
make test-coverage  # pytest + coverage XML/HTML + JUnit for app/
make pre-commit     # full hook suite (also enforced on git commit)
make check          # lint + typecheck + test-coverage (CI gate)

make db-upgrade                     # alembic upgrade head inside Docker
make db-downgrade DB_REVISION=-1    # alembic downgrade inside Docker
make db-current                     # show current alembic revision
make db-history                     # show migration history
make db-revision MESSAGE=add_index  # create a new empty revision
make lock                           # refresh uv.lock inside Docker

All supported developer workflows go through Docker. To run quality checks for a child repo (chain, imdbapi, rag_ingestion), use make from within that directory.

When you add a new migration, create it with make db-revision MESSAGE=..., edit the generated file under alembic/versions, then apply it with make db-upgrade.

VS Code¶

The committed .vscode/ config is split intentionally:

tasks.json runs make ... targets from the host workspace
settings.json assumes an attached backend container interpreter at /opt/venv/bin/python
launch.json provides backend app and app-test debug profiles
extensions.json recommends Remote Containers, Pylance, Ruff, and Coverage Gutters

Recommended workflow:

Run make editor-up (or make up) from the host workspace.
In VS Code, use Dev Containers: Attach to Running Container....
Attach to the backend service container (it will be running after make editor-up or make up).
Use the committed tasks for lifecycle and quality commands.
Use the Python Test Explorer for app/tests/ only in this iteration.

Editor intelligence is configured to resolve:

app/src
chain/src
chain/imdbapi/src

That gives backend developers navigation across the imported libraries without taking over the child repos' own debug/task surfaces prematurely.

For coverage visualization:

run make test-coverage or the backend: test with coverage VS Code task
open the generated app-coverage.xml with Coverage Gutters

Environment variables¶

The authoritative runtime contract lives in ../infrastructure/docs/provider-runtime-contract.md.

Variable	Used by	Secret source / notes
`WITH_PROVIDERS`	Docker build	Chain optional dependency bundle: `default-cloud`, `ollama-qdrant`, etc.
`APP_SECRET_KEY`	backend app	Azure Key Vault: `app-secret-key`
`DATABASE_URL`	backend app, chain saver	Azure Key Vault: `postgres-url`; required for persistent LangGraph checkpoints
`CLASSIFIER_PROVIDER`	app via chain	`anthropic`, `openai`, `groq`, `together`, `ollama`, or `google`
`CLASSIFIER_MODEL`	app via chain	Lightweight classifier/confirmation model
`REASONING_PROVIDER`	app via chain	`anthropic`, `openai`, `groq`, `together`, `ollama`, or `google`
`REASONING_MODEL`	app via chain	Reasoning and Q&A model
`EMBEDDING_PROVIDER`	app via chain	`openai`, `ollama`, `sentence-transformers`, or `huggingface`
`EMBEDDING_MODEL`	app via chain	Must match the model used by RAG ingestion
`EMBEDDING_DIMENSION`	app via chain	Must match the vector dimension written by RAG ingestion
`VECTOR_STORE`	app via chain	`qdrant`, `chromadb`, `pinecone`, or `pgvector`
`VECTOR_COLLECTION_PREFIX`	app via chain, RAG	Final target is `{prefix}_{sanitized_model}_{dimension}`
`QDRANT_URL`	app via chain	Azure Key Vault / Jenkins: `qdrant-url`
`QDRANT_API_KEY_RO`	app via chain	Azure Key Vault / Jenkins: `qdrant-api-key-ro`
`ANTHROPIC_API_KEY`	selected provider	Azure Key Vault: `anthropic-api-key`
`OPENAI_API_KEY`	selected provider	Azure Key Vault: `openai-api-key`
`GROQ_API_KEY`	selected provider	Azure Key Vault: `groq-api-key`
`TOGETHER_API_KEY`	selected provider	Azure Key Vault: `together-api-key`
`GOOGLE_API_KEY`	selected provider	Azure Key Vault: `google-api-key`
`OLLAMA_BASE_URL`	selected provider	Docker-reachable Ollama endpoint
`LANGSMITH_API_KEY`	optional tracing	Azure Key Vault: `langsmith-api-key`
`CORS_ORIGINS`	backend app	JSON array of allowed browser origins; never use `"*"` in production
`GLOBAL_RATE_LIMIT`	backend app	SlowAPI fallback limit for all routes
`AUTH_RATE_LIMIT`	backend app	SlowAPI limit for login/token route
`CHAT_RATE_LIMIT`	backend app	SlowAPI limit for `/chat` requests
`MAX_MESSAGE_LENGTH`	backend app	Max accepted user message length before FastAPI returns 422

The backend runtime image installs only the provider SDK bundle selected by WITH_PROVIDERS at build time. Runtime env vars must select providers covered by that bundle; otherwise the chain fails fast with a missing optional dependency error.

Databases and health¶

The backend uses two distinct data-system categories:

Store	Technology	Purpose
Vector store	selected by `VECTOR_STORE`	semantic movie search
Relational DB	PostgreSQL 16	users, sessions, chat history

The backend image contains only the provider extras selected by WITH_PROVIDERS. The deployed environment must provide the matching vector-store settings from the canonical runtime contract.

The backend app exposes:

Path	Purpose
`/health/live`	container liveness probe
`/health/ready`	database readiness probe

Local tests use a dedicated movie_finder_test database created by docker/postgres/init/01-create-test-database.sql.

Working on a specific subproject¶

Area	Start here
App / backend integration	app/README.md → CONTRIBUTING.md
LangGraph chain	chain/README.md → chain/CONTRIBUTING.md
IMDb API client	chain/imdbapi/README.md → chain/imdbapi/CONTRIBUTING.md
RAG ingestion	rag_ingestion/README.md → rag_ingestion/CONTRIBUTING.md

CI/CD¶

The backend pipeline lives in Jenkinsfile.

Build / deploy flow:

Check out the backend workspace with submodules
Lint and type-check the backend app slice
Run backend app tests with 100% line and branch coverage against a local PostgreSQL sidecar

The parent movie-finder pipeline owns image build and Azure deployment. See INTEGRATION.md for the cross-repo runtime contract.