8.0 Learning Checklist: LLM Application Development and RAG

Use this page as a printable checklist. If you need the full explanation, return to the Chapter 8 entry page.

RAG portfolio evidence pack

Two-Hour First Pass

Time box	Do this	Stop when you can say
20 min	Read the RAG application loop on the entry page	”A RAG answer should be tied to retrieved evidence.”
25 min	Run the tiny RAG script	”I can inspect top-k chunks before trusting the answer.”
25 min	Skim 8.1 RAG basics and document processing	”Chunk size, overlap, and metadata affect retrieval and citations.”
25 min	Skim 8.3 API practice and tool/function calling	”An LLM app needs request, response, error, and retry paths.”
25 min	Read the debugging ladder	”I can separate document, retrieval, generation, citation, and ops failures.”

Required Evidence

Evidence	Minimum version
`chunks.jsonl`	5-10 chunks with `id`, `source`, `text`, and `version`
`retrieval_logs.jsonl`	query, top-k chunk IDs, score, and source for each test question
`eval_questions.csv`	at least 10 fixed questions with expected source or answer points
`failure_cases.md`	at least three failures labeled as document, chunking, retrieval, generation, citation, or deploy
`rag_config.md`	chunk size, overlap, top-k, rerank choice, prompt version
`context_strategy.md`	long context, RAG, memory, or hybrid decision with one rejected alternative
`rag_app_workshop_output.txt`	output from 8.5.6 Hands-on: Full Chapter 8 RAG App Workshop
`README.md`	run command, sample question, cited answer, evaluation result, next fix

Evidence to Keep

Keep this page’s proof of learning as a small evidence card:

Rag Eval Set: fixed questions with expected evidence
Retrieval Trace: query, chunks, scores, selected evidence
Answer Trace: cited answer and unsupported claim check
App Trace: request, response, validation, logs
Project Readme: run command, metrics, failures, next action

Quality Gates

Gate	Pass condition
Citation	Every factual answer cites a chunk, source, and version.
Empty retrieval	System refuses to answer when evidence is missing.
Regression eval	Same questions run before and after each chunking, retrieval, reranking, or prompt change.
Operations	Logs include query, top-k, prompt version, latency, token cost, and failure label.

Expected result: your Chapter 8 project folder contains chunks, retrieval logs, fixed eval questions, cited answers, failure labels, app logs, and a README that explains the next retrieval or generation fix.

Exit Questions

Can you explain why RAG is different from asking a longer Prompt?
Can you show which document chunks were retrieved for a question?
Can you explain why chunk metadata is necessary for citations and debugging?
Can you handle empty retrieval with a no-answer response instead of a guess?
Can you compare two RAG versions using the same evaluation questions?
Can you explain when long context is simpler than RAG, and when RAG or memory is still required?

Check reasoning and explanation

RAG is different because it retrieves evidence before answering. A longer Prompt still depends on what the model already knows or guesses, while RAG can pull fresh, private, or document-based facts into the answer.
Show the query, top-k chunks, scores, source, and version so someone else can check the retrieval path.
Metadata keeps the source, version, and location attached to each chunk, which makes citation, debugging, and regression analysis possible.
When retrieval is empty, the safest response is a no-answer or needs-more-information reply, not a guess.
Use the same evaluation questions before and after retrieval, chunking, or reranking changes so the comparison stays fair.

If the answer is yes, move to Chapter 9. Chapter 9 will upgrade the system from answer generation to Agents that can plan, call tools, and recover from failures.

Evidence to Keep

Keep this page’s proof of learning as a small evidence card:

Rag Eval Set: fixed questions with expected evidence
Retrieval Trace: query, chunks, scores, selected evidence
Answer Trace: cited answer and unsupported claim check
App Trace: request, response, validation, logs
Project Readme: run command, metrics, failures, next action