Skip to content

7.0 Learning Checklist: LLM Principles, Prompt, and Fine-tuning

Use this page as a printable checklist. If you need the full explanation, return to the Chapter 7 entry page.

LLM study guide evolution path

Time boxDo thisStop when you can say
20 minRead the token-to-answer picture on the entry page”Text becomes tokens, vectors, context, then next-token prediction.”
25 minSkim 7.1 and run one tokenizer example”Token count affects cost and context limits.”
25 minSkim 7.2 and the LLM history page”Scale, data, Transformer, and alignment changed what models can do.”
30 minRun the prompt testing script from the entry page”I can compare prompt versions with fixed cases.”
20 minRead the solution-choice table”I should not fine-tune before checking Prompt, RAG, tools, and validation.”
EvidenceMinimum version
prompts/Three prompt versions for one task
prompt_eval_cases.csvAt least five fixed inputs and a simple score column
structured_output_schema.jsonRequired fields and allowed value types
failure_cases.mdAt least three failed outputs and the likely cause
gpu_train_log.txtdevice: cuda training log from 7.4.5 Rent a GPU and Train a Hand-Built GPT-2
llm_stage_workshop_output.txtOutput from 7.8.4 Hands-on: Full Chapter 7 Workshop
README.mdHow to run, what passed, what failed, what to try next

Keep this page’s proof of learning as a small evidence card:

Prompt Versions
at least three versions for one task
Eval Cases
fixed inputs with scores and failure notes
Schema Check
structured output is parsed and validated
Method Choice
Prompt/RAG/fine-tuning/tools decision is written down
Gpt2 Record
mini GPT-2 GPU training log, environment info, and sample output
Exit Proof
workshop output plus README notes
GatePass condition
Prompt comparisonSame cases, one changed variable, saved outputs and scores.
Structured outputParser rejects missing fields or wrong types.
Failure analysisEach failure has a likely cause: instruction, input, schema, missing knowledge, or safety.
Method choiceDecision table explains why Prompt, RAG, fine-tuning, tools, or Agent comes first.
Hand-built GPT-2Run mini GPT-2 on a CUDA GPU and identify embedding, attention, loss, and generate in the code.

Expected result: your Chapter 7 folder contains prompt versions, fixed eval cases, parser/schema checks, failure notes, a device: cuda mini GPT-2 training log, workshop output, and a README that explains the method choice.

  • Can you explain token, embedding, attention, context window, pretraining, Prompt, fine-tuning, and alignment without copying definitions?
  • Can you change one prompt variable at a time and compare results with the same input cases?
  • Can you validate JSON output instead of trusting text that only looks like JSON?
  • Can you explain when missing information calls for RAG instead of a longer Prompt?
  • Can you explain when repeated behavior adaptation might justify fine-tuning?
  • Can you open or rent a GPU notebook, run mini GPT-2 with device: cuda, and save loss plus generated text?
Check reasoning and explanation
  1. Treat each term as part of one flow: token and embedding are the representation layer, attention routes context, the context window limits what can be seen at once, pretraining builds the base model, Prompt steers the run, fine-tuning changes behavior with data, and alignment keeps outputs useful and safe.
  2. Keep the same cases, change only one prompt variable, and save both the outputs and the score so the comparison is reproducible instead of anecdotal.
  3. Use a schema or parser to validate structure, required fields, and types. If parsing fails, reject the output instead of reading it as if it were correct.
  4. Use RAG when the answer depends on fresh, private, or citable facts from documents rather than what the model may remember.
  5. Fine-tuning becomes worth considering when the same behavior keeps showing up across many high-quality examples and Prompt plus validation still is not enough.

If the answer is yes, move to Chapter 8. Chapter 8 will connect these ideas to real LLM applications and RAG systems.

Keep this page’s proof of learning as a small evidence card:

Prompt Versions
at least three versions for one task
Eval Cases
fixed inputs with scores and failure notes
Schema Check
structured output is parsed and validated
Method Choice
Prompt/RAG/fine-tuning/tools decision is written down
Gpt2 Record
mini GPT-2 GPU training log, environment info, and sample output
Exit Proof
workshop output plus README notes