Periagoge
Concept
3 min readself knowledge

Fine-tuning vs. In-Context Learning for Military Service Record AI

Fine-tuning rewrites an AI's core understanding through intensive training on VA data, while in-context learning feeds the AI your specific documents and asks it to reason about them right now. For military records, in-context learning is often safer because it works only with what you provide, reducing the risk of the AI inserting false information it thinks it learned elsewhere.

Hypatia
Why It Matters

Fine-tuning and in-context learning are two fundamentally different ways to teach an AI model new information. They represent a trade-off between permanence and flexibility, cost and iteration speed—and the right choice depends entirely on your scale and use case.

In-Context Learning (The Default Approach)

In-context learning means providing examples, instructions, and reference documents within a single conversation with the AI—what most veterans already do when they paste a VA form into ChatGPT. The AI learns from the context you provide in that moment only. Once the conversation ends, it forgets. Each new conversation starts fresh.

For military families, this is the practical standard. You upload your decision letter, ask the AI to explain it, and it learns your case in that context. You close the conversation; the AI retains nothing. This has major advantages: (1) Privacy—your documents never train the model permanently; (2) Flexibility—you can change approach between conversations; (3) Zero setup cost—no training required, just chat; (4) Immediate iteration—refine your prompts and see results in seconds.

The downside: you must provide context every time. If you're working with a VSO analyzing 50 cases, explaining "I want appeals formatted using OSCINT structure with emphasis on medical nexus arguments" fifty times is inefficient. Each conversation starts without institutional knowledge.

Fine-Tuning (The Permanent Approach)

Fine-tuning modifies the underlying model weights—essentially retraining it on your specific data. You provide 100–1,000 examples of (input, desired output) pairs. The model learns to recognize patterns in your data and adjusts its internal parameters accordingly. Those changes persist. Every future conversation uses your tuned model.

For military contexts, fine-tuning makes sense if you: (1) Manage a large organization processing dozens of cases with consistent formatting and legal strategy; (2) Have domain expertise you want baked in permanently (a VSO might fine-tune on their successful appeal precedents); (3) Need faster inference—fine-tuned models often respond quicker because they've internalized your domain, requiring less prompting.

The trade-offs are substantial. Fine-tuning requires a significant upfront data preparation effort. You need 100+ high-quality examples of appeals you've written, formatted consistently. You need to define exactly what success looks like. You pay for the fine-tuning job itself (typically $25–100+ depending on data size and model). If your understanding of VA procedures evolves, you must retune. And there's knowledge drift risk: a fine-tuned model locked in on yesterday's best practices doesn't automatically benefit from new VA guidance released next month.

Hybrid Approaches for Military Organizations

Sophisticated VSO workflows combine both. They fine-tune a Claude or GPT model on 200 successful precedent appeals, establishing baseline organizational style and legal reasoning. But for individual case analysis, they use in-context learning—uploading the specific veteran's documents and asking the tuned model to apply its training to the unique facts. This balances institutional consistency with case-specific flexibility.

Another hybrid: use in-context learning for analysis and drafting, but systematize your best prompts into a fine-tuning dataset over time. Each veteran case that succeeds becomes a training example. After 50 successful cases, you have enough data to fine-tune with confidence—your model now embeds winning patterns rather than requiring them as prompts.

The Token Economy Intersection

Fine-tuning also reduces token costs for heavy users. If you're loading the same 10,000-token appeals guide into every conversation with ChatGPT, fine-tuning it saves those tokens across hundreds of conversations. For individual veterans? Not worth it. For a national VSO processing 1,000+ appeals annually? The ROI is clear.

Try this: For the next week, use in-context learning. Paste your VA decision guidance and appeal strategy into each ChatGPT conversation as context. Track how often you're repeating the same instructions. If you notice you're repeating the same system prompt five+ times, you've found your fine-tuning candidate. Save those prompts in a document—if you scale operations, that becomes your training data.

Helpful guides
Hypatia
Daily Life & Decisions
Related Concepts
Peri
Questions about Fine-tuning vs. In-Context Learning for Military Service Record AI?

Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.

Ready to work on Fine-tuning vs. In-Context Learning for Military Service Record AI?

Explore related journeys or tell Peri what you're working through.