When you need AI to know specific patient information reliably, you have two paths: fine-tuning (training the model on your data) or retrieval (storing information separately and feeding it in as needed), and each has real tradeoffs in cost, flexibility, and accuracy. Fine-tuning works well for stable, private patient knowledge; retrieval works better when information changes often or you need to update it without retraining.
You have two main strategies for making an AI "know" about your specific patient: fine-tuning (training the model on your patient's data) and retrieval (storing patient data separately and feeding it to the AI when needed). They're fundamentally different architectures with different costs, speeds, and flexibility profiles.
Fine-tuning creates a new version of an AI model customized to your patient's clinical patterns. You provide hundreds of past interactions—appointment notes, responses to treatments, symptom logs—and the model learns the statistical patterns embedded in that history. After training, the model "knows" your patient's baseline, typical responses to interventions, and contextual details without you having to feed them in every time.
Advantages: Once trained, queries are fast and cheap—the model has internalized the knowledge. No need to load context every request. Useful if you're making dozens of queries about the same patient and want consistency. The model develops a coherent understanding of cause-and-effect patterns in that patient's medical history.
Disadvantages: Expensive upfront ($100-500 per fine-tuned model depending on data volume and provider). Inflexible—if patient data changes (new medications, new diagnoses), you need to retrain. Fine-tuning works best for stable, historical patterns, not dynamic real-time information. There's also a real risk: the model can overfit, learning quirks in your specific data that don't generalize. Medical decisions based on overfitted patterns are dangerous.
Retrieval stores patient data in a searchable database (vector database, semantic search, or simple keyword indexing). When you ask the AI a question, it first retrieves the relevant documents, then answers based on what it found. It's like giving the AI access to a library but asking it to look up the right books for each question.
Advantages: Flexible and cheap to maintain. New patient data flows in automatically—no retraining. You can update patient information in real-time and the AI sees it immediately. Safer for medical decisions because the model isn't making inferences from statistical patterns; it's citing actual documents. Scales easily to multiple patients. Cost is low per query.
Disadvantages: Retrieval quality depends on how well your search function works. If your database returns irrelevant documents, the AI's answer suffers. Requires good documentation structure—if your notes are messy or unsearchable, retrieval fails. Not ideal if you need the AI to synthesize complex patterns across hundreds of subtle data points; retrieval tends to surface recent/obvious matches.
Use fine-tuning if: You're building a specialized clinical decision-support tool for one patient with years of stable data, you have resources for retraining when major changes occur, and inference speed is critical (you're querying dozens of times per day). Example: a caregiver managing a chronically ill parent's medication regimen over five years, querying daily about symptom correlations.
Use retrieval if: You're coordinating care across multiple patients or caregivers, data changes frequently, you need to cite sources (the AI should show you which documents it's basing answers on), or you want a lightweight, maintainable system. Most caregiving scenarios fall here. Example: a care coordinator pulling relevant notes before each team meeting, or a family managing a newly diagnosed illness where treatment protocols evolve weekly.
Many production systems use both: fine-tune a model on high-confidence stable patterns (e.g., "this patient always tolerates this medication well"), then use retrieval for current data (recent labs, current symptoms). The fine-tuned model provides intuition; retrieval provides facts.
Try this: Map your caregiving workflow. Are you querying the same patient repeatedly with consistent data? Consider fine-tuning. Are you juggling multiple patients, updating info constantly, or need the AI to cite its sources? Use retrieval. Start with retrieval—it's simpler, safer, and more maintainable. Only invest in fine-tuning if you've identified a specific bottleneck that repeated querying can't solve.
Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.
Explore related journeys or tell Peri what you're working through.