Keyword search requires you to guess the exact words a database used; semantic search understands meaning, so searching for "difficulty breathing" finds documents about dyspnea or respiratory distress even if those exact words weren't in your query.
When you search a medical database for "joint pain," you get articles specifically mentioning those words. When a semantic search system processes "joint pain," it finds articles about arthritis, osteoarthritis, rheumatoid conditions, and inflammatory joint disease—even if those exact words don't appear. This difference fundamentally changes research quality.
Semantic search works by converting text into meaning-space rather than word-space. Your query and each document get encoded as vectors (lists of numbers) that capture semantic meaning. Documents "close" to your query in meaning-space get retrieved, regardless of surface-level word overlap. This is why Perplexity and Consensus often find more relevant studies than traditional PubMed keyword searches.
Modern semantic search uses embedding models—neural networks trained to convert text into vectors where similar concepts are numerically close. An embedding model might place vectors for "myocardial infarction," "heart attack," and "acute coronary syndrome" very near each other, even though they're different phrases. A keyword search would treat them as distinct.
The system computes similarity as cosine distance between vectors. Your query vector is compared to thousands or millions of document vectors. Documents with highest similarity scores bubble to the top of results. This happens across entire databases in seconds.
Medical terminology is fragmented. Doctors use "MI" or "acute myocardial infarction" or "heart attack." Different fields use different names for the same condition (psychiatry calls it "major depressive disorder"; primary care says "depression"). Semantic search connects these linguistic variations to the underlying concept. A keyword search for "depression" misses papers using "depressive syndrome" or specific diagnostic codes. Semantic search captures them.
This is especially valuable for rare conditions with inconsistent naming, emerging conditions still being named, or treatments referenced by brand name, generic name, and colloquial terms. If you're researching a rare autoimmune condition, semantic search finds papers discussing it under different nomenclature systems.
Semantic search also handles concept combinations better than keyword matching. Searching for "kidney disease in diabetics" as separate keywords gives you papers about kidney disease and papers about diabetes—not specifically the interaction. Semantic search understands you're asking about that specific intersection.
Semantic search requires good embedding models. Models trained on general text don't capture domain-specific medical concepts as precisely as models trained on medical literature. Some medical-specific embedding models perform better than general ones, but not all RAG systems use them.
Semantic search can also miss precision. If you need research on a very specific genotype or rare variant, semantic search might return papers on the general gene without the specific mutation you're researching. Keyword search, ironically, would handle the specific variant name better.
There's also a cold-start problem with very recent discoveries. If a paper was published yesterday, it hasn't been embedded and indexed in semantic search databases yet. Keyword search on cutting-edge databases might catch it faster.
Use semantic search (Consensus, Perplexity) for broad concept exploration, learning what research exists about a condition, and discovering related conditions or treatments you hadn't considered. Use keyword search (direct PubMed access) when you have specific, precise search terms and want high recall (finding everything) rather than high precision (finding most relevant).
The best approach chains both: start with semantic search to understand the research landscape and identify key terms, then use keyword search with those precise terms to ensure you haven't missed specific subpopulations or recent variants.
Try this: Search Consensus (semantic) and PubMed (keyword) for the same medical question—try something like "autoimmune thyroid disease and pregnancy" or "statin side effects in women." Notice how Consensus pulls more varied terminology and conceptually related papers, while PubMed returns articles with those exact phrase combinations. Save useful papers from both and notice where each excels.
Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.
Explore related journeys or tell Peri what you're working through.