OCR confidence scores tell you how reliably software converted scanned text from old documents, and understanding which words the software struggled with helps you catch transcription errors before they corrupt your research. Low confidence scores on names, dates, or locations warrant double-checking against the original image.
When AI transcribes historical documents, it assigns a confidence score to each word or character, indicating how certain it is about the reading based on image quality, handwriting clarity, and training data. Low confidence scores flag words the AI is guessing at, which in genealogy often correspond to critical details like surnames, place names, and dates.
Understanding confidence scores helps family historians know exactly where to focus their manual review efforts, preventing errors from silently entering their research and ensuring that uncertain transcriptions do not get mistaken for confirmed facts in the family record.
Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.
Explore related journeys or tell Peri what you're working through.