Family trees across different websites often duplicate the same people without you realizing it, creating false confidence that you've found independent sources when you're really just seeing one researcher's work copied twice. Deduplication ensures your evidence actually comes from different places.
Cross-repository deduplication is the AI-driven process of identifying when the same ancestor appears as separate, duplicate entries across multiple genealogy databases, family trees, or document collections, then merging or flagging those records for researcher review. Without this process, the same great-grandmother can exist dozens of times with slightly different names, dates, and details that contradict each other.
AI makes deduplication scalable by using probabilistic matching to catch duplicates even when names are spelled differently or dates are off by a year, giving researchers a cleaner, more reliable foundation for building accurate family histories.
Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.
Explore related journeys or tell Peri what you're working through.