Periagoge
Concept
1 min readself knowledge

Cross-Repository Deduplication in Family Trees

Family trees across different websites often duplicate the same people without you realizing it, creating false confidence that you've found independent sources when you're really just seeing one researcher's work copied twice. Deduplication ensures your evidence actually comes from different places.

Hypatia
Why It Matters

Cross-repository deduplication is the AI-driven process of identifying when the same ancestor appears as separate, duplicate entries across multiple genealogy databases, family trees, or document collections, then merging or flagging those records for researcher review. Without this process, the same great-grandmother can exist dozens of times with slightly different names, dates, and details that contradict each other.

AI makes deduplication scalable by using probabilistic matching to catch duplicates even when names are spelled differently or dates are off by a year, giving researchers a cleaner, more reliable foundation for building accurate family histories.

Helpful guides
Hypatia
Daily Life & Decisions
Related Concepts
Peri
Questions about Cross-Repository Deduplication in Family Trees?

Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.

Ready to work on Cross-Repository Deduplication in Family Trees?

Explore related journeys or tell Peri what you're working through.