Friday, February 18, 2011

Committed to seamless genealogy file sharing

RootsTech 2011 provided a great deal of answers, but opened up many more questions. Sharing genealogy data is a top concern for Ol' Myrt, and it looks like FamilySearch is considering updating its generic genealogy file system currently known as GEDCOM.
BetterGEDCOM (a grassroots effort) and OpenGen (a top-down developer model with a genealogy database website component) are two entities also in the conversation about improving file sharing. For a listing of GEDCOM Alternatives see the comprehensive list by Tamura Jones.

Hopefully, conversations will continue when it comes to the concept of sharing genealogy database files between researchers, much like this model:

End-user to end-user
Where Researcher A exports a GEDCOM file from his genealogy management program and then directly shares that file with Researcher B via flash drive, CD or some other means. Researcher B opens his genealogy management program and then imports the GEDCOM file.

At  RootsTech, FamilySearch mentioned SoRD as a method of file sharing, but if Ol' Myrt's understanding is correct, that will involve a third party, much like this model:
End-User to website to End-User
Where Researcher A uploads his GEDCOM file to a genealogy website, where Researcher B  downloads from website and imports the GEDCOM file.

Call me crazy, but Ol' Myrt here doesn't like the idea of a website being part of the equation unless it is a disinterested third party like Dropbox, where two researchers agree to share a folder (instead of snail mailing a flash drive) with a generic file that can read by either researcher's genealogy management program seamlessly.

Allowing big-guy genealogy websites like or even to broker data file transfers between individual researchers can prove problematic on at least these levels:
  • How can the end-user determine his data isn't altered perhaps by masks and filters by the website?
  • How can the end user determine his data isn't linked and/or merged by the website to other user-contributed databases?
  • How can the end user ensure that his data isn't retained by the website, when he chooses to take down the file? One might want to do this and upload a more up-to-date file, or completely remove the file with no further interaction with the website.

Generic genealogy database files are currently called GEDCOM files. In his posting File Sharing? GEDCOM or what? James Tanner explains "It seems like whomever you need to share information with never seems to have the same program you use. There was a time when Personal Ancestral File (PAF) was pervasive. But with the de-emphasis on the program, there are dozens of different options. Your first door of opportunity in sharing files is to see what formats are supported by your own program."

Unfortunately each genealogy program out there has decided to pick and choose which GEDCOM file parameters it will to embrace. The less-than-marvelous results have been described by genealogy bloggers. It's particularly disheartening that of Legacy Family Tree, Family Tree Maker and RootsMagic only the later can seamlessly export and import it's own GEDCOM file without data loss. See:

While The LDS Church owns the "GEDCOM" name, it hasn't supported the GEDCOM file during the last 14 years. See BetterGEDCOM Wikispaces' History of GEDCOM. According to other RootsTech attendees, FamilySearch says it's GEDCOM revision will begin with the GenTech data model. BetterGEDCOM developers have considered this and a variety of other data models on the BetterGEDCOM Wiki, including:

And the trick is?
Convincing genealogy software producers to adhere to a single "new" file sharing format will be problematic, particularly since they don't currently agree by using all existing GEDCOM specifications. IF they would work this out together, there would be no need for reforms such being explored by BetterGEDCOM. IF end users bring pressure to bear once a good GEDCOM alternative has been developed, THEN perhaps software producers will adopt the plan.


Some folks are pleased FamilySearch is weighing in on improving file sharing, thinking that high-profile organization will galvanize support from software producers. Others question FamilySearch commitment based on its history of dropping PAF and ignoring GEDCOM for 14 years. (I don't think FS dropped PAF per se, but rather decided to focus on things they do best like digitizing records.) My commitment is to support the development of seamless end-user to end-user file sharing capability, without requiring a genealogy website to be part of the equation.


Happy family tree climbing!
Myrt     :)
Your friend in genealogy.