CODA vs Data Scrubbing

Some discussions just don't fit into a well defined box. Use this forum to discuss general topics and issues revolving around the Church and the technology offerings we use and share.
Post Reply
ctrpapa
New Member
Posts: 3
Joined: Mon Oct 18, 2010 12:50 pm
Location: USA

CODA vs Data Scrubbing

#1

Post by ctrpapa »

While working at Microsoft I helped write a script to scrub all personally identifiable info from strings, phone numbers, credit info, etc.

This wasn't scrambled or encrypted but actually replaced strings with whatever was desired and in no way related to the original.

The database was at least 50GB but we were still able to make it rewrite all the info to another server minus anything that would be personally identifiable in a reasonable amount of time (less than a day)

Without knowing much about the data behind the live lds.org would it be possible to do something similar instead of using CODA?
User avatar
TimRiker
Church Employee
Church Employee
Posts: 432
Joined: Sun Dec 19, 2010 5:16 pm
Location: USA, Utah
Contact:

#2

Post by TimRiker »

The FORG initiative is looking into tools like this to clean live data.

CODA using a few tables to generate data. These include things like male and female first names, last names, etc. It's pretty basic at present. CODA does not have access to any live data.
Post Reply

Return to “General Discussions”