After doing a few pages of census indexing (5 pages from 1900), an idea hit me. When dealing with difficult handwriting, it would be helpful to have structured samples of how a given enumerator writes each letter, lowercase and uppercase, and digit.
On most of the few pages I have indexed, I have had to spend considerable time searching around the page to find examples of how the enumerator writes several different letters to try to decide which letter the enumerator intended by a particular strange-looking squiggle. In many cases, finding a particular example has been extremely important in correctly interpreting the handwriting.
I would propose a side/branch effort to capture a small number of samples of each letter and digit for each enumerator to make them available to indexers and perhaps arbitrators. I envision that a volunteer would go through a few or perhaps several pages by the same enumerator. Perhaps except for the few letters that are rarely used, this volunteer would capture a few examples of each letter and digit. These samples would be stored such that indexing volunteers could use the enumerator's name, the state, the year, and perhaps the county to pull up the samples. For cases where multiple enumerators in the same state, county, and year had the same name, the images of the enumerator's name could be used to select the right enumerator. Perhaps the creation of samples from a given enumerator could be triggered by indexing volunteer request. Or, perhaps the indexing volunteer could capture the samples himself/herself after finding particularly difficult handwriting.
I believe this would yield significant improvements in accuracy and efficiency of indexing volunteer time, which would increase the amount of end result achievable. I believe the amount of time saved will be far greater, perhaps several times greater, than the amount of time spent capturing the samples.
Comments?