After downloading the final reels on April 6, the data entry component of the project is now complete. Our vendor, Atlis Publishing & Graphics Services, which was acquired by Data Stream Content Solutions in January, delivered 42 XML files over six weeks. These files represent the 42 reels of microfilm that we sent them in January. DSCS converted the microfilm to JPG images and transcribed and marked up the text according to the encoding guide we prepared for them. Using a combination of programmatic and manual conversion, DSCS converted 109,348 records into XML. As they were completed, the files were uploaded onto their FTP server, which we downloaded every few days. They also transcribed the not infrequent handwriting--all to great success. In addition to the XML files, DSCS also sent us PDF files of all the images with the corresponding unique ID number they assigned each record in the encoding. This resource has been extremely helpful in proofreading.
Overall, we are very happy with their work and would highly recommend their services.
A note on estimates:
Our extensive planning over the past two years started with a major grant request as well as an RFP process, which forced us to calculate the number of records and approximate key strokes to better assess our options. Using Statistics 101 and a little common sense sampling, we estimated a total of 108,400 records (note the actual of 109,348) and an estimated keystroke per record of 247 (the actual turned out to be 246.625). Pretty good since we only had three giant file cabinets, a ruler, and a calculator to figure it out!
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment