2  Digitization

Although this course focuses on text recognition and analysis it is important to note that digitization, the quality of the images and the consistent collection of meta-data, is key to all subsequent processing. If you start a project where the digitization is not yet completed you should consider the importance of the digitization step within the context of all subsequent post-processing and text recognition workflows.

The quality of the collected image data and the availability of meta-data has a profound impact on your workflow. Preemptively addressing image quality and meta-data issues can save significant time and effort, even when taking up some more time in planning and data collection.

Some general guidelines for digitization therefore include:

Finally, if not within your domain expertise reach out to your local collection managers for support and input on all these aspects.

Figure 2.1: The COBECORE digitization station, including a reproduction stand, cold lights, a DSLR camera and a black matte background