Thinking through your intentions for the final OCR'd text will help you to create a final text that is rich in all of the appropriate ways. We are digitizing documents faster than we can create the metadata needed to accompany them. The editing/correcting process could take a considerable amount of time for large amounts of text and/or poor quality original text.įor more information on how to obtain a quality image, please consult the LibGuide on How to Use Digital Tools for Archival Research. You should be aware that if your goal is 100% text accuracy, you will need to check and correct the text after it has gone through the original recognition process. No special skills are required to use OCR software. If you don't have a digital document, or if what you have is poor quality, you can scan the original document using your OCR program as your scanning software. Typescript results in poorer OCR than printed type inconsistent use of font faces and sizes can lower OCR accuracy.Īn OCR software's ability to accurately analyze your document depends on the condition of the original and/or quality of the digital file. Low-contrast documents can result in poor OCR. Language: texts published before 1850 maynot be the most compatible with OCR software. Older and discolored documents must be scanned in RGB mode in order to capture all of the image data. Skewed pages can lead to inaccurate recognition. Straightness of the initial scan can affect OCR quality. The recommended best scanning resolution for OCR accuracy is 300 dpi.īrightness settings that are too high or too low can have negative effects on the accuracy of your image.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |