Skip header
 

Embedding Text Information in Scanned Data

You can use the OCR function to embed the text information in the scanned document without processing the data on your computer.

Important

  • For details about the optional units required for this function, see "Functions Requiring Optional Configurations", Getting Started.

  • This function supports the following file types: [PDF], [High Compression PDF], and [PDF/A].

  • If [Black & White: Photo] is selected from [Original Type] when originals are being scanned, the text is scanned in shades of gray, and the characters and the top and bottom of the page may not be recognized correctly. If OCR accuracy has a higher priority than the image quality, select [Black & White: Text] in [Original Type] when scanning the original.

  • You cannot use the OCR function in the following cases:

    • [TIFF / JPEG] or [TIFF] is selected as the file type.

    • [100 dpi] is selected as the resolution.

    • When the WSD or DSM destination list is used.

1Place originals.

2Press [Send File Type / Name].

Operation panel screen illustration

3Press [PDF] in [File Type].

4Press [OCR Settings] in the PDF File Setting, and then press [On].

5Configure the settings such as [Add Extrct.Text to File Name], [Delete Blank Page], and [Cognitive Language] as required.

6Press [OK] twice.

7To send an e-mail, configure the destination address and other required settings.

8Press [Start].

Note

  • The OCR function can process texts up to 40,000 characters per page.

  • The OCR function can recognize the following languages:

    • English, German, French, Italian, Spanish, Dutch, Portuguese, Polish, Swedish, Finnish, Hungarian, Norwegian, Danish, Japanese.

  • The effective resolution may be less than 200 dpi when an image scanned at 200 dpi or greater resolution is reduced by specifying the reproduction ratio. You can apply the OCR function in such cases, but the text recognition accuracy may deteriorate.

  • [Add Extrct.Text to File Name] cannot be specified when [Store to HDD] in [Store File] is selected.

  • Depending on character shapes or types, characters may not be recognized correctly.

  • A PDF file without embedded text is generated if the scanned page does not contain a section that can be recognized as characters.

  • If a page contains large blank areas, the top and bottom of the page may not be recognized correctly.

  • No PDF file is generated if all pages in a document are determined as blank pages. If this happens, make sure to set the originals correctly, and try again.

  • A blank page or the top and bottom of a page may not be recognized correctly if the scanned page has smears or dirty spots or an image on the back side of the page can be seen through.

  • No type faces are identified while the OCR function is being applied to scanning. If the widths of the printed and embedded characters differ, the position of the embedded text may not match that of the printed text on the scanned page.

  • When you scan documents with the OCR function enabled to send them to an e-mail or folder destination, the consecutive scanning jobs may take time to start.