myPDFconvert - Powerful Compression + OCR
Via its GUI, myPDFconvert also offers processing options for high-performing compression of documents in several PDF and graphics formats as well as an integrated OCR processing
High compression
New ISO-standardized compression methods such as JBIG2 (ISO/IEC 14492) and JPEG 2000 (ISO/IEC 15444) considerably reduce the storage requirement for compressed documents compared to traditional methods such as TIFF G3 and G4. Both new algorithms, JBIG2 and JPEG 2000, allow lossless as well as lossy compression and can therefore be customized regarding quality and file size.
Normally, PDF/A-compliant (acc. to ISO 19005-1b) PDF files created with lossless JBIG2 compression are finally only half as large as the originally existing or scanned TIFF files. These documents allow full-text research by archiving systems and other automatic processing, and users can immediately convert their incoming mail in compliance with the standard during scanning – and even sign it if required.
The functions used in myPDFconvert normally compress black-and-white scans by a factor 5 to 10 and color scans by a factor 10 to 100.
OCR (Optical character recognition)
If bitmap documents (e.g. TIFF, BMP) are converted to PDF documents using myPDFconvert, even an OCR processing can be performed besides compression. The text data retrieved by OCR can be stored in the created PDF as an invisible text layer, which lies above the decompressed image when viewing the document in the Acrobat viewer. In this way, users receive PDF/A-compliant, parsable PDF files as a processing result – created on the basis of raster data or of PDFs with embedded image documents.
OCR: Automated
Even the OCR engine in myPDFconvert is designed for industrial-strength scanning and OCR in business applications. myPDFconvert automates the OCR process by watched folders, so that users need not monitor document processing. In the corresponding mode, files undergo an OCR analysis if they are just drawn into a folder. To manage large quantities of scans, OCR batch processing runs with a high speed of approx. 1 scanned page per second.

