Glossary
Need help with some technical terms?
This glossary provides short definitions of key terms related to PaperScan.
Need help with some technical terms?
This glossary provides short definitions of key terms related to PaperScan.
PaperScan allows image rotation at any angle besides the classic 90, 180 and 270 degrees rotation features.
Skew is an artifact that can occur in scanned images because of the camera being misaligned, imperfections in the scanning or surface, or simply because the paper was not placed completely flat when scanned. Deskew is the process of removing skew from images.
Automatic color detection is delivered in PaperScan PRO through a patent pending, proprietary to ORPALIS technology, allowing users to scan any kind of documents (black/white, greyscale or colored) using color scan mode and automatically detect the original color-type of the document (black/white, greyscale or colored).
This allows users to save the document in best fitted bits-per-pixel (bpp) encoding, thus providing best quality for smallest possible file size.
You can find more detailed explanations in this dedicated blog article.
JBIG2 encoding is a revolutionary breakthrough in captured document technology allowing scanned images to be compressed up to 10x smaller than with TIFF G4 and TIFF-based PDFs. It allows documents of any types to be viewed and manipulated efficiently over the Internet, and affords digital copiers/printers efficient network transmission of digitally copied documents. More on JBIG2
JPEG 2000 is an image compression standard which, as opposed to the classic JPEG standard, offers important advantages, such as : provides both lossy and lossless compression in the same file stream, higher compression ratios for lossy compression than the JPEG standard, provides the ability to display same image in different resolutions and sizes, provides the capability of viewing a certain Region of Interest (ROI) in high quality while the rest of the image is displayed in lower quality, etc. There are numerous other advantages of JPEG 2000 over JPEG compression standard you can find out about from dedicated section in the official site.
Sharing electronic colored documents is often complicated for many reasons: the file size is usually very big, and when the size becomes acceptable, it’s –most of the time- to the detriment of quality. However, it should be possible to share and view color as well as black and white. Mixed raster content (MRC) is a method for compressing images that contain both binary-compressible text and continuous-tone components, using image segmentation methods to improve the level of compression and the quality of the rendered image. By separating the image into components with different compressibility characteristics, the most efficient and accurate compression algorithm for each component can be applied. Read more.
PDF/A is a file format for the long-term archiving of electronic documents. PDF/A is in fact a subset of PDF, obtained by leaving out PDF features not suited to long-term archiving.
PDF Searchable Image (often called PDF-OCR) is a PDF image-based document which, in addition to the visual representation of the original document (bitmapped
layer), also contains a (hidden) text layer resulted from Optical Character Recognition (OCR) process so you can search for any word on any page of the PDF document.
PaperScan uses Tesseract OCR engine to create PDF-OCR files. Text can be viewed in PDF Viewer apps, such as Adobe Reader, PaperLight, etc.
You can remove the black dots punch holes from scanned document images after scanning punched paper documents.
PaperScan allows configurable punch holes removal.
For better understanding, please see our screenshots with images before/after punch holes removal using PaperScan.
TWAIN is an industry standard API for image acquisition, strongly oriented toward scanners. The TWAIN standard is maintained by the TWAIN Working Group. More on TWAIN
WIA (Microsoft Windows Image Acquisition) API is standardized for acquiring digital images from devices that are primarily used to capture still images and for managing these devices.WIA is a COM interface, very different from but overlapping with TWAIN. Compared to TWAIN it offers much better support for digital cameras, and much less sophisticated support for scanners. More on WIA