ABBYY FineReader 10 User’s Guide
59
Inverted image is an image with white characters against a dark background.
L
License Manager is a utility used for managing ABBYY FineReader licenses and activating ABBYY FineReader 10 Corporate
Edition.
Ligature is a combination of two or more "glued" characters (such as fi, fl, ffi). These characters are difficult to separate because
they are usually "glued" in print. Treating them as a single compound character improves OCR accuracy.
M
Monospaced font is a font (such as Courier New) in which all characters are equally spaced. For better OCR results on
monospaced fonts, select Tools>Options..., click the Document tab, and select Typewriter under Document print type.
O
Omnifont system is a recognition system that recognizes characters set in any font and font size without prior training.
Optional hyphen is a hyphen (¬) that indicates exactly where a word or word combination should be split if it occurs at the end
of a line (e.g. "autoformat" should be split into "auto–format"). ABBYY FineReader replaces all hyphens found in dictionary
words with optional hyphens.
P
Page layout is the arrangement of text, tables, pictures, paragraphs, and columns on a page, as well as fonts, font sizes, font
colors, text background, and text orientation.
Page layout analysis is the process of detecting areas on a page image. Areas can be of five types: text, picture, table, barcode,
and recognition area. Page layout analysis can be performed automatically when clicking the Read button, or manually by the
user prior to OCR.
Paradigm is the set of all grammatical forms of a word.
Pattern is a set of pairs (each pair contains a character image and the character itself) that is created during pattern training.
PDF security settings are restrictions that can prevent a PDF document from being opened, edited, copied or printed. These
settings include Document Open Passwords, Permissions Passwords, and encryption levels.
Permissions Password is a password which prevents other users from printing and editing a PDF document unless they type the
password the author specified. If some security settings are selected for the document, other users will not be able to change these
settings until they type the password the author specified.
Picture area is an area that is used for image areas that contain pictures. This type of area may enclose an actual picture or any
other object that should be displayed as a picture (e.g. a section of text).
Primary form is the form of a headword in a dictionary entry.
Print type is a parameter reflecting how the source text was printed (on a laser printer or equivalent, on a typewriter, etc.). For
laser–printed texts, select Autodetect; for typewritten texts, select Typewriter; for faxes, select Fax.
Product ID is the parameter that is automatically generated based on the hardware configuration when activating ABBYY
FineReader on a particular computer.
Prohibited characters — if certain characters will never be found in recognized text, they may be specified in a set of prohibited
characters in the language group properties. Specifying these characters increases the speed and quality of OCR.
R
Resolution is a s
canning parameter that determines how many dpi to use during scanning. Resolution of 300 dpi should be used
for texts set in 10pt font size and larger, 400 to 600 dpi is preferable for texts of smaller font sizes (9pt and less).
Recognition area is an area enclosing a section of an image that ABBYY FineReader should analyze automatically.
S
Scanner is a device for inputting images into a computer.
Separators are symbols that can separate words (e.g. /, \, dash) and that are separated by spaces from the words themselves.
T
Table area is an area that is used for table image areas or for areas of text that are structured as a table. When the application
reads this type of area, it draws vertical and horizontal separators inside the area to form a table. This area is the rendered as a
table in the output text.
Tagged PDF is a PDF document which contains information about the document structure such as its logical parts, pictures,
tables, etc. This structure is encoded in PDF tags. A PDF file equipped with the tags may be reflowed to fit different screen sizes
and will display well on handheld devices.
Text area is an area that contains text. Note that text areas should only contain single–column text.
Training is establishing a correspondence between a character image and the character itself. (For details, see Recognition with
Training section.)
U
Uncertain characters are characters that may have been recognized incorrectly. ABBYY FineReader highlights uncertain
characters.
Uncertain words are words containing one or several uncertain characters.
Unicode is a standard developed by the Unicode Consortium (Unicode, Inc.). The standard is a 16–bit international encoding
system for processing texts written in the main world languages. The standard is easily extended. The Unicode Standard
determines the character encoding, as well as properties and procedures used in processing texts written in a certain language.
Supported Image Formats
The table below lists the image formats supported in ABBYY FineReader 10.
Format Extension Open Save