- IMAGE ACQUISITION -- "GETTING THE PAGE"
- LOCATING
- RECOGNIZING
- PROOFING INFORMATION
- EXPORT
- OPEN STRUCTURE
- MISCELLANEOUS
1. IMAGE ACQUISITION -- "GETTING THE PAGE"
The OpenRTK® (SDK) can automatically read image files
in the following formats and methods:
- Image format supported: PCX, DCX, PDF, BMP, TIFF Uncompressed,
TIFF Packbits, TIFF Group 3, TIFF Group 4, JPEG.
- Each image can be read in any one of four orientations and
rotated appropriately when read: Portrait, Landscape, Flipped
Portrait, and Flipped Landscape.
- The OpenRTK® also provides developers with APIs to detect the
orientation of an image, rotate automatically if necessary, and
straighten any skewed images.
Note: If you use your own image reader, it is strongly suggested
that you implement scan line doubling of standard resolution images
for better recognition accuracy.
2. LOCATING
Locating, the process of identifying and ordering the areas of text on a page, can be accomplished:
- Automatically, using the OpenRTK® 's built-in Locate features. This is useful when you want to automatically process a heterogeneous set of documents where the location of text to be recognized on each page is unknown.
- Manually, defining areas of text explicitly and ignoring the OpenRTK® 's built-in Locate features. This may be useful when trying to process a form, where the location of text to be recognized on each page is always the same.
Of course, it's also possible to use the OpenRTK® 's built-in Locate features and then manually adjust those results. Text regions, whether they are the result of automatic or manual Locating, can be inserted, deleted, and modified.
You can choose whether to locate pictures for better format preservation.
3. RECOGNIZING
The OpenRTK® utilizes ExperExchange's exclusive MLFA technology with the ability to accurately recognize more than 2600 font types, supplemented with built-in and auxiliary (aka user) dictionaries. Recognizing options include:
- setting code page (ANSI or OEM)
- setting OCR language (now supports English, French, German,
Italian, Spanish, Portuguese, Danish, Dutch, Swedish, Norwegian, Hungarian, Polish, Finnish and Polynesian)
- setting paper quality (DQDM or Letter, with or without degraded document recognition option)
- setting illegible character symbol
The OpenRTK® also recognizes and retains the following text attributes:
- text style (bold, italic, underline, superscript, and subscript)
- point size ( 6 to 64 points at 300 dpi)
- font family (serif, sans serif, or monospace)
4. PROOFING INFORMATION
The proofing step is, for the most part, a user interface (UI) function. Although the OpenRTK® has no UI elements, it provides information indicating the position, confidence level and candidates of each character (including the suspect and illegible) through the Open data structure.
5. EXPORT
The OpenRTK® furnishes conversions from its internal data model to various application file formats, utilizing its superior page format retention capabilities. Formats supported by the OpenRTK® include the following, although the availability of formats is dependent upon the platform.
- ASCII (Plain Text)
- ASCII with line breaks (Text with line breaks)
- Comma-delimited ASCII(Comma Delimited Text)
- Lotus 1-2-3 v2.x, 3.x
- Lotus Ami Professional v1.2, 2.0, 3.0
- Microsoft Excel v2.x, 3.0, 4.0
- Microsoft Rich Text Format (RTF)
- Microsoft Word for Windows (RTF)
- Native/TypeReader (can be opened by the OpenRTK® (SDK) for later proofing purpose)
- Native/TypeReader Text Only(can be opened by the OpenRTK® )
- "Smart" ASCII (Formatted Text)
- Tab Delimited Text
- WordPerfect 5.0
- WordPerfect 5.1, 5.2
- HTML (Internet ready!)
- Portable Document Format (PDF) (Normal, Image with Hidden Text, and Image only)
6. OPEN FEATURES
Features listed in this section are extremely useful for the advanced customers who want to combine the power of ExperVision’s OCR with other available technologies to enhance their competitive advantages in the market.
(Available OCR Engine Internal Information)
- Document structure
- Page structure
- Line structure
- Word structure
- Character recognition and alternatives
(Works with foreign recognition engine, e.g. ICR, Japanese OCR, or Chinese OCR)
- Iterative recognition with foreign recognition engine
- Page layout re-analysis based on foreign recognition
- TypeReader proofing format available to foreign recognition results
7. Powerful and Open API Design
Dynamic Link Library (DLL) or runtime format is provided in OpenRTK® . Application Programming Interface (API) access to all data, isolating the developer from internal design and code changes of OpenRTK® .
Over 200 APIs (Application Program Interfaces) and abundant analytical information of the document have been designed and opened to the calling applications, which support:
- Image Acquisition,
- Image Preprocessing,
- Layout Location,
- Content Recognition,
- Proofing Information,
- Result Exporting
- Format Conversion,
- etc.
< back to top > |