OpenRTK 6.0 Feature List
 

  1. IMAGE ACQUISITION -- "GETTING THE PAGE"
  2. LOCATING
  3. RECOGNIZING
  4. PROOFING INFORMATION
  5. EXPORT
  6. OPEN STRUCTURE
  7. MISCELLANEOUS

1. IMAGE ACQUISITION -- "GETTING THE PAGE"

The OpenRTK can automatically read image files in any one of the following formats:

JPEG TIFF Group 4
TIFF Uncompressed PCX
TIFF Packbits DCX
TIFF Group 3 BMP

Each image can be read in any one of four orientations and rotated appropriately when read: Portait, Landscape, Flipped Portrait, and Flipped Landscape. The OpenRTK also provides developers with APIs to detect the orientation of an image, rotate automatically if necessary, and straighten any skewed images. 

Note: RTK6.0 now supports recognition of TrueColor (24 bit) and 256 shade of grayscale images through JPEG and uncompressed TIFF formats.

Note: If you use your own image reader, it is strongly suggested that you implement scan line doubling of standard resolution images for better recognition accuracy.

2. LOCATING

Locating, the process of identifying and ordering the areas of text on a page, can be accomplished:

  • Automatically, using the OpenRTK's built-in Locate features. This is useful when you want to automatically process a heterogeneous set of documents where the location of text to be recognized on each page is unknown.
  • Manually, defining areas of text explicitly and ignoring the OpenRTK's built-in Locate features. This may be useful when trying to process a form, where the location of text to be recognized on each page is always the same.

Of course, it's also possible to use the OpenRTK's built-in Locate features and then manually adjust those results. Text regions, whether they are the result of automatic or manual Locating, can be inserted, deleted, and modified.

You can choose whether to locate pictures for better format preservation.

3. RECOGNIZING

The OpenRTK utilizes ExperExchange's exclusive MLFA technology with the ability to accurately recognize more than 2600 font types, supplemented with built-in and auxiliary (aka user) dictionaries. Recognizing options include:

  • setting code page (ANSI or OEM)
  • setting OCR language (now supports English, French, German, Italian, Spanish, Portuguese, Danish, Dutch, Swedish, and Norwegian )
  • setting paper quality (DQDM or Letter, with or without degraded document recognition option)
  • setting illegible character symbol

The OpenRTK also recognizes and retains the following text attributes:

  • text style (bold, italic, underline, superscript, and subscript)
  • point size ( 6 to 64 points at 300 dpi)
  • font family (serif, sans serif, or monospace)

4. PROOFING INFORMATION

The proofing step is, for the most part, a user interface (UI) function. Although the OpenRTK has no UI elements, it provides information indicating the position, confidence level and candidates of each character (including the suspect and illegible) through the Open data structure.

5. EXPORT

The OpenRTK furnishes conversions from its internal data model to various application file formats, utilizing its superior page format retention capabilities. Formats supported by the OpenRTK include the following, although the availability of formats is dependent upon the platform. 

  • ASCII (Plain Text)
  • ASCII with line breaks (Text with line breaks)
  • Comma-delimited ASCII(Comma Delimited Text)
  • Lotus 1-2-3 v2.x, 3.x
  • Lotus Ami Professional v1.2, 2.0, 3.0
  • Microsoft Excel v2.x, 3.0, 4.0
  • Microsoft Rich Text Format (RTF)
  • Microsoft Word for Windows (RTF)
  • Native/TypeReader (can be opened by the OpenRTK for later proofing purpose)
  • Native/TypeReader Text Only(can be opened by the OpenRTK)
  • "Smart" ASCII (Formatted Text)
  • Tab Delimited Text
  • WordPerfect 5.0
  • WordPerfect 5.1, 5.2
  • HTML (Internet ready!)
  • Portable Document Format (PDF) (Normal, Image with Hidden Text, and Image only)1

6. OPEN FEATURES

(Available OCR Engine Internal Information)

  • Document structure
  • Page structure
  • Line structure
  • Word structure
  • Character recognition and alternatives

(Works with foreign recognition engine, e.g. ICR, Japanese OCR, or Chinese OCR)

  • Iterative recognition with foreign recognition engine
  • Page layout re-analysis based on foreign recognition
  • TypeReader proofing format available to foreign recognition results

7. MISCELLANEOUS

Additionally, the OpenRTK provides the following features:

  • Application Programming Interface (API) access to all data, isolating the developer from internal design and code changes
  • Dynamic Link Library (DLL) runtime format
  • Customer training to achieve highest recognition accuracy on same type of documents (i.e, the recognition of an aged book) 

1 Run-time royalty fee for using PDF export filter will be charged separately. Permission from ExperVision is required before distributing PDF export filter.

< back to top >