| |
|
Development Licensing Concept
Customer value received by Licensee
- Enables application developers to write, test, debug, and modify an application, using all of the APIs of OpenRTK®
- Customers can integrate the best OCR technology seamlessly with their own document management system, to target and compete in the fast growing market.
- Customers can customize the best OCR for their particular application, such as form processing, resume recognition, business card reader, invoice recognition, check recognition...
- Customers can enhance the recognition technology by integrating their own domain specific knowledge.
Licensee rights and obligation
- Licensee will be granted a non-exclusive and non-transferable OpenRTK® license for purpose of software development only.
- Licensee can make an archival copy of the OpenRTK® , the use of which shall be limited solely for back-up purposes.
- Since the OpenRTK® may be used only by Licensee for integration of the OpenRTK® with and into the Licensee Applications; Licensee needs to describe the application so that ExperVision can build the suitable OpenRTK® version for licensee.
ExperVision® rights and support
- ExperVision® owns certain proprietary computer software programs commonly known as the Recognition Toolkit. ExperVision owns certain proprietary materials and other documentation relating to the OpenRTK® .
- If Licensee requires assistance solely with respect to the OpenRTK® , ExperVision will make its engineers reasonably available to Licensee by telephone or at ExperVision® ’s facility during ExperVision® ’s normal business hours to provide free tech support for 20 hours.
- ExperVision® can provide comprehensive OCR Consulting& Customizing Service to help clients solve the special problems in RTK application process, besides OCR technology license
The pricing of OpenRTK® includes the below two parts:
- OpenRTK® development license is $5,190.
- OpenRTK® run time licenses, need to be purchased only after a client purchases the OpenRTK® development license. For details please email OCR_Consulting_Team@ExperVision.com
|
|
Multiple Platforms
- Platform List
- Windows 98, 2000, NT, XT
- UNIX, Solaris Vista
- Linux, Fedora, Ubunto
- Macintosh OS 7,8,9,10 & X
- Windows Mobile
- Palm WinCE
- Symbian
- FreeBSD
- MIPS, etc.
|
Classification of APIs
API classification and the flexibility OpenRTK® provides
Dynamic Link Library (DLL) or runtime format is provided in OpenRTK. Application Programming Interface (API) access to all data, isolating the developer from internal design and code changes of OpenRTK.
Over 200 APIs (Application Program Interfaces) and abundant analytical information of the document have been designed and opened to the calling applications.
|
|
Basic APIs of OpenRTK
o Doc-Image Acquisition
o Image Pre-Processing
o Layout Analysis/Locating
o Char/Font Recognition
o Proofing Information
o Formatting & Export
o Internal Data Openings
1) Doc-Image Acquisition
o Read image from files or scanning buffer into memory
o Read multiple image file formats
- TIFF uncompressed, pack bits, G3, G4
- PCX, DCX, BMP
- JPEG, JPEG 2000
- PDF
o Image lock/unlock for massive image data handling
- Images are stored in memory or temp file
- Application can load image to memory by lock image operation
- Application can unload image to temp file by unlock
- Images in thousands can be handled using minimum mem
o Multiple images per page
- Color/Grey image
- B/W image
- Thumbnail image
2) Image Pre-Processing
o Image conversion from color/grey to binary
o Auto orientation and auto de-skew
o Book handling
- Gap detection
- Page Splitting, etc.
o Line detection and optional removal
o Noise detection and de-speckling for OCR
3) Layout Analysis/Locating
o GTS – Graphics/Text/Table Regions Separation and Ordering
- Document layout analysis
- Form layout analysis
- Graphic & text separation
- Reading order analysis
- Normal, Force Single Column
- Template method
|
|
4) Char/Font Recognition
o Font: Extract and keep right information of fonts in text region
o Char: Higher Text Recognition accuracy with font information
o Iterative “Segmentation – Recognition - Post-processing” for best accuracy
o Further enhanced recognition by other proprietary techniques
Example – Handling 2,600 Fonts
o RTK.DB – organized character shape information of 2,600 fonts
o Identify font(s) of a given text region
o Recognize characters with the tree classifier for the identified font(s)
o Super fast and accurate algorithm – see QR Wang’s IEEE papers in 80s
5) Proofing Information
o Font Style of the Text Region
o Coordinates of Recognized Chars, Words, Lines & Paragraphs
o Marks for Operator’s Attention (best performed in UNLV Competition)
o WYSISWYG as in TypeReader® & TextProofer®
o Correct OCR result to 100% accuracy
6) Formatting & Export
o Save recognition result in common formats
- ASCII Text, CSV
- Microsoft Excel & Lotus 1-2-3
- Microsoft Word, RTF & WordPerfect
- HTML
- Various PDF Formats
o And more advanced PDF Settings
- Watermark
- Encryption
- Thumbnail
- Meta data
- ...
7) Internal Data Opening & RTK Object Model
- RTKClient = Document
- RTKImage = Image
- RTKPage = Page
- RTKRgn = Paragraph
- RTKLine = Line
- REWord = Word
- RECharacter = Character
|

|
|
Eight Steps to OCR
- Initialize the engine
- Get a document ID
- Create & lock a page
- Fill in the image
- Locate the text
- Recognize
- Export
- Shut down the engine
|
- RTKInit
- RTKClientCreate
- RTKClientPageInsert
- RTKImageFileRead
- RTKPageLocate
- RTKPageRecognize
- RTKClientExport
- RTKTerm
|
Options in OpenRTK
o Combinations of Available Options
- 10+ Optional Formats to Import
- 4 Options of Image Processing
- 2 Options of Layout Analysis
- 4 Options of Char/Font Recognition
- 10 Optional Format in Export
- Template & Small Region Recognition
o Cater various Doc & Form OCR needs
More APIs Available
o Basic APIs for fundamental OCR applications
o Extended APIs to support applications like TypeReader®
o Advanced APIs for more convenient RTK integration
o APIs customization available on client’s request
For details please email OCR_Consulting_Team@ExperVision.com to ask for Open Recognition Toolkit Introduction.
|
|