Parascript is working with more and more service providers that need to perform data extraction, which must be as accurate as possible to ensure business operations are successful and dependable. This article delves into how service providers best ensure accurate data extraction and ground truth data.
Best practices in document management—when approaching a document management challenge that necessitates data extraction—require fully understanding the types of documents involved. The “nature” of the document is fundamental in determining the most appropriate technologies and techniques to use. For example, OCR cannot provide a comprehensive solution in many cases. Instead, OCR acts as the underlying, supporting technology that aids with producing a final result.
Here we cover how to address the problems posed by a legacy system that has inadequate metadata in terms of both detail as well as coverage. Existing document types don’t have the metadata needed to support efficient governance and use, and new document types need to be created. Unless an organization is familiar with the […]
Selecting the right threshold for a specific application is very important as it determines the accuracy of the accepted answers and the amount of manual labor required. Finding this threshold requires some work, but it will definitely pay off. How does it work? When performing recognition or verification, the software evaluates an image and provides […]
“There’s no business like new business!” Or perhaps better said, “there’s no more expensive business than new business!” In most businesses, developing and onboarding new business takes a lot more effort than retaining or growing existing business. That’s part of the reason so much attention has been turned to customer loyalty, retention, and engagement. Regardless, […]
Form design plays a critical role in the success of capture and recognition. Using the right design elements significantly impacts speed and improves accuracy rates, just as incorrectly designed forms result in increased manual processing. While these days most forms are available online allowing users to fill out the information directly on the computer, there […]
Image capture is the first step to successful document recognition. Image capture might not be under your control, but if it is, here are 3 simple rules to follow:
A great way to improve recognition results in general is to use a drop out form whenever possible. These forms are printed with ink that can be “dropped out” by the scanner during image capture (usually red). What you’re left with is a nice white page with nicely positioned data. Using drop out templates is […]
Form design is essential to the success of recognition technology. Forms should be well designed to reduce illegible, inaccurate, misinterpreted or missing data, and to improve data recognition. It’s not always the case that you will be able to control the forms that you need to handle, but even if you can’t, often you can […]
Do you need to scan a variety of documents from various sources? Do the originals contain some color, a little color or a lot of color affecting the form elements? Do you wonder what settings will produce the best quality? Or what is the best choice for scanning documents into your workflow? These questions are […]
What’s the secret to getting project buy-in when you don’t have specific budget or authority? We’ve all heard it a million times. If you want to get something done inside an organization, and you don’t have budget or authority, you need a sponsor. That’s easy to say for a management professor. But in the trenches […]
Let’s explore the mathematical model for optimizing the tradeoff between errors and rejects. The reject mechanism helps to guarantee the specified error level required by an application. Recognition engines usually return an answer accompanied by a value parameter called confidence value. The confidence value ranges from 0 to 100 and indicates how confident the engine is that […]