Big Data forces a lot of big questions and for good reason; it’s complex stuff. It encompasses both structured and unstructured information, and what’s new about Big Data is the tremendous volume and access to this information.
With computing power continuing to grow allowing more sophisticated machine learning to be applied to all sorts of data, questions naturally turn to how to practically apply these powerful technologies to answering relevant questions. These questions become more complex when organizations wish to incorporate information stored within documents into their Big Data strategies.
Documents Core To Big Data Strategy
Rather than just tech-talk, let’s start with how documents provide significant value to an organization’s internal and external processes through Big Data. On the accounting side, there’s spend management. Every organization receives invoices and incurs employee expenses, both of which need to be managed to ensure that payment obligations are vetted and settled.
To perform these activities, the proof of purchases or records of orders are required. For business expenses, the record is the invoice and the purchase order. For individual employee expenses, receipts are often the primary record.
While many companies do an effective job at policy management for business-level expenses and individual employee reimbursements, rare is the company that employs data on these documents within a big data strategy aimed at making business operations more efficient. And yet, let’s take a look at what data is contained in these documents. They typically include vendor names, dates, amounts, addresses, shipping charges, and individual descriptions of items or services purchased.
Extracting The Right Data
What if these data were extracted and aggregated into monthly or quarterly reports based not only upon spending categories, but also the individual prices of items, the vendors involved with the sale, the locations where purchases were made and then blended and cross-referenced with publicly-available retail data? All of a sudden, an organization has a benchmark of spending on an item-by-item basis compared with known retail prices and insights into trends of where and when the best available prices were offered. They can also identify substitutions for some items that enable cost reductions. The result is a more-coordinated and comprehensive sourcing capability.
Next, let’s look at the marketing department. Several years ago, Gartner analyst Craig Roth introduced the concept of big content as a marketing-specific application dealing with hard-to-produce content such as eBooks, white papers, presentations, and videos. While there is no direct tie between Big Data and big content, the ultimate goal is to utilize big content to engage prospective customers, take their digital exhaust, and analyze it to understand where the prospect is with their purchasing decision and what materials work best….For a full view of the entire article click here.