Many organizations need to extract data or text from contracts, expense reports, tax documents and more — but the catch is that they have to hire someone to do it. But Amazon Textract could eliminate the need for manual data extraction.
According to Amazon Web Services, the technology “goes beyond simple optical character recognition (OCR) to identify the contents of fields in forms, information stored in tables, and the context in which the information is presented, such as a name or social security number from a tax form or the product SKU or quantity in a warehouse from an inventory report.”
That data can then be used to build searches on large datasets, or loaded into a database for use in applications.
More from Amazon Web Services:
Amazon Textract makes it easy for customers to accurately process millions of document pages in just a few hours, significantly lowering document processing costs, and allowing customers to focus on deriving business value from their text and data instead of wasting time and effort on post-processing. Results are delivered via an API that can be easily accessed and used without requiring any machine learning experience.
What Amazon Textract supports
Amazon says Textract’s API supports scans, PDFs, and photos. Customers can use it with database and analytics services like Amazon Elasticsearch Service, Amazon DynamoDB, and AmazonAthena and other machine learning services.
Who has used it?
- “We are building data pipelines to turn scanned medical charts into useful clinical information to improve care coordination, drive quality outcomes, and ensure appropriate reimbursement for members under our coverage,” said Steve Prewitt, Chief Analytics Officer at Healthfirst,.
- “We hope to use AmazonTextract to digitize millions of historical weather observations from document archives,” said Philip Brohan, Climate Scientist at Met Office, the UK’s national weather service. “Making these observations available to science will improve our understanding of climate variability and change.”
- “We have already used Amazon Textract to analyze tens of thousands of loan documents on behalf of financial institutions, and our own software-as-a-service offering has been enhanced by the service, enabling us to identify 95% of the defects in loan application packages and help banks reduce their manual data entry,” said Justin Wickett, Founder and CEO, Informed Inc.
Learn more on Amazon Web Service’s website