Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from any document or image. Amazon Textract now offers the flexibility to specify the data you need to extract from documents using the new Queries feature within the Analyze Document API. You don’t need to know the structure of the data in the document (table, form, implied field, nested data) or worry about variations across document versions and formats.
In this post, we discuss the following topics:
Success stories from AWS customers and benefits of the new Queries feature
How the Analyze Document Queries API helps extract information from documents
A walkthrough of the Amazon Textract console
Code examples to utilize the Analyze Document Queries API
How to process the response with the Amazon Textract parser library
Benefits of the new Queries feature
Traditional OCR solutions struggle to