Skip to content

Building an OCR solution for document analysis with AWS Textract and AWS StepFunction

License

Notifications You must be signed in to change notification settings

gabrieltorreswm/poc-pdf-textract

Repository files navigation

Building an OCR solution for document analysis with AWS Textract and AWS StepFunctions.

In this post, I’ll talk about AWS Textract and AWS Step Functions and how they could combine to build remarkable solutions like a serverless OCR (Optical character recognition) processor, useful for Table extraction, like Bank Extract, Invoices or SpreadSheets for instance. But you can add to this as much “sugar” as you want to process any other type of files with different structures.

See more here: https://gabrieltorreswm.medium.com/building-an-ocr-solution-for-document-analysis-with-aws-textract-and-aws-stepfunctions-81b4932c9443

Screen Shot 2022-07-06 at 10 12 01

Installation

Use the package manager npm to install the dependency and then run serverless to deploy in your AWS account.

npm install 
serverless deploy 

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

About

Building an OCR solution for document analysis with AWS Textract and AWS StepFunction

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages