You now have until September 15th to submit your use case or success story to the 2022 Dataiku Frontrunner Awards!ENTER YOUR SUBMISSION

AWS Textract and NLP

vkankar
Level 1
Level 1
AWS Textract and NLP

1. Please share video or example of using Tesseract OCR plugin in Dataiku

2. Do Dataiku provides similar solution to  AWS Textract output

3. Please share reference link for NLP

 

Regards,

Vikram

 

0 Kudos
2 Replies
GregW
Dataiker
Dataiker

Hi Vikram,

You can find the documentation for the Tesseract OCR plugin, which contains a step-by-step example, here: https://www.dataiku.com/product/plugins/tesseract-ocr/

You can also find another OCR solution here: https://www.dataiku.com/product/plugins/natif-idp/

We don't currently have a pre-built plugin for Textract. However, you can find here the link to our reference documentation for working with text data: https://doc.dataiku.com/dss/latest/unstructured-data/text/index.html

And also here, a full list of related plugins: https://www.dataiku.com/product/plugins/?filter%5Bplugins-topic%5D=nlp

Regards,

Greg

0 Kudos
Anjaney
Dataiker
Dataiker

We also have a business solution that you can spin up from your DSS instance that shows how to leverage our OCR plugin - https://knowledge.dataiku.com/10.0/kb/industry-solutions/interactive-doc-intelligence-esg/interactiv...

0 Kudos