AWS Textract and NLP
1. Please share video or example of using Tesseract OCR plugin in Dataiku
2. Do Dataiku provides similar solution to AWS Textract output
3. Please share reference link for NLP
Regards,
Vikram
Answers
-
Greg Dataiker, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer Posts: 6 Dataiker
Hi Vikram,
You can find the documentation for the Tesseract OCR plugin, which contains a step-by-step example, here: https://www.dataiku.com/product/plugins/tesseract-ocr/
You can also find another OCR solution here: https://www.dataiku.com/product/plugins/natif-idp/
We don't currently have a pre-built plugin for Textract. However, you can find here the link to our reference documentation for working with text data: https://doc.dataiku.com/dss/latest/unstructured-data/text/index.html
And also here, a full list of related plugins: https://www.dataiku.com/product/plugins/?filter%5Bplugins-topic%5D=nlp
Regards,
Greg
-
Anjaney Dataiker, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 1 Dataiker
We also have a business solution that you can spin up from your DSS instance that shows how to leverage our OCR plugin - https://knowledge.dataiku.com/10.0/kb/industry-solutions/interactive-doc-intelligence-esg/interactive-doc-intelligence-esg.html