Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
1. Please share video or example of using Tesseract OCR plugin in Dataiku
2. Do Dataiku provides similar solution to AWS Textract output
3. Please share reference link for NLP
You can find the documentation for the Tesseract OCR plugin, which contains a step-by-step example, here: https://www.dataiku.com/product/plugins/tesseract-ocr/
You can also find another OCR solution here: https://www.dataiku.com/product/plugins/natif-idp/
We don't currently have a pre-built plugin for Textract. However, you can find here the link to our reference documentation for working with text data: https://doc.dataiku.com/dss/latest/unstructured-data/text/index.html
And also here, a full list of related plugins: https://www.dataiku.com/product/plugins/?filter%5Bplugins-topic%5D=nlp
We also have a business solution that you can spin up from your DSS instance that shows how to leverage our OCR plugin - https://knowledge.dataiku.com/10.0/kb/industry-solutions/interactive-doc-intelligence-esg/interactiv...