Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
In addition to the python package pytesseract, the Tesseract system package must be installed on the machine that runs Dataiku (it's written in the How to setup section of the plugin webpage: https://www.dataiku.com/product/plugins/tesseract-ocr/).
The python package is just a wrapper to call the Tesseract system package that cannot be installed by Dataiku.
You can check that Tesseract has been installed by typing the tesseract command in your terminal.
Yes exactly, if you install the tesseract library in the base image as well as building the plugin code env for your container image, then containerized execution should work.