-
ModuleNotFoundError: No module named 'modellightgbm'
I'm training a LightGBM classification model on Dataiku using a Python 3.6 environment with the following packages on dataiku 13.3.1 version : scikit-learn>=0.20,<0.21 scipy>=1.2,<1.3 xgboost==1.5.2 statsmodels>=0.10,<0.11 jinja2>=2.10,<2.11 flask>=1.0,<1.1 cloudpickle>=1.3,<1.6 matplotlib==3.3.4 numpy==1.19.5…
-
Governing old projects : easing the archive or deletion
Hi, we have hundreds of dataiku projects on our instances, many of them are just old versions or failed experiments and are wondering if someone already worked on this. Today we are listing projects through the Python API, exposing to end users a list of projet keys, associated with the owner and lastModifiedOn timestamp…
-
Extract tables from PDF
Hello community, to perform RAG, I want to extract tables from PDFs. I would like to do this using Dataiku plugins, but the quality is not what I expect. Do you know of other methods to do this? Thanks !
-
Dataiku cloudでSparkを使う方法について
GCP環境で立ち上げたDataiku cloudでsparkを使おうとするとエラーが出てしまいます。 ドキュメントにはDataiku cloudではデフォルトでSparkが使えると書いてあるのですがPysparkレシピを実行すると画像のようなエラーが出ます。何か設定をする必要があるのでしょうか。 (globalのほうで質問をしてしまったのですが日本コミュニティのほうに再度質問させていただきます。失礼しました。) Operating system used: Windows11
-
Custom trigger to run itself?
Hi everyone, I want to execute scenario again if its fail, so it can try 3 times, sometimes kubernetes fails or, spark fails could be fixed after run again so, i dont miss time range between fail and fix manually. This is the code created with LLM, it used python for that. I changed project name to variable "project name",…
-
How to use "Execute Python unit test" scenario step
A new scenario step was added in a recent DSS version which is to execute a Python unit test. I'd like to start using this. However, the documentation is pretty brief: "this step executes one or more Python pytest tests from a project’s Libraries folder using a Pytest selector". Anyone have more details on or an example of…
-
Dataiku cloudでSparkを使う方法について
GCP環境で立ち上げたDataiku cloudでsparkを使おうとするとエラーが出てしまいます。 ドキュメントにはDataiku cloudではデフォルトでSparkが使えると書いてあるのですがPysparkレシピを実行すると画像のようなエラーが出ます。何か設定をする必要があるのでしょうか。 Operating system used: Windows Operating system used: Windows
-
How to use Notebook
Hi Team, I have signed in with my gmail account and not able to use environment. Can someone help me how to start using this enviornment or notebooks to work a sample hacthaton
-
How does the evaluation store threshold actually work?
In the documentation for the evaluation store, when doing a two-class (binary) classification, there is a slider for the threshold used. The documentation for this threshold reads in part: When doing binary classification, most models don’t output a single binary answer, but instead a continuous “score of being positive”.…
-
Learning more about Dataiku administration
Hi All, do we have any specific courses within Dataiku more specific towards administration part, in case of someone like to persue his carrier towards the administration part of the tool do we have and certification/ courses for them Mostly like how the other tools have specifically separated section for administration…