-
Python import from Library
I am new to Dataiku and am using the free version on my Macbook M1 laptop. I have created a python recipe and include a file from a library I created. The library has the following structure: ```Python/__init__.py rankfm_ge/__init__.pycodeA.py codeB.py```Inside my Jupyter notebook I import `codeA.py` as follows: ```…
-
time series forecasting recipe has bugs?
I'm trying to use time series forecasting recipe to predict multi-time series data. But it's always stopped at resampling steps in time series forecasting recipe as showed below. To debug, I created dummy data attached below with my time sequence which is same as my original data . Q is sequence number (1,2,3,4....), Q1 is…
-
How to scrub for keywords in an excel sheet for an email inbox
I currently have a excelsheet (sample data in image attached) showing the emails in my email with their body, subject, date sent etc. How do i make Dataiku scrub through the bodies to retrieve common keywords? eg) in the 9 emails there, dataiku will have 3 of them show up as "marketing enquiries" and so on. I believe text…
-
How to scrub for keywords in an excel sheet for an email inbox
I currently have a excelsheet (sample data in image attached) showing the emails in my email with their body, subject, date sent etc. How do i make Dataiku scrub through the bodies to retrieve common keywords? eg) in the 9 emails there, dataiku will have 3 of them show up as "marketing enquiries" and so on. I believe text…
-
View the tuning results
I would like to understand Optimization strategies. SESSION 1 is the result of setting multiple hyperparameters (default: 8 values), and SESSION 2 is the result of setting only one of those parameters, but SESSION 2 has a higher score. I thought that when I created several models with hyperparameters, the one with the best…
-
evolution of hyperparameter values going down
As shown in the image, sometimes the score is adjusted in the direction of going down, how does this work? I understood that in most cases the score would go up.
-
Spark Cluster mode
Hello, As we are using Spark heavily, we are having the problem of slowness of application launching in yarn cluster mode. The slowness comes from having many DSS related files and also many jars files has to be uploaded for every single spark application. We checked the feature of using Cluster mode. However, we know that…
-
How do I find an Archived Project?
I changed the Status of a project to "Archive" on several project home pages. I then realized that I want to look at my Archived Projects. However, I can not find them. Does anyone know how to resurrect an archived project? Operating system used: Sonoma 14.3.1
-
ERROR: poppler no found on variable PATH
Hello everyone, I'm facing an issue with a Python function embedded in a Dataiku API. The business case involves applying OCR to PDFs, and I'm encountering two problems: * The PATH environment variable throws an error, stating that it cannot find the poppler-utils library within that variable. * Failed: Failed to run…
-
Snowflake dataset override default connection details
Hi I was wondering what the exact names for the parameters were that need to be added in the 'specificSettings' to get a new managed dataset on snowflake be materialized in a catalog and schema different from the default details of the connection. I added a screenshot of the UI (I am looking for the database and schema…