-
Maintenance of Dataiku
I have read through the documentation for Dataiku logs, macros, etc. and was wondering if there is a solid plan with guidelines already out there that someone could just follow regarding maintenance of projects by the owner as well as maintenance of the product on the server. The documentation is way to technical. I am…
-
How can we get the best deep learning model from the multiple deep learning models?
Hello Team, I'm implementing an image classification use-case in dataiku using multiple deep learning models such as CNN, RESNET, DENSENET and etc.,. and want to get champion model out of it. Is there any out of box capability that gives champion model out of tried deep learning models based on chosen metric? I'm aware…
-
Dataiku and Dataverse
Dear all, is there exemple of connecting a table from Microsoft Dataverse on Dataiku ? Regards, Mike Operating system used: centOS
-
Visual Timer Series exit code 132
Hi, whenever I try to execute a Time Series training, I get: Training failed Read the logsProcess died (exit code: 132) This is the Logs (LONG):[2022/07/17-10:25:05.693] [FT-TrainWorkThread-xNsxm5L8-298] [INFO] [dku.analysis.prediction] - ******************************************[2022/07/17-10:25:05.694]…
-
Concurrency Run of the Scenario is Supported?
Hi, I have a scenario that I want to be triggered by many users at the same time. However, I found this particular information in the documentation. https://doc.dataiku.com/dss/latest/scenarios/triggers.html#concurrent-triggers:~:text=sorts%20of%20checks.-,Concurrent%20triggers,-In%20order%20to That seems to give tips to…
-
Best practice for setting up QA, stage, and production data-pipelines
Hello all, I've been building a large data-pipeline, and the project is starting to get messy, as I have been creating new branches as I develop new versions of the pipeline. So I want to ask: what are the best practices for separating projects into development, staging, and production? Should I separate development,…
-
Slow performance questions
Hello, I am having some performance issues. I have about 5000 rows of data where one of the columns is a a large amount of text, and importing this data from JSON was quick. But everything else I try to do is painfully slow. Since data-science tasks can include millions of rows, something must be wrong with my…
-
Catalog DSS
Hi All, We are trying to find TEXT is reference anywhere within a DATAIKU project. Say, for example, I have a DATASET "CUSTOMER" and I need to identify all the recipes/Scenarios using the same. Note that there are cases where on few recipes we don't dataset marked as input in the recipes however, its being referenced in…
-
Dataiku 10.0.6 released
Quick heads up that Dataiku 10.0.6 has been released and the full release notes are available at https://doc.dataiku.com/dss/latest/release_notes/10.0.html#version-10-0-6-may-20th-2022 DSS 10.0.6 is a very significant new release with both new features, performance enhancements and bugfixes. My highlights are these two API…
-
clustering algorithm for mixed data
Hello, I cant find any AutoML Clustering model for mixed data or maybe i dont know which one is usefull for mixed data. I created K-prototypes model in python notebook but it is not effective as AutoML Clustering. Mixed data: Binary (gender) and continues(age, income) data together Operating system used: windows 10