-
How to set my dataset to upload automatically when my tables are updated.
Hello! I want to know how to set my dataset to upload automatically when my tables are updated. My data is coming from two excel - SharePoint online. And then i concat then into one. Does someone knows how to help me?
-
Remove duplicate and returns 0 rows in the end
Hello! I`ve been trying to create a python recipe that remove duplicates based in a column and keep the last but when I try to use it in my database it remove every single row in the dataframe. My code was: # -*- coding: utf-8 -*-import dataikuimport pandas as pd# Recipe inputsdf =…
-
I am getting this error
Cannot start Chromium Web browser on server [1m[43m[30m Puppeteer old Headless deprecation warning:[0m[33m In the near future `headless: true` will default to the new Headless mode for Chrome instead of the old Headless implementation. For more information, please see…
-
Failed to synchronize Hive Metastore for dataset error
Hi, I am running 3 flows under the same project with the same prepare recipe and I get the following error message on the 3rd flow and not on the other 2 flow. HIVE_METASTORE_SYNCHRONIZE occurred 1 times Failed to synchronize Hive metastore for dataset The source of the dataset are CSV files from a SharePoint. The flow is…
-
Excel file import [R recipe]
I have an Excel file in a managed folder, I can't read my file using : data <- dkuManagedFolderDownloadPath("dsdsJk", "file") Even using the different options of as = : "raw", "text", "parsed" I get nothing.
-
attach Dynamic file in folder in scenario
Hello everyone, After the creation of a scenario there is an option of adding Reporter to it, i tried creating a reporter as mail and add the file that the scenario create (scenario runs weekly and name the file based on the date). I want to attach the file within the folder with all the other files present, in the mail…
-
How to filter experiment runs using start time [revised]
Hi Dataiku Community, I would like to know how we can filter experiment runs using the experiment run's start time. filter_string = "attributes.start_time > '2024-03-26 03:05:48'"order_by = ["metrics.accuracy DESC"]with project.setup_mlflow(managed_folder=experiment_managed_folder) as mlflow: experiment =…
-
Writing to partitioned dataset using the writer
I'm trying to overwrite a table using data from another table (with the same schema). I keep running into the issue that both datasets are partitioned and the writer does not like that (same case with the copy_to function). Here is what I'm trying to do: ERROR:dataiku.core.dataset_write:Exception caught while…
-
Backset and forcast is one month ahead - Time series forecasting
I have a problem with the time series forecasting, the backset is one month ahead and also the forecast. Other problem is that I have a 36 month dataset and the forecast output that dataiku is giving me is in month 36, so actually the time series is not giving me month 37 I am uploading a picture of the problem. Anyone…
-
Improve time granularity in charts when using a date object
Hello, I had a time serie with timestamps on the horizontal axis. The best granularity I could have was ticks for hours in "date ranges", but the event I was watching was taking place during seconds/minutes: it would be awesome to have a "minutes" or "seconds" in "date ranges". This is exactly the same idea of what's…