Hi, I am working on data in S3 which is partitioned by timestamp on filename. I need to repartition the data using a column value as the files contain unordered timestamp data. I tried redispatch part…
Hi All, I'm looking for advice on a solution that I am looking to develop. Specifically we have a series of projects (19 in total) that contain the same outputs but slightly different logic in terms o…
Hello When I tried to deploy my API to the deployer, I got a warning said "Deployment is incomplete on an API node". The API Node already installed and started though. What should I do? I also got an …
Hi! I have a dash webapp running. I would like to be able to check whether the webapp backend is still running , so that I can restart it with a scenario if it has stopped . Dataiku scenarios do suppo…
From a Python recipe, I am trying to write a dataframe from a SQL query that has a little over 13k rows. Using the following code, DSS is only writing 10k rows. How do I get the remaining rows? DataWi…
Hi i would like to ingest all partitions into python dataframes, updating these partitions by applying some calculations and then write these updated partitions back to the output DSS dataset. can any…
Going through the hard time while challenging the schema change effect from the sources. can anyone please help on the below query. i have a partitioned whole dataset (consist the data of all the mont…
Hello, I'm getting this error whilst trying to train a time series model on GPU. OSError: libnvToolsExt.so.1: cannot open shared object file: No such file or directory I have done the following so far…
Hi Dataiku Community, Could be please help me with this scenario? I would like to know if there are any options in Dataiku which accomplish the below task. I have uploaded a file from my local directo…
Hi, from the formula language in the prepare recipe, i am looking to filter the data with multiple columns with a specific value or in a list for each column. Ex : column 'A' has 5 values, out of 5, 3…