-
folder.get_download_stream I/O operation on closed file
Hi experts, Dataiku prompts this error ValueError: I/O operation on closed file. when I tried to execute the script below: with folder.get_download_stream(path_csv) as f: data = pd.read_csv(f, encoding='latin-1') The reason I put that encoding is due to the utf-8 error if encoding is not specified ( 'utf-8' codec can't…
-
Recipe for implementing PCA.
Hi there, I am trying to implement PCA on a dataset with 700 columns. How do i do this in DSS
-
want to load multiple dataframes to seperate csv files in a folder
want to extract 5 files (100s more later) to 5 csv files in a folder, error is <type 'exceptions.AttributeError'>: 'str' object has no attribute 'to_csv' or is there a better way to do it test1 = dataiku.Dataset("test1")df1 = test1.get_dataframe()test2 = dataiku.Dataset("test2")df2 = test2.get_dataframe()test3 =…
-
How can I query Dataiku's commit log from the public API client?
I would like to use DSS's public API to query the internal commit log. In particular, I would like to obtain: 1. The project key 2. The commit timestamp 3. The commit ID of the latest timestamp I know that it is possible to list all Project and Commits using an Internal DSS dataset. However, I would like to accomplish the…
-
Project duplication with API not keeping triggers
Hello, I'm using the API to duplicate my projects on releases from development to production (as opposed to the UI duplication to save time as there are 30+ projects we duplicate every time). On duplicating the projects I noticed that the scenario triggers are not automatically enabled in the newly duplicated project, even…
-
Cumulative sum on raws by group
Hi Dataiku Folks, Do you know if there is a visual way to go from the table 1 to table 2 (values are all "1" to simplify the example). Output dataset will have: two new columns, no aggregation but cumulative sum by two ids. Window recipe with custom aggregation seems not able to create new columns ; prepare recipes seems…
-
Unfold HR Hierarchy
Hi! I am looking for some advise on extracting relationships within a HR hierarchy. This is a problem I know how to resolve with code but I just want to understand if I am missing any solution to this using the visual recipes that exist. Below are sample input and outputs... Before…
-
Memory of DSS
Is it possible to train a text aware scenario to find patterns with the 200k lines? For example extracting keywords and so on, would the size of data be a problem?
-
Dash Webapp not working after a while
I have dash Webapp published at Dashboard, the drop down list works fine when I open it. However, it does not work after a while, then I have to refresh the browser (chrome) to make it work. Then it happens again after a while, what could be the reason and how to fix it? thanks a lot. Operating system used: Win10
-
Saving a dataframe from a notebook to a specific zone as a csv file in DSS
hi i have a dataframe that needs to be saved to a specific zone in my project. how do i achieve this? output_ds = dataiku.Dataset("myoutputdataset")output_ds.write_with_schema(my_dataframe)