Using Dataiku

How to prevent DSS replace NA with null?
Hi, I'm using Python recipe to query and insert data to the output SQL Server dataset as below. import dataikuimport pandas as pdfrom dataiku import SQLExecutor2# Read recipe inputsp787PDMItem = dataiku.Dataset("_P787PDMItem_src")p787PDMItem_df = p787PDMItem.get_dataframe()# Initialize an empty DataFrame to collect all…
UI looks wrong when joining a dataset with itself
Hello, For a project I'm currently working on, I need to join a dataset with itself using different filters and computed fields each time. => something that would look like SELECT * FROM (SELECT computed_field_1 FROM data WHERE filter_1) AS data_1 JOIN (SELECT computed_field_2 FROM data WHERE filter_2) AS data_2 ON…
Dataset in Email Body
Hi Team, Can we use Data of any DSS table while creating Email body? Actually I want data(prepared in my DSS workflow) to be displayed in Tabular form in Email Body. I am fine if i need to write any custom Python Code for it. Thanks in Advance
Managed Folder contents location unicity
Goodday! Are managed folder contents entered into any type of logging system, backup system, or version control? Ie. can those contents be found in other places than the managed folders themselves? I'm assuming that's not the case, and the actual managed folder is the only place that the contents/data is actually stored…
FInding flow zones by tag name?
Hi, Would like to be able to find flow zones by tag name using the Python DSS, such that users can select matching flow zones for execution using their own parameterized variables. Is this possible? thx Operating system used: Windows 10
Calculating percentage individual sales based on monthly total sales
I have a dataset with two columns representing individual sales values and another column representing the month. I want to create an additional column that calculates the percentage of individual sales values relative to the total sales values for each month.
How to create dynamic regex function
I have a column called filename and the field is formatted as name1_name2_name3_name4_name5_YYYYMMDDHHMMSS.txt.pgp name1_name2_name3_name4_YYYYMMDDHHMMSS.txt.pgp name1_name2_name3_name4_name5_name6_YYYYMMDDHHMMSS.txt.pgp up until now, the number of names can differ, but the filename always ends with YYYYMMDDHHMMSS.txt.pgp…
Using a Dataset's Column in Email Body
Hi all, simple question here How I can add a dataset's column in my mail body ${column_name} doesn't work. I thought I could add as a local variable but I don't know about the configs. Coould you please help out? Thanks.
Percentage of individual values based on total value
I have a dataset with two columns representing individual sales values and another column representing the month. I want to create an additional column that calculates the percentage of individual sales values relative to the total sales values for each month.
BigQuery connection
Hello, Dataiku Team. If I want to deploy a project on Automation node, but my project has a connection with BigQuery, I need to create the same connection for Automation node, right? The thing is that I do not have the same options in the automation node as design node. In design node I have set a section called Path…

Trending Discussions

Docs for "pandasutils"?
Hello, My apologies if this is a remedial question, but at the start of every Python recipe the boilerplate code includes an import of: from dataiku import pandasutils as pdu Is there documentation for pandasutils? Is it a package that can be used in Python recipes? I've tried looking through the Dataiku Developer Guide,…
Run a Time Series Forecasting Model
I get the following error message Error message: Failed to train : <class 'ImportError'> : libcuda.so.1: cannot open shared object file: No such file or directory Operating system used: 13.1.4
Identifying the Node Type in a DSS Notebook using Python
In Python, in a DSS notebook, I want to know if the code is running in the design node or the automation node. How can I do that?

Leaderboard

Turribeach 3539

tgb417 2473

Ignacio_Toledo 1079