Using Dataiku

Data Modeling
Hi All, Could you please the steps to create star schema data model using DATAIKU. The tables are Time_Dimesion,emp_Dimension,Customer_Dimension,Product_Dimension and Sales_Fact. Thank you.
AI code assistant in Jupyter not displaying full response
See the below image
How do I find the file path of images in a folder so I can extract them and put them on a pdf ?
I have the flow below where I want to use the images stored in the Photos folder? In the folder, there are two images, how can I get those images and add them into the script below that allows me to create pdf files. This part of the script is where the error is coming pdf.image('/project/XX1111/ABC12DDD/apple.jpeg')…
Divide a column by another one
Hi, I am supposed to get the GDP per capita. I have a column GDP and a column population. How I can divide GDP by population ?
Admin API
Hi, I'd asking about Jobs section in the following page "/admin/monitoring/background-tasks" , how I can getting this data using HTTP API or python API.
Where is the PDF output from my python script stored using fpdf?
I installed the fpdf python package in a code env. I created a python notebook and I run this code import fpdf from fpdf import FPDF # Create instance of FPDF class pdf = FPDF() # Add a page pdf.add_page() # Set font pdf.set_font("Arial", size = 15) # Add a cell pdf.cell(200, 10, txt = "Hello", ln = True, align = 'C') #…
Create a report template and add dynamic values to it
Hello, I must create a report template with some fixed text, dynamic values, and calculated graphs. As a simple example, I want my report to have the following info. This is the annual summary of Client A's YYYY orders. [Images of some graphs for Client A's orders ] Where Client A and YYYY values are populated dynamically…
Metrics and Checks
I am archiving the runs of a recipe as a csv dataset in a folder on Dataiku and syncing the latest run to a separate dataset. Can I establish a check which compares the latest run with the previous run rather than predefined numbers?
Duplicate rows need to remove or replace value
My rows are repeating information over and over again because I now have two columns that have a computer name. The one column has different computer names (information from another database) and because of this it is duplicating the results to put in a value for the computer names that are different. I need to get rid of…
Impersonation
We have created 2 node design and automation we were able to move users and connection from design node to automation using the .json files. do we have such an alternative for user impersonation as well were we can copy the details from design to automation Operating system used: Linux

Trending Discussions

Docs for "pandasutils"?
Hello, My apologies if this is a remedial question, but at the start of every Python recipe the boilerplate code includes an import of: from dataiku import pandasutils as pdu Is there documentation for pandasutils? Is it a package that can be used in Python recipes? I've tried looking through the Dataiku Developer Guide,…
Run a Time Series Forecasting Model
I get the following error message Error message: Failed to train : <class 'ImportError'> : libcuda.so.1: cannot open shared object file: No such file or directory Operating system used: 13.1.4
Identifying the Node Type in a DSS Notebook using Python
In Python, in a DSS notebook, I want to know if the code is running in the design node or the automation node. How can I do that?

Leaderboard

Turribeach 3539

tgb417 2473

Ignacio_Toledo 1079