-
How do i create a categorisation model for a reviews dataset
Hi there - new to dataiku, Lets say i have an excel sheet of 2 columns where one has app reviews and the other has dates they were posted. Is there a video tutorial anywhere or example where i can create a model to categorise the app reviews into categories eg) ux/ui problem or customer service problem as well as include…
-
Converting Python script to Python Recipe to output multiple files locally
Hi, I have a basic python script that reads in a file and splits this file (based on a supplier) into multiple CSV file and stores them in a dataiku managed folder. I now want to convert this notebook into a python recipe but am unable to do so as it looks as if the recipe is looking for a singular output.. Help.... Code…
-
Import a PDF file
I am trying to import a PDF file to read data and checking this tutorial:Tutorial | Managed folders - Dataiku Knowledge Base. But I meet a new problem: I don't have admin permission to create or edit python env. to install the package. Is there any way to install the package only in my project?
-
Dataset approval workflow similar to Dataiku Govern
Hi @Everyone, I would like to implement the approval workflow for one or different versions of Datasets while publishing it to a Collection. This will assist us in avoiding any drastic change in data which could cause an issue in downstream flows. So, intentional pipeline would be: * A flow will be triggered, which will…
-
Happy 10th anniversary to all Dataikers out there!
It's a great achievement in such a short time!
-
Update Dataset with Dash
Hi there, I am relatively new to Dataiku, I have looked in the forum for a similar question, but I haven't found anything. I am developping a webapp with Dash. I want the user to modify some values of a dataset, and these changes to be updated in the flow. Given on some inputs of the user, the webapp displays a…
-
time series forecasting using prophet model get error during run time
######## python and prophet version details prophet version=1.1.4 python_version=3.9.12 I am using the same version in my local machine its working file. but i am facing error on DATAIKU ########### Code Sample ######## lag_periods = [1,5, 10, 15] for lag in lag_periods: df_prophet[f'Lag_{lag}'] =…
-
Data transformation
Hi All, This is a mock data of my input : Input RuleAccountEntityAuditR1a1,a2e1 R2 e2-c2 This has to be split and transformed to this form : RuleDimensionValueR1Accounta1R1Accounta2R1Entitye1R2Entitye2R2Audit-c2 Then have to search a databse, say "db1" and find the levels :…
-
Fuzzy Match
i need to do fuzzy match based on jaro distance .I have two columns (X, Y). I have two unique values in the Y column.The fuzzy match need to take the shortest string from the X column and it should compare with another Y column's X values.likewise it need to do for all the X column values. If it satisfied the predefined…
-
Project deployment using python script on deployer
Hi Team, we are using the below script to deploy project but getting the error : DataikuException: com.dataiku.dip.server.controllers.NotFoundException: API service does not exist: TESTING_2 import dataikuapi import json host = "https://dss-dataiku:1999" apiKey= "oNnek9DtMI" client = dataikuapi.DSSClient(host,apiKey)…