-
How do i create a categorisation model for a reviews dataset
Hi there - new to dataiku, Lets say i have an excel sheet of 2 columns where one has app reviews and the other has dates they were posted. Is there a video tutorial anywhere or example where i can create a model to categorise the app reviews into categories eg) ux/ui problem or customer service problem as well as include…
-
Thoughts on Building a Unique Query In Dataiku?
Hello All! I'm looking to build a specific query in Dataiku and, I'm sorry to say, I'm quite the Noob. I have an extract of all transactions within a market that includes item descriptions, transaction dates, unique customer IDs, and total price. I'm specifically looking to analyse the purchases that an average customer…
-
SPLIT & PREPARE RECIPE
Hi All, I have a task to create a split recipe as well as a prepare recipe & has to be automated using python code. I'm new to Dataiku & would like to know how it can be done. The model data can be assumed as follows : we have 20 columns of which only 4 needs to selected. This data needs to split as per the following rule…
-
Does dataiku support schema Evolution ?
Hello guys, We have many cases that required adding new columns and the dataset. However, The issue of this dataset is shared across many projects and used downstream. Does dataiku support schema Evolution ? is adding new coulmns affecting visual or code recipe in downstream datasets ? Thanks Kind regards
-
Custom aggregation in specific set of rows
Please see an example of data table I have. I would like to convert the absolute counts in table 1 into percentages such that denominator is total count of all categories in each account as shown in table 2 below. Please help Table 1 (Current Format) Accounts Category Count Account A X 95 Account A Y 100 Account B Z 500…
-
How to combine several rows to one rows?
Hello, My data looks like this: Recordsvaluesrecords_0_NameJimmyrecords_0_Number1records_0_StatusStudentrecords_1_NamesMarierecords_1_Number2records_1_StatusWorker And i want it looks like this: NameNumberStatusJimmy1StudentMarie2Worker Any ideas?
-
Removing duplicate columns
Dear community, I have such a case: - I have a large database that needs cleaning. - while performing the typical cleaning activities (parsing etc.) I discovered that I have numerous columns that are just duplicates of one another (judging by basic analysis it's hundreds) but with different names. Example: 1 column name is…
-
Function does not reduce error
Hi, I'm facing some trouble in the following python recipe. # -*- coding: utf-8 -*-import dataikuimport pandas as pd, numpy as npfrom dataiku import pandasutils as pdufrom statsmodels.stats.stattools import medcouple# Read recipe inputsCOLETA_f_datas = dataiku.Dataset("COLETA_f_datas")COLETA_f_datas_df =…
-
Using Dataiku DSS for EDA purposes
Hi I'm a big fan and use DSS often and would like to know if everyone finds the visualisations in DSS fascinating? The latest version (I saw in the demo) has even more and really good visualisations. I was wondering if anyone uses DSS for EDA purposes as well and if so you can share some tips with me on how you go about -…
-
Tell us about your Dataiku DSS experience!
Hi Dataiku DSS Community, Part of the fun stuff we get to do here is to continually make improvements to DSS to make it an even better analytics tool for you to use. We are launching a research study to get insights on how you, our wonderful users, use Dataiku Data Science Studio and are particularly interested in getting…