Using Dataiku
- I have 2 files / sheets, one that shows file names and one that shows colors filename sheet colors sheet what I'm trying to do is somehow link these two datasets, if filename contains one of the color…
- Hi, I have a basic python script that reads in a file and splits this file (based on a supplier) into multiple CSV file and stores them in a dataiku managed folder. I now want to convert this notebook…Last answer byLast answer by Turribeach
Please always post code using the code block (the </> icon in the toolbar) as otherwise the padding is lost and you probably know Python code can not run without the proper padding.
There should be no reason as to why your code doesn't work in a recipe. Please post the error/issue/behavior that you see as "unable to do so" doesnt' really say much.
- I'm starting to work with the Fuzzy Joins and having good luck. However, I'm trying to figure out when I might want to use a Relative Threshold related to the Right or Left Table when doing a overall …
- I have a table that is presorted that shows teams and players. What I'm trying to do is create some kind of calculation to fill my blank cells for team, I want the cells to be filled with the last non…Last answer byLast answer by Turribeach
Nice one @ZachM
, didn't work for me as postgreSQL doesn't support First/Last not null:But this can be done in a SQL Recipe relatively easily:
SELECT t1."Player", t1."Team" , ( SELECT t2."Team" FROM "CT_TEST_book3_copy" t2 WHERE t2."Player" = ( SELECT MAX(t3."Player") FROM "CT_TEST_book3_copy" t3 WHERE t3."Player" <= t1."Player" AND t3."Team" IS NOT NULL ) ) AS "Calc_Team" FROM "CT_TEST_book3_copy" t1;
- If I have a string value formatted as firstname_middlename_lastname_initials_age_gender and I only need everything before age, is there a formula I can use to give me everything before the fourth inst…Solution bySolution by Sarina
Hi @abalo006
,
You can indeed use a formula to accomplish this. If you are quite certain that you always want the first 4 strings, you could use a formula like this, where `line` is the column name:if(arrayLen(split(line, '_')) > 3, join(split(line, '_')[0,4], '_'), 'none' )
This will check and make sure that there indeed are at least 4 strings split by the "_" character, and then will re-join the first four strings back by a "_" character (though you could of course modify this as you like)You may wish to add a few additional if statements within the formula step to ensure that you are always returning expected results in case any data is missing etc.
Thanks,
Sarina - I have data that has a column with the days of the week, source and season. I'm trying to split my days column to rows. I have picture of my current input and expected output. I tried to use the split…Last answer by
- I have a dataset that has a list of dates and days of the week, is there a way I can filter my dataset for dates that are less than or equal to the current date? I tried the formula date = now() but I…Solution bySolution by tgb417
Once you have parsed the dates you can filter in many ways
There is a dedicated date filter step you can add to a visual prepare recipie.
https://doc.dataiku.com/dss/latest/preparation/processors/filter-on-date.html
You can also use the formula filter to filter as well. Here is the documentation for that type of step.
https://doc.dataiku.com/dss/latest/preparation/processors/filter-on-formula.html
Finally this may help in the date section.
https://doc.dataiku.com/dss/latest/preparation/filter-flag.html#filter-on-date-range
However, you must first have a parsed date to do any filtering.
- I have a table with four 3 columns, date, color and time. Im trying to create some kind of sequence field that takes into consideration the first time the color was seen on a specific day. Ex. green w…Last answer by
- I have a column with round floats (e.g. only 1.0, 100.0, etc.) and manually defined it as type "double" in a visual recipe. However, once the dataset is loaded into another visual recipe, the visual r…Last answer by
- Hi there, I encounter the sudden issue of not being able to load datasets into a Jupyter Notebook. Changing environment/Kernel doesn't help. System reboot doesn't help. Force reloading doesn't help ne…Last answer by
Top Tags
Trending Discussions
- Answered2
- Answered ✓7
Leaderboard
Member | Points |
Turribeach | 3702 |
tgb417 | 2515 |
Ignacio_Toledo | 1082 |