Split recipe : Keeping null values
shahas71
Dataiku DSS Core Designer, Registered Posts: 12 ✭
Hi All,
Consider this as my input :
Account | Value |
A | |
B | 1,2 |
C | 3,4 |
& i want the output as:
Account | Value |
A | |
B | 1 |
B | 2 |
C | 3 |
C | 4 |
But what i'm getting is this :
Account | Value |
B | 1 |
B | 2 |
C | 3 |
C | 4 |
The null value cell is getting removed.
Anybody can provide with solution?
Thanks in advance
Operating system used: windows
Answers
-
Hello @shahas71
,One workaround to this is to first use a replace processor to replace the empty cells with an unique arbitrary value (e.g., "REPLACE_THIS"), then run the split and fold processor (which should still preserve the cells with "REPLACE_THIS"), then add another replace processor to replace "REPLACE_THIS" with null again.Another potential solution is to use the tokenize processor with "One token per row" selected under "Operation", but this will depend on what your actual data looks like.Hope this helps,Max