Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Certain cells in an Excel dataset have an array of data in it but instead of a comma or any visible separator- some users end up doing an Alt+Enter that will put the next value below the first.
When we load it to Dataiku, it shows as a Pilcrow symbol (¶) and even if I create a step in the recipe to remove it, it just stays there. Can I please see your advise how I can fix this? Please see attached file.
Thanks so much and happy to be part of this community.
This is occurring because the underlying rows/data does not actually contain a pilcrow. You can confirm this is the case by adding a row in your excel file (or another file) that actually includes a pilcrow, and you'll find that the find and replace works. In actuality, it is still a line break and the pilcrow is merely indicating that this is the case.
As my colleague suggested, you could try using "\n" instead in the Find & Replace processor. Otherwise, code recipes could be another option as well.