I have a column with some comments (original file excel), which contain various information. I would like to seperate them unter different comments/catalogues. Is that possible with DSS?
|Process Comment||comment 1||comment 2||comment 3||comment 4|
|no heating, rework||rework|
|with Coating, 20kW||20kw|
|TC defect, steel||steel||TC defect|
|TC defect||TC defect|
Here is a quick one-step visual recipe that will get you very close to what you want to do. If you need to rename the columns to comment 1, comment 2, comment 3... You will have to do some additional work.
This is a very useful layout for ML Models.
Hope this helps.
Thanks a lot for your quick reply. The thing is that in the real data the comments are very various, when I use this approach, there are more than 100 column created... Then I get a error. The question is whether I can extract 'mode' of those comments, I mean the comment words that appear very often. For the comment word, that appears once or twice, it can be ignored.
If you are planing to use the results of this in a DSS visual ml then you might want to use the built in feature handling. Treating this column as text. Or converting this column into a JSON vector.
Here is a brief video that is part of the Dataiku Academy that talks about feature handling in DSS.