Join us on July 16th as we explore real-world Reinforcement Learning Learn more

Python recipe removing null records

Level 2
Python recipe removing null records

Hi all,

i realized that when using python recipes, records with all columns with null values are removed.

You can do a test preparing a dataset with only one column (should have null values). Any recipe you use on this dataset will keep all records, but python recipe (default recipe that makes a copy) will remove records with null values.

Is this the expected behaviour?

 

Thanks!

0 Kudos
1 Reply
Dataiker
Dataiker

Hi,

Unfortunately, this is the expected behavior in the specific case of datasets with a single column.

This is related to how the data is serialized internally: it uses CSV. For the case of single-column datasets, there is no separator and thus no real difference between an invalid empty line and a all-null line. This means that it would not be easy to change this behavior.