space character handling in Dataiku dataset

Retta
Level 1
space character handling in Dataiku dataset

Hi,

I have a Greenplum database table as an input for my dataset, let's name this as gp_tableA.

From this dataset, I have a Prepare recipe, we name it as compute_gp_tableA_prepared. I only use this recipe to rename some columns and remove unwanted ones.

I notice at the output dataset (gp_tableA_prepared), one of the record, originally have ' ' as value (a single space), converted to null/blank.

example:

in Greenplum Database: record 1 in ColumnA, value ' ' (a single space)

in Dataiku : record 1 in ColumnA, value '' (blank)

is there any tips to retain the ' ' (space) value, as i need this as part of the logic?


Operating system used: Windows 10

0 Kudos
1 Reply
Manuel
Dataiker Alumni

Hi,

I am not sure where the values are being replaced and without access to Greenplum, I cannot test.

However, a workaround is that, in your prepare recipe, you replace the empty values with a space:

  • On the column dropdown, select Analyse
  • On the "empty values" click the pencil to edit the value as a space
  • This adds a new processor to the recipe (see attached image).

I hope this helps

0 Kudos