Duplicates generated by recipe

Batpig
Level 1
Duplicates generated by recipe

Hi everyone,

My DSS server exhibits a strange behaviour:

Every table generated from recipe create duplicates: If i enter 100,000 lines table without doing any operations on it, this will result in a 100,000 lines output, but with x duplicates inside instead of original data

Has anyone already experienced the same issue?

Below a simple example:

The input table:

DSS_1.GIF

In / Out recipe

DSS_2.GIF

 

Output with duplicates:

DSS_3.GIF

Best regards,

Baptiste

0 Kudos
2 Replies
JeremieP
Dataiker
Dataiker

Hi Baptiste,

In your Python recipe, can you go to the "Input/Output" tab and check that for your output dataset, the option "Append instead of overwrite" is not activated ?

 

0 Kudos
Batpig
Level 1
Author

Hi Jeremie,

Thanks for your answer, indeed the option wasn't activated

Lookslike the problem is with my python env, as the problem don't occur with virtualenvs

Baptiste

0 Kudos
A banner prompting to get Dataiku DSS
Public