Feature handling Dummy encoding
stoch
Registered Posts: 1 ✭
Dataiku's category handling = Dummy encoding with dropping dummy option seems to be using a level with the least exposure/volume as a dummy.
Q1. Is there a way to set this dummy manually instead of Dataiku's default method? Want to avoid using category handling = custom preprocessing option.
Q2. Using Variable type = Categorical with Drop one dummy option on input variable of double type seems to be dropping 2 levels. For example, there are only 3 regression coefficients from a variable with 5 levels). I would of expected there would be 4 regression coefficients since 1 is used as a dummy). Does anyone know the reason for this?
Many thanks in advance.
Tagged: