How are derived features computed in the visual analysis module?

NicolasB
Level 1
How are derived features computed in the visual analysis module?

Hi,

I was wondering how the derived features (square, root, log) are actually computed. Is the function applied after rescaling if any ?

In particular for square root and logarithm which cannot take non negative values, what is the actual preprocessing used here? Is there a different rescaling used to generate the derived columns compared to the user-specified one?

0 Kudos
3 Replies
CoreyS
Dataiker Alumni

Hi, @NicolasB! Can you provide any further details on the thread to assist users in helping you find a solution (insert examples like DSS version etc.) Also, can you let us know if youโ€™ve tried any fixes already?This should lead to a quicker response from the community.

Looking for more resources to help you use Dataiku effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as โ€˜Accepted Solutionโ€™ to help others like you!
0 Kudos
tgb417

@NicolasB ,

If you read python.  You can get the code in the form of a Jupyter Notebook that the Visual ML users for a particular model you are creating.

Although this does not work for all model types.  This may help you understand exactly what DSS is doing in your case.

This can be found in the Action Drop down on any Model Results.

Geting code for Python Model.jpg

 

 

--Tom
0 Kudos
NicolasB
Level 1
Author

Thanks for your answers, we are using dataiku version 8.

I did take a look at the jupyter notebook generated from the visual ML but it does not seem to replicate the piece of code needed to generate the extra features.

I tried looking at some of .json files in the corresponding model folders but I didn't find anything to help me understand how the extra features are actually computed.

Thanks,

Nicolas

0 Kudos