But none of them explain how to do it (in my comprehension). Python and sklearn are well explained and shown but not MLLib. I wonder if there is a code example I could use to help me or a more sepcific documentation.
I am using dataiku V7, not 8. I don't know if there is an impact on custom model.
Inside of the custom model editor there is a sample provided, I have pasted it below.
// This sample code uses a standard MLlib algorithm, the RandomForestRegressor.
// import the Estimator from spark.ml
// instantiate the Estimator
.setLabelCol("ot_avg") // Must be the target column
.setFeaturesCol("__dku_features") // Must always be __dku_features
.setPredictionCol("prediction") // Must always be prediction