Survey banner
Share your feedback on the Dataiku documentation with this 5 min survey. Thanks! TAKE THE SURVEY

Custom python model with library editor

Odin
Level 1
Custom python model with library editor

Hey,

I’m trying to define a custom Python model that should be fitted adjusting for the variable "exposure". The code runs without an error in the notebook, but this error message occurs when the model is fitted in the visual analyses. 

 "Failed to train : <class 'IndexError'> : only integers, slices (`:`), ellipsis (`...`), numpy.newaxis (`None`) and integer or boolean arrays are valid indices. "

The code in the library editor looks like this: 

class Poisson_gbrt(BaseEstimator): 
    
    def __init__(self):
        self.model = Pipeline([
        ("regressor",
          HistGradientBoostingRegressor(loss="poisson",max_leaf_nodes=12,random_state=42),),])
    
    def fit(self,X,y,regressor__sample_weight=None):
        self.model.fit(X,y,regressor__sample_weight=X["Exposure"]) 
        return None 
    
    
    def predict(self, X):
        return self.model.predict(X)

          

Thanks, 

Odin 

0 Kudos
1 Reply
AlexT
Dataiker

Hi @Odin ,

The error suggests you are passing X["Exposure"] which is not what HistGradientBoostingRegressor you could try to convert this to numpy array instead :

exposure_weights = X["Exposure"].to_numpy()

  

0 Kudos