You now have until September 15th to submit your use case or success story to the 2022 Dataiku Frontrunner Awards!ENTER YOUR SUBMISSION

Dataiku is telling me that is_numeric_dtype does not exist.

Iceberg
Level 1
Level 1
Dataiku is telling me that is_numeric_dtype does not exist.

Hi,

 

I created a Python recipe to standarize columns

# -*- coding: utf-8 -*-
import dataiku
import pandas as pd, numpy as np
from dataiku import pandasutils as pdu


# apply the z-score method in Pandas using the .mean() and .std() methods
def z_score(df):
# copy the dataframe
df_std = df.copy()
# apply the z-score method
for column in df_std.columns:
if is_numeric_dtype(df_std[column]):
df_std[column] = (df_std[column] - df_std[column].mean()) / df_std[column].std()

return df_std

# Read recipe inputs
spam_prepared = dataiku.Dataset("spam_prepared")
spam_prepared_df = spam_prepared.get_dataframe()


spam_standarized = z_score(spam_prepared_df)

# Write recipe outputs
spam_standarized = dataiku.Dataset("spam_standarized")
spam_standarized.write_with_schema(spam_standarized_df)

 

But Dataiku is telling me that is_numeric_dtype does not exist. What am I doing wrong?

 

Thanks

0 Kudos
1 Reply
AgatheG
Dataiker
Dataiker

Hi Iceberg,

is_numeric_dtype is a method from the module pandas.api.types. In your code snippet, you should replace your occurrences of is_numeric_dtype with pd.api.types.is_numeric_dtype.

 

Best regards,

Agathe

Labels

?
Labels (1)
A banner prompting to get Dataiku