How do I define types on loading data in Python ?

Registered Posts: 11 ✭✭✭✭
While loading my data, Pandas translates some types wildly, transforming integer to double, strings (with only numbers in it) to decimal, removes leading 0 in strings such as "00042", and so on.

I'd like to set the types of the column while loading the data, to avoid wild type conversion and screwing up my data, how can I do that ?

my_df = dataiku.Dataset("my_data")
# my_df= my_df.get_dataframe(infer_with_pandas=False)
my_df= my_df.get_dataframe()

infer_with_pandas=False didn't work : Job failed : Error in Python process: : Integer column has NA values in column 13. I get what it's telling me, but I do have NAs in some of my columns, and I need to keep them.

Welcome!

It looks like you're new here. Sign in or register to get started.

Answers

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.