Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on May 21, 2024 9:23AM
Likes: 0
Replies: 2
Greetings !
I'm currently on a platform with Dataiku 11.3.1 and writing datasets on HDFS. IT requires all dataset to be written in Parquet, but the default setting is on CSV (Hive) and it can generate errors.
Is there a way to configure the connection to force the default settings to be Parquet ?
Best regards,
Hi @Charly
,
You can configure the instance level preferred format from the Administration -> "Prefered storage formats" and place PARQUET_HIVE as the first option
This can also be controlled at project level by overriding the global Datasets creation settings.