Saving spark dataframe as Parquet using standalone Spark on Local server

Registered Posts: 8 ✭✭✭

Hi Team,

My Dataiku server has not been integrated with Hadoop cluster but I have standalone spark installed in the DSS server. While creating a new dataset, the only file format that is available for me is csv. I wanted to know, whether it is possible to save my datasets as 'parquet' into my local DSS server.

Best Answer

Answers

  • Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,399 Neuron

    You can save datasets as parquet but you will need to handle them manually using Dataiku managed folders. In other words if you want to use Dataiku datsets in your flow you are stuck with the Dataiku format.

  • Registered Posts: 8 ✭✭✭

    Hey Turribeach, Thanks for your response. But I am not talking about Dataiku view. It is about the file format in which I can save the dataframe in my local server. Right now, I can see only csv as an option (screenshot provided) but I believe parquet can also be used.

    Let me know if we can enable parquet also in this option.

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.