How to save a Pyspark DataFrame to a managed folder

Solved!
stephl
Level 2
How to save a Pyspark DataFrame to a managed folder
  • Hello, Community.

May I know how I can use Pyspark recipe to save my pyspark dataframe as csv file to a output managed folder?

I have searched in the community, but most of posts cover pandas dataframe only....

 

0 Kudos
1 Solution
Turribeach

A Pyspark DataFrame is by definition a dataframe that only exists on your Pyspark engine so in order to save it in Dataiku you first need to bring to memory. You can do that by calling the toPandas() method.

View solution in original post

1 Reply
Turribeach

A Pyspark DataFrame is by definition a dataframe that only exists on your Pyspark engine so in order to save it in Dataiku you first need to bring to memory. You can do that by calling the toPandas() method.