No module named pyspark.sql in Jupyter

Solved!
UserBird
Dataiker
Dataiker
No module named pyspark.sql in Jupyter

While trying to run the sample code provided in the Jupyter Python Spark Notebook, I get an error "no module named pyspark.sql" :





Do I need to configure something in order to use pyspark ?

I'm running DSS community on an EC2 AMI.

0 Kudos
1 Solution
Clément_Stenac
Dataiker
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html

View solution in original post

0 Kudos
1 Reply
Clément_Stenac
Dataiker
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html

View solution in original post

0 Kudos
Labels (2)
A banner prompting to get Dataiku DSS
Public