The February release for the Community is live! Read More

No module named pyspark.sql in Jupyter

Solved!
UserBird
Dataiker
Dataiker
No module named pyspark.sql in Jupyter

While trying to run the sample code provided in the Jupyter Python Spark Notebook, I get an error "no module named pyspark.sql" :





Do I need to configure something in order to use pyspark ?

I'm running DSS community on an EC2 AMI.

0 Kudos
1 Solution
Clément_Stenac
Dataiker
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html

View solution in original post

0 Kudos
1 Reply
Clément_Stenac
Dataiker
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html

View solution in original post

0 Kudos
A banner prompting to get Dataiku DSS