Community Conundrum 27: Stacks of Questions is live! Read More

No module named pyspark.sql in Jupyter

Dataiker
Dataiker
No module named pyspark.sql in Jupyter

While trying to run the sample code provided in the Jupyter Python Spark Notebook, I get an error "no module named pyspark.sql" :





Do I need to configure something in order to use pyspark ?

I'm running DSS community on an EC2 AMI.

0 Kudos
1 Reply
Dataiker
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html
0 Kudos