Join us on Wednesday, June 3rd for a deep dive into Customer Predictive Analytics Learn more

No module named pyspark.sql in Jupyter

Dataiker
Dataiker
No module named pyspark.sql in Jupyter

While trying to run the sample code provided in the Jupyter Python Spark Notebook, I get an error "no module named pyspark.sql" :





Do I need to configure something in order to use pyspark ?

I'm running DSS community on an EC2 AMI.

0 Kudos
1 Reply
Dataiker
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html
0 Kudos
Labels (2)