Discover the winners & finalists of the 2022 Dataiku Frontrunner Awards!READ THEIR USE CASES

No module named pyspark.sql in Jupyter

Solved!
UserBird
Dataiker
No module named pyspark.sql in Jupyter

While trying to run the sample code provided in the Jupyter Python Spark Notebook, I get an error "no module named pyspark.sql" :





Do I need to configure something in order to use pyspark ?

I'm running DSS community on an EC2 AMI.

0 Kudos
1 Solution
Clément_Stenac
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html

View solution in original post

0 Kudos
1 Reply
Clément_Stenac
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html
0 Kudos

Labels

?
Labels (2)
A banner prompting to get Dataiku