Submit your innovative use case or inspiring success story to the 2023 Dataiku Frontrunner Awards! LET'S GO

No module named pyspark.sql in Jupyter

Solved!
UserBird
Dataiker
No module named pyspark.sql in Jupyter

While trying to run the sample code provided in the Jupyter Python Spark Notebook, I get an error "no module named pyspark.sql" :





Do I need to configure something in order to use pyspark ?

I'm running DSS community on an EC2 AMI.

0 Kudos
1 Solution
Clément_Stenac
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html

View solution in original post

0 Kudos
1 Reply
Clément_Stenac
Dataiker
Hi,

You need to setup the DSS / Spark integration.

* For DSS 3.1: https://doc.dataiku.com/dss/3.1/installation/spark.html

* For DSS 4: https://doc.dataiku.com/dss/latest/spark/installation.html
0 Kudos

Labels

?
Labels (2)
A banner prompting to get Dataiku