Dataiku + Spark on Blob Datasets

Solved!
yashpuranik
Dataiku + Spark on Blob Datasets

Hi Folks,

 

Curious about this link: https://doc.dataiku.com/dss/latest/spark/datasets.html#other. This mentions HDFS and S3 as better suited for Spark computation. I am curious why Blob Storage is not included as well. Is this a case of incomplete documentation? Or is Dataiku still working on implementing support for Spark + Azure Blob Storage?

 

Yash

yashpuranik
0 Kudos
1 Solution
AlexT
Dataiker

Hi,
The documentation is not updated. Dataiku will work on Azure Blob. 

You will need to set the HDFS interface in the connection settings of the Azure blob connection:

Screenshot 2023-10-23 at 5.51.42 PM.png

Thanks,

View solution in original post

0 Kudos
1 Reply
AlexT
Dataiker

Hi,
The documentation is not updated. Dataiku will work on Azure Blob. 

You will need to set the HDFS interface in the connection settings of the Azure blob connection:

Screenshot 2023-10-23 at 5.51.42 PM.png

Thanks,

0 Kudos

Labels

?
Labels (1)
A banner prompting to get Dataiku