Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Hi,
I need to upload a parquet file from my AWS S3 bucket but I'm getting this error message :
Question : Is it possible with Dataiku upload parquet files ? Which file types are allowed to upload from AWS S3 ?
BTW, I'm able to upload CSV files from my AWS S3 bucket
org/apache/hadoop/conf/Configuration, caused by: ClassNotFoundException: org.apache.hadoop.conf.Configuration
org/apache/hadoop/conf/Configuration, caused by: ClassNotFoundException: org.apache.hadoop.conf.Configuration
Hi @Carl ,
To use Parquet files in DSS you must first run the Hadoop integration.
https://doc.dataiku.com/dss/latest/hadoop/installation.html#setting-up-dss-hadoop-integration
If you don't have Hadoop installed on the DSS machine you can use standaloneArchive available here:
https://downloads.dataiku.com/public/studio/10.0.7/dataiku-dss-hadoop-standalone-libs-generic-hadoop...
First download dataiku-dss-hadoop-standalone-libs-generic-hadoop3-10.0.7.tar.gz and then stop dss and run :
DATADIR/dssadmin install-hadoop-integration -standaloneArchive dataiku-dss-hadoop-standalone-libs-generic-hadoop3-10.0.7.tar.gz
Start DSS and they you should be able to upload parquet files.
Let me know if that helps.