Skip to content

Latest Activity

Sort by:

1 - 10 of 54

Conversion to Parquet fails in Hadoop HDFS
5
0
$ hadoop version Hadoop 3.1.2 Source code repository https://github.com/apache/hadoop.git -r 1019dde65bcf12e05ef48ac71e84550d589e5d9a Compiled by sunilg on 2019-01-29T01:39Z Compiled with protoc 2.5.0…
Answered ✓
Hadoop
Started by Benoni
Most recent by Azhar
Setup & Configuration
Jul 23, 2024
No connection defined to upload files/jars
2
0
I am trying to execute a PySpark recipe on a remote AWS EMR Spark cluster and I am getting: Your Spark settings don't define a temporary storage for yarn-cluster modein act.compute_prepdataset1_NP: No…
Answered ✓
Hadoop
Spark
AWS
Started by J. Daniel
Most recent by J. Daniel
Setup & Configuration
Jun 7, 2024
How to save a keras model from a python recipe in a folder ?
3
0
I would like to save a keras model in a folder. I can not figure out how to save the weights of my models because I do not find the correct filepath. The needed code to achieve this goal is : model.sa…
Answered ✓
code
Machine Learning
Python
Started by Laurent
Most recent by capatommy
Using Dataiku
May 27, 2024
HDFS - Force Parquet as default settings for recipe output
2
0
Greetings ! I'm currently on a platform with Dataiku 11.3.1 and writing datasets on HDFS. IT requires all dataset to be written in Parquet, but the default setting is on CSV (Hive) and it can generate…
Answered ✓
Hadoop
lex
Ignition
Started by Charly
Most recent by Charly
Setup & Configuration
May 21, 2024
Enabling parquet format in Dataiku DSS
3
1
Hi Currently when we write into Dataiku file system we only csv and avro format. How can I enable parque format in Dataiku DSS running on linux platform on EC2 instance. I need steps for that. Also we…
Answered ✓
Linux
Hadoop
Started by Ankur30
Most recent by somepunter
Setup & Configuration
Apr 14, 2023
Permission Denied Installing Standalone Hadoop Integration
3
0
I am trying to install the standalone hadoop integration for Dataiku. My Dataiku instance is hosted on a linux server and when I follow the directions for standalone installation here (Setting up Hado…
Answered ✓
Hadoop
Ignition
Started by ryanraasch
Most recent by ryanraasch
Setup & Configuration
Mar 21, 2023
How to add a file to the Resources directory so that it is accessible at runtime
5
0
How can I quickly update the code environment, upload a zipped certificate file to the resources directory, and then make the certificate file accessible during runtime? I upload the file, modify the …
Question
Backup
Hadoop
Started by btsshirt
Most recent by clayms
Setup & Configuration
Mar 8, 2023
Accessing Spark web UI
1
1
Hello, I am a beginner in Spark and I am trying to setup Spark on our Kubernetes cluster. The cluster is now working and I can run Spark jobs; however, I want to access Spark web UI to inspect how my …
Answered ✓
Hadoop
Resource control
Spark
Started by ObadaJab
Most recent by Omar
Setup & Configuration
May 13, 2021
PySpark exit recipe with Warning status
2
0
I have a PySpark recipe which reads a dataset, and extracts a column based on first index (first row). In a scenario when the input dataset partition is empty, it throws a normal error: 'index out of …
Question
Backup
Hadoop
Syncing
Started by jacksonisaac
Most recent by emher
General Discussion
May 11, 2021
NoClassDefFoundError when reading a parquet file
3
0
I have setup an HDFS connection to access a Google Cloud Storage bucket on which I have parquet files. After adding GoogleHadoopFileSystem to the hadoop configuration I can access the bucket and files…
Question
Hadoop
Spark
Started by phildav
Most recent by tstaig
Setup & Configuration
Nov 24, 2020

1 - 10 of 541

Top