Using Dataiku

Sort by:

1 - 10 of 29

How to save a keras model from a python recipe in a folder ?
I would like to save a keras model in a folder. I can not figure out how to save the weights of my models because I do not find the correct filepath. The needed code to achieve this goal is : model.sa…
Answered ✓
code
Machine Learning
Python
Started by Laurent
Most recent by capatommy
May 27, 2024
0
3
Solution by Clément_Stenac
Hi,

Keras can only save H5 files to a regular filesystem, not to arbitrary storage locations.

You can either (recommended) switch your managed folder to a local folder, or:

* Save weights to a local file

* Then upload the models file to the managed folder using folder.upload_file(path_of_the_local_file, "path_in_the_managed_folder"). You'll need to use something like that for retrieving the file in the scoring recipe:

with open("localfile", "wb") as out:
with folder.get_download_stream("path-of-the-h5-file") as in:
out.write(in.read())
Reply to Discussion
How to read file with Python from HDFS managed folder
Hello Could you give example how to read csv file with Python\panda from HDFS managed folder? Thanks Milko
Question
code
Python
Hadoop
Started by milko_ivanov
Most recent by Vinothkumar
Oct 12, 2020
0
3
Last answer by Vinothkumar
To add more details:

tried to create a python script..
1.First option:
Tried to read the files which is available in the paths. Able to read the file which is in csv format. but unable to read which is in excel.Not so sure why.But looks DSS mainly supports txt n csv
Code:
with handle1.get_download_stream('/dqs/DQS_Reference Study Sites.csv') as f:

data=f.readlines() ##able to read csv.But in the same place if i keep excel and try to read.It comes as kind of xml component.
2.Second option:
Instead of reading excel via python.If we able to create a empty excel with specific headers(as like original file) and place in s3.So that the regular flow will be able to run that
But again here i am able to place the empty dataframe with just columns alone as a csv file.But the same way im not able to move excel file.
Code:
with handle1.get_writer(Filename) as writer:
writer.write(network_df.to_csv().encode("utf-8"))#Working fine.but the same to_excel not working.

So if any one option works fine then that will solve my problem. Can someone help me here?
Reply to Discussion
Spark can’t read my HDFS datasets
Hello, Spark won't see hfds:/// and just looks for file:/// when i'm trying to process a HDFS managed dataset. I followed the How-To link on: https://www.dataiku.com/learn/guide/spark/tips-and-trouble…
Answered ✓
Datasets
Hadoop
Spark
Started by Benoni
Most recent by Benoni
May 9, 2019
0
1
Solution by
Reply to Discussion
Spark IllegalArgumentException using partitioning
I am using Dataiku to create partitions on an HDFS dataset, as the result of a Spark recipe. I noticed if the dataframe in the preceding recipe contains the column with which you are trying to partiti…
Question
Hadoop
Spark
Started by jmccartin
May 7, 2019
0
Reply to Discussion
Error Running HDFS Command in Python Recipe
I have some code where I need to run an HDFS command in Python to check if a file is present. See below for an example: import subproces command = 'hdfs dfs -ls /sandbox' ssh = subprocess.Popen(comman…
Question
Python
Hadoop
Security
Started by jkonieczny
Most recent by Tomas
Apr 17, 2019
0
2
Last answer by
Reply to Discussion
DSS unable to connect to Hiveserver2 (MapR)
Hi, I'm unable to get started with establishing connectivity between DSS and Hiveserver2. HDFS integration works, and I have added the jars required for Hive client in a folder owned by dss user, and …
Question
Hadoop
Connections
Started by rothfuss
Most recent by rothfuss
Feb 14, 2019
0
2
Last answer by
Reply to Discussion
Available engines
Hi, what defines the list of available engines for data processing recipes such as prepare? I have a HDFS dataset created by Impala, then a prepare or sync to another HDFS dataset, but only Spark/MR (…
Answered ✓
Flow
Visual recipes
Hadoop
Started by Tomas
Most recent by Mattsco
Oct 27, 2018
0
1
Solution by
Reply to Discussion
Can changes in HDFS datasets be automatically tracked?
Hi, I am using HDFS datasets in my workflow which are updating on a daily basis and I would like to find out if these daily changes can be tracked by DSS and saved in a separate "delta" file through a…
Question
Hadoop
Partitioning
Scenarios
Started by UserBird
Most recent by Clément_Stenac
Jun 30, 2018
0
1
Last answer by
Reply to Discussion
[CDH Cluster] Unable to start HiveServer2 Connection
Hi, I am evaluating DSS, so I installed it in my server and added a 2 weeks enterprise trial license. I am facing a Cloudera CDH 5.12 cluster, kerberized. I am able to connect and browse HDFS, but Hiv…
Answered ✓
Hadoop
Hive
Started by BillMurray
Most recent by BillMurray
May 30, 2018
0
4
Solution by
Reply to Discussion
validation failed: Cannot insert into target table because number/types are different.
Hi, I get this message from a hive recipe on a partitioned dataset stored on HDFS: validation failed: Cannot insert into target table because number/types are different "2018-02": Table inclause-0 has…
Question
Hadoop
Partitioning
Hive
Started by Mattsco
Most recent by Mattsco
Feb 21, 2018
0
1
Last answer by
Reply to Discussion

1 - 10 of 291

Trending Discussions

DataIku suddenly stopped working
Answered
2
Examples for custom prediction in API Designer
Answered
7
How to create a conditional table in dataiku after flow is successful
Answered
4

Leaderboard

Member	Points
Turribeach	3686
tgb417	2513
Ignacio_Toledo	1082

Using Dataiku

Top Tags

Trending Discussions

Leaderboard