-
Splitting dataset into groups and process each group seperately
I'm new to dataiku and any help would be appreciated, I have a scenario where I have to send mail to sales representative which contains questions asked by its associated client (customer that he manages) eg Input SR_Mail | Client_name | Question | — - - - - - - - - - - - - - - - - - abc@def.com | SBC | ABCD ? |…
-
Send email plugin sends email one at a time and its very slow
Hey I'm doing PoC using send emails plugin, but when I tried to send 1.6k emails it took around 1 hr 26 min to complete. Is there better solution to this ? Operating system used: Linux
-
Connect to Oracle NetSuite
Hello, team. A client tells me that his data is hosted on Oracle NetSuite. When he wants to view them, he installs the NeuSuite odbc driver on his Windows machine where he only enters the host, port, and service name. Subsequently, to display information in tables, it uses SQL Server with Link Server where he enters again…
-
Problem with loading large files
I try to upload datasets from my location to dataiku but it only allows me to upload data smaller than 1 GB in weight. I have tried several types but it has not been possible since it generates an error when loading the information. I don't know if this is directly due to the instance or the license I have.
-
How to go from flat relational data to nested object oriented data
I am trying to combine multiple rows into a single nested json object. I know how to do the opposite (i.e. flatten), but cannot find the right tool to go the opposite direction. As an example, I start with this data: Class, Student, Grade 1, Sally, A 1, Matt, A 1, Phil, C What I want as an output is a single record: Class,…
-
How to rate limit number of records processed per second
Hello, I'm new to dataiku and I have a scenario where I have to send emails. Now I am using send_emails plugin to send emails, but my smtp server has a limit that it can process only 40 mails/second and I want to ensure that only 30 mails are being sent per second. Is there anything that can limit only 30 records being…
-
Write the schema of a pandas dataframe using the "dataikuapi" variant
Hello, Dataiku team. How can I write the schema of a pandas dataframe using the "dataikuapi" variant? I'm doing it this way because I want to send information from the api node to a table in my project in the layout node. I want to write the schema like when using the dataiku library: df_final.write_with_schema(df). But,…
-
Remote references can't be fetched
Hi all, Getting the following error when specifying the Git remote in the libraries (Import from Git). Please assist me with this matter. Remote references can't be fetched Branches and tags could not be fetched from remote Git, caused by: IOException: Process failure, caused by: IOException: Process execution failed…
-
Using darts python library to create a custom Saved Model
I've followed the tutorial here: Importing serialized scikit-learn pipelines as Saved Models for MLOps - Dataiku Developer Guide and I've been able to develop a model using the darts==0.30.0 library, having wrapped it in the standard scikit-learn pipeline. My issue is with the very last part of step 3 of this tutorial…
-
Using Neo4j plug-in to create relationships duplicates nodes
Hi, I have created unique identifier(s) for two types nodes in my graph. I first push the data on the nodes into Neo4j, using Export nodes recipe: Primary key is set to a column containing the unique identifier for the node. Then I push the data using the Export relationships recipe. Primary keys for source and target are…