-
Reading Datasets using names stored in another dataset
Hi I have a project 'test' which has 6 datasets. student_name & s_1.s_2,s_3..s_5. student_name dataset has 2 columns: name & content. content column shows the name of datasets pertaining to each student (s_1 etc). I am creating a dashboard where each student will have a tab to show the contents of the related s_n dataset.…
-
Error in trying to register to Saved Model with prediction type 'OTHER'
Hi I am trying to follow the MLOps Best practices to deploy a model in production. I am trying register a model after adding the model in experiment tracking. The model has been added in experiment tracking successfully as a 'OTHER' model using the below syntax mlflow.pyfunc.log_model( artifact_path=f"model_age}",…
-
How do I read a yml file from the dataiku library?
I have a yml config file and want to read it in my notebooks, how do I do that?
-
Dataframe pivot table from python doesn't save properly as Dataiku dataset
Hi everyone, I am creating a pivot table using pandas data frame in python. df_pivot = pd.pivot_table(df, values=['Units'], index =['Name', 'City', 'State', 'Zip'], columns= ['Date'], aggfunc={'Units': 'sum'}, margins = True, fill_value=0) However, when I save it using the following: dataset_new = dataiku.Dataset("Dataset…
-
Function calling for custom llm connections
Hi, When can we expect function calling capabilities for custom LLM connections? Is there an estimated release timeline? Tool calls are supported on the compatible completion models of some LLM connections: OpenAI Azure OpenAI Azure LLM Anthropic Claude Anthropic Claude models on AWS Bedrock connections MistralAI Thank you!
-
How can you export a csv file to a folder using Python code, with specified path names ie 2024-09?
-
Creating a dropdown in an exported end-user dataset
I have created a report that is downloaded as an excel and stored into a fileshare. This reprot is then shared with external stakeholders for feedback. I would like this feedback to be in a column with a dropdown of options. Due to the volume of stakeholders, I cannot do this for each report manually and would like to…
-
Convert files name in a folder to a column
Hi community, I have a specific use case where my data are stored in a Dataiku folder on S3 in separated CSV files everyday. For instance, the extract date is only stored in the name of the files and not inside the files itself. I want to use the 'Create a dataset' function to create a new dataset from this folder and I…
-
How to run multiple flows at a time with different datasets?
We have created a flow which takes in different input datasets and updates many tables in Snowflake. There are many scenarios to trigger the recipes, and the scenarios are run using API endpoints which are triggered from a frontend website. What we want to do is run the flow for multiple inputs. Each of the input will…
-
Application as Recipe Inputs are Broken (or insanely obtuse to use)
I have a project that I built out to be an Application-As-A-Recipe to upload a Dataiku dataset as a file to our API. I will refer to my Application-as-a-Recipe as my "child process" for brevity's sake. Another Project calls this recipe within its flow. The child process has a scenario to build out all datasets and…