-
How to use selenium chrome driver in python recipe by authenticating thorugh digital cert
I need step by step help with installing chrome driver. I have already installed package selenium in my code env lets assume the name of the Code Env is "web_reports". Please provide the dependencies or guide to open an URL by authenticating via digital certificate. I want to download some reports from that website using…
-
What is globalExplanationsTopImportances ? How is it calculated?
I'm using the API to retrieve the following information for an regression model. details.get_performance_metrics()['globalExplanationsTopImportances'] This returns a dictionary list of dictionaries with keys and "s" and "d" as follows [{s:"Feature1",d:0.25},{s:"Feature2",d:0.15}] What value is given as d? What are the…
-
Develop Shiny app in code studio before deployment
Is it possible to use a code studio to take advantage of autocomplete, plugins, etc in order to create an RShiny app and then deploy it as a webapp? Specifically I would like to use RStudio. If so how would I do this
-
Using Group Recipe Without Aggregations
Hello everyone, I believe that python allows people to use the group_by() method without any aggregations; however, in dataiku, we must aggregate when we use the group recipe. In other words, I would like to group by a specific column and keep all other columns without aggregating, is that possible in any way? Note that I…
-
min between 2 dates
Hello, I'm trying to find the 1st date between 2 date fields. I was thinking of using a min formula but there may be missing values in these fields and the formula doesn't seem to work in this case. Is there another solution other than an "if then" formula ?
-
remapping connections for API services
Goodday! In the API Designer, we can define connections to use with SQL Query Endpoints. How do we remap these connections based on deployments to different API nodes? (ie. use different connection for deployments to a production API node vs. deployments to an acceptance API node) I don't see any option in the deployer UI…
-
Remove duplicate
Hi, I have gone through few of the post on the remove duplicate but none of that give the clear answer on the same. Can you pls. provide the path to showcase how can i use some column with condition if that value repeats it would stop counting the same value with entire row in the output? K.Rgds, Kalpesh
-
Filter recipe : How to avoid stop processing when there are no matched records
Hi dataiku users, I want to know how to resolve the situation in subject. I use filter recipe only for processing the exception data, and stack with main data after that, so if there are no filtered records in output dataset, no problem. but in dataiku, if there are not all data sets in stack recipe, return error and stop…
-
How to use streaming python
Hi All! I'm trying to use streaming Python with the example given in documentation: https://doc.dataiku.com/dss/latest/streaming/cpython.html#writing-to-datasets If i try to follow it , it doesn't work exactly: 1) .get_continuous_writer() expects a source-id as one of the arguments 2) if i give something like…
-
Kafka - Restart Failed Process
I get random errors on my Kafka due to GCS bucket failures and Bigquery size limits. I'm working with my teams to resolve, but I'm wanting to know if there is an easy way to restart a continuous process in the event of a failure? I thought about setting a scenario to start the process every 30 minutes or so, but I'm sure…