-
Can I call Databases on a Deployed API using Python?
I have several databases that I'd like to access from an API I'm deploying. The databases are stored in the following postgres table: api-node-referenced-data I currently have many different SQL queries that are used to access the tables, but I'd like to change them all to a parameterised query in Python to avoid repeated…
-
Using date in DataIKU
Hi, Despite going through documentation multiple times, I still don't really understand how dates work in DSS. I'm importing dataset from a connection. Without turning on any of the options in Date & Time handling, this is how data looks like: It says that the data type is string, while in the database itself it is, in…
-
Test Capability filter using SQL Expression in Join Recipe
Hi, I got this dataset tx and cards_prepared dataset from your Dataiku Academy https://academy.dataiku.com/path/core-designer/visual-recipes-overview-1/500697 I’m doing it by starting from scratch. I'm using filter in join recipe using SQL Expression using a default one; "purchase_date" LIKE CONCAT('%',…
-
container configuration for streamlit app
Dear Experts, I am having difficulties in starting Streamlit app. In the container configuration, as shown in the attached image, when I select None, it gives error "Unable to start run : No container config to run Code Studio on". However when I select container ai_exec gpu, it always say Pod is still missing. I do not…
-
Why Aren't Record Counts Computable by Default?
In my DSS flows, I always activate record counts to check the volume of my datasets. However, it is cumbersome to activate them one by one, and it seems that it is not possible to activate them by default for an entire project. Why doesn't DSS allow this? Operating system used: RHL 8
-
SQL API ENDPOINT (Passing multiple values in test query)
Hi, I am trying to create a SQL API endpoint using a select query to my database. select * from table where state in (?) and city in (?) parameters param1, param2 The query works fine, however problem arises when i try to filter records for multiple values of a parameter. Just wanted to check where am i going wrong with…
-
DSS visual recipes defaulting to max column length with Redshift tables
Hi everyone, When working with Redshift tables in DSS visual recipes we noticed that the table creation settings sometimes defaults to setting certain column lengths to the redshift max (65,000). In many cases this becomes excessive. For example, in the screenshot below the "brand" column has a length of 65k but most of…
-
How to create a column with a unique ID value
Hi, I have 4 flow with different data information but with several companies in all flows. How can I create a column with a unique ID for each company? This way I can merge the flows together based on the ID. Thanks
-
Dataset type change error after python recipe
I'm using Python recipe (pandas) to edit column names of my dataset (names changed after pivot recipe). after python recipe type of one column changes. The problem is i get following error "The schema of the dataset does not match the table" * Type mismatch for column 1 (serial) : 'NVARCHAR' in dataset, 'DOUBLE'(8:float)…
-
Extracting Archive files
I'm trying to extract set of archive files with different formats like [".zip", ".7z", ".rar", ".tar", ".gz", ".bz2", ".xz", ".iso", ".ZIP", ".7Z", ".RAR", ".TAR", ".GZ", ".BZ2", ".XZ", ".ISO"] I was using 7z executable api to extract locally in python . I could see that there's no in built execution api like 7z. Is there…