-
Best way to clean data for country meaning
I have a dataset which includes ISO 3166-1 alpha-3 country codes which are being detected properly, and full country names where about 14% are being flagged as not passing validation. Is there some place I can get the canonical list of country names dataiku uses for meaning? And if I can get that list, what's the most time…
-
install-graphics-export air-gapped
I want to install install-graphics-export so I can export my flow or a dashboard to PDF. I’m in an air-gapped environment. When I start the installation, I get the following error message: ./dssadmin install-graphics-export [+] Saving installation log to /data/dataiku/data/run/install.log [+] Checking dependencies +…
-
Crawl budget prediction for enhanced SEO with OnCrawl plugin
We’re pleased to share that Dataiku has published an OnCrawl plugin. At OnCrawl, we are convinced that data science, like technical SEO, is essential to strategic decision-making in forward-looking companies today. The complexity of today's markets, the sheer volume of data available affecting SEO, the growing opacity of…
-
Allow Scenario Trigger on dataset change for Google Sheets
The idea of a "Trigger on dataset change" is excellent, but it doesn't support all dataset types. It would help us a lot if it could trigger on dataset changes in Google Sheets.
-
Can't set up "Containerized execution", "Build image for containerized visual recipes" not working
I have a GCP / Ubuntu installation of DSS. I'm trying to set up GKE to run recipes on. I've used the GKE plugin to create a cluster, I can see it running (both from DSS and from the Google Cloud Console). I think the documentation is outdated and refers to an older version of the plugin and still relies on the Google…
-
Extract tables from PDF
Hello community, to perform RAG, I want to extract tables from PDFs. I would like to do this using Dataiku plugins, but the quality is not what I expect. Do you know of other methods to do this? Thanks !
-
googlesheets plugin feature: Ignore top n rows on import
Reading a google sheet with the plugin currently requires that header columns are in row 1. In the wild, a lot of users don't build sheets like that and the data begins some rows down the sheet. I suggest to add a feature of ignoring a number of top rows to correctly set the header row and table data.
-
Integration with Microsoft Fabric and its OneLake
Hi, Couldn't find anything on an integration with Microsoft Fabric through OneLake yet in the Dataiku docs/release notes. Is this coming soon? As I read the Microsoft docs I understand we can't connect directly via ADLS, only via APIs or SDKs. Thanks in advance, Jonathyan Operating system used: RHEL 8
-
Using Neo4j plug-in to create relationships duplicates nodes
Hi, I have created unique identifier(s) for two types nodes in my graph. I first push the data on the nodes into Neo4j, using Export nodes recipe: Primary key is set to a column containing the unique identifier for the node. Then I push the data using the Export relationships recipe. Primary keys for source and target are…
-
How to get the handle of the current plugin?
I'm developing a plugin and I'd like to get the handle of the current plugin to get its name and settings. Similar to client.get_default_project() to get the current project. Also, is there a way to know which scenario is triggering the plugin? Is there any way to achieve this?