In this post, we will show how to set up a sample CI/CD (continuous integration / continuous deployment) pipeline built on Jenkins for our Dataiku DSS project. It follows our blog post Continuous integration and continuous deployment (CI/CD) in Dataiku DSS that presents the concepts and some important questions in order to…
Greetings fellow Linguists, You can start by installing NLTK (Natural Language Toolkit) as any other Python package in DSS, by creating a code environment and adding "nltk" to your package requirements. To do so, follow this documentation. However, some functionalities of NLTK such as text corpora and language-specific…
Greetings fellow Linguists, To use spaCy models in DSS, you can start by installing it like any other Python package in DSS: by creating a code environment and adding "spacy" to your package requirements. To do so, follow this documentation. However, some functionalities of spaCy, such as language-specific tokenizers, rely…
A common requirement when you have a column of numbers is to format all numbers so that they have the same length, adding leading zeros if needed. This can be done in the DSS preparation recipe using a Formula. The formula function to use is format. For example, to ensure that all values of the column mycolumn are padded…
React webapps are not natively supported by DSS, but it’s still possible to integrate a React application into Dataiku DSS with the help of DSS dev plugin and a visual webapp. In this article, I'll discuss a few ways you can do this. Quick start All of the steps below are implemented in a demo plugin found in this…
This article applies both to: * Bokeh webapps * Usage of the Bokeh library in a Jupyter notebook Add your image to the "Static Web Resources" * In the global menu of DSS, select "Global Shared Code". If you don't see this menu, your administrator needs to grant you additional permissions. * Click on "Static Web Resources"…
While adding a "web content" insight to a Dataiku DSS dashboard, you may see either a blank insight, or the "unhappy face" like this: An issue can be that you are trying to embed a non-secure (http://) within a secure (https://) DSS. This kind of embedding is forbidden by browsers as a security measure. Dataiku DSS cannot…
When working with large amounts of disparate, user-entered text data, we often need to standardize or collapse entries into a resolved form. For example, how can we get a computer to recognize that strings like "Abraham Lincoln", "Abe Lincoln", and "Abrahm Lincoln" are actually the same category? We want to map these close…
Scenario executions within Dataiku DSS are a key part of operationalization (o16n), and your organization may have many of them running every day. If you are using Jira to organize your support flow, you may want to have an issue automatically created when a scenario fails. One way of doing this is to set up Dataiku DSS to…
You can add a group to a DSS Project, and add the ldap authorization group, in Python. Using a Python Script Here is a sample code snippet to achieve this: import dataikuclient = dataiku.api_client()client.create_project(name="SAMPLE_PROJECT", project_key="SAMPLE_PROJECT", owner="YOUR_USER")project =…
Create an account to contribute great content, engage with others, and show your appreciation.