-
HDFS - Force Parquet as default settings for recipe output
Greetings ! I'm currently on a platform with Dataiku 11.3.1 and writing datasets on HDFS. IT requires all dataset to be written in Parquet, but the default setting is on CSV (Hive) and it can generate errors. Is there a way to configure the connection to force the default settings to be Parquet ? Best regards,
-
how to use the Dataiku template
i am learning academy-sample. So, i want to learn more sample. Is there any way to download the Dataiku template (sample : "https://gallery.dataiku.com/projects/QS_JOB_POSTINGS/" , and "AML Alerts Triage" , etc ) and import it into my environment? Operating system used: ubuntu
-
I can not view geo-map ( i use free-edition)
Using the "Geocode" function of the "recipes" feature, an external API was used to add the latitude and longitude items. In addition, we will use "GeoPoint" to change to map data. The documentation says that by doing this, you can use "charts" to visualize the data using a map. I added a geo-point column. On the subsequent…
-
Messaging channels : SMTP with OAuth2
Hello, I would like to setup an SMTP (smtp.office365.com) messaging channel with OAuth 2 authentication (https://docs.microsoft.com/en-us/exchange/client-developer/legacy-protocols/how-to-authenticate-an-imap-pop-smtp-application-by-using-oauth) Is it supported by Dataiku and if yes what would be the Extra properties to…
-
Dss running Out of memory too fast
Hello ! I'm currently working on a DSS instance, having fun building data piplines, but it's running out of memory after 2hrs. DSS is installed manually on a docker image. We tryed several setup with 'xmx' configuration but we ends-up with many idle processes (~20 jek) that filled our allocated memory. The number of jek is…
-
Base Image for containerized execution
Hi All, DSS 12 made the switch for the base OS to AlmaLinux. The logs on the fleet manager seem to indicate that the base image for containerized execution is still centos7. Is that correct? What are the plans/timelines to update the base image if indeed we are still on centos7 that is fast approaching EOL? Thanks, Yash
-
IPython module error
I have a custom linux install of DSS. Recently, all my Python notebooks have been failing with the nginx error screen. On checking the logs, I see the following in ipython.log. [2023/07/13-02:11:33.293] [8630/MainThread] [WARNING] [NotebookApp] Notebook version 5 is no longer maintained. Please upgrade to version 6 or…
-
Connecting anaconda with dss windows version
Hello community, I am having a trouble connecting anaconda with DSS windows version. I have followed the instructions on this thread, I searched for the path of anaconda and then added it to the env-site.ps1. (env-site.ps1 for windows, env-site.sh for Linux) Here are the screenshots of the steps I have made. I have found…
-
Null fields in JSON are ignored and not ingested
Hi! I am new to using Dataiku and noticed that fields in my JSON files that are null are not being ingested. Is this common/expected behaviour? Is there a setting to force ingesting fields in JSON even though they are null in every JSON I have? I am extracting the JSON files from an S3 bucket Operating system used: MacOS…
-
Setting Up Python 3.7 and Anaconda Code Env
Hi All, I am having some trouble setting Dataiku code environments with any Python other than 2.7 & 3.6. Further more, I can't seem to get the "Use Conda" option either. I know anaconda is considered a tier 2 support item, but it's required for my current project. Does anyone know how to get these features working? Thank…