-
Re: Combining data sources from Blob Storage
While it's easy to dump the data in cloud storage like that it's very hard to retrieve and to perform any data retrieval when you have millions of files. So you really should move away from…1 · -
Re: Beginner Help: Deploying an API Service with Pickle Model from Jupyter Notebook in Dataiku
An API service running in the API node and doesn't have a project context. In fact the API node doesn't even have access to the project data. I am guessing that you are only testing your AP…1 · -
Re: [Samsung Fire & Marine] Need to improve the performance of Join and Group recipes
You are comparing apples to oranges and giving no context. SAS is an in-memory tool that's why it can perform joins fast assumning the data can be loaded into memory. But like all in-memory tool…1 · -
Re: How can I change the default location where a .conf file is created to any custom location?
This is not a Dataiku question but a great expectations package question so probably best to raise as a discussion in the great_expectations/discussions Github repo. It does look like you can pass th…1 · -
Re: Spark installation in standalone mode in DSS
Yes, this can be done but it is not a recommended setup. The documentation covers the use of an unmanaged Spark on Kubernetes: https://doc.dataiku.com/dss/latest/spark/installation.html https://doc.d…1 ·