-
RAG Webapp
Hello everyone, In my Dataiku Flow, I have a RAG setup that includes embeddings and prompts. I’d like to replicate this process—achieving the same results as in Prompt Studio—in a Dash web app. The goal is to reuse the knowledge base built in the Flow and leverage the augmented LLM created by the embedding recipe. Does…
-
getting error while importing dataiku project.
while i am migrating dev to val, getting the below error. canyou please provide me with how can i fix it. Importing archive... Traceback (most recent call last): File "/app/dss_install/dataiku-dss-13.2.2/python/dataikuapi/dssclient.py", line 1490, in _perform_http response.raise_for_status() File…
-
MS SQL SERVER CONNECTION
Hello Everyone, I created a connection with my Azure SQL DB using the MS SQL Server connector, the connection went well, but when I clicked on get table list, I got the following error message: Oops: an unexpected error occurred The connection is closed. Please see our options for getting help HTTP code: 500, type:…
-
column Index
I have a dataset which has Jan, Jan_1, Feb, Feb_1... I was to use Column index to pick the last column. Can you help using Column index without Python?
-
Access to webapp back-ends from outside of DSS (with Postman)
I found this article but i have some questions, hope someone can help me. https://community.dataiku.com/discussion/7648/securing-access-to-webapp-back-ends-from-outside-of-dss I created a standard webapp with python server, im trying to access the endpoints from postman. I'm sending my project apikey as suggested by…
-
Seeking Optimization Tips for DSS Flow and Spark Configuration
hello everyone, I am currently working on optimizing my DSS flow. I have a scenario that currently takes 20 minutes to execute, and I am looking to reduce this time to just 5 minutes. I would greatly appreciate any tips or strategies for optimization. Additionally, I am interested in understanding how to configure Spark…
-
Spark Configuration for optimization resource allocation
Hello, I am interested in understanding how to configure Spark settings to ensure optimal resource allocation. Specifically, I am looking for guidance on configuring parameters like spark.driver.cores, spark.dynamicAllocation.initialExecutors, spark.executor.cores, spark.dynamicAllocation.enabled, spark.executor.instances,…
-
pairwise distance for 2 geopoints
Is there any tool available to calculate pairwise distance. I have 2 different geo points available in dataframe .
-
Dynamic Column
I have an excel input file. Col A to Col T till row T26, I have data where Col T have latest month data. Col V to Col AO have second set of data till AO50. Now, its dynamic data, every month, a new column is added. For example, for next month, Col u will have Dec month's data. same with second set of data, I will be…
-
Delete records based on multiple JOINs
Newbie here. Trying to convert a SQL from HIVE that pulls records partly based on several JOIN conditions but limits those record based on other JOIN conditions. In SQL it is a "WHERE NOT EXISTS" condition. The following is the code - SELECT x FROM y, z Multiple left joins… (and) WHERE NOT EXISTS ( SELECT 1 FROM…