Unusual Error with Group By Recipe in Dataiku

Marouane
Marouane Registered Posts: 1
edited March 19 in Using Dataiku

Hello,

I’ve encountered an unusual error while using the Group By recipe in Dataiku. Here’s a summary of the issue:

  1. Context: I created a Group By recipe on three columns, applying three custom aggregations using SQL.
  2. Input Data: The recipe takes as input a PostgreSQL (PGSQL) table, which is the output of a JOIN operation between three different datasets. There were no errors when creating this table.
  3. Error Encountered:
    • When running the job, Dataiku’s job runner returns an error. Initially, I tried to debug the issue by creating a simple, basic Group By recipe, but I encountered the same error :
      • ERROR: syntax error at or near "AS" Position: 535
      • Please see our options for getting help
      • HTTP code: , type: org.postgresql.util.PSQLException
      • [15:20:54] [WARN] [dku.flow.sql] - Could not print execution planorg.postgresql.util.PSQLException: ERROR: syntax error at or near "AS" Position: 447at org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2676)at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:2366)at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:356)at org.postgresql.jdbc.PgStatement.executeInternal(PgStatement.java:496)at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:413)at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:333)at org.postgresql.jdbc.PgStatement.executeCachedSql(PgStatement.java:319)at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:295)at org.postgresql.jdbc.PgStatement.executeQuery(PgStatement.java:244)at com.dataiku.dip.queries.ExecutionPlanService.getGenericExecutionPlan(ExecutionPlanService.java:477)at com.dataiku.dip.queries.ExecutionPlanService.getExecutionPlan(ExecutionPlanService.java:309)at com.dataiku.dip.recipes.code.sql.AbstractSQLQueryRecipeRunner.safePrintExecutionPlan(AbstractSQLQueryRecipeRunner.java:275)at com.dataiku.dip.dataflow.exec.sql.SQLQueryRecipeRunner.runFullSQL(SQLQueryRecipeRunner.java:199)at com.dataiku.dip.dataflow.exec.sql.SQLQueryRecipeRunner.run(SQLQueryRecipeRunner.java:163)at com.dataiku.dip.dataflow.exec.MultiEngineRecipeRunner.run(MultiEngineRecipeRunner.java:206)at com.dataiku.dip.dataflow.jobrunner.ActivityRunner$FlowRunnableThread.run(ActivityRunner.java:378)

  1. At times, the error message changes to: "Could not find 'column name' at position 566." Unfortunately, I haven’t been able to replicate this specific error consistently.
  2. Issue Progression:
    • As the development progressed, it seems that any new Group By recipes on this dataset stoped working. The issue appears to be persistent.
  3. Flowchart State:
    • Datasets: 91 datasets in total, with 76 of them being PGSQL output tables.
    • Recipes: 72 recipes in the project.

I would appreciate any insights or suggestions on how to resolve this issue.

Answers

  • Alexandru
    Alexandru Dataiker, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 1,270 Dataiker

    Hi,
    Could you please open a support ticket with the job diagnositcs so we can review them further?

    If the issue is intermittent, please attach both successful/ failed job diagnostics.

    Thanks

Setup Info
    Tags
      Help me…