Hello,
I’ve encountered an unusual error while using the Group By recipe in Dataiku. Here’s a summary of the issue:
- Context: I created a Group By recipe on three columns, applying three custom aggregations using SQL.
- Input Data: The recipe takes as input a PostgreSQL (PGSQL) table, which is the output of a JOIN operation between three different datasets. There were no errors when creating this table.
- Error Encountered:
- When running the job, Dataiku’s job runner returns an error. Initially, I tried to debug the issue by creating a simple, basic Group By recipe, but I encountered the same error :
- ERROR: syntax error at or near "AS" Position: 535
- Please see our options for getting help
- HTTP code: , type: org.postgresql.util.PSQLException
- [15:20:54] [WARN] [dku.flow.sql] - Could not print execution planorg.postgresql.util.PSQLException: ERROR: syntax error at or near "AS" Position: 447at org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2676)at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:2366)at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:356)at org.postgresql.jdbc.PgStatement.executeInternal(PgStatement.java:496)at org.postgresql.jdbc.PgStatement.execute(PgStatement.java:413)at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:333)at org.postgresql.jdbc.PgStatement.executeCachedSql(PgStatement.java:319)at org.postgresql.jdbc.PgStatement.executeWithFlags(PgStatement.java:295)at org.postgresql.jdbc.PgStatement.executeQuery(PgStatement.java:244)at com.dataiku.dip.queries.ExecutionPlanService.getGenericExecutionPlan(ExecutionPlanService.java:477)at com.dataiku.dip.queries.ExecutionPlanService.getExecutionPlan(ExecutionPlanService.java:309)at com.dataiku.dip.recipes.code.sql.AbstractSQLQueryRecipeRunner.safePrintExecutionPlan(AbstractSQLQueryRecipeRunner.java:275)at com.dataiku.dip.dataflow.exec.sql.SQLQueryRecipeRunner.runFullSQL(SQLQueryRecipeRunner.java:199)at com.dataiku.dip.dataflow.exec.sql.SQLQueryRecipeRunner.run(SQLQueryRecipeRunner.java:163)at com.dataiku.dip.dataflow.exec.MultiEngineRecipeRunner.run(MultiEngineRecipeRunner.java:206)at com.dataiku.dip.dataflow.jobrunner.ActivityRunner$FlowRunnableThread.run(ActivityRunner.java:378)
- At times, the error message changes to: "Could not find 'column name' at position 566." Unfortunately, I haven’t been able to replicate this specific error consistently.
- Issue Progression:
- As the development progressed, it seems that any new Group By recipes on this dataset stoped working. The issue appears to be persistent.
- Flowchart State:
- Datasets: 91 datasets in total, with 76 of them being PGSQL output tables.
- Recipes: 72 recipes in the project.
I would appreciate any insights or suggestions on how to resolve this issue.