-
Import Table from Hive JDBC Connection
I have set up a connection between Dataiku and Hive using an Apache Hive JDBC driver. When I select the "Import tables to dataset" option at the bottom of the connection, I can select a table and import it as a Dataset. However, when the Dataset gets created, I get the following error: Failed to read data from table,…
-
[CDH Cluster] Unable to start HiveServer2 Connection
Hi, I am evaluating DSS, so I installed it in my server and added a 2 weeks enterprise trial license. I am facing a Cloudera CDH 5.12 cluster, kerberized. I am able to connect and browse HDFS, but Hive connection is not working. This is the error in log file backlog.log [2018/05/30-07:47:19.991] [qtp1440621772-171] [INFO]…
-
validation failed: Cannot insert into target table because number/types are different.
Hi, I get this message from a hive recipe on a partitioned dataset stored on HDFS: validation failed: Cannot insert into target table because number/types are different "2018-02": Table inclause-0 has 27 columns, but query has 28 columns. my query is: SELECT * FROM MyTable
-
The partitioning column does not display in dataiku
Hi everybody, I an getting a real problem, when I import a table from hive , the partitioning column does not display in dataiku please any help ?
-
How can I change the recipe engine from DSS to Hive ?
-
Sync empty Hive table
I have a shell script which appends a Hive table (overwritten on every run of the workflow) to a Hive archive table. However, on occasion, the first Hive table is empty, which is breaking the workflow because I get an error when the shell script runs: Failed to access source dataset project_key.dataset_name. Either it is…
-
DSS is Overwriting DATE as TIMESTAMP in HIVE
Hi, In our project we are loading input file into Hive tables using DSS and build summary tables on top of it which are used by tableau. We are using Python recipe which will create HIve table based on input file. Input file has HEADER which will have Column names. So python recipe creates all these columns as STRING data…
-
Hive metastore synchronization fails (GSS initiate failed: Server not found in Kerberos database)
DSS 4.0 When trying to synchronize the metastore, I get this error: [18:20:40] [ERROR] [org.apache.thrift.transport.TSaslTransport] running compute_sfpd_incidents_sample_prepared_NP - SASL negotiation failure javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSException: No valid credentials provided…
-
HIVE metastore sync issue
We have 2 tables with the same name in different DSS projects, and we only see one table with the name in the Hive metastore. It seems that when tables are synced in DSS to a Hive Metastore, the DSS project names are not used. Is there a way to distinguish the tables in Hive without changing the table names?
-
How to use UDF in hive recipe ?
Hi, I'm trying to use a UDF in a hive recipe, I've seen the additional configuration line where I can add a key and a value Can I put there : add (as key?) and /thepathtomyudf.jar (as value?) ? Or is it not possible in Hive recipes ? Best Regards, -- Kevin KHATAEI Data Analyst / Big Data Consultant | CGI