Excel uploading is not working as expected

Povilas
Level 2
Excel uploading is not working as expected

Strange things happens trying to upload simple excel file (.xlsx) to Dataiku from local computer. I will try to clearly explain how did I try to upload file and what went wrong:




  • I click "+Dataset" button and select "Upload your files".

  • Drag simple excel file with two columns.

  • In "Format/Preview" window I select "skip first lines" to 1 (to skip table names).

  • In "Schema" window I select new column name and formats I want. Table looks like as I expected in "Format/Preview" window.

  • Without any more changes in "Partitioning" and "Advanced" windows I press "CREATE".

  • Dataset that is created does not have columns that I specified (it has default columns col_0 and col_1 instead) and format of one of the columns is changed to boolean (Text is ecpected in both).



 I tried to redo the same procedure with .csv file and everything worked fine.

0 Kudos
7 Replies
Alex_Combessie
Dataiker Alumni
Could you provide a sample of the Excel file?
0 Kudos
Povilas
Level 2
Author
I sent excel to you personal email (not sure if it's possible to upload file here...)
0 Kudos
Alex_Combessie
Dataiker Alumni

Thanks, I tested it using the setting below:





 It is indeed a bug for Excel files when renaming the columns at the upload stage in the Settings menu. While we fix it, we recommend using a Prepare visual recipe to perform the renaming.

0 Kudos
B2oriel
Level 2

I have the same problem. 

0 Kudos
Turribeach

This thread is from 2017. Please raise a new thread and explain your issue properly. 

0 Kudos
B2oriel
Level 2

My problem is exactly the same than this thread from 2017... i read "until we fix it", today nothing has been settled

0 Kudos
Turribeach

Well the thread gives an alternative solution. if you still face the issue on the latest version of Dataiku then clearly it hasnโ€™t been fixed. In which case you can use the alternative solution suggested or if you canโ€™t then you can raise a thread explaining why you canโ€™t use the alternative solution. Either way you should raise a new thread because only the original poster of the thread can mark questions as Solution Accepted. And if you post in the same thread you are preventing questions from being marked as resolved. People are also less likely to respond to an old thread or a thread that has many responses as they need to read the whole thread to understand what is talking about. You can link to this post in your new thread for reference. 

0 Kudos

Labels

?
Labels (2)
A banner prompting to get Dataiku