Spanish accent in column names & records

mangeshp23
Level 2
Spanish accent in column names & records

Hello,

I have a input folder in that we have 29 csv files. In those files we have columns which has Spanish accent in their name. & we have some columns which has records with Spanish accent. So while reading the CSV files, we are facing issue of count mismatch. so  how we get the proper records with spanish accent?


Operating system used: Windows

0 Kudos
3 Replies
tgb417

@mangeshp23 

I know that you can import non-latian (ASCII) characters in data records as CSV files.

However, I've never tried using characters like รฑ and ร‘ for column headers.  

If it is not working I'd put in a support ticket.  The support team may know about some work arounds.  This may also be considered as a defect and might be able to change.  You could also suggest this as a product idea.  https://community.dataiku.com/t5/Product-Ideas/idb-p/Product_Ideas

If you have to work fast.  I might try in import the data without headers, just drop the first row of each file.  You will get default column names but you may be able to get the data into DSS to start working on it.

 

--Tom
LouisDHulst

Hi,

@mangeshp23 

Could you show an example of data loss? I was able to import a csv file with both รฑ and รก in the column headers and in the records using the visual interface and with a python script reading from a folder without any data loss.

welderm
Level 1

I have the same case, I'm reading a json file and the keys have accents.

I used the command below but it didn't work.

locale.setlocale(locale.LC_TIME, 'pt_BR.utf8')

0 Kudos