Survey banner
The Dataiku Community is moving to a new home! Some short term disruption starting next week: LEARN MORE

read excel file present in folder

degananda264
Level 1
read excel file present in folder

i could not able to read excel file  present in dataiku folders .could you please share me the code if possible thanks in advance 

0 Kudos
3 Replies
MiguelangelC
Dataiker

Hi,

From the Python API for managed folders we can get a handle to interact with the objects inside a folder. However, there is no specific Dataiku method for reading Excel files. One straightforward method to do so would be to use pandas. For example:

import dataiku
import pandas as pd
folder=dataiku.Folder('<name of folder>')
with folder.get_download_stream(<path of file inside folder>) as f:
  data=f.read()
  df=pd.read_excel(data,engine='openpyxl')
print(df)

Please note that depending on the details of your Python and DSS version and the extension of the files the code may need to be slightly different. Also, note I am using the 'openpyxl' engine to do the read. So the code env I am using has this package installed. You may choose to use a different one.

Chocoder
Level 1

Hi @MiguelangelC ! I had the exact same question and your proposition worked perfectly fine for me!

Thanks a lot,

PS: maybe you should write your code like that as the ' ' were missing around path!

import dataiku
import pandas as pd
folder=dataiku.Folder('<name of folder>')
with folder.get_download_stream('<path of file inside folder>') as f:
  data=f.read()
  df=pd.read_excel(data,engine='openpyxl')
print(df)

 

0 Kudos
Sv3n-Sk4
Level 3

Hello @degananda264,

Did you try to use the "Create Dataset" when you selected your folder in your flow?

0 Kudos