Discover this year's submissions to the Dataiku Frontrunner Awards and give kudos to your favorite use cases and success stories!READ MORE

Any latest update on Excel binary format (Xlsb) support in Dataiku?

tinaresh
Level 1
Any latest update on Excel binary format (Xlsb) support in Dataiku?

I know DSS only supports XLSX, not XLSB! Is there any alternative ways to load XLSB (binary) format? 

Also wondering why DSS is not supporting XLSB Format 😞

Cheers!

Naresh

Import xlsb 

0 Kudos
2 Replies
sergeyd
Dataiker
Dataiker

Hi @tinaresh 

I assume DSS does not support this file type due to the underlying change in the storing format. Xlsb uses binary format instead of XML like with most other Excel files (like XLSX)

You can always use python to read xlsb files from a managed folder: 

 

import dataiku
import pandas as pd
import io
import pyxlsb  # need to pip install pyxlsb

folder_handle = dataiku.Folder("folder_id") #replace with folder id
paths = folder_handle.list_paths_in_partition()
for path in paths:
    print(path)
    with folder_handle.get_download_stream(path) as f:
        b = io.BytesIO(f.data)
        df = pd.read_excel(b, engine='pyxlsb')
        print(df.shape)

 

 

 

 

CoreyS
Community Manager
Community Manager

Hi @tinaresh  please feel free to utilize the Product Ideas board. The Product Ideas board is here to let you share and exchange your ideas on how to improve Dataiku. Here are some resources to help get you started:

How to suggest Dataiku ideas 
Participating on the Product Ideas board 
Suggest an idea

I hope this helps!

Looking for more resources to help you use Dataiku effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as ‘Accepted Solution’ to help others like you!