Any latest update on Excel binary format (Xlsb) support in Dataiku?

tinaresh
Level 1
Any latest update on Excel binary format (Xlsb) support in Dataiku?

I know DSS only supports XLSX, not XLSB! Is there any alternative ways to load XLSB (binary) format? 

Also wondering why DSS is not supporting XLSB Format ๐Ÿ˜ž

Cheers!

Naresh

Import xlsb 

0 Kudos
2 Replies
sergeyd
Dataiker

Hi @tinaresh 

I assume DSS does not support this file type due to the underlying change in the storing format. Xlsb uses binary format instead of XML like with most other Excel files (like XLSX)

You can always use python to read xlsb files from a managed folder: 

 

import dataiku
import pandas as pd
import io
import pyxlsb  # need to pip install pyxlsb

folder_handle = dataiku.Folder("folder_id") #replace with folder id
paths = folder_handle.list_paths_in_partition()
for path in paths:
    print(path)
    with folder_handle.get_download_stream(path) as f:
        b = io.BytesIO(f.data)
        df = pd.read_excel(b, engine='pyxlsb')
        print(df.shape)

 

 

 

 

CoreyS
Dataiker Alumni

Hi @tinaresh  please feel free to utilize the Product Ideas board. The Product Ideas board is here to let you share and exchange your ideas on how to improve Dataiku. Here are some resources to help get you started:

How to suggest Dataiku ideas 
Participating on the Product Ideas board 
Suggest an idea

I hope this helps!

Looking for more resources to help you use Dataiku effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as โ€˜Accepted Solutionโ€™ to help others like you!