Any latest update on Excel binary format (Xlsb) support in Dataiku?

tinaresh
tinaresh Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Registered Posts: 3 ✭✭✭✭

I know DSS only supports XLSX, not XLSB! Is there any alternative ways to load XLSB (binary) format?

Also wondering why DSS is not supporting XLSB Format

Cheers!

Naresh

Tagged:

Answers

  • Sergey
    Sergey Dataiker, Dataiku DSS Core Designer, Dataiku DSS & SQL, Dataiku DSS Core Concepts Posts: 365 Dataiker
    edited July 17

    Hi @tinaresh

    I assume DSS does not support this file type due to the underlying change in the storing format. Xlsb uses binary format instead of XML like with most other Excel files (like XLSX)

    You can always use python to read xlsb files from a managed folder:

    import dataiku
    import pandas as pd
    import io
    import pyxlsb  # need to pip install pyxlsb
    
    folder_handle = dataiku.Folder("folder_id") #replace with folder id
    paths = folder_handle.list_paths_in_partition()
    for path in paths:
        print(path)
        with folder_handle.get_download_stream(path) as f:
            b = io.BytesIO(f.data)
            df = pd.read_excel(b, engine='pyxlsb')
            print(df.shape)

  • CoreyS
    CoreyS Dataiker Alumni, Dataiku DSS Core Designer, Dataiku DSS Core Concepts, Registered Posts: 1,150 ✭✭✭✭✭✭✭✭✭

    Hi @tinaresh
    please feel free to utilize the Product Ideas board. The Product Ideas board is here to let you share and exchange your ideas on how to improve Dataiku. Here are some resources to help get you started: Suggest an idea

    I hope this helps!

Setup Info
    Tags
      Help me…