Problem with np.bool dependency in "dataiku" package in python

Solved!
collig
Level 2
Problem with np.bool dependency in "dataiku" package in python

Hi,

I'm having a problem importing the  "dataiku"  package in python in the following environments since update to 11.2

python 3.6

python 3.7

python 3.9

 

import dataiku

 

 

/data1/dss/dataiku-dss-11.2.0/python/dataiku/core/schema_handling.py:17: FutureWarning: In the future `np.bool` will be defined as the corresponding NumPy scalar.  (This may have returned Python scalars in past versions.
  'boolean': np.bool

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-4-3acb3cd0f9dd> in <cell line: 1>()
----> 1 import dataiku

/data1/dss/dataiku-dss-11.2.0/python/dataiku/__init__.py in <module>
     11 from .base import remoterun
     12 from .core.base import is_container_exec
---> 13 from .core.dataset import Dataset, _dataset_writer_atexit_handler
     14 from .core.schema_handling import get_schema_from_df
     15 

/data1/dss/dataiku-dss-11.2.0/python/dataiku/core/dataset.py in <module>
     33 
     34 # Module code
---> 35 from dataiku.core import flow, base, schema_handling, dkuio
     36 from dataiku.core.platform_exec import read_dku_json
     37 from dataiku.core.dkujson import dump_to_filepath, load_from_filepath

/data1/dss/dataiku-dss-11.2.0/python/dataiku/core/schema_handling.py in <module>
     15     'float': np.float32,
     16     'double': np.float64,
---> 17     'boolean': np.bool
     18 }
     19 

/data1/dss/dss_data/code-envs/python/GC_dbs_39/lib64/python3.9/site-packages/numpy/__init__.py in __getattr__(attr)
    282             return Tester
    283 
--> 284         raise AttributeError("module {!r} has no attribute "    285                              "{!r}".format(__name__, attr))
    286 

AttributeError: module 'numpy' has no attribute 'bool'

Appears at first to be a soft warning but any attempt to use the dataiku package fails

 

# Example: load a DSS dataset as a Pandas dataframe
mydataset = dataiku.Dataset("PWHRn6UM")

 

 

---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
<ipython-input-3-4bdc522e44af> in <cell line: 2>()
      1 # Example: load a DSS dataset as a Pandas dataframe
----> 2 mydataset = dataiku.Dataset("PWHRn6UM")

NameError: name 'dataiku' is not defined


Operating system used: Windows


Operating system used: Windows

0 Kudos
1 Solution
MiguelangelC
Dataiker

Hi,


The error is not related to the newer DSS version. This behaviour comes from the numpy package version it uses.

While the deprecation warning is harmless, the attribute error comes from 'numpy.bool' not being available since numpy 1.24.0 (released on 19th of December): https://github.com/numpy/numpy/releases/tag/v1.24.0


​We are in process of fixing this globally for DSS. In the meantime, we can do a quick workaround on your code env:
 1) Go to Administration > Code Envs > Select your code env
 2) Add the following in 'Packages to install': numpy < 1.24'
 3) Click 'UPDATE'

View solution in original post

0 Kudos
7 Replies
collig
Level 2
Author

Issue resolves if I use numpy==1.23 on python 3.9

numpy 1.23 not compatible with python 3.6 / 3.7 however

0 Kudos
MiguelangelC
Dataiker

Hi,


The error is not related to the newer DSS version. This behaviour comes from the numpy package version it uses.

While the deprecation warning is harmless, the attribute error comes from 'numpy.bool' not being available since numpy 1.24.0 (released on 19th of December): https://github.com/numpy/numpy/releases/tag/v1.24.0


​We are in process of fixing this globally for DSS. In the meantime, we can do a quick workaround on your code env:
 1) Go to Administration > Code Envs > Select your code env
 2) Add the following in 'Packages to install': numpy < 1.24'
 3) Click 'UPDATE'

0 Kudos
collig
Level 2
Author

Thanks, this resolves issue with earlier versions of python

0 Kudos
TEChopra1000
Level 1

It looks like this hasn't been completed yet:

​We are in process of fixing this globally for DSS.

Is this still WIP on the Dataiku side? 

0 Kudos
NicolasD
Dataiker

Hello,

As of version 12.2.3, we've removed this deprecated usage from our codebase. You may still get the warning since it's in use in some common dependencies like pandas or scikit-optimize, meaning we still don't recommend using numpy>=1.24 in general.

Best regards

RichardScottOZ
Level 1

Using one now with numpy 1.23.5, same error.

0 Kudos

Please start a new thread.

0 Kudos

Labels

?

Setup info

?
A banner prompting to get Dataiku