Survey banner
The Dataiku Community is moving to a new home! We are temporary in read only mode: LEARN MORE

Pandas error - Python Code Environment Creation

mettuharish
Level 3
Pandas error - Python Code Environment Creation

Hello Team,

 

We are observing below errors while creation python code environments. Please help us in getting this fixed.


created virtual environment CPython3.6.8.final.0-64 in 316ms
creator CPython3Posix(dest=/opt/app/data/code-envs/python/test_env, clear=False, no_vcs_ignore=False, global=False)
seeder FromAppData(download=False, pip=bundle, setuptools=bundle, wheel=bundle, via=copy, app_data_dir=/home/svc_ikuadmin_prd/.local/share/virtualenv)
added seed packages: pip==21.3.1, setuptools==59.6.0, wheel==0.37.1
activators BashActivator,CShellActivator,FishActivator,NushellActivator,PowerShellActivator,PythonActivator
Updating code environment according to spec ...
Installing from Pip requirements
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
ERROR: Could not find a version that satisfies the requirement pandas<1.1,>=1.0 (from versions: none)
ERROR: No matching distribution found for pandas<1.1,>=1.0
/opt/app/data/code-envs/python/test_env/bin/python: No module named ipykernel


Operating system used: Redhat

0 Kudos
9 Replies
Turribeach

Does your DSS server got access to the internet to download packages from PyPI?

0 Kudos
mettuharish
Level 3
Author

No it doesnt have access to internet. Is there any other alternative way for this to achieve?

0 Kudos

Well simple get internet connectivity to your server, direct or via a proxy. Or get a pip mirror hosted internally in your company. Or pay for hosted repository mirror and open up your company firewalls to it:

https://help.sonatype.com/repomanager3/nexus-repository-administration/formats/pypi-repositories

Without being able to install Python packages Dataiku ability to run custom Python is pretty much reduced to built-in / pre-installed packages which pretty basic stuff like Pandas.

0 Kudos
mettuharish
Level 3
Author

Server is able to connect to pypi.org on port 443 but still I am facing the same issue. How to resolve this now?

0 Kudos

Can you please post the actual error? Thanks 

0 Kudos
mettuharish
Level 3
Author

Here is the actual error:

created virtual environment CPython3.8.14.final.0-64 in 393ms
creator CPython3Posix(dest=/opt/app/data/code-envs/python/Harish_test, clear=False, no_vcs_ignore=False, global=False)
seeder FromAppData(download=False, pip=bundle, setuptools=bundle, wheel=bundle, via=copy, app_data_dir=/home/svc_ikuadmin_prd/.local/share/virtualenv)
added seed packages: pip==22.3.1, setuptools==65.6.3, wheel==0.38.4
activators BashActivator,CShellActivator,FishActivator,NushellActivator,PowerShellActivator,PythonActivator
Updating code environment according to spec ...
Installing from Pip requirements
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
ERROR: Could not find a version that satisfies the requirement pandas<1.1,>=1.0 (from versions: none)
ERROR: No matching distribution found for pandas<1.1,>=1.0
WARNING: There was an error checking the latest version of pip.
/opt/app/data/code-envs/python/Harish_test/bin/python: No module named ipykernel

0 Kudos

Well nothing new there. The server is still unable to connect to the Python repository. There is more than one URL that needs to be allowed. You also need DNS working too and no outbound firewall blocking traffic in the DSS server. 

0 Kudos
mettuharish
Level 3
Author

Please check this.

[svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.org:443
* About to connect() to pypi.org port 443 (#0)
* Trying 151.101.192.223...
* Connected to pypi.org (151.101.192.223) port 443 (#0)
^C
[svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.python.org:443
* About to connect() to pypi.python.org port 443 (#0)
* Trying 199.232.32.223...
* Connected to pypi.python.org (199.232.32.223) port 443 (#0)
^C
[svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pythonhosted.org:443
* About to connect() to pythonhosted.org port 443 (#0)
* Trying 151.101.65.55...
* Connected to pythonhosted.org (151.101.65.55) port 443 (#0)
^C
[svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pythonhosted.org:80
* About to connect() to pythonhosted.org port 80 (#0)
* Trying 151.101.193.55...
* Connected to pythonhosted.org (151.101.193.55) port 80 (#0)
^C
[svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.python.org:80
* About to connect() to pypi.python.org port 80 (#0)
* Trying 199.232.32.223...
* Connected to pypi.python.org (199.232.32.223) port 80 (#0)
^C
[svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.org:80
* About to connect() to pypi.org port 80 (#0)
* Trying 151.101.128.223...
* Connected to pypi.org (151.101.128.223) port 80 (#0)

0 Kudos
Turribeach

Ok that’s a good sign but not good enough to prove the connection works. You could be getting a response from a corporate proxy saying “page blocked”. Try to actually download a file using curl, like this one:

https://files.pythonhosted.org/packages/f8/de/24721cc6092060b2eecfb8561be44df963f12726eec047a3f24fc6...

 

0 Kudos

Labels

?
Labels (1)
A banner prompting to get Dataiku