Pandas error - Python Code Environment Creation

Options
mettuharish
mettuharish Dataiku DSS Core Designer, Registered Posts: 15

Hello Team,

We are observing below errors while creation python code environments. Please help us in getting this fixed.


created virtual environment CPython3.6.8.final.0-64 in 316ms
creator CPython3Posix(dest=/opt/app/data/code-envs/python/test_env, clear=False, no_vcs_ignore=False, global=False)
seeder FromAppData(download=False, pip=bundle, setuptools=bundle, wheel=bundle, via=copy, app_data_dir=/home/svc_ikuadmin_prd/.local/share/virtualenv)
added seed packages: pip==21.3.1, setuptools==59.6.0, wheel==0.37.1
activators BashActivator,CShellActivator,FishActivator,NushellActivator,PowerShellActivator,PythonActivator
Updating code environment according to spec ...
Installing from Pip requirements
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
ERROR: Could not find a version that satisfies the requirement pandas<1.1,>=1.0 (from versions: none)
ERROR: No matching distribution found for pandas<1.1,>=1.0
/opt/app/data/code-envs/python/test_env/bin/python: No module named ipykernel


Operating system used: Redhat

Tagged:

Answers

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,708 Neuron
    Options

    Does your DSS server got access to the internet to download packages from PyPI?

  • mettuharish
    mettuharish Dataiku DSS Core Designer, Registered Posts: 15
    Options

    No it doesnt have access to internet. Is there any other alternative way for this to achieve?

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,708 Neuron
    Options

    Well simple get internet connectivity to your server, direct or via a proxy. Or get a pip mirror hosted internally in your company. Or pay for hosted repository mirror and open up your company firewalls to it:

    https://help.sonatype.com/repomanager3/nexus-repository-administration/formats/pypi-repositories

    Without being able to install Python packages Dataiku ability to run custom Python is pretty much reduced to built-in / pre-installed packages which pretty basic stuff like Pandas.

  • mettuharish
    mettuharish Dataiku DSS Core Designer, Registered Posts: 15
    Options

    Server is able to connect to pypi.org on port 443 but still I am facing the same issue. How to resolve this now?

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,708 Neuron
    Options

    Can you please post the actual error? Thanks

  • mettuharish
    mettuharish Dataiku DSS Core Designer, Registered Posts: 15
    Options

    Here is the actual error:

    created virtual environment CPython3.8.14.final.0-64 in 393ms
    creator CPython3Posix(dest=/opt/app/data/code-envs/python/Harish_test, clear=False, no_vcs_ignore=False, global=False)
    seeder FromAppData(download=False, pip=bundle, setuptools=bundle, wheel=bundle, via=copy, app_data_dir=/home/svc_ikuadmin_prd/.local/share/virtualenv)
    added seed packages: pip==22.3.1, setuptools==65.6.3, wheel==0.38.4
    activators BashActivator,CShellActivator,FishActivator,NushellActivator,PowerShellActivator,PythonActivator
    Updating code environment according to spec ...
    Installing from Pip requirements
    WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
    WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
    WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
    WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
    WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))': /simple/pandas/
    ERROR: Could not find a version that satisfies the requirement pandas<1.1,>=1.0 (from versions: none)
    ERROR: No matching distribution found for pandas<1.1,>=1.0
    WARNING: There was an error checking the latest version of pip.
    /opt/app/data/code-envs/python/Harish_test/bin/python: No module named ipykernel

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,708 Neuron
    Options

    Well nothing new there. The server is still unable to connect to the Python repository. There is more than one URL that needs to be allowed. You also need DNS working too and no outbound firewall blocking traffic in the DSS server.

  • mettuharish
    mettuharish Dataiku DSS Core Designer, Registered Posts: 15
    Options

    Please check this.

    [svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.org:443
    * About to connect() to pypi.org port 443 (#0)
    * Trying 151.101.192.223...
    * Connected to pypi.org (151.101.192.223) port 443 (#0)
    ^C
    [svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.python.org:443
    * About to connect() to pypi.python.org port 443 (#0)
    * Trying 199.232.32.223...
    * Connected to pypi.python.org (199.232.32.223) port 443 (#0)
    ^C
    [svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pythonhosted.org:443
    * About to connect() to pythonhosted.org port 443 (#0)
    * Trying 151.101.65.55...
    * Connected to pythonhosted.org (151.101.65.55) port 443 (#0)
    ^C
    [svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pythonhosted.org:80
    * About to connect() to pythonhosted.org port 80 (#0)
    * Trying 151.101.193.55...
    * Connected to pythonhosted.org (151.101.193.55) port 80 (#0)
    ^C
    [svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.python.org:80
    * About to connect() to pypi.python.org port 80 (#0)
    * Trying 199.232.32.223...
    * Connected to pypi.python.org (199.232.32.223) port 80 (#0)
    ^C
    [svc_ikuadmin_prd@nau2p-lapphsbqh python]$ curl -v telnet://pypi.org:80
    * About to connect() to pypi.org port 80 (#0)
    * Trying 151.101.128.223...
    * Connected to pypi.org (151.101.128.223) port 80 (#0)

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,708 Neuron
    Options

    Ok that’s a good sign but not good enough to prove the connection works. You could be getting a response from a corporate proxy saying “page blocked”. Try to actually download a file using curl, like this one:

    https://files.pythonhosted.org/packages/f8/de/24721cc6092060b2eecfb8561be44df963f12726eec047a3f24fc66aabb1/simple-pandas-0.2.1.tar.gz

Setup Info
    Tags
      Help me…