Google Cloud

josteece
josteece Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 2
edited July 16 in Using Dataiku

I added google-cloud to my environment so I can use Bigquery. But when I tried to import bigquery module into dataiku notebook, it came up with ModuleNotFoundError.

import google.cloud

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-8-1035661e8528> in <module>
----> 1 from google.cloud import bigquery

ModuleNotFoundError: No module named 'google'

Please any help will be appreciated.

Tagged:

Answers

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,090 Neuron

    Are you selecting the correct code environment as your kernel?

    While you can access BigQuery directly via the Google packages this is not the most efficient way. You should create a connection to BigQuery, add your datasets to the flow and access them like any other database. Once you have Dataiku datasets you can also access them in Jupyter Notebook. You can also write SQL queries using a SQL Notebook. Once you have something you like you can convert them to Dataiku datasets as well.

  • josteece
    josteece Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 2

    I later discovered that I installed the wrong module. The right module should have been google-cloud-bigquery. After adding this to my environment, I was able to access the bigquery from notebook. I will try the Bigquery connection alternative that you propose as well and compare the performance. Thanks.

Setup Info
    Tags
      Help me…