Google Cloud
I added google-cloud to my environment so I can use Bigquery. But when I tried to import bigquery module into dataiku notebook, it came up with ModuleNotFoundError.
import google.cloud
--------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) <ipython-input-8-1035661e8528> in <module> ----> 1 from google.cloud import bigquery ModuleNotFoundError: No module named 'google'
Please any help will be appreciated.
Answers
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,046 Neuron
Are you selecting the correct code environment as your kernel?
While you can access BigQuery directly via the Google packages this is not the most efficient way. You should create a connection to BigQuery, add your datasets to the flow and access them like any other database. Once you have Dataiku datasets you can also access them in Jupyter Notebook. You can also write SQL queries using a SQL Notebook. Once you have something you like you can convert them to Dataiku datasets as well.
-
josteece Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 2 ✭
I later discovered that I installed the wrong module. The right module should have been google-cloud-bigquery. After adding this to my environment, I was able to access the bigquery from notebook. I will try the Bigquery connection alternative that you propose as well and compare the performance. Thanks.