Issues with the Google Drive plugin: file not appearing in the Managed Folder in Dataiku

Hello everyone!
I’m using the Google Drive plugin in Dataiku to read and write files directly from my Google Drive account. I managed to set it up, and it works correctly when creating datasets from Drive. However, I’m having trouble creating a Managed Folder in Dataiku that points to a file in Drive.
Here’s what I do:
- Install and enable the “Google Drive” plugin (version 1.1.5).
- Create a new Managed Folder in my project:
- Type:
fsprovider_googledrive_googledr
- Type of authentication: Access token
- Google Drive Token: Preset_test (a token I created and configured beforehand)
- Drive directory ID:
1N5bY3Xm69VuUqR...
(taken from the folder’s URL in my Google Drive) - Path:
/data.zip
(becausedata.zip
does exist in that Drive folder)
- Type:
- I click Browse or Save. No errors appear (I previously had “KeyError: credentials” issues, but I fixed them), and the settings seem to be saved.
- However, when I open the Managed Folder in Dataiku, it appears empty. Also,
data.zip
doesn’t show up or download, even though it does exist in my Drive folder. - The logs don’t show any particular error.
I’ve confirmed the file exists in my Google Drive folder, and the service account/token has the correct permissions. For some reason, Dataiku doesn’t display the file or “pull” it into the Managed Folder.
Has anyone encountered a similar issue? Is there any additional configuration I need so that Dataiku lists or imports the folder’s contents?
Thanks in advance for any advice!
Best Answer
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,248 Neuron
A Dataiku Managed Folder should point to a folder in your storage layer not a file. Also annoyingly Dataiku creates the actual physical folder a managed folder points to only after you upload a file to the folder. When you create the Dataiku managed folder using the GUI do not modify the path parameter. Once the Dataiku managed folder is visible in your flow go to view it and upload a dummy file to it. Now refresh your Google Drive and you should see something like this under your Google Drive directory:
Google Drive » PROJECT_ID » DSS_FOLDER_ID » dummy_file
Now you can add more files to the new folder DSS_FOLDER_ID created and you should see them in DSS if you refresh the Managed folder view.