Join us, on May 27th, for an introduction to the new Dataiku Academy Learn more

Load from HDFS to Oracle

Level 2
Load from HDFS to Oracle
When inserting data from HDFS to Oracle or another supported database such as Greenplum, does the data flow through the DSS server (memory)?
0 Kudos
3 Replies
Dataiker
Dataiker

Hi Ubethke,



No, hopefully you won't load all your data into the memory of the DSS server.



The data is "streamed" by chunk from HDFS to Oracle. 



You should check this learn article for more details.



Matt



 



 

Mattsco
0 Kudos
Level 2
Author
Based on the article it is streamed from HDFS to DSS server to Oracle. Correct?
If so then I would argue that this is not efficient as it requires an unnecessary roundtrip of the data through the DSS server.
Your thoughts?
Thanks
Uli
0 Kudos
Dataiker
Dataiker
Correct!
I agree but it's not always technically possible for us to avoid this roundtrip.
Mattsco
0 Kudos
Labels (4)