How to schedule a job to incrementally append data to an hdfs dataset from an oracle table ?
UserBird
Dataiker, Alpha Tester Posts: 535 Dataiker
I have an hdfs dataset which has been created and synced from an oracle table. How can I sync it incrementally every month? I want to schedule a job for this.
Best Answer
-
This is a typical use case for partitioning, see
http://doc.dataiku.com/dss/latest/partitions/index.html
(and be prepared for the steep learning curve ;-)