Hi, Let's say we have 2 datasets, partitioned by letters : Dataset 1 : - partition A - partition B Dataset 2 : - partition B - partition C I would like to get a "summed" dataset, where existing partit…
One of my dataset is partitioned along two dimensions (source and date), but I'd like to retrieve a non-partitioned dataset from it. When I try to run a simple "Sync" recipe using "All available" part…
Hi, I'm creating datasets based on files in a S3 bucket. The files in the bucket are in a single folder, but have several name patterns, such as "blue_01012017.csv", "red_02012017.csv", etc. Using par…
Hello, I have a filesytem organized this way: /folder/YEAR/MONTH/DDHH I tried to partition at the DDHH level, with one folder per partition. Since it is not a 'regular' structure (such as %Y/%M/%DD/.*…
Hello, I have a filesytem organized this way: /folder/YEAR/MONTH/DDHH I tried to partition at the DDHH level, with one folder per partition. Since it is not a 'regular' structure (such as %Y/%M/%DD/.*…
Hi, I use DSS v4.0.1. I have a CSV input dataset patitionned by year in files (/%Y_dataset_src), and a reciepe for preparing data into a vertica dataset (partionned among %Y in a column date). I need …
Hi, How can I use partitionning variable substitution in a data preparation recipe (for example in a Formula step)? Thanks PS: I've already ask my question a while ago: http://answers.dataiku.com/159/…