Submit your inspiring success story or innovative use case to the 2022 Dataiku Frontrunner Awards! ENTER YOUR SUBMISSION

dataset running

Solved!
mingyeong
Level 1
Level 1
dataset running

안녕하세요. 데이터 build 관련해서 질문이 있습니다.

튜토리얼을 따라하고 있는 과정인데, 전체 flow에서 절반의 데이터셋과 recipe 정도를 돌리고

서버를 껐다가 다음날 이어서 진행하려고 하였는데

flow는 정상적으로 있으나 가끔씩 dataset의 data들이 사라져있습니다.

서버를 껐다가 재접속 하는 경우 rebuild를 전부 해야하나요? 

0 Kudos
1 Solution
ATsao
Dataiker
Dataiker

Hi, 

Where is your underlying data being stored? It is important to note that datasets in DSS are simply pointers to the underlying data, whether they are tables (for SQL databases) or files (for file based objects or connections), and shouldn't need to be rebuilt as long as it is accessible. Typically, you only need to rebuild your Flow if the input data has changed, you've made changes to your Flow, and/or certain datasets have been cleared. 

Thanks,

Andrew

View solution in original post

0 Kudos
1 Reply
ATsao
Dataiker
Dataiker

Hi, 

Where is your underlying data being stored? It is important to note that datasets in DSS are simply pointers to the underlying data, whether they are tables (for SQL databases) or files (for file based objects or connections), and shouldn't need to be rebuilt as long as it is accessible. Typically, you only need to rebuild your Flow if the input data has changed, you've made changes to your Flow, and/or certain datasets have been cleared. 

Thanks,

Andrew

0 Kudos