Processing a continuous stream of data

Gustavo_Brian
Gustavo_Brian Registered Posts: 10 ✭✭✭✭

Hi,



I have a service that continuously emits data. You start receiving data once you have connected opening a TCP connection and never stops until you terminate the connection.



I'd like to develop a custom plugin to be able to process that data on Dataiku how I can do that as data never ends?



Will "build" log overload the server?



Thanks

UPDATE:

We are loading data from a flight's metasearch service. They expose a data stream we consume polling from a TCP connection (https://github.com/gbrian/Flightmate-Stream). We plan to use Dataiku to parse, sanitize, ... data and the drop into Hadoop apart from applying the corresponding analysis and lab ;)

@alexander Hope this helps

Answers

Setup Info
    Tags
      Help me…