Elasticsearch index with custom settings?

pvannies
pvannies Partner, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Core Concepts, Neuron, Dataiku DSS Adv Designer, Neuron 2022, Frontrunner 2022 Finalist, Frontrunner 2022 Winner, Frontrunner 2022 Participant, Neuron 2023 Posts: 18 Neuron

Hi community! 

I am wondering how I can create an Elasticsearch index with custom settings for the analyzer, filter and tokenizer. The documentation (doc.dataiku.com/dss/latest/connecting/elasticsearch.html) mentions "you can use an index template before building the managed dataset for the first time", however, it does not explain how to do so. I've tried to first create the index (with custom settings) in ES and then build the dataset in Dataiku. However, this approach fully overwrites the index and, thereby, removes the custom settings. 

My question: how can I build an ES dataset in Dataiku with custom (analyzer, filter, tokenizer) settings? Any help is appreciated! 

Kind regards,
Pauline

Answers

  • pvannies
    pvannies Partner, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Core Concepts, Neuron, Dataiku DSS Adv Designer, Neuron 2022, Frontrunner 2022 Finalist, Frontrunner 2022 Winner, Frontrunner 2022 Participant, Neuron 2023 Posts: 18 Neuron

    So far, we found that creating the index in ElasticSearch with custom settings, then creating an empty ES dataset in dataiku that will contain the same name as in the index (mind the project key), and using this as an 'existing dataset' for an output can work.

Setup Info
    Tags
      Help me…