BERT Model

Solved!
vinhdiesal
Level 2
BERT Model

Dataiku published a natural language processing article which states different models that can be used including BERT. I really would like to use BERT inside the DSS to train on the datasets we have. 

Can you provide some guidance on how to load the BERT model inside DSS? Perhaps we can start with the models from TF Hub. 

Thanks, 

Vinh

0 Kudos
1 Solution
CoreyS
Dataiker Alumni

As an update, while you can always integrate it via code, additionally, you can now use BERT & any other model on HuggingFace in the Visual ML design interface, using the text preprocessing method called Sentence Embedding.

Looking for more resources to help you use Dataiku effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as โ€˜Accepted Solutionโ€™ to help others like you!

View solution in original post

2 Replies
Alex_Combessie
Dataiker Alumni

Hi,

As of today, using BERT in DSS can be achieved with python code. You can use any library such as tensorflow-hub or huggingface, as you would do outside of DSS.

Optionally, you can use the dataiku API to store model files in a DSS managed folder.

Note that we plan to add transformers models such as BERT to the NLP Embedding plugin.

Best regards,

Alex

0 Kudos
CoreyS
Dataiker Alumni

As an update, while you can always integrate it via code, additionally, you can now use BERT & any other model on HuggingFace in the Visual ML design interface, using the text preprocessing method called Sentence Embedding.

Looking for more resources to help you use Dataiku effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as โ€˜Accepted Solutionโ€™ to help others like you!