Submit your innovative use case or inspiring success story to the 2023 Dataiku Frontrunner Awards! LET'S GO

Extract text from html stored in column

Extract text from html stored in column
How would one extract the text and strip all the html. parseHTML() gives me just the html back, and htmlText() gives me the html as text (no brackets)
2 Replies
Dataiker Alumni
Object functions of the formula language have some more advanced capabilities,

To do better you will need to use code, the easiest is to use Python, the package BeautifulSoup will help you.
0 Kudos
Level 2

htmlText(parseHtml(field to parse)) worked for me

0 Kudos