Interactive Document Intelligence With NLP

MichaelG · ‎01-18-2022

Many firms have a large document corpus made up of both digitized and raw images. Now more than ever, financial institutions are turning towards unstructured data sources to capture additional attributes in order to, ultimately, adjust or confirm their analyses and discover new trends and insights. Many organizations rely on individuals to read sections of these documents or search for relevant materials in an ad hoc manner, with no systematic way of categorizing and understanding the information and trends.

Join us for this Dataiku session on interactive document intelligence, where we will showcased a modular and reusable pipeline to rapidly and automatically digitize documents, extract text, and consolidate data into a unified and searchable database. We focused on NLP techniques applied to prepare, categorize, and analyse textual data based on themes of interest (in this project: ESG), with additional theme modules available. Lastly, we will demoed a purpose-built dashboard to provide business users with a simple and interactive tool to analyse high-level trends and drill down into aggregated insights.

I hope I helped! Do you Know that if I was Useful to you or Did something Outstanding you can Show your appreciation by giving me a KUDOS?

Looking for more resources to help you use DSS effectively and upskill your knowledge? Check out these great resources: Dataiku Academy | Documentation | Knowledge Base

A reply answered your question? Mark as ‘Accepted Solution’ to help others like you!

shankarhn · ‎01-24-2022

I am keen to attend this. I find it valuable.

ebbingcasa · ‎01-24-2022

Sounds great, Michael! You might want to have a look how your endeavour overlaps with Haystack (open source), too.

Sign up to take part

Interactive Document Intelligence With NLP

Interactive Document Intelligence With NLP