How to extract text from doc files?

Rakesh
Rakesh Dataiku DSS Core Designer, Registered Posts: 3 ✭✭

I have few .doc files in my managed folder, I want to extract the text from the files using python recipe.

Please guide me how can I achieve this.

Or is there any way to convert the .doc files into .docx file programmatically and then extracting the text from the converted file?

Thank you in Advance.

Operating system used: Windows

Operating system used: Windows

Answers

Setup Info
    Tags
      Help me…