This module offers functionality of casting documents to text. Documents are expected to be in either of formats: .html, .doc, .docx, .pdf, .odt.
In your terminal go to the folder where you would like to locate doc_to_text package. Clone doc_to_text
git clone https://github.com/artreven/doc_to_text.git
and install it:
[sudo] pip install doc_to_text