Converts Word Processor office files (e.g. .DOC files) to OpenDocument, DocBook, and structured HTML.
This is Docvert for Python 2. To find Docvert for Python 3 see http://github.com/holloway/docvert-python3/
python2 ./docvert-web.py [-p PORT] [-H host]
python2 ./docvert-cli.py
usage: docvert-cli.py [-h] [--version] --pipeline PIPELINE
[--response {auto,path,stdout}]
[--autopipeline {Break up over Heading 1.default,Nothing one long page}]
[--url URL]
[--list-pipelines]
[--pipelinetype {tests,auto_pipelines,pipelines}]
infile [infile ...]
http://lists.catalyst.net.nz/mailman/listinfo/docvert
Python 2.6 or 2.7
libreoffice
python-uno
python-lxml
python-imaging
pdf2svg
librsvg2-2
sudo apt-get install libreoffice python-uno python-lxml python-imaging pdf2svg librsvg2-2
/usr/bin/soffice --headless --norestore --nologo --norestore --nofirststartwizard --accept="socket,port=2002;urp;"
then in another terminal
cd ~
git clone git://github.com/holloway/docvert.git
cd docvert
python2 ./docvert-web.py
and browse to http://localhost:8080
Released under the GPL3 see LICENCE