Converting HTML to odt, doc, docx

user1081056 picture user1081056 · Dec 5, 2011 · Viewed 26.1k times · Source

Is there an easy way to convert HTML(with CSS styles and embedded images) to ODT, DOCX, DOC from the command line on linux server. I searched a lot but have not found a good option. There was a problem the same way to convert to PDF, decided by wkhtmltopdf. Perhaps there are ways to convert the resulting PDF documents to other formats?

Answer

Zsolt Botykai picture Zsolt Botykai · Dec 5, 2011

To convert to odt it's pretty easy after installing pandoc.

After the relatively hard part: from odt (or even html) you can script (Open|Libre)Office via e.g. unoconv

Or you can like:

abiword --to=doc filename.odt

Also see this thread, and this blog post.

HTH