Special characters not showing in pandoc html output

user2603493 picture user2603493 · Jan 20, 2014 · Viewed 9.3k times · Source

I am trying to get special characters (for foreign surnames) working in pandoc. I followed the instructions here and made sure all special characters are represented using UTF encoding (as per this page. I chose HTML Entity (decimal) option. The resulting files work well when converting to docx or pdf but not html. Is there an encoding that will work for all three output types, or do I need to include some other option?

Here is a line of markdown code for conversion using the special character encoding

some example text with special characters Å, ä,ö

(Sorry, can't figure out how to stop the UTF strings converting to symbols on stackexchange)

pandoc commands

pandoc example.md -o example.docx # Works

pandoc example.md -o example.pdf # Works

pandoc example.md -o example.html # Doesn't work

running via inconv does not change output behaviour

iconv -t utf-8 example.md | pandoc -o example.html # Doesn't work

Answer

Toxaris picture Toxaris · Feb 12, 2014

Try

pandoc example.md -s -o example.html

instead. The additional -s (for "stand-alone") makes pandoc insert the necessary metadata to create a full HTML file instead of just the HTML snippet that directly corresponds to the text in example.md. As part of the metadata, pandoc also generates the information that the file is UTF8 encoded. Your browser needs this piece of information to display the special characters correctly.

If you cannot use the -s flag for some reason, make sure to tell the browser about the UTF8 some other way.