Why are ePub files so much smaller than mobi or PDF files for the same book

Finn picture Finn · Jan 25, 2012 · Viewed 17.3k times · Source

When I buy ebooks I download all of the available formats. I've noticed that the file sizes for the various formats can be markedly different and epub is typically much smaller.

For example:

  • PDF - 5.7mb;
  • ePub - 2.7mb;
  • Mobi - 8.1mb.

Or:

  • PDF - 4.5mb;
  • ePub - 1.8mb;
  • Mobi - 5.3mb.

I've flipped through them and tried to confirm that the contents are the same and they seem to be (i.e. no large images missing). Can anyone explain why epub is so much smaller than the other two?

Answer

ampt picture ampt · Jan 25, 2012

The mobi versions can be larger because they include the legacy mobi format, the new KF8 format and a copy of the original epub, this is assuming the mobi file was generated with the latest version of kindlegen.

For the PDF's I'm guessing (and that's all it is here) that embedded fonts may be the cause of a larger file size, another thing that comes into play here is image optimisation. Depending on the image optimisation settings used when the PDF was created will largely affect the final file size.

Epub's are basically just a bunch HTML, CSS and image files with a few XML files for defining the books metadata, chapter order and table of contents navigation. The epub file is really just a zip file with a .epub extension and since it doesn't have 3 copies of the same book like the Kindle version does it will always be much smaller.