docspell/modules/microsite/docs/dev/adr/0007_convert_html_files.md
2020-02-11 22:41:44 +01:00

1.6 KiB

layout title
docs Convert HTML Files

{{ page.title }}

Context and Problem Statement

How can HTML documents be converted into a PDF file that looks as much as possible like the original?

It would be nice to have a java-only solution. But if an external tool has a better outcome, then an external tool is fine, too.

Since Docspell is free software, the tools must also be free.

Considered Options

Native (firefox) view:

Note: the example html is from here.

I downloaded the HTML file to disk together with its resources (using Save as... in the browser).

Pandoc

Not showing the version using context pdf-engine, since it looked very similiar to the latex variant.

wkhtmltopdf

Unoconv

Decision Outcome

wkhtmltopdf.

It shows the best results.