mirror of
https://github.com/TheAnachronism/docspell.git
synced 2024-11-13 02:31:10 +00:00
3d49ceaab5
- Use another external tool to convert pdf to pdf which also adds the extracted text as another layer into the pdf - Although not used, the external conversion routine will now check for an existing text file that is named as the pdf file with extension `.txt`. If present it is included in the conversion result and will be used as the extracted text. - text extraction for pdf files happens now on the converted file, because it may already contain the text from the conversion step and thus avoids running OCR twice. - All errors during conversion are not fatal; processing continues without a converted file. |
||
---|---|---|
.. | ||
buildvm.sh | ||
configuration-test.nix | ||
module-consumedir.nix | ||
module-joex.nix | ||
module-server.nix | ||
pkg.nix | ||
release.nix | ||
solr.nix |