mirror of
https://github.com/TheAnachronism/docspell.git
synced 2024-11-13 02:31:10 +00:00
3d49ceaab5
- Use another external tool to convert pdf to pdf which also adds the extracted text as another layer into the pdf - Although not used, the external conversion routine will now check for an existing text file that is named as the pdf file with extension `.txt`. If present it is included in the conversion result and will be used as the extracted text. - text extraction for pdf files happens now on the converted file, because it may already contain the text from the conversion step and thus avoids running OCR twice. - All errors during conversion are not fatal; processing continues without a converted file. |
||
---|---|---|
.. | ||
tools | ||
configure.md | ||
curate.md | ||
emailsettings.md | ||
finding.md | ||
install.md | ||
joex.md | ||
mailitem.md | ||
metadata.md | ||
nix.md | ||
notifydueitems.md | ||
processing.md | ||
reverseproxy.md | ||
scanmailbox.md | ||
tools.md | ||
uploading.md |