docspell/docker
Eike Kettner 3d49ceaab5 Use ocrmypdf tool to create pdf/a during conversion
- Use another external tool to convert pdf to pdf which also adds the
  extracted text as another layer into the pdf

- Although not used, the external conversion routine will now check
  for an existing text file that is named as the pdf file with extension
  `.txt`. If present it is included in the conversion result and will be
  used as the extracted text.

- text extraction for pdf files happens now on the converted file,
  because it may already contain the text from the conversion step and
  thus avoids running OCR twice.

- All errors during conversion are not fatal; processing continues
  without a converted file.
2020-07-18 17:19:29 +02:00
..
build-images.sh Update nix/nixos and docker setups 2020-06-29 21:01:07 +02:00
consumedir.dockerfile Update nix/nixos and docker setups 2020-06-29 21:01:07 +02:00
docker-compose.yml Prepare docker setup for fulltext search 2020-06-28 13:37:39 +02:00
docspell.conf Prepare docker setup for fulltext search 2020-06-28 13:37:39 +02:00
entrypoint-joex.sh Update nix/nixos and docker setups 2020-06-29 21:01:07 +02:00
joex.dockerfile Use ocrmypdf tool to create pdf/a during conversion 2020-07-18 17:19:29 +02:00
restserver.dockerfile Update nix/nixos and docker setups 2020-06-29 21:01:07 +02:00