Commit Graph

114 Commits

Author SHA1 Message Date
Eike Kettner
29ddcccbba Use a base image for joex containing all the tools 2020-09-09 22:59:34 +02:00
Eike Kettner
dc88fcb960 Update nix and docker setup 2020-09-09 22:31:35 +02:00
Eike Kettner
8e5e198098 Update nix and docker setups 2020-09-08 00:32:17 +02:00
Eike Kettner
d68d076c84 Update nix and docker setups 2020-08-15 00:34:33 +02:00
Eike Kettner
66793080d8 Update docker setup 2020-08-01 19:01:49 +02:00
Eike Kettner
3d49ceaab5 Use ocrmypdf tool to create pdf/a during conversion
- Use another external tool to convert pdf to pdf which also adds the
  extracted text as another layer into the pdf

- Although not used, the external conversion routine will now check
  for an existing text file that is named as the pdf file with extension
  `.txt`. If present it is included in the conversion result and will be
  used as the extracted text.

- text extraction for pdf files happens now on the converted file,
  because it may already contain the text from the conversion step and
  thus avoids running OCR twice.

- All errors during conversion are not fatal; processing continues
  without a converted file.
2020-07-18 17:19:29 +02:00
Eike Kettner
ec7b34ee6f Update nix/nixos and docker setups 2020-06-29 21:01:07 +02:00
Eike Kettner
f883648839 Add missing entrypoint script for docker 2020-06-28 13:50:14 +02:00
Eike Kettner
d3b3c6289b Prepare docker setup for fulltext search 2020-06-28 13:37:39 +02:00
Eike Kettner
41964027d1 Update docker files 2020-06-17 22:28:04 +02:00
Eike Kettner
3d902c3273 Add a docker image for watching a directory 2020-05-25 19:43:06 +02:00
Eike Kettner
0b7cc0ec6b Update nix and docker setups 2020-05-25 17:57:41 +02:00
Eike Kettner
8f46f6b57b Update docker setup 2020-04-30 22:38:53 +02:00
Eike Kettner
5b21a876aa Try provide docker setup 2020-03-31 00:45:43 +02:00