Merge branch 'current-docs'

This commit is contained in:
eikek
2023-03-12 21:25:37 +01:00
2 changed files with 5 additions and 5 deletions

View File

@ -32,9 +32,9 @@ component.
## External Programs for Joex ## External Programs for Joex
- [Ghostscript](http://pages.cs.wisc.edu/~ghost/) (the `gs` command) - [Ghostscript](https://www.ghostscript.com/) (the `gs` command) is
is used to extract/convert PDF files into images that are then fed used to extract/convert PDF files into images that are then fed to
to ocr. It is available on most GNU/Linux distributions. ocr. It is available on most GNU/Linux distributions.
- [Unpaper](https://github.com/Flameeyes/unpaper) is a program that - [Unpaper](https://github.com/Flameeyes/unpaper) is a program that
pre-processes images to yield better results when doing ocr. If this pre-processes images to yield better results when doing ocr. If this
is not installed, docspell tries without it. However, it is is not installed, docspell tries without it. However, it is

View File

@ -251,8 +251,8 @@ machine/setup.
Another limit is `max-image-size` which defines the size of an image Another limit is `max-image-size` which defines the size of an image
in pixel (`width * height`) where processing is skipped. in pixel (`width * height`) where processing is skipped.
Then [ghostscript](http://pages.cs.wisc.edu/~ghost/) is used to Then [ghostscript](https://www.ghostscript.com/) is used to extract
extract single pages into image files and single pages into image files and
[unpaper](https://github.com/Flameeyes/unpaper) is used to optimize [unpaper](https://github.com/Flameeyes/unpaper) is used to optimize
the images for ocr. Unpaper is optional, if it is not found, it is the images for ocr. Unpaper is optional, if it is not found, it is
skipped, which may be a compromise on slow machines. skipped, which may be a compromise on slow machines.