mirror of
https://github.com/TheAnachronism/docspell.git
synced 2025-06-22 02:18:26 +00:00
Merge branch 'current-docs'
This commit is contained in:
@ -32,9 +32,9 @@ component.
|
|||||||
|
|
||||||
## External Programs for Joex
|
## External Programs for Joex
|
||||||
|
|
||||||
- [Ghostscript](http://pages.cs.wisc.edu/~ghost/) (the `gs` command)
|
- [Ghostscript](https://www.ghostscript.com/) (the `gs` command) is
|
||||||
is used to extract/convert PDF files into images that are then fed
|
used to extract/convert PDF files into images that are then fed to
|
||||||
to ocr. It is available on most GNU/Linux distributions.
|
ocr. It is available on most GNU/Linux distributions.
|
||||||
- [Unpaper](https://github.com/Flameeyes/unpaper) is a program that
|
- [Unpaper](https://github.com/Flameeyes/unpaper) is a program that
|
||||||
pre-processes images to yield better results when doing ocr. If this
|
pre-processes images to yield better results when doing ocr. If this
|
||||||
is not installed, docspell tries without it. However, it is
|
is not installed, docspell tries without it. However, it is
|
||||||
|
@ -251,8 +251,8 @@ machine/setup.
|
|||||||
Another limit is `max-image-size` which defines the size of an image
|
Another limit is `max-image-size` which defines the size of an image
|
||||||
in pixel (`width * height`) where processing is skipped.
|
in pixel (`width * height`) where processing is skipped.
|
||||||
|
|
||||||
Then [ghostscript](http://pages.cs.wisc.edu/~ghost/) is used to
|
Then [ghostscript](https://www.ghostscript.com/) is used to extract
|
||||||
extract single pages into image files and
|
single pages into image files and
|
||||||
[unpaper](https://github.com/Flameeyes/unpaper) is used to optimize
|
[unpaper](https://github.com/Flameeyes/unpaper) is used to optimize
|
||||||
the images for ocr. Unpaper is optional, if it is not found, it is
|
the images for ocr. Unpaper is optional, if it is not found, it is
|
||||||
skipped, which may be a compromise on slow machines.
|
skipped, which may be a compromise on slow machines.
|
||||||
|
Reference in New Issue
Block a user