diff --git a/website/site/content/docs/install/prereq.md b/website/site/content/docs/install/prereq.md index 4e711ce8..a11166c4 100644 --- a/website/site/content/docs/install/prereq.md +++ b/website/site/content/docs/install/prereq.md @@ -32,9 +32,9 @@ component. ## External Programs for Joex -- [Ghostscript](http://pages.cs.wisc.edu/~ghost/) (the `gs` command) - is used to extract/convert PDF files into images that are then fed - to ocr. It is available on most GNU/Linux distributions. +- [Ghostscript](https://www.ghostscript.com/) (the `gs` command) is + used to extract/convert PDF files into images that are then fed to + ocr. It is available on most GNU/Linux distributions. - [Unpaper](https://github.com/Flameeyes/unpaper) is a program that pre-processes images to yield better results when doing ocr. If this is not installed, docspell tries without it. However, it is diff --git a/website/site/content/docs/joex/file-processing.md b/website/site/content/docs/joex/file-processing.md index 3aee1edf..5a4da053 100644 --- a/website/site/content/docs/joex/file-processing.md +++ b/website/site/content/docs/joex/file-processing.md @@ -251,8 +251,8 @@ machine/setup. Another limit is `max-image-size` which defines the size of an image in pixel (`width * height`) where processing is skipped. -Then [ghostscript](http://pages.cs.wisc.edu/~ghost/) is used to -extract single pages into image files and +Then [ghostscript](https://www.ghostscript.com/) is used to extract +single pages into image files and [unpaper](https://github.com/Flameeyes/unpaper) is used to optimize the images for ocr. Unpaper is optional, if it is not found, it is skipped, which may be a compromise on slow machines.