mirror of
				https://github.com/TheAnachronism/docspell.git
				synced 2025-10-31 17:50:11 +00:00 
			
		
		
		
	Merge branch 'current-docs'
This commit is contained in:
		| @@ -32,9 +32,9 @@ component. | ||||
|  | ||||
| ## External Programs for Joex | ||||
|  | ||||
| - [Ghostscript](http://pages.cs.wisc.edu/~ghost/) (the `gs` command) | ||||
|   is used to extract/convert PDF files into images that are then fed | ||||
|   to ocr. It is available on most GNU/Linux distributions. | ||||
| - [Ghostscript](https://www.ghostscript.com/) (the `gs` command) is | ||||
|   used to extract/convert PDF files into images that are then fed to | ||||
|   ocr. It is available on most GNU/Linux distributions. | ||||
| - [Unpaper](https://github.com/Flameeyes/unpaper) is a program that | ||||
|   pre-processes images to yield better results when doing ocr. If this | ||||
|   is not installed, docspell tries without it. However, it is | ||||
|   | ||||
| @@ -251,8 +251,8 @@ machine/setup. | ||||
| Another limit is `max-image-size` which defines the size of an image | ||||
| in pixel (`width * height`) where processing is skipped. | ||||
|  | ||||
| Then [ghostscript](http://pages.cs.wisc.edu/~ghost/) is used to | ||||
| extract single pages into image files and | ||||
| Then [ghostscript](https://www.ghostscript.com/) is used to extract | ||||
| single pages into image files and | ||||
| [unpaper](https://github.com/Flameeyes/unpaper) is used to optimize | ||||
| the images for ocr. Unpaper is optional, if it is not found, it is | ||||
| skipped, which may be a compromise on slow machines. | ||||
|   | ||||
		Reference in New Issue
	
	Block a user