Set default value for min-text-len to 500

This value is used to decide whether to try OCR or not. If text is
below this value, OCR is run and both results are compared. It was set
to 10, which is just one or two words. Since the context for docspell
are documents, this value is too low.
This commit is contained in:
Eike Kettner 2020-08-01 15:44:46 +02:00
parent a06d20a479
commit 1fc57fc2b2

View File

@ -169,7 +169,7 @@ docspell.joex {
# below, OCR is run afterwards. Then both extracted texts are
# compared and the longer will be used.
pdf {
min-text-len = 10
min-text-len = 500
}
# Extracting text using OCR works for image and pdf files. It will