mirror of
https://github.com/TheAnachronism/docspell.git
synced 2025-04-04 10:29:34 +00:00
Set default value for min-text-len to 500
This value is used to decide whether to try OCR or not. If text is below this value, OCR is run and both results are compared. It was set to 10, which is just one or two words. Since the context for docspell are documents, this value is too low.
This commit is contained in:
parent
a06d20a479
commit
1fc57fc2b2
@ -169,7 +169,7 @@ docspell.joex {
|
||||
# below, OCR is run afterwards. Then both extracted texts are
|
||||
# compared and the longer will be used.
|
||||
pdf {
|
||||
min-text-len = 10
|
||||
min-text-len = 500
|
||||
}
|
||||
|
||||
# Extracting text using OCR works for image and pdf files. It will
|
||||
|
Loading…
x
Reference in New Issue
Block a user