mirror of
https://github.com/TheAnachronism/docspell.git
synced 2025-06-22 02:18:26 +00:00
Reorganize nlp pipeline and add nlp-unsupported language italian
Improves and reorganizes how nlp pipelines are setup. Now users can choose from many options, depending on their hardware and usage scenario. This is the base to use more languages without depending on what stanford-nlp supports. Support then is involves to text extraction and simple regex-ner processing.
This commit is contained in:
13
modules/files/src/test/resources/examples/letter-ita.txt
Normal file
13
modules/files/src/test/resources/examples/letter-ita.txt
Normal file
@ -0,0 +1,13 @@
|
||||
Pontremoli, 9 aprile 2013
|
||||
|
||||
Spettabile Villa Albicocca
|
||||
Via Francigena, 9
|
||||
55100 Pontetetto (LU)
|
||||
|
||||
Oggetto: Prenotazione
|
||||
|
||||
Gentile Direttore,
|
||||
|
||||
Vorrei prenotare una camera matrimoniale …….
|
||||
|
||||
In attesa di una Sua pronta risposta, La saluto cordialmente
|
Reference in New Issue
Block a user