501c6f2988
Updating stanford corenlp to 4.3.2; adding more languages
...
There are models for Spanish, that have been added now. Also the
Hungarian language has been added to the list of supported
languages (for tesseract mainly, no nlp models)
2021-11-20 14:31:39 +01:00
7a4a444da6
Adopt nix setup and joex module
2021-11-18 00:23:17 +01:00
d793d2286c
Update nix setup
2021-10-27 22:07:16 +02:00
cbc6877040
Update nix/nixos setup
2021-09-23 19:46:57 +02:00
cc9e3a31d8
Add openid to the nixos module
2021-09-06 14:47:42 +02:00
5cc2ad92a2
Update nix setup
...
Removes the consumedir module, use the module in the dsc project [0]
instead.
[0] https://github.com/docspell/dsc/blob/master/nix/module.nix
2021-08-28 12:52:43 +02:00
5d33b3841a
Add a task to check for updates periodically
...
It must be enabled and configured by the admin.
Refs: #990
2021-08-20 00:25:27 +02:00
cd1330a3cd
Update nix setup
2021-07-29 23:11:11 +02:00
a7a4e8e144
Update nix setup
2021-07-29 18:09:49 +02:00
e22683c9f6
Update nix setup
2021-06-18 23:53:21 +02:00
6fed858629
Update nix expressions for 21.05
...
Also remove lots of old releases declared in release.nix.
2021-06-18 22:27:29 +02:00
bdc7822f50
Add documentation about docker setup
2021-05-31 22:19:49 +02:00
a047a3f137
Update nix setup
2021-05-29 16:16:42 +02:00
5a4a9c63d7
Update nix setup
2021-04-11 23:35:14 +02:00
7a8a50c3e9
Update nix setup
2021-03-13 18:07:46 +01:00
a355b7c63b
Update nix setup
2021-02-19 23:30:59 +01:00
d7bc963450
Cleanup nodes that are not reachable anymore
2021-02-18 00:37:18 +01:00
c45d72df34
Update nix setup
2021-01-25 10:17:25 +01:00
606a6756c8
Fix nix package for tools
2021-01-23 20:30:48 +01:00
98aa3423c1
Fix joex config for nix package
2021-01-23 20:30:48 +01:00
189f202b21
Move tool scripts into a separate dir
2021-01-23 20:30:48 +01:00
a6c31be22f
Update documentation
2021-01-20 22:47:15 +01:00
f01646aeb5
Reorganize nlp pipeline and add nlp-unsupported language italian
...
Improves and reorganizes how nlp pipelines are setup. Now users can
choose from many options, depending on their hardware and usage
scenario.
This is the base to use more languages without depending on what
stanford-nlp supports. Support then is involves to text extraction and
simple regex-ner processing.
2021-01-18 17:41:40 +01:00
b57ceb6764
Use jdk11 in nix packages
2021-01-12 01:09:15 +01:00
646eedadf7
Update nix setup
2021-01-11 23:24:29 +01:00
a563ba33e7
Add new joex option to nix module
2021-01-06 23:06:13 +01:00
2a172ce720
Remove fulltext recreate-key config value
...
It's now in the admin routes, protected by the
`admin-endpoint.secret`.
2021-01-04 15:18:02 +01:00
21cd778c37
Merge branch 'fix-0.17' into master
2020-12-15 20:45:39 +01:00
e8433be862
Update nix package
2020-12-15 20:43:01 +01:00
9c106fcda6
Update nix package
2020-12-14 14:56:32 +01:00
5681581bf8
Add missing configs to nix modules
2020-12-14 14:51:56 +01:00
ab0fe34ee8
Update nix setup
2020-11-28 18:11:41 +01:00
f261aeae6e
Update nix/nixos setup
2020-11-15 13:10:19 +01:00
acbfb9464f
Update nix module with new config values
2020-11-15 00:01:48 +01:00
79c781db79
Update nix setup
2020-11-01 20:59:04 +01:00
0114bb4d72
Use source name from config file for integration endpoint uploads
...
Fixes : #389
2020-10-26 22:37:30 +01:00
d25448d8d9
Update nix setup
2020-10-19 14:55:49 +02:00
13daa99933
Update docker and nix setup
2020-09-28 01:10:44 +02:00
2773eb5341
Merge branch 'current-docs'
2020-09-25 22:03:41 +02:00
a36c499fb1
Update docs on rpi regarding requirements for joex
2020-09-25 21:58:02 +02:00
dc88fcb960
Update nix and docker setup
2020-09-09 22:31:35 +02:00
9547d6ffac
Allo setting jvm arguments in nixos modules
2020-09-08 18:07:04 +02:00
8e5e198098
Update nix and docker setups
2020-09-08 00:32:17 +02:00
1dcccbcf7d
Allow to hide classification settings in the webapp
2020-09-05 16:00:19 +02:00
afbe9554b6
Update joex nixos module
2020-09-02 22:23:12 +02:00
3473cbb773
Use collective data with NER annotation
2020-08-25 20:40:44 +02:00
d68d076c84
Update nix and docker setups
2020-08-15 00:34:33 +02:00
09d74b7e80
Return item notes with search results
...
In order to not make the response very large, a admin can define a
limit on how much to return.
2020-08-05 00:09:37 +02:00
7f20c80e51
Update nix setup
2020-08-01 16:54:06 +02:00
3d49ceaab5
Use ocrmypdf tool to create pdf/a during conversion
...
- Use another external tool to convert pdf to pdf which also adds the
extracted text as another layer into the pdf
- Although not used, the external conversion routine will now check
for an existing text file that is named as the pdf file with extension
`.txt`. If present it is included in the conversion result and will be
used as the extracted text.
- text extraction for pdf files happens now on the converted file,
because it may already contain the text from the conversion step and
thus avoids running OCR twice.
- All errors during conversion are not fatal; processing continues
without a converted file.
2020-07-18 17:19:29 +02:00