Commit Graph

75 Commits

Author SHA1 Message Date
eikek
501c6f2988 Updating stanford corenlp to 4.3.2; adding more languages
There are models for Spanish, that have been added now. Also the
Hungarian language has been added to the list of supported
languages (for tesseract mainly, no nlp models)
2021-11-20 14:31:39 +01:00
eikek
7a4a444da6 Adopt nix setup and joex module 2021-11-18 00:23:17 +01:00
eikek
d793d2286c Update nix setup 2021-10-27 22:07:16 +02:00
eikek
cbc6877040 Update nix/nixos setup 2021-09-23 19:46:57 +02:00
eikek
cc9e3a31d8 Add openid to the nixos module 2021-09-06 14:47:42 +02:00
eikek
5cc2ad92a2 Update nix setup
Removes the consumedir module, use the module in the dsc project [0]
instead.

[0] https://github.com/docspell/dsc/blob/master/nix/module.nix
2021-08-28 12:52:43 +02:00
eikek
5d33b3841a Add a task to check for updates periodically
It must be enabled and configured by the admin.

Refs: #990
2021-08-20 00:25:27 +02:00
eikek
cd1330a3cd Update nix setup 2021-07-29 23:11:11 +02:00
eikek
a7a4e8e144 Update nix setup 2021-07-29 18:09:49 +02:00
eikek
e22683c9f6 Update nix setup 2021-06-18 23:53:21 +02:00
eikek
6fed858629 Update nix expressions for 21.05
Also remove lots of old releases declared in release.nix.
2021-06-18 22:27:29 +02:00
eikek
bdc7822f50 Add documentation about docker setup 2021-05-31 22:19:49 +02:00
Eike Kettner
a047a3f137 Update nix setup 2021-05-29 16:16:42 +02:00
Eike Kettner
5a4a9c63d7 Update nix setup 2021-04-11 23:35:14 +02:00
Eike Kettner
7a8a50c3e9 Update nix setup 2021-03-13 18:07:46 +01:00
Eike Kettner
a355b7c63b Update nix setup 2021-02-19 23:30:59 +01:00
Eike Kettner
d7bc963450 Cleanup nodes that are not reachable anymore 2021-02-18 00:37:18 +01:00
Eike Kettner
c45d72df34 Update nix setup 2021-01-25 10:17:25 +01:00
Eike Kettner
606a6756c8 Fix nix package for tools 2021-01-23 20:30:48 +01:00
Eike Kettner
98aa3423c1 Fix joex config for nix package 2021-01-23 20:30:48 +01:00
Eike Kettner
189f202b21 Move tool scripts into a separate dir 2021-01-23 20:30:48 +01:00
Eike Kettner
a6c31be22f Update documentation 2021-01-20 22:47:15 +01:00
Eike Kettner
f01646aeb5 Reorganize nlp pipeline and add nlp-unsupported language italian
Improves and reorganizes how nlp pipelines are setup. Now users can
choose from many options, depending on their hardware and usage
scenario.

This is the base to use more languages without depending on what
stanford-nlp supports. Support then is involves to text extraction and
simple regex-ner processing.
2021-01-18 17:41:40 +01:00
Eike Kettner
b57ceb6764 Use jdk11 in nix packages 2021-01-12 01:09:15 +01:00
Eike Kettner
646eedadf7 Update nix setup 2021-01-11 23:24:29 +01:00
Eike Kettner
a563ba33e7 Add new joex option to nix module 2021-01-06 23:06:13 +01:00
Eike Kettner
2a172ce720 Remove fulltext recreate-key config value
It's now in the admin routes, protected by the
`admin-endpoint.secret`.
2021-01-04 15:18:02 +01:00
Eike Kettner
21cd778c37 Merge branch 'fix-0.17' into master 2020-12-15 20:45:39 +01:00
Eike Kettner
e8433be862 Update nix package 2020-12-15 20:43:01 +01:00
Eike Kettner
9c106fcda6 Update nix package 2020-12-14 14:56:32 +01:00
Eike Kettner
5681581bf8 Add missing configs to nix modules 2020-12-14 14:51:56 +01:00
Eike Kettner
ab0fe34ee8 Update nix setup 2020-11-28 18:11:41 +01:00
Eike Kettner
f261aeae6e Update nix/nixos setup 2020-11-15 13:10:19 +01:00
Eike Kettner
acbfb9464f Update nix module with new config values 2020-11-15 00:01:48 +01:00
Eike Kettner
79c781db79 Update nix setup 2020-11-01 20:59:04 +01:00
Eike Kettner
0114bb4d72 Use source name from config file for integration endpoint uploads
Fixes: #389
2020-10-26 22:37:30 +01:00
Eike Kettner
d25448d8d9 Update nix setup 2020-10-19 14:55:49 +02:00
Eike Kettner
13daa99933 Update docker and nix setup 2020-09-28 01:10:44 +02:00
Eike Kettner
2773eb5341 Merge branch 'current-docs' 2020-09-25 22:03:41 +02:00
Eike Kettner
a36c499fb1 Update docs on rpi regarding requirements for joex 2020-09-25 21:58:02 +02:00
Eike Kettner
dc88fcb960 Update nix and docker setup 2020-09-09 22:31:35 +02:00
Eike Kettner
9547d6ffac Allo setting jvm arguments in nixos modules 2020-09-08 18:07:04 +02:00
Eike Kettner
8e5e198098 Update nix and docker setups 2020-09-08 00:32:17 +02:00
Eike Kettner
1dcccbcf7d Allow to hide classification settings in the webapp 2020-09-05 16:00:19 +02:00
Eike Kettner
afbe9554b6 Update joex nixos module 2020-09-02 22:23:12 +02:00
Eike Kettner
3473cbb773 Use collective data with NER annotation 2020-08-25 20:40:44 +02:00
Eike Kettner
d68d076c84 Update nix and docker setups 2020-08-15 00:34:33 +02:00
Eike Kettner
09d74b7e80 Return item notes with search results
In order to not make the response very large, a admin can define a
limit on how much to return.
2020-08-05 00:09:37 +02:00
Eike Kettner
7f20c80e51 Update nix setup 2020-08-01 16:54:06 +02:00
Eike Kettner
3d49ceaab5 Use ocrmypdf tool to create pdf/a during conversion
- Use another external tool to convert pdf to pdf which also adds the
  extracted text as another layer into the pdf

- Although not used, the external conversion routine will now check
  for an existing text file that is named as the pdf file with extension
  `.txt`. If present it is included in the conversion result and will be
  used as the extracted text.

- text extraction for pdf files happens now on the converted file,
  because it may already contain the text from the conversion step and
  thus avoids running OCR twice.

- All errors during conversion are not fatal; processing continues
  without a converted file.
2020-07-18 17:19:29 +02:00