1902 Commits

Author SHA1 Message Date
Eike Kettner
e9a4f904c9 Update consumedir docs 2021-01-25 08:13:55 +01:00
Eike Kettner
0d7e92be40 Update changelog 2021-01-25 08:13:55 +01:00
Eike Kettner
6b5382add1 Update docs 2021-01-25 08:13:55 +01:00
mergify[bot]
8de4b67b45
Merge pull request #595 from eikek/h2-fix
H2 fix
2021-01-25 00:56:10 +00:00
Eike Kettner
394aeeccb6 Introduce a sql literal and constants in query builder
The h2 jdbc driver could not translate the union query in QCollective
when the `kind` was set via a constant value. Using literals works
here. Renamed the corresponding elements in the query builder.
2021-01-25 00:18:24 +01:00
Eike Kettner
bb744668ca Remove unused code 2021-01-24 23:24:33 +01:00
mergify[bot]
69c0254bcf
Merge pull request #593 from eikek/emil-update
Update emil to 0.7.1
2021-01-24 19:48:07 +00:00
Eike Kettner
1e48346361 Update emil to 0.7.1 2021-01-24 20:07:09 +01:00
mergify[bot]
bbc8f0774e
Merge pull request #592 from scala-steward/update/http4s-blaze-client-0.21.16
Update http4s-blaze-client, ... to 0.21.16
2021-01-24 18:19:37 +00:00
Scala Steward
01eca0dd14
Update http4s-blaze-client, ... to 0.21.16 2021-01-24 18:31:47 +01:00
mergify[bot]
6cc9c159d6
Merge pull request #590 from eikek/scan-mailbox-filter
Refactor scan mailbox form and add flag for post-processing
2021-01-24 01:06:51 +00:00
Eike Kettner
96612e0e59 Refactor scan mailbox form and add flag for post-processing
Mails are filtered once by using an imap search and then by some globs
to filter files and subjects. Imap can search by subject via a
string-contains, but not via globs or patterns (afaik). The subject
filter is applied to all downloaded mail headers. Now for post
processing (moving to some target folder or deleting), it can be
chosen to post-process all "seen" mails or only those that matched all
filters.
2021-01-24 01:46:31 +01:00
mergify[bot]
d3e5bca9f6
Merge pull request #589 from eikek/nix-tools-package
Nix tools package
2021-01-24 00:43:34 +00:00
mergify[bot]
2f9671f5c1
Merge pull request #588 from scala-steward/update/imageio-jpeg-3.6.2
Update imageio-jpeg, imageio-tiff to 3.6.2
2021-01-23 21:49:40 +00:00
Scala Steward
529dca091d
Update imageio-jpeg, imageio-tiff to 3.6.2 2021-01-23 22:20:29 +01:00
mergify[bot]
0089021f92
Merge pull request #587 from eikek/save-settings
Fix classifier_settings table
2021-01-23 20:59:28 +00:00
Eike Kettner
1b66e2af5c Fix classifier_settings table 2021-01-23 21:30:26 +01:00
Eike Kettner
606a6756c8 Fix nix package for tools 2021-01-23 20:30:48 +01:00
Eike Kettner
98aa3423c1 Fix joex config for nix package 2021-01-23 20:30:48 +01:00
Eike Kettner
189f202b21 Move tool scripts into a separate dir 2021-01-23 20:30:48 +01:00
mergify[bot]
e6d67c368b
Merge pull request #586 from eikek/optional-text-limit
Make the text length limit optional
2021-01-22 23:47:38 +00:00
Eike Kettner
c7e850116f Make the text length limit optional 2021-01-22 23:06:50 +01:00
mergify[bot]
8dd1672c8c
Merge pull request #583 from eikek/fix-baseurl-setting
Render baseurl without trailing slash
2021-01-21 23:44:14 +00:00
mergify[bot]
38e0a50942
Merge pull request #582 from eikek/delete-org-fix
Fix deleting organization
2021-01-21 22:56:56 +00:00
mergify[bot]
df5f9e8c51
Merge pull request #581 from eikek/text-analysis-improvements
Text analysis improvements
2021-01-21 22:01:50 +00:00
Eike Kettner
0ec620fcf0 Render baseurl without trailing slash
The webapp expects it like this currently, because the url is only a
string.
2021-01-21 21:42:08 +01:00
Eike Kettner
f4a03e7c69 Fix deleting organization
The foreign key in person must be resetted.
2021-01-21 21:27:02 +01:00
Eike Kettner
4cba96f390 Always return classifier results as suggestion
The classifier results are spliced into the suggestion list at second
place. When linking they are only used if nlp didn't find anything.
2021-01-21 21:05:28 +01:00
mergify[bot]
b9b554980a
Merge pull request #580 from scala-steward/update/flyway-core-7.5.1
Update flyway-core to 7.5.1
2021-01-21 19:20:53 +00:00
Eike Kettner
021ac568ae Update documentation for text analysis 2021-01-21 20:06:53 +01:00
Scala Steward
cea9fa0d38
Update flyway-core to 7.5.1 2021-01-21 18:27:10 +01:00
Eike Kettner
9957c3267e Add constraints from config to classifier training
For large and/or many documents, training the classifier can lead to
OOM errors. Some limits have been set by default.
2021-01-21 17:46:39 +01:00
Eike Kettner
363cf5aef0 Quote names in sql changesets 2021-01-21 00:22:58 +01:00
Eike Kettner
38387e00a0 Fix mariadb migration 2021-01-21 00:22:53 +01:00
Eike Kettner
a6c31be22f Update documentation 2021-01-20 22:47:15 +01:00
Eike Kettner
85ddc61d9d Move date proposal setting to nlp config 2021-01-20 19:17:29 +01:00
Eike Kettner
5d366c3bd6 Make labels in classifier settings more clear 2021-01-20 01:05:59 +01:00
Eike Kettner
2059095581 Update README 2021-01-20 00:59:35 +01:00
Eike Kettner
b12d965223 Improve logging 2021-01-20 00:40:58 +01:00
Eike Kettner
27c24c128d Store tags guessed with classifier in database 2021-01-20 00:30:40 +01:00
Eike Kettner
9d83cb7fe4 Store item based proposals in separate table
Classifier don't work on each attachment, but on all. So the results
must not be stored at an attachment. This reverts some previous
changes to put the classifier results for item entities into its own
table.
2021-01-19 23:48:09 +01:00
Eike Kettner
3ff9284a64 Return classifier results as suggestions 2021-01-19 23:13:51 +01:00
Eike Kettner
75573c905e Use classifier results as fallback when linking proposed metadata 2021-01-19 23:13:34 +01:00
Eike Kettner
8455d1badf Lookup results from classifier
The model may be out of date, data may change. Then it should be
looked up to fetch the id to be compatible with next stages.
2021-01-19 22:56:01 +01:00
Eike Kettner
1cd3441462 Run classifier for item entities (concerned, correspondent)
Store the results separately from nlp results in attachment metadata.
2021-01-19 22:08:29 +01:00
Eike Kettner
d124f0c1a9 Rename db changeset
It's not just a fix, but adds new things
2021-01-19 22:08:29 +01:00
Eike Kettner
5c487ef7a9 Refactor running classifier in text analysis 2021-01-19 21:30:02 +01:00
Eike Kettner
99dcaae66b Learn classifiers for item entities
Learns classifiers for concerned and correspondent entities. This can
be used as an alternative to or after nlp.
2021-01-19 20:54:47 +01:00
Eike Kettner
a6f29153c4 Control what tag categories to use for auto-tagging 2021-01-19 01:20:13 +01:00
Eike Kettner
cce8878898 Exclude tags w/o category from classifying; remove obsolete models 2021-01-18 21:51:49 +01:00