946204e809
Update documentation
2021-01-25 09:11:54 +01:00
e9a4f904c9
Update consumedir docs
2021-01-25 08:13:55 +01:00
0d7e92be40
Update changelog
2021-01-25 08:13:55 +01:00
6b5382add1
Update docs
2021-01-25 08:13:55 +01:00
8de4b67b45
Merge pull request #595 from eikek/h2-fix
...
H2 fix
2021-01-25 00:56:10 +00:00
394aeeccb6
Introduce a sql literal and constants in query builder
...
The h2 jdbc driver could not translate the union query in QCollective
when the `kind` was set via a constant value. Using literals works
here. Renamed the corresponding elements in the query builder.
2021-01-25 00:18:24 +01:00
bb744668ca
Remove unused code
2021-01-24 23:24:33 +01:00
69c0254bcf
Merge pull request #593 from eikek/emil-update
...
Update emil to 0.7.1
2021-01-24 19:48:07 +00:00
1e48346361
Update emil to 0.7.1
2021-01-24 20:07:09 +01:00
bbc8f0774e
Merge pull request #592 from scala-steward/update/http4s-blaze-client-0.21.16
...
Update http4s-blaze-client, ... to 0.21.16
2021-01-24 18:19:37 +00:00
01eca0dd14
Update http4s-blaze-client, ... to 0.21.16
2021-01-24 18:31:47 +01:00
6cc9c159d6
Merge pull request #590 from eikek/scan-mailbox-filter
...
Refactor scan mailbox form and add flag for post-processing
2021-01-24 01:06:51 +00:00
96612e0e59
Refactor scan mailbox form and add flag for post-processing
...
Mails are filtered once by using an imap search and then by some globs
to filter files and subjects. Imap can search by subject via a
string-contains, but not via globs or patterns (afaik). The subject
filter is applied to all downloaded mail headers. Now for post
processing (moving to some target folder or deleting), it can be
chosen to post-process all "seen" mails or only those that matched all
filters.
2021-01-24 01:46:31 +01:00
d3e5bca9f6
Merge pull request #589 from eikek/nix-tools-package
...
Nix tools package
2021-01-24 00:43:34 +00:00
2f9671f5c1
Merge pull request #588 from scala-steward/update/imageio-jpeg-3.6.2
...
Update imageio-jpeg, imageio-tiff to 3.6.2
2021-01-23 21:49:40 +00:00
529dca091d
Update imageio-jpeg, imageio-tiff to 3.6.2
2021-01-23 22:20:29 +01:00
0089021f92
Merge pull request #587 from eikek/save-settings
...
Fix classifier_settings table
2021-01-23 20:59:28 +00:00
1b66e2af5c
Fix classifier_settings table
2021-01-23 21:30:26 +01:00
606a6756c8
Fix nix package for tools
2021-01-23 20:30:48 +01:00
98aa3423c1
Fix joex config for nix package
2021-01-23 20:30:48 +01:00
189f202b21
Move tool scripts into a separate dir
2021-01-23 20:30:48 +01:00
e6d67c368b
Merge pull request #586 from eikek/optional-text-limit
...
Make the text length limit optional
2021-01-22 23:47:38 +00:00
c7e850116f
Make the text length limit optional
2021-01-22 23:06:50 +01:00
8dd1672c8c
Merge pull request #583 from eikek/fix-baseurl-setting
...
Render baseurl without trailing slash
2021-01-21 23:44:14 +00:00
38e0a50942
Merge pull request #582 from eikek/delete-org-fix
...
Fix deleting organization
2021-01-21 22:56:56 +00:00
df5f9e8c51
Merge pull request #581 from eikek/text-analysis-improvements
...
Text analysis improvements
2021-01-21 22:01:50 +00:00
0ec620fcf0
Render baseurl without trailing slash
...
The webapp expects it like this currently, because the url is only a
string.
2021-01-21 21:42:08 +01:00
f4a03e7c69
Fix deleting organization
...
The foreign key in person must be resetted.
2021-01-21 21:27:02 +01:00
4cba96f390
Always return classifier results as suggestion
...
The classifier results are spliced into the suggestion list at second
place. When linking they are only used if nlp didn't find anything.
2021-01-21 21:05:28 +01:00
b9b554980a
Merge pull request #580 from scala-steward/update/flyway-core-7.5.1
...
Update flyway-core to 7.5.1
2021-01-21 19:20:53 +00:00
021ac568ae
Update documentation for text analysis
2021-01-21 20:06:53 +01:00
cea9fa0d38
Update flyway-core to 7.5.1
2021-01-21 18:27:10 +01:00
9957c3267e
Add constraints from config to classifier training
...
For large and/or many documents, training the classifier can lead to
OOM errors. Some limits have been set by default.
2021-01-21 17:46:39 +01:00
363cf5aef0
Quote names in sql changesets
2021-01-21 00:22:58 +01:00
38387e00a0
Fix mariadb migration
2021-01-21 00:22:53 +01:00
a6c31be22f
Update documentation
2021-01-20 22:47:15 +01:00
85ddc61d9d
Move date proposal setting to nlp config
2021-01-20 19:17:29 +01:00
5d366c3bd6
Make labels in classifier settings more clear
2021-01-20 01:05:59 +01:00
2059095581
Update README
2021-01-20 00:59:35 +01:00
b12d965223
Improve logging
2021-01-20 00:40:58 +01:00
27c24c128d
Store tags guessed with classifier in database
2021-01-20 00:30:40 +01:00
9d83cb7fe4
Store item based proposals in separate table
...
Classifier don't work on each attachment, but on all. So the results
must not be stored at an attachment. This reverts some previous
changes to put the classifier results for item entities into its own
table.
2021-01-19 23:48:09 +01:00
3ff9284a64
Return classifier results as suggestions
2021-01-19 23:13:51 +01:00
75573c905e
Use classifier results as fallback when linking proposed metadata
2021-01-19 23:13:34 +01:00
8455d1badf
Lookup results from classifier
...
The model may be out of date, data may change. Then it should be
looked up to fetch the id to be compatible with next stages.
2021-01-19 22:56:01 +01:00
1cd3441462
Run classifier for item entities (concerned, correspondent)
...
Store the results separately from nlp results in attachment metadata.
2021-01-19 22:08:29 +01:00
d124f0c1a9
Rename db changeset
...
It's not just a fix, but adds new things
2021-01-19 22:08:29 +01:00
5c487ef7a9
Refactor running classifier in text analysis
2021-01-19 21:30:02 +01:00
99dcaae66b
Learn classifiers for item entities
...
Learns classifiers for concerned and correspondent entities. This can
be used as an alternative to or after nlp.
2021-01-19 20:54:47 +01:00
a6f29153c4
Control what tag categories to use for auto-tagging
2021-01-19 01:20:13 +01:00