99dcaae66b
Learn classifiers for item entities
...
Learns classifiers for concerned and correspondent entities. This can
be used as an alternative to or after nlp.
2021-01-19 20:54:47 +01:00
a6f29153c4
Control what tag categories to use for auto-tagging
2021-01-19 01:20:13 +01:00
cce8878898
Exclude tags w/o category from classifying; remove obsolete models
2021-01-18 21:51:49 +01:00
3e28ce1254
Add the sql concat function to query builder
2021-01-18 21:51:45 +01:00
249f9e6e2a
Extend guessing tags to all tag categories
2021-01-18 21:51:45 +01:00
3f75af0807
Add 9 more lanugages to the list of document lanugages
2021-01-18 17:41:40 +01:00
94bb18c152
Refactor solr language fields
2021-01-18 17:41:40 +01:00
26dff18ae0
Add spanish as an example
...
Adding a new language without nlp requires now only to fill out the
pieces:
- define a list of month names to support date recognition
- add it to joex' dockerfile to be available for tesseract
- update the solr migration/field definitions
- update the elm file so it shows up on the client
2021-01-18 17:41:40 +01:00
360cad3304
Refactoring solr/fts migration
...
When re-indexing everything, skip intermediate populating the index
and do this as the very last step.
Parameterize adding new fields by their language.
2021-01-18 17:41:40 +01:00
ff121d462c
Disable memory intensive tests on travis
2021-01-18 17:41:40 +01:00
f01646aeb5
Reorganize nlp pipeline and add nlp-unsupported language italian
...
Improves and reorganizes how nlp pipelines are setup. Now users can
choose from many options, depending on their hardware and usage
scenario.
This is the base to use more languages without depending on what
stanford-nlp supports. Support then is involves to text extraction and
simple regex-ner processing.
2021-01-18 17:41:40 +01:00
a70e9ab614
Store used language for processing on attachmentmeta
...
Issue: #570
2021-01-17 22:56:33 +01:00
6cf3f9be5a
Fix joex version endpoint in spec
2021-01-17 22:56:33 +01:00
aa937797be
Choose nlp mode in config file
2021-01-17 22:56:33 +01:00
54a09861c4
Use model cache with basic annotator
2021-01-17 22:56:33 +01:00
a77f67d73a
Make pipeline cache generic to be used with BasicCRFAnnotator
2021-01-17 22:56:33 +01:00
4462ebae0f
Resurrect the basic ner classifier
2021-01-17 22:56:33 +01:00
a699e87304
Separate ner from classification
2021-01-17 22:56:33 +01:00
f02f15e5bd
Move blocker into constructor of text analyser
2021-01-17 22:56:33 +01:00
b2b8ad625a
scalafmt
2021-01-17 20:11:58 +01:00
f0f0e6e0d4
Search for categories case-insensitive
...
The string was already lowercased, but the comparison was not.
Fixes #568
2021-01-17 20:10:24 +01:00
623a61dbb6
Introduce a lowerEq operator to the query builder
2021-01-17 20:10:00 +01:00
54bd75e99e
Set version to 0.19.0-SNAPSHOT
2021-01-11 23:27:47 +01:00
0d1b55a205
Set version to 0.18.0
2021-01-11 22:39:40 +01:00
d77b5855e4
Set default pool-size to 1
2021-01-11 22:30:59 +01:00
38ae7a9027
Make source a quick link on card and detail
2021-01-11 21:37:36 +01:00
33458766fe
Correcty reset search menu when clicking on custom-field quick link
2021-01-11 14:03:23 +01:00
7beda302b2
Fix and improve tag search menu
...
Show also "empty tags", where the count is 0. Before only tags with a
count > 0 were displayed. When searching this is fine, but when using
drag&drop to attach tags to items, it is good to see all. They can be
hidden via a button.
The tags are now ordered by their count descending, but regarding to
the overall count – not the current view. Otherwise the tags are
reordered when clicking on them, which is confusing. Also it then
shows the "more important" (most used) tags first, even when the
result is a subset.
A fix was made related to updating the menu. When coming back from
the detail view where a tag with prior count=0 was associated, the
menu didn't show it, because it relied on a previous state, where this
tag were not included.
2021-01-11 13:01:38 +01:00
3fccc3df39
Return all tags in search stats result
...
Before only tags with a count > 0 were included. Now those that have
not attached to any item are returned as well.
2021-01-11 12:13:13 +01:00
75986c461f
Fix ner date label boundary reporting
2021-01-10 09:10:39 +01:00
fb05e997ab
Provide multiple date suggestions for English
...
Issue: #561
2021-01-10 09:02:26 +01:00
bddafa7d28
Fix looping over already seen mails when they are skipped
...
When skipping mails due to a filter, it must still enter the
post-handling step. Otherwise it will be seen again on next run.
Issue: #551
2021-01-09 15:07:18 +01:00
d712f8303d
Make glob matching case-insensitive by default
2021-01-09 13:23:15 +01:00
cbca4d234f
Fix scrolling to card
...
That was broken due to the independent scroll in commit #bcb1b8.
2021-01-09 02:00:01 +01:00
cef1c38cc4
Restrict height of job output
2021-01-09 01:49:55 +01:00
0abd7dea10
Fix scrolling to top in detail view
2021-01-09 01:16:59 +01:00
c0d7aba5d5
Improve selecting attachments of an item
...
Use a list of small thumbnails instead of just names.
Closes : #396
2021-01-09 01:16:59 +01:00
9bc2084499
Allow to click on custom fields in detail view
...
Closes : #514
2021-01-09 01:16:59 +01:00
48d182667d
Harmonize login and register page
2021-01-09 01:16:59 +01:00
752c8f9be2
Show new-invite as normal page
2021-01-09 01:16:59 +01:00
bcb1b87fc0
Enable independent scrolling of search menu and list
...
Fixes some other minor css issues.
Closes : #541
2021-01-09 01:16:59 +01:00
3c12e3678f
Allow to search for *
in custom date fields
...
This requires to pass the raw input through to the caller.
Closes : #550
2021-01-09 01:16:59 +01:00
14dacaa837
Fix typo
2021-01-09 10:41:53 +11:00
716252721c
Fix cache clearing
...
It must be cancelled when obtaining a pipeline.
2021-01-07 23:31:01 +01:00
a670bbb6c2
Make idle interval when clearing nlp cache configurable
2021-01-06 23:03:00 +01:00
73a9572835
Poc for clearing stanford pipeline after some idle time
2021-01-05 23:56:20 +01:00
b08e88cd69
Add (inofficial) routes to get system information
2021-01-05 20:54:53 +01:00
30df887934
Sort custom field options in dropdown
2021-01-05 18:04:54 +01:00
668abf2140
Add a reset-password admin route
2021-01-04 20:59:31 +01:00
2a172ce720
Remove fulltext recreate-key config value
...
It's now in the admin routes, protected by the
`admin-endpoint.secret`.
2021-01-04 15:18:02 +01:00