Commit Graph

3003 Commits

Author SHA1 Message Date
cce8878898 Exclude tags w/o category from classifying; remove obsolete models 2021-01-18 21:51:49 +01:00
3e28ce1254 Add the sql concat function to query builder 2021-01-18 21:51:45 +01:00
249f9e6e2a Extend guessing tags to all tag categories 2021-01-18 21:51:45 +01:00
c5778880d9 Update documentation 2021-01-18 17:41:40 +01:00
3f75af0807 Add 9 more lanugages to the list of document lanugages 2021-01-18 17:41:40 +01:00
94bb18c152 Refactor solr language fields 2021-01-18 17:41:40 +01:00
26dff18ae0 Add spanish as an example
Adding a new language without nlp requires now only to fill out the
pieces:

- define a list of month names to support date recognition
- add it to joex' dockerfile to be available for tesseract
- update the solr migration/field definitions
- update the elm file so it shows up on the client
2021-01-18 17:41:40 +01:00
360cad3304 Refactoring solr/fts migration
When re-indexing everything, skip intermediate populating the index
and do this as the very last step.

Parameterize adding new fields by their language.
2021-01-18 17:41:40 +01:00
ff121d462c Disable memory intensive tests on travis 2021-01-18 17:41:40 +01:00
f01646aeb5 Reorganize nlp pipeline and add nlp-unsupported language italian
Improves and reorganizes how nlp pipelines are setup. Now users can
choose from many options, depending on their hardware and usage
scenario.

This is the base to use more languages without depending on what
stanford-nlp supports. Support then is involves to text extraction and
simple regex-ner processing.
2021-01-18 17:41:40 +01:00
a70e9ab614 Store used language for processing on attachmentmeta
Issue: #570
2021-01-17 22:56:33 +01:00
6cf3f9be5a Fix joex version endpoint in spec 2021-01-17 22:56:33 +01:00
aa937797be Choose nlp mode in config file 2021-01-17 22:56:33 +01:00
54a09861c4 Use model cache with basic annotator 2021-01-17 22:56:33 +01:00
a77f67d73a Make pipeline cache generic to be used with BasicCRFAnnotator 2021-01-17 22:56:33 +01:00
4462ebae0f Resurrect the basic ner classifier 2021-01-17 22:56:33 +01:00
a699e87304 Separate ner from classification 2021-01-17 22:56:33 +01:00
f02f15e5bd Move blocker into constructor of text analyser 2021-01-17 22:56:33 +01:00
ffbec3502f Merge pull request #575 from eikek/category-search-fix
Category search fix
2021-01-17 20:28:06 +00:00
b2b8ad625a scalafmt 2021-01-17 20:11:58 +01:00
f0f0e6e0d4 Search for categories case-insensitive
The string was already lowercased, but the comparison was not.

Fixes #568
2021-01-17 20:10:24 +01:00
623a61dbb6 Introduce a lowerEq operator to the query builder 2021-01-17 20:10:00 +01:00
0be89687dc Ignore poi updates for now
The new version 5.0.0 brings additional 18M transitive dependencies.
It's worth it to checkout what is needed/changed before upgrading.
2021-01-17 19:49:15 +01:00
f40290d4ef Merge pull request #571 from scala-steward/update/swagger-ui-3.40.0
Update swagger-ui to 3.40.0
2021-01-15 23:45:44 +00:00
d337e8bb6a Update swagger-ui to 3.40.0 2021-01-16 00:19:17 +01:00
53aeb365d1 Merge pull request #569 from scala-steward/update/sbt-scalafix-0.9.25
Update sbt-scalafix to 0.9.25
2021-01-15 20:08:37 +00:00
99641e502b Update sbt-scalafix to 0.9.25 2021-01-15 20:23:51 +01:00
96fcd92c83 Merge pull request #566 from scala-steward/update/yamusca-core-0.8.0
Update yamusca-core to 0.8.0
2021-01-14 01:40:09 +00:00
cd73dd4c08 Update yamusca-core to 0.8.0 2021-01-14 02:13:43 +01:00
b57ceb6764 Use jdk11 in nix packages 2021-01-12 01:09:15 +01:00
f18b763dee Merge branch 'current-docs' 2021-01-12 00:16:41 +01:00
a40779d73a Fix external link 2021-01-12 00:15:37 +01:00
c18c5db717 Remove space in changelog that breaks formatting 2021-01-12 00:14:07 +01:00
54bd75e99e Set version to 0.19.0-SNAPSHOT 2021-01-11 23:27:47 +01:00
646eedadf7 Update nix setup 2021-01-11 23:24:29 +01:00
0d1b55a205 Set version to 0.18.0 2021-01-11 22:39:40 +01:00
83c89d0678 Update changelog date 2021-01-11 22:39:40 +01:00
d77b5855e4 Set default pool-size to 1 2021-01-11 22:30:59 +01:00
8910ac6954 Add docs for file processing 2021-01-11 22:30:59 +01:00
96c337f7db Update screencasts in documentation 2021-01-11 21:38:13 +01:00
31fe4f817a Update documentation 2021-01-11 21:38:13 +01:00
8cf1e38323 Update Changelog 2021-01-11 21:38:13 +01:00
38ae7a9027 Make source a quick link on card and detail 2021-01-11 21:37:36 +01:00
63535408c9 Merge pull request #564 from scala-steward/update/flyway-core-7.5.0
Update flyway-core to 7.5.0
2021-01-11 15:43:40 +00:00
b86783a4a8 Update flyway-core to 7.5.0 2021-01-11 16:22:39 +01:00
33458766fe Correcty reset search menu when clicking on custom-field quick link 2021-01-11 14:03:23 +01:00
35277d9dbf Merge branch 'current-docs' 2021-01-11 13:24:10 +01:00
1b79d7b36d Merge pull request #563 from eikek/tag-menu-fixes
Tag menu fixes
2021-01-11 12:22:18 +00:00
7beda302b2 Fix and improve tag search menu
Show also "empty tags", where the count is 0. Before only tags with a
count > 0 were displayed. When searching this is fine, but when using
drag&drop to attach tags to items, it is good to see all. They can be
hidden via a button.

The tags are now ordered by their count descending, but regarding to
the overall count – not the current view. Otherwise the tags are
reordered when clicking on them, which is confusing. Also it then
shows the "more important" (most used) tags first, even when the
result is a subset.

A fix was made related to updating the menu. When coming back from
the detail view where a tag with prior count=0 was associated, the
menu didn't show it, because it relied on a previous state, where this
tag were not included.
2021-01-11 13:01:38 +01:00
3fccc3df39 Return all tags in search stats result
Before only tags with a count > 0 were included. Now those that have
not attached to any item are returned as well.
2021-01-11 12:13:13 +01:00