Commit Graph

2370 Commits

Author SHA1 Message Date
Eike Kettner
3f316ab4d0 Update config file doc 2020-02-20 21:10:00 +01:00
eikek
7e23f206ab Merge pull request #35 from scala-steward/update/flyway-core-6.2.4
Update flyway-core to 6.2.4
2020-02-20 16:58:02 +01:00
Scala Steward
f03c893148 Update flyway-core to 6.2.4 2020-02-20 16:16:42 +01:00
Eike Kettner
fbe0c1aec5 Allow more chars for mimetype 2020-02-20 00:39:31 +01:00
Eike Kettner
97305d27ff Integrate support for more files into processing and upload
The restriction that only pdf files can be uploaded is removed. All
files can now be uploaded. The processing may not process all. It is
still possible to restrict file uploads by types via a configuration.
2020-02-19 23:27:00 +01:00
Eike Kettner
9b1349734e Convert some files to pdf 2020-02-19 02:03:10 +01:00
Eike Kettner
5869e2ee6e Streamline extern-conv stdin/infile 2020-02-18 12:43:47 +01:00
Eike Kettner
0dcc00836b Make logger configurable in system commands 2020-02-18 12:02:43 +01:00
Eike Kettner
bd605b8c94 Add first drafts for converting 2020-02-18 01:31:22 +01:00
Eike Kettner
c665c212a0 Early draft for running wkhtmltopdf 2020-02-17 14:02:23 +01:00
Eike Kettner
e0682464b5 Configure pdf extraction; move Logger and DataType to common 2020-02-17 14:01:36 +01:00
Eike Kettner
3d615181e0 Early draft for text extraction 2020-02-17 01:57:22 +01:00
Eike Kettner
1a5546fe99 scalafmt: change align most->more 2020-02-16 22:03:27 +01:00
Eike Kettner
756f8bcb4c Merge remote-tracking branch 'origin/master' into feature/file-types 2020-02-16 21:53:28 +01:00
Eike Kettner
8143a4edcc Adding extraction primitives 2020-02-16 21:37:26 +01:00
eikek
e32880e5b4 Merge pull request #34 from scala-steward/update/scalafmt-core-2.4.1
Update scalafmt-core to 2.4.1
2020-02-16 11:22:50 +01:00
Scala Steward
13700dbda7 Update scalafmt-core to 2.4.1 2020-02-16 04:26:36 +01:00
Eike Kettner
851ee7ef0f Reorganize processing code
Use separate modules for

- text extraction
- conversion to pdf
- text analysis
2020-02-15 21:25:25 +01:00
Eike Kettner
919381be1e More research on how to create pdfs from other files 2020-02-15 13:57:21 +01:00
Eike Kettner
3deba44282 Rename example files 2020-02-15 12:52:24 +01:00
eikek
bdd89e16e5 Merge pull request #33 from scala-steward/update/scalafmt-core-2.4.0
Update scalafmt-core to 2.4.0
2020-02-15 12:46:04 +01:00
Scala Steward
46506ccf97 Update scalafmt-core to 2.4.0 2020-02-15 12:27:50 +01:00
Eike Kettner
1309c8b7fa Move mimetype detection to docspell-files 2020-02-14 22:06:18 +01:00
Eike Kettner
5c3d2b2e28 Rename example-files to files 2020-02-14 11:14:09 +01:00
Eike Kettner
bf9bf25502 Rename example files 2020-02-14 11:10:54 +01:00
eikek
3492ecb684 Merge pull request #32 from scala-steward/update/http4s-blaze-client-0.21.1
Update http4s-blaze-client, ... to 0.21.1
2020-02-14 10:35:17 +01:00
Scala Steward
492ab23973 Update http4s-blaze-client, ... to 0.21.1 2020-02-14 06:22:44 +01:00
eikek
f1b7f8dc32 Merge pull request #31 from scala-steward/update/flyway-core-6.2.3
Update flyway-core to 6.2.3
2020-02-13 18:35:59 +01:00
Scala Steward
dcf9edc50d Update flyway-core to 6.2.3 2020-02-13 14:23:03 +01:00
Eike Kettner
569aae3038 Add example files into its own project
The text and convert module can use them in their tests.
2020-02-11 22:46:23 +01:00
Eike Kettner
2c0425433e Move File class to common module 2020-02-11 22:42:04 +01:00
Eike Kettner
3026f199f7 Some research on pdf conversion 2020-02-11 22:41:44 +01:00
Eike Kettner
ce22b727b1 Add new convert module and sketch its integration 2020-02-11 00:33:52 +01:00
Eike Kettner
3be90d64d5 Move SystemCommand to common module 2020-02-10 22:23:06 +01:00
Eike Kettner
ba3865ef5e Starting to support more file types
First, files are be converted to PDF for archiving. It is also easier
to create a preview. This is done via the `ConvertPdf` processing
task (which is not yet implemented).

Text extraction then tries first with the original file. If that
fails, OCR is done on the (potentially) converted pdf file.

To not loose information of the original file, it is saved using the
table `attachment_source`. If the original file is already a pdf, or
the conversion did not succeed, the `attachment` and
`attachment_source` record point to the same file.
2020-02-10 12:42:45 +01:00
eikek
57ec8eec53 Merge pull request #30 from scala-steward/update/swagger-ui-3.25.0
Update swagger-ui to 3.25.0
2020-02-10 10:28:14 +01:00
Scala Steward
e08ef5997b Update swagger-ui to 3.25.0 2020-02-10 10:18:20 +01:00
eikek
5d00adc0f7 Merge pull request #29 from scala-steward/update/http4s-blaze-client-0.21.0
Update http4s-blaze-client, ... to 0.21.0
2020-02-09 22:42:50 +01:00
Scala Steward
b653f0c57c Update http4s-blaze-client, ... to 0.21.0 2020-02-09 22:19:17 +01:00
Eike Kettner
5c37efeaba Apply scalafmt to all files 2020-02-09 01:54:26 +01:00
eikek
6a9ec42a03 Merge pull request #28 from scala-steward/update/http4s-blaze-client-0.21.0-RC5
Update http4s-blaze-client, ... to 0.21.0-RC5
2020-02-09 01:29:22 +01:00
Scala Steward
6b391cfde9 Update http4s-blaze-client, ... to 0.21.0-RC5 2020-02-09 00:19:54 +01:00
Eike Kettner
c3fb538f37 Update readme 2020-02-08 18:05:21 +01:00
Eike Kettner
533396d386 Using the new preview route to show the attachment in webui 2020-02-08 18:02:31 +01:00
Eike Kettner
8908ad2561 Add attachment preview url based on ViewerJS
The viewerJS library can display PDF files easily using pdfjs. Another
attachment route redirects to the viewerjs application to display the
current attachment.

The attachment responses have been improved in that now the response
headers are added to all responses. Additional a HEAD route has been
added to support the viewerJS application.
2020-02-08 18:02:31 +01:00
Eike Kettner
e1826f39ac Disable revolver plugin on non-app projects
This allows to type `reStart` in the root sbt project to start both
applications.
2020-02-08 18:02:31 +01:00
Eike Kettner
9b66604b96 Include item notes in search 2020-02-08 13:39:06 +01:00
Eike Kettner
d2edddd238 Show attachment meta data in ui
Allow to view the extracted text and results from text analysis of an
attachment.
2020-02-08 12:23:59 +01:00
eikek
070b4f8452 Merge pull request #24 from scala-steward/update/http4s-blaze-client-0.21.0-RC4
Update http4s-blaze-client, ... to 0.21.0-RC4
2020-02-08 10:09:08 +01:00
eikek
0cbc94b9e7 Merge pull request #27 from scala-steward/update/circe-generic-0.13.0
Update circe-generic, circe-parser to 0.13.0
2020-02-08 10:08:40 +01:00