Eike Kettner
8cfecfb3dd
Update docs
2020-02-22 00:48:58 +01:00
eikek
6d9272af65
Merge pull request #36 from scala-steward/update/sbt-microsites-1.1.2
...
Update sbt-microsites to 1.1.2
2020-02-21 16:55:26 +01:00
Scala Steward
3893fb449e
Update sbt-microsites to 1.1.2
2020-02-21 16:35:43 +01:00
Eike Kettner
98576a5fb5
Add link to original file
2020-02-20 22:40:27 +01:00
Eike Kettner
72fd3b1a25
Implement downloading original file
2020-02-20 22:33:57 +01:00
Eike Kettner
39809f9d05
Sketch route for retrieving original file
2020-02-20 22:12:27 +01:00
Eike Kettner
7e1678da98
Remove pdf filter in ds.sh script
2020-02-20 21:52:33 +01:00
Eike Kettner
7fe8843893
Update documentation sites
2020-02-20 21:43:37 +01:00
Eike Kettner
3f316ab4d0
Update config file doc
2020-02-20 21:10:00 +01:00
eikek
7e23f206ab
Merge pull request #35 from scala-steward/update/flyway-core-6.2.4
...
Update flyway-core to 6.2.4
2020-02-20 16:58:02 +01:00
Scala Steward
f03c893148
Update flyway-core to 6.2.4
2020-02-20 16:16:42 +01:00
Eike Kettner
fbe0c1aec5
Allow more chars for mimetype
2020-02-20 00:39:31 +01:00
Eike Kettner
97305d27ff
Integrate support for more files into processing and upload
...
The restriction that only pdf files can be uploaded is removed. All
files can now be uploaded. The processing may not process all. It is
still possible to restrict file uploads by types via a configuration.
2020-02-19 23:27:00 +01:00
Eike Kettner
9b1349734e
Convert some files to pdf
2020-02-19 02:03:10 +01:00
Eike Kettner
5869e2ee6e
Streamline extern-conv stdin/infile
2020-02-18 12:43:47 +01:00
Eike Kettner
0dcc00836b
Make logger configurable in system commands
2020-02-18 12:02:43 +01:00
Eike Kettner
bd605b8c94
Add first drafts for converting
2020-02-18 01:31:22 +01:00
Eike Kettner
c665c212a0
Early draft for running wkhtmltopdf
2020-02-17 14:02:23 +01:00
Eike Kettner
e0682464b5
Configure pdf extraction; move Logger and DataType to common
2020-02-17 14:01:36 +01:00
Eike Kettner
3d615181e0
Early draft for text extraction
2020-02-17 01:57:22 +01:00
Eike Kettner
1a5546fe99
scalafmt: change align most->more
2020-02-16 22:03:27 +01:00
Eike Kettner
756f8bcb4c
Merge remote-tracking branch 'origin/master' into feature/file-types
2020-02-16 21:53:28 +01:00
Eike Kettner
8143a4edcc
Adding extraction primitives
2020-02-16 21:37:26 +01:00
eikek
e32880e5b4
Merge pull request #34 from scala-steward/update/scalafmt-core-2.4.1
...
Update scalafmt-core to 2.4.1
2020-02-16 11:22:50 +01:00
Scala Steward
13700dbda7
Update scalafmt-core to 2.4.1
2020-02-16 04:26:36 +01:00
Eike Kettner
851ee7ef0f
Reorganize processing code
...
Use separate modules for
- text extraction
- conversion to pdf
- text analysis
2020-02-15 21:25:25 +01:00
Eike Kettner
919381be1e
More research on how to create pdfs from other files
2020-02-15 13:57:21 +01:00
Eike Kettner
3deba44282
Rename example files
2020-02-15 12:52:24 +01:00
eikek
bdd89e16e5
Merge pull request #33 from scala-steward/update/scalafmt-core-2.4.0
...
Update scalafmt-core to 2.4.0
2020-02-15 12:46:04 +01:00
Scala Steward
46506ccf97
Update scalafmt-core to 2.4.0
2020-02-15 12:27:50 +01:00
Eike Kettner
1309c8b7fa
Move mimetype detection to docspell-files
2020-02-14 22:06:18 +01:00
Eike Kettner
5c3d2b2e28
Rename example-files to files
2020-02-14 11:14:09 +01:00
Eike Kettner
bf9bf25502
Rename example files
2020-02-14 11:10:54 +01:00
eikek
3492ecb684
Merge pull request #32 from scala-steward/update/http4s-blaze-client-0.21.1
...
Update http4s-blaze-client, ... to 0.21.1
2020-02-14 10:35:17 +01:00
Scala Steward
492ab23973
Update http4s-blaze-client, ... to 0.21.1
2020-02-14 06:22:44 +01:00
eikek
f1b7f8dc32
Merge pull request #31 from scala-steward/update/flyway-core-6.2.3
...
Update flyway-core to 6.2.3
2020-02-13 18:35:59 +01:00
Scala Steward
dcf9edc50d
Update flyway-core to 6.2.3
2020-02-13 14:23:03 +01:00
Eike Kettner
569aae3038
Add example files into its own project
...
The text and convert module can use them in their tests.
2020-02-11 22:46:23 +01:00
Eike Kettner
2c0425433e
Move File class to common module
2020-02-11 22:42:04 +01:00
Eike Kettner
3026f199f7
Some research on pdf conversion
2020-02-11 22:41:44 +01:00
Eike Kettner
ce22b727b1
Add new convert module and sketch its integration
2020-02-11 00:33:52 +01:00
Eike Kettner
3be90d64d5
Move SystemCommand
to common module
2020-02-10 22:23:06 +01:00
Eike Kettner
ba3865ef5e
Starting to support more file types
...
First, files are be converted to PDF for archiving. It is also easier
to create a preview. This is done via the `ConvertPdf` processing
task (which is not yet implemented).
Text extraction then tries first with the original file. If that
fails, OCR is done on the (potentially) converted pdf file.
To not loose information of the original file, it is saved using the
table `attachment_source`. If the original file is already a pdf, or
the conversion did not succeed, the `attachment` and
`attachment_source` record point to the same file.
2020-02-10 12:42:45 +01:00
eikek
57ec8eec53
Merge pull request #30 from scala-steward/update/swagger-ui-3.25.0
...
Update swagger-ui to 3.25.0
2020-02-10 10:28:14 +01:00
Scala Steward
e08ef5997b
Update swagger-ui to 3.25.0
2020-02-10 10:18:20 +01:00
eikek
5d00adc0f7
Merge pull request #29 from scala-steward/update/http4s-blaze-client-0.21.0
...
Update http4s-blaze-client, ... to 0.21.0
2020-02-09 22:42:50 +01:00
Scala Steward
b653f0c57c
Update http4s-blaze-client, ... to 0.21.0
2020-02-09 22:19:17 +01:00
Eike Kettner
5c37efeaba
Apply scalafmt to all files
2020-02-09 01:54:26 +01:00
eikek
6a9ec42a03
Merge pull request #28 from scala-steward/update/http4s-blaze-client-0.21.0-RC5
...
Update http4s-blaze-client, ... to 0.21.0-RC5
2020-02-09 01:29:22 +01:00
Scala Steward
6b391cfde9
Update http4s-blaze-client, ... to 0.21.0-RC5
2020-02-09 00:19:54 +01:00