Eike Kettner 
							
						 
					 
					
						
						
							
						
						e0682464b5 
					 
					
						
						
							
							Configure pdf extraction; move Logger and DataType to common  
						
						 
						
						
						
						
					 
					
						2020-02-17 14:01:36 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						3d615181e0 
					 
					
						
						
							
							Early draft for text extraction  
						
						 
						
						
						
						
					 
					
						2020-02-17 01:57:22 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						1309c8b7fa 
					 
					
						
						
							
							Move mimetype detection to docspell-files  
						
						 
						
						
						
						
					 
					
						2020-02-14 22:06:18 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						bf9bf25502 
					 
					
						
						
							
							Rename example files  
						
						 
						
						
						
						
					 
					
						2020-02-14 11:10:54 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						2c0425433e 
					 
					
						
						
							
							Move File class to common module  
						
						 
						
						
						
						
					 
					
						2020-02-11 22:42:04 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						3be90d64d5 
					 
					
						
						
							
							Move SystemCommand to common module  
						
						 
						
						
						
						
					 
					
						2020-02-10 22:23:06 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						ba3865ef5e 
					 
					
						
						
							
							Starting to support more file types  
						
						 
						
						... 
						
						
						
						First, files are be converted to PDF for archiving. It is also easier
to create a preview. This is done via the `ConvertPdf` processing
task (which is not yet implemented).
Text extraction then tries first with the original file. If that
fails, OCR is done on the (potentially) converted pdf file.
To not loose information of the original file, it is saved using the
table `attachment_source`. If the original file is already a pdf, or
the conversion did not succeed, the `attachment` and
`attachment_source` record point to the same file. 
						
						
					 
					
						2020-02-10 12:42:45 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						5c37efeaba 
					 
					
						
						
							
							Apply scalafmt to all files  
						
						 
						
						
						
						
					 
					
						2020-02-09 01:54:26 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						88efe13209 
					 
					
						
						
							
							Fix item route responses  
						
						 
						
						... 
						
						
						
						Also avoid storing empty strings in a nullable field. 
						
						
					 
					
						2020-01-11 12:58:04 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						4490a444a9 
					 
					
						
						
							
							Allow dots in identifiers  
						
						 
						
						
						
						
					 
					
						2020-01-07 00:20:41 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						9020d9aa3b 
					 
					
						
						
							
							Don't require a prefix when configuring byte arrays  
						
						 
						
						
						
						
					 
					
						2020-01-05 15:29:58 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						8814de3c38 
					 
					
						
						
							
							Allow simple search when listing meta data  
						
						 
						
						
						
						
					 
					
						2020-01-02 20:21:49 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						fc3e22e399 
					 
					
						
						
							
							Apply scalafmt to all files  
						
						 
						
						
						
						
					 
					
						2019-12-30 21:44:13 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						a9e70401de 
					 
					
						
						
							
							Update dependencies  
						
						 
						
						
						
						
					 
					
						2019-12-28 12:38:11 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						07a23b9611 
					 
					
						
						
							
							Fix percent encoding  
						
						 
						
						... 
						
						
						
						Must use utf8 bytes, of course… 
						
						
					 
					
						2019-12-11 21:56:31 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						2ad1586d00 
					 
					
						
						
							
							Set stricter compile options and fix cookie data  
						
						 
						
						
						
						
					 
					
						2019-09-28 22:17:45 +02:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eike Kettner 
							
						 
					 
					
						
						
							
						
						831cd8b655 
					 
					
						
						
							
							Initial version.  
						
						 
						
						... 
						
						
						
						Features:
- Upload PDF files let them analyze
- Manage meta data and items
- See processing in webapp 
						
						
					 
					
						2019-09-21 22:02:36 +02:00