mirror of
				https://github.com/TheAnachronism/docspell.git
				synced 2025-10-24 22:30:10 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			313 lines
		
	
	
		
			10 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			313 lines
		
	
	
		
			10 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| +++
 | |
| title = "Docker"
 | |
| weight = 20
 | |
| +++
 | |
| 
 | |
| # Docker Images
 | |
| 
 | |
| The docker images are at
 | |
| [hub.docker.com](https://hub.docker.com/u/docspell). The `latest` tag
 | |
| always points to the latest *release*. The releases are also tagged
 | |
| with their respective version number. Additionally, there are images
 | |
| tagged with `nightly` which are built from the `master` branch.
 | |
| Therefore the `nightly` packages should be used with care, because
 | |
| things might break in between. But they are useful for trying out
 | |
| something.
 | |
| 
 | |
| There are images for all components that are available from the github
 | |
| release page. The images contain all the necessary
 | |
| [prerequisites](@/docs/install/prereq.md).
 | |
| 
 | |
| - `docspell/restserver` this images contains the http server
 | |
| - `docspell/joex` this image contains the job executor and all
 | |
|   required software (ocrmypdf, unoconv etc) mentioned in
 | |
|   [prerequisites](@/docs/install/prereq.md).
 | |
| - `docspell/dsc` this is an image containing a
 | |
|   [cli](@/docs/tools/cli.md) for docspell that can be used to watch
 | |
|   directories for new files. It doesn't specify a `CMD` or
 | |
|   `ENTRYPOINT`, so you must specify the exact command to run. Here, it
 | |
|   is used to watch a directory for uploading files. This runs the `dsc
 | |
|   watch` command.
 | |
| 
 | |
| ## Examples
 | |
| 
 | |
| These examples use `docker run` to start the restserver and
 | |
| jobexecutor. Both must be connected to the same database. For this
 | |
| example, a shared directory is used and the in-process database H2.
 | |
| For a real setup, using PostgreSQL is recommended.
 | |
| 
 | |
| This requires to change the default config. This example creates a new
 | |
| config file. Please refer to the [configuration
 | |
| page](@/docs/configure/_index.md) for more details.
 | |
| 
 | |
| ``` bash
 | |
| $ cat > /tmp/docspell.conf <<-"EOF"
 | |
| # common settings
 | |
| db_url = "jdbc:h2:///var/docspell/db;MODE=PostgreSQL;DATABASE_TO_LOWER=TRUE;AUTO_SERVER=TRUE"
 | |
| 
 | |
| # job executor settings
 | |
| docspell.joex.jdbc.url = ${db_url}
 | |
| docspell.joex.base-url = "http://10.4.3.2:7878"
 | |
| docspell.joex.bind.address = "0.0.0.0"
 | |
| 
 | |
| # restserver settings
 | |
| docspell.server.backend.jdbc.url = ${db_url}
 | |
| docspell.server.bind.address = "0.0.0.0"
 | |
| docspell.server.integration-endpoint {
 | |
|   enabled = true
 | |
|   http-header {
 | |
|     enabled = true
 | |
|     header-value = "test123"
 | |
|   }
 | |
| }
 | |
| EOF
 | |
| ```
 | |
| 
 | |
| This sets the db url to the same value for both components; thus we
 | |
| can use the same file for both components. It also sets the bind
 | |
| address to bind the server socket on all interfaces. Another thing to
 | |
| note is the `base-url` setting for joex. This publishes joex by this
 | |
| ip, such that the server component can notify the job executor for new
 | |
| work. The `integration-endpoint` setting is explained later.
 | |
| 
 | |
| After creating a common network, we start the rest server and joex:
 | |
| ```
 | |
| $ docker network create --subnet 10.4.3.0/24 dsnet
 | |
| $ docker run -d --name ds-restserver \
 | |
|     --network dsnet --ip 10.4.3.3 \
 | |
|     -p 127.0.0.1:7880:7880 \
 | |
|     -v /tmp/testdb:/var/docspell \
 | |
|     -v /tmp/docspell.conf:/opt/docspell.conf \
 | |
|     docspell/restserver:latest /opt/docspell.conf
 | |
| $ docker run -d --name ds-joex \
 | |
|     --network dsnet --ip 10.4.3.2 \
 | |
|     -v /tmp/testdb:/var/docspell \
 | |
|     -v /tmp/docspell.conf:/opt/docspell.conf \
 | |
|     docspell/joex:latest /opt/docspell.conf
 | |
| ```
 | |
| 
 | |
| After this `docker ps` should show these two containers. Go to
 | |
| `http://localhost:7880` and sign up/login and start playing around.
 | |
| When signing up, use the same name for collective and user and then
 | |
| login with this name.
 | |
| 
 | |
| For the last part, we use the `docspell/dsc` image to create another
 | |
| container that watches a directory and pushes files to docspell.
 | |
| 
 | |
| ``` bash
 | |
| $ docker run -d --name ds-consume \
 | |
|     --network dsnet --ip 10.4.3.4 \
 | |
|     -v /tmp/inbox:/var/inbox \
 | |
|     docspell/dsc:latest dsc -v -d http://10.4.3.3:7880 watch -r --delete -i \
 | |
|       --header "Docspell-Integration:test123" /var/inbox
 | |
| ```
 | |
| 
 | |
| This starts the [dsc](@/docs/tools/cli.md) tool that watches a
 | |
| directory and uploads arriving files to the docspell server. This
 | |
| requires the value from the `integration-endpoint` setting to be
 | |
| allowed to upload files. It also requires you to explicitely enable
 | |
| this: go to *Collective Profile → Settings* and enable the
 | |
| *Integration Endpoint*. Then create a subdirectory in `/tmp/inbox`
 | |
| with the name of the *collective* that you registered and place a file
 | |
| into the `/tmp/inbox/[collective]` directory. The file is pushed to
 | |
| docspell and processed shortly after.
 | |
| 
 | |
| To see all available options, run `dsc` with the `--help` option:
 | |
| ``` bash
 | |
| $ docker run docspell/dsc:latest dsc --help
 | |
| ```
 | |
| 
 | |
| Or just [download the
 | |
| binary](https://github.com/docspell/dsc/releases/latest), no docker
 | |
| required.
 | |
| 
 | |
| Note that this is just an example and is only to demonstrate how to
 | |
| use the docker images. For instance, this setup does not provide
 | |
| fulltext search. For a more sophisticated docker setup, use
 | |
| appropriate tools, for example `docker-compose` which is explained
 | |
| below.
 | |
| 
 | |
| # Docker Compose
 | |
| 
 | |
| There is a [docker-compose](https://docs.docker.com/compose/) setup
 | |
| available in the `/docker/docker-compose` folder. This setup is
 | |
| similiar to the example above, but adding fulltext search and a
 | |
| PostgreSQL database by using just one command. It's only a few steps
 | |
| to get started.
 | |
| 
 | |
| ## Start Docspell
 | |
| ### 1. Get the docker-compose files
 | |
| 
 | |
| There are two options. You can clone the whole repository:
 | |
| 
 | |
| ```bash
 | |
| $ git clone https://github.com/eikek/docspell
 | |
| ```
 | |
| 
 | |
| This downloads all sources. What you actually need is only 3 files. So
 | |
| if you don't have git or don't want to clone the whole repo, use these
 | |
| steps instead:
 | |
| 
 | |
| ``` bash
 | |
| $ mkdir -p docspell/docker/docker-compose
 | |
| $ cd docspell/docker/docker-compose
 | |
| $ wget https://raw.githubusercontent.com/eikek/docspell/master/docker/docker-compose/docker-compose.yml
 | |
| ```
 | |
| 
 | |
| You can choose any directory instead of
 | |
| `docspell/docker/docker-compose`, of course. It's only this folder to
 | |
| make the rest of the guide work for both ways of obtaining the
 | |
| docker-compose file.
 | |
| 
 | |
| ### 2. Run `docker-compose up`
 | |
| 
 | |
| Change into the new `docker-compose` directory, for example:
 | |
| 
 | |
| ```bash
 | |
| $ cd docspell/docker/docker-compose
 | |
| ```
 | |
| 
 | |
| Then run `docker-compose`:
 | |
| 
 | |
| ```bash
 | |
| $ docker-compose up -d
 | |
| ```
 | |
| 
 | |
| If you look at `docker-compose.yml`, there are several environment
 | |
| variables defined. A few that you should change, i.e. all "secrets":
 | |
| 
 | |
| - `DOCSPELL_SERVER_ADMIN__ENDPOINT_SECRET`
 | |
| - `DOCSPELL_SERVER_AUTH_SERVER__SECRET`
 | |
| - `DOCSPELL_SERVER_INTEGRATION__ENDPOINT_HTTP__HEADER_HEADER__VALUE`
 | |
| 
 | |
| Then, the value for
 | |
| `DOCSPELL_SERVER_INTEGRATION__ENDPOINT_HTTP__HEADER_HEADER__VALUE`
 | |
| must be duplicated in the consumedir command (both values must match).
 | |
| It is the header defined for the [integration
 | |
| endpoint](@/docs/api/upload.md#integration-endpoint). You can use
 | |
| whatever you like, best something random. Please see the help to the
 | |
| [dsc tool](@/docs/tools/cli.md) docs for additional info.
 | |
| 
 | |
| Goto `http://localhost:7880`, signup and login. When signing up,
 | |
| choose the same name for collective and user. Then login with this
 | |
| name and the password.
 | |
| 
 | |
| (Optional) Create a folder `./docs/<collective-name>` (the name you
 | |
| chose for the collective at registration) and place files in there for
 | |
| importing them.
 | |
| 
 | |
| Docspell can be configured via environment variables or a config file.
 | |
| Please see the [configuration](@/docs/configure/_index.md) for more
 | |
| details and possible values/variables. You can create a config file
 | |
| and mount it into the container. Then specify the config file as the
 | |
| an argument to the command, i.e. add a
 | |
| 
 | |
| ``` yml
 | |
| command:
 | |
|   - /path/to/config.conf
 | |
| ```
 | |
| 
 | |
| to the service definition (or add it to an existing `command:`
 | |
| section).
 | |
| 
 | |
| ## Override this setup
 | |
| 
 | |
| If you want to change this setup, you can simply use your own compose
 | |
| file or add a `docker-compose.override.yml` that allows to amend
 | |
| certain configs. Look [here](https://docs.docker.com/compose/extends/)
 | |
| to find more about it.
 | |
| 
 | |
| As an example, here is a `docker-compose.override.yml`:
 | |
| 
 | |
| ``` yaml
 | |
| version: '3.7'
 | |
| 
 | |
| services:
 | |
|   consumedir:
 | |
|     volumes:
 | |
|       - importdocs:/opt/docs
 | |
| 
 | |
| volumes:
 | |
|   docspell-postgres_data:
 | |
|     driver: local
 | |
|     driver_opts:
 | |
|       type: nfs4
 | |
|       o: addr=192.168.x.y,rw,noatime,rsize=8192,wsize=8192,tcp,timeo=14
 | |
|       device: ":/mnt/FreeNas/docker_vol1/docspell/postgres_data"
 | |
| 
 | |
|   docspell-solr_data:
 | |
|     driver: local
 | |
|     driver_opts:
 | |
|       type: nfs4
 | |
|       o: addr=192.168.x.y,rw,noatime,rsize=8192,wsize=8192,tcp,timeo=14
 | |
|       device: ":/mnt/FreeNas/docker_vol1/docspell/solr_data"
 | |
| 
 | |
|   importdocs:
 | |
|     driver: local
 | |
|     driver_opts:
 | |
|       type: nfs4
 | |
|       o: addr=192.168.x.y,rw,noatime,rsize=8192,wsize=8192,tcp,timeo=14
 | |
|       device: ":/mnt/FreeNas/archiv/gescannt/output"
 | |
| ```
 | |
| 
 | |
| 
 | |
| ## Upgrading
 | |
| 
 | |
| Since [downgrading](@/docs/install/downgrading.md) is not supported,
 | |
| it is recommended to backup your database before upgrading. Should
 | |
| something not work as expected, restore the database backup and go
 | |
| back to the previous version.
 | |
| 
 | |
| The latest release is always tagged with `latest`. Should you use this
 | |
| tag, then run these commands to upgrade to newer images:
 | |
| 
 | |
| ``` bash
 | |
| $ docker-compose down
 | |
| $ docker-compose pull
 | |
| $ docker-compose up --force-recreate --build -d
 | |
| ```
 | |
| 
 | |
| ## Backups
 | |
| 
 | |
| When running the docker compose setup, you can use the following to
 | |
| backup the database.
 | |
| 
 | |
| 1. (Optionally) Stop docspell, for example with `docker-compose down`.
 | |
|    It is preferred to stop, i.e. should you upgrade versions.
 | |
| 2. Add a new file `docker-compose.override.yml` (next to your
 | |
|    `docker-compose.yml`) with this content:
 | |
| 
 | |
|    ```yml
 | |
|    version: '3.8'
 | |
|    services:
 | |
| 
 | |
|    db:
 | |
|      volumes:
 | |
|        - /some/backupdir:/opt/backup
 | |
|    ```
 | |
| 
 | |
|    The `/some/backupdir` is the directory where the backup should be stored on the host.
 | |
| 3. If you stopped the containers in step 1, start now  **only** the db service via `docker-compose up -d -- db`
 | |
| 4. Run the dump command:
 | |
|    ```
 | |
|    docker exec -it postgres_db pg_dump -d dbname -U dbuser -Fc -f /opt/backup/docspell.sqlc
 | |
|    ```
 | |
| 
 | |
| The `docker-compose.override.yml` file is only to mount a local
 | |
| directory into the db container. You can also add these lines directly
 | |
| into the `docker-compose.yml`. Now you have the dump in your local
 | |
| `/some/backupdir` directory.
 | |
| 
 | |
| This dump can be restored almost the same way. Mount your backup
 | |
| directory into the db container as before in steps 1-3. Then run this
 | |
| command in step 4 instead:
 | |
| 
 | |
| ```
 | |
| docker exec -it postgres_db pg_restore -d dbname -U dbuser -Fc /opt/backup/docspell.sqlc
 | |
| ```
 | |
| 
 | |
| So, before the upgrade run steps 1 to 4. Then you have a dump of your
 | |
| current database (everything, files and all other data). When creating
 | |
| and restoring a dump, do not start the docspell containers - make sure
 | |
| to start the db container only.
 |