docspell/website/site/content/docs/install/rpi.md

53 lines
1.9 KiB
Markdown
Raw Normal View History

2020-07-27 20:13:22 +00:00
+++
2022-01-27 19:23:15 +00:00
title = "Raspberry-Pi"
2020-07-27 20:13:22 +00:00
weight = 40
+++
2022-01-27 19:23:15 +00:00
# Raspberry Pi
2021-06-16 21:57:58 +00:00
Both components can run next to each other on a raspberry pi or
2020-07-27 20:13:22 +00:00
similiar device.
2020-11-14 20:48:52 +00:00
There is a [project on
github](https://github.com/docspell/rpi-scripts) that can help with
setting up a raspberry pi with docspell.
2021-06-16 21:57:58 +00:00
The docker images are built for arm64 and armv7, these can also be
used on the pi.
2020-07-27 20:13:22 +00:00
## REST Server
The REST server component runs very well on the Raspberry Pi and
similiar devices. It doesn't require much resources, because the heavy
work is done by the joex components.
## Joex
Running the joex component on the Raspberry Pi is possible, but will
result in long processing times for OCR and text analysis. The board
should provide 4G of RAM (like the current RPi4), especially if also a
2021-01-20 20:35:54 +00:00
database and solr are running next to it. The memory required by joex
depends on the config and document language. Please pick a value that
2022-03-21 13:41:39 +00:00
suits your setup from
[here](@/docs/configure/file-processing.md#memory-usage). For boards
like the RPi, it might be necessary to use `nlp.mode=basic`, rather
than `nlp.mode=full`. You should also set the joex pool size to 1.
An example: on this [UP
board](https://up-board.org/up/specifications/) with an Intel Atom
2021-01-20 20:35:54 +00:00
x5-Z8350 CPU (@1.44Ghz) and 4G RAM, a scanned (300dpi, in German) pdf
file with 6 pages took *3:20 min* to process. This board also runs the
SOLR and a postgresql database.
The same file was processed in 55s on a qemu virtual machine on my i7
notebook, using 1 CPU and 4G RAM (and identical config for joex). The
virtual machine only had to host docspell (joex and restserver, but
the restserver is very lightweight).
The learning task for text classification can also use high amount of
memory, but this depends on the amount of data you have in docspell.
If you encounter problems here, you can set the maximum amount of
items to consider in the collective settings page.