docspell/modules/microsite/docs/doc.md

---
layout: docs
title: Documentation
permalink: doc
---

# {{page.title}}

This is the documentation for Docspell @VERSION@.

Docspell assists in organizing large amounts of PDF files that are
typically scanned paper documents. You can associate tags, set
correspondends, what a document is concerned with, a name, a date and
some more. If your documents are associated with this meta data, you
should be able to quickly find them later using the search
feature. But adding this manually to each document is a tedious
task. What if most of it could be attached automatically?

## How it works

Documents have two main properties: a correspondent (sender or
receiver that is not you) and something the document is about. Usually
it is about a person or some thing – maybe your car, or contracts
concerning some familiy member, etc.

1. You maintain a kind of address book. It should list all possible
   correspondents and the concerning people/things. This grows
   incrementally with each new unknown document.
2. When docspell analyzes a document, it tries to find matches within
   your address book. It can detect the correspondent and a concerning
   person or thing. It will then associate this data to your
   documents.
3. You can inspect what docspell has done and correct it. If docspell
   has found multiple suggestions, they will be shown for you to
   select one. If it is not correctly associated, very often the
   correct one is just one click away.

The set of meta data that docspell uses to draw suggestions from, must
be maintained manually. But usually, this data doesn't grow as fast as
the documents. After a while there is a quite complete address book
and only once in a while it has to be revisited.

Besides extracting the text from documents to analyze, docspell also
converts all files into PDF files. This unifies the different formats
your documents may be in originally and makes them more accessible
from other systems and the future.

## Terms

In order to better understand these pages, some terms should be
explained first.

### Item

An **Item** is roughly your (pdf) document, only that an item may span
multiple files, which are called **attachments**. And an item has
**meta data** associated:

- a **correspondent**: the other side of the communication. It can be
  an organization or a person.
- a **concerning person** or **equipment**: a person or thing that
  this item is about. Maybe it is an insurance contract about your
  car.
- **tag**: an item can be tagged with custom tags. A tag can have a
  *category*. This is intended for grouping tags, for example a
  category `doctype` could be used to group tags like `bill`,
  `contract`, `receipt` etc. Usually an item is not tagged with more
  than one tag of a category.
- a **item date**: this is the date of the document – if this is not
  set, the created date of the item is used.
- a **due date**: an optional date indicating that something has to be
  done (e.g. paying a bill, submitting it) about this item until this
  date
- a **direction**: one of "incoming" or "outgoing"
- a **name**: some item name, defaults to the file name of the
  attachments
- some **notes**: arbitrary descriptive text. You can use markdown
  here, which is appropriately formatted in the web application.

### Collective

The users of the application are part of a **collective**. A
**collective** is a group of users that share access to the same
items. The account name is therefore comprised of a *collective name*
and a *user name*.

All users of a collective are equal; they have same permissions to
access all items. The items don't belong to a user, but to the
collective.

That means, to identify yourself when signing in, you have to give the
collective name and your user name. By default it is separated by a
slash `/`, for example `smith/john`. If your user name is the same as
the collective name, you can omit one; so `smith/smith` can be
abbreviated to just `smith`.
-												Initial version.

Features:

- Upload PDF files let them analyze

- Manage meta data and items

- See processing in webapp

											
										
										
											2019-07-22 22:53:30 +00:00
+								---
 								layout: docs
 								title: Documentation
-												Update microsite

											
										
										
											2020-03-28 15:35:28 +00:00
+								permalink: doc
-												Initial version.

Features:

- Upload PDF files let them analyze

- Manage meta data and items

- See processing in webapp

											
										
										
											2019-07-22 22:53:30 +00:00
+								---
 								# {{page.title}}
-												Change documentation side menu and front

											
										
										
											2020-02-28 19:50:46 +00:00
+								This is the documentation for Docspell @VERSION@.
-												Show project version in the documentation

											
										
										
											2020-02-24 19:59:15 +00:00
-												Initial version.

Features:

- Upload PDF files let them analyze

- Manage meta data and items

- See processing in webapp

											
										
										
											2019-07-22 22:53:30 +00:00
+								Docspell assists in organizing large amounts of PDF files that are
 								typically scanned paper documents. You can associate tags, set
 								correspondends, what a document is concerned with, a name, a date and
 								some more. If your documents are associated with this meta data, you
 								should be able to quickly find them later using the search
 								feature. But adding this manually to each document is a tedious
 								task. What if most of it could be attached automatically?
 								## How it works
 								Documents have two main properties: a correspondent (sender or
 								receiver that is not you) and something the document is about. Usually
 								it is about a person or some thing – maybe your car, or contracts
 								concerning some familiy member, etc.
 . You maintain a kind of address book. It should list all possible
 								   correspondents and the concerning people/things. This grows
 								   incrementally with each new unknown document.
 . When docspell analyzes a document, it tries to find matches within
 								   your address book. It can detect the correspondent and a concerning
 								   person or thing. It will then associate this data to your
 								   documents.
 . You can inspect what docspell has done and correct it. If docspell
 								   has found multiple suggestions, they will be shown for you to
 								   select one. If it is not correctly associated, very often the
 								   correct one is just one click away.
 								The set of meta data that docspell uses to draw suggestions from, must
 								be maintained manually. But usually, this data doesn't grow as fast as
 								the documents. After a while there is a quite complete address book
 								and only once in a while it has to be revisited.
-												Update docs

											
										
										
											2020-02-21 23:48:58 +00:00
+								Besides extracting the text from documents to analyze, docspell also
 								converts all files into PDF files. This unifies the different formats
 								your documents may be in originally and makes them more accessible
 								from other systems and the future.
-												Initial version.

Features:

- Upload PDF files let them analyze

- Manage meta data and items

- See processing in webapp

											
										
										
											2019-07-22 22:53:30 +00:00
 								## Terms
 								In order to better understand these pages, some terms should be
 								explained first.
 								### Item
 								An **Item** is roughly your (pdf) document, only that an item may span
 								multiple files, which are called **attachments**. And an item has
 								**meta data** associated:
 								- a **correspondent**: the other side of the communication. It can be
 								  an organization or a person.
 								- a **concerning person** or **equipment**: a person or thing that
 								  this item is about. Maybe it is an insurance contract about your
 								  car.
 								- **tag**: an item can be tagged with custom tags. A tag can have a
 								  *category*. This is intended for grouping tags, for example a
 								  category `doctype` could be used to group tags like `bill`,
 								  `contract`, `receipt` etc. Usually an item is not tagged with more
 								  than one tag of a category.
 								- a **item date**: this is the date of the document – if this is not
 								  set, the created date of the item is used.
 								- a **due date**: an optional date indicating that something has to be
 								  done (e.g. paying a bill, submitting it) about this item until this
 								  date
 								- a **direction**: one of "incoming" or "outgoing"
 								- a **name**: some item name, defaults to the file name of the
 								  attachments
-												Adopt nix modules to new config

											
										
										
											2020-02-22 11:40:56 +00:00
+								- some **notes**: arbitrary descriptive text. You can use markdown
-												Initial version.

Features:

- Upload PDF files let them analyze

- Manage meta data and items

- See processing in webapp

											
										
										
											2019-07-22 22:53:30 +00:00
+								  here, which is appropriately formatted in the web application.
 								### Collective
 								The users of the application are part of a **collective**. A
 								**collective** is a group of users that share access to the same
 								items. The account name is therefore comprised of a *collective name*
 								and a *user name*.
 								All users of a collective are equal; they have same permissions to
 								access all items. The items don't belong to a user, but to the
 								collective.
 								That means, to identify yourself when signing in, you have to give the
 								collective name and your user name. By default it is separated by a
 								slash `/`, for example `smith/john`. If your user name is the same as
 								the collective name, you can omit one; so `smith/smith` can be
 								abbreviated to just `smith`.