branch: externals/scanner
commit 1439335484257935bb29cef31f1fa3daa9376440
Author: Raffael Stocker <[email protected]>
Commit: Raffael Stocker <[email protected]>
update documentation
---
Readme.org | 19 ++++++++++++++-----
scanner.el | 18 +++++++++++++-----
2 files changed, 27 insertions(+), 10 deletions(-)
diff --git a/Readme.org b/Readme.org
index be5416d..2b750e1 100644
--- a/Readme.org
+++ b/Readme.org
@@ -10,7 +10,8 @@
provided by =scanimage(1)= for image mode and =tesseract(1)= for document
mode. The scanner package uses =tesseract(1)= to provide optical character
recognition (OCR). You can select the language plugins with
- =scanner-tesseract-languages=.
+ =scanner-tesseract-languages=. See also the remark about the data
+ directories below.
In document mode, you can scan one or multiple pages that are then written
in a customizable output format, e.g. (searchable) PDF or text, or whatever
@@ -32,10 +33,18 @@
For both images and documents, you can customize the scan mode (e.g. "Color"
or "Gray") if your scanning device supports it.
- Finally, you can pass additional options to the backends using the
- customization variables =scanner-scanimage-switches= and
- =scanner-tesseract-switches=. The former variable is helpful for tuning
- brightness and contrast, for instance.
+ You can pass additional options to the backends using the customization
+ variables =scanner-scanimage-switches= and =scanner-tesseract-switches=.
+ The former variable is helpful for tuning brightness and contrast, for
+ instance.
+
+ Finally, the customization options =scanner-tessdata-dir= and
+ =scanner-tessdata-configdir= must be set to point to tesseract's data
+ directory containing the language definitions (usually something like
+ =/usr/share/tessdata/=) and tesseract's configs directory containing the
+ output configurations (usually something like
+ =/usr/share/tessdata/configs/=).
+
* Bugs
diff --git a/scanner.el b/scanner.el
index b1f6f0f..65c9557 100644
--- a/scanner.el
+++ b/scanner.el
@@ -35,7 +35,8 @@
;; provided by scanimage(1) for image mode and tesseract(1) for document mode.
;; The scanner package uses tesseract(1) to provide optical character
;; recognition (OCR). You can select the language plugins with
-;; ‘scanner-tesseract-languages’.
+;; ‘scanner-tesseract-languages’. See also the remark about the data
+;; directories below.
;;
;; In document mode, you can scan one or multiple pages that are then written
;; in a customizable output format, e.g. (searchable) PDF or text, or whatever
@@ -57,10 +58,17 @@
;; For both images and documents, you can customize the scan mode
;; (e.g. "Color" or "Gray") if your scanning device supports it.
;;
-;; Finally, you can pass additional options to the backends using the
-;; customization variables ‘scanner-scanimage-switches’ and
-;; ‘scanner-tesseract-switches’. The former variable is helpful for tuning
-;; brightness and contrast, for instance.
+;; You can pass additional options to the backends using the customization
+;; variables ‘scanner-scanimage-switches’ and ‘scanner-tesseract-switches’.
+;; The former variable is helpful for tuning brightness and contrast, for
+;; instance.
+;;
+;; Finally, the customization options ‘scanner-tessdata-dir’ and
+;; ‘scanner-tessdata-configdir’ must be set to point to tesseract's data
+;; directory containing the language definitions (usually something like
+;; /usr/share/tessdata/) and tesseract's configs directory containing the
+;; output configurations (usually something like
+;; /usr/share/tessdata/configs/).
;;; Code: