branch: externals/doc-toc commit 5314cd0b2a75325a996e70343db66a419afb898e Author: Daniel Nicolai <dalanico...@gmail.com> Commit: Daniel Nicolai <dalanico...@gmail.com>
Update README --- README.org | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.org b/README.org index 9b67521bed..f4c9a0c20e 100644 --- a/README.org +++ b/README.org @@ -34,10 +34,12 @@ Extraction and adding contents to a document is done in 4 steps: ** 1. Extraction Open some pdf or djvu file in Emacs (pdf-tools and djvu package recommended). -Find the pagenumbers for the TOC. Then type =M-x toc-extract-pages,= or =M-x +Find the pagenumbers for the TOC. Then type =M-x toc-extract-pages=, or =M-x toc-extract-pages-ocr= if doc has no text layer or text layer is bad, and answer the subsequent prompts by entering the pagenumbers for the first and the last -page each followed by =RET=. +page each followed by =RET=. *For PDF extraction with OCR, currently it is required* +*to view all contents pages once before extraction* (toc-mode uses the cached file +data). A buffer with the, somewhat cleaned up, extracted text will open in TOC-cleanup mode. Prefix command with the universal argument (=C-u=) to omit clean and get the