branch: externals/doc-toc commit b869d9c88825b8d0fba8682b3406e856ad9d9d0e Author: Daniel Nicolai <dalanico...@gmail.com> Commit: Daniel Nicolai <dalanico...@gmail.com>
Update README with tesseract ocr info --- README.org | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/README.org b/README.org index 598540dabf..75096d6ec9 100644 --- a/README.org +++ b/README.org @@ -33,9 +33,10 @@ Extraction and adding contents to a document is done in 4 steps: ** 1. Extraction Open some pdf or djvu file in Emacs (pdf-tools and djvu package recommended). -Find the pagenumbers for the TOC. Then type =M-x toc-extract-pages= and answer the -subsequent prompts by entering the pagenumbers for the first and the last page -each followed by =RET=. +Find the pagenumbers for the TOC. Then type =M-x toc-extract-pages,= or =M-x +toc-extract-pages-ocr= if doc has no text layer or text layer is bad, and answer +the subsequent prompts by entering the pagenumbers for the first and the last +page each followed by =RET=. A buffer with the, somewhat cleaned up, extracted text will open in TOC-cleanup mode. Prefix command with the universal argument (=C-u=) to omit clean and get the