Hi there. I believe that I have some sollutions to your problems. First of all, you need to see whether or not your documentts are in some kind of structured format. if they are, say DocBookXML, or something similar, you may be able to find a quick solution to the searching problem. if the documents are structured, you can probably parce them by entety type. of course, this depends on how well they are marked up. Like I've stated earlier, they key item here is to generate rapidly searchable indexes that can be quaried against. I'm assuming that since you deal with highly technical data, it is more or less in a structured form. You could even generate SQL statements and possibly use SQLLite if you don't want a full DB as overhead. Anyways, I'm more than willing to help in any way with this project of yours. Let me know what you think. --Erik
On 5/29/10, Ron Johnson <ron.l.john...@cox.net> wrote: > On 05/29/2010 02:34 PM, Merciadri Luca wrote: >> Ron Johnson wrote: > [snip] >>> >>> Have you tried other PDF readers? Searched for Linux-based PDF indexers? >> As I said in another topic, I am totally okay for free stuff (if it was >> not the case, I would not be using Debian: thinking unfree but using >> free is cowardice), but the fact is that I have not found a reader whose >> range of compatibility with the PDF standard is as high as in acroread. >> Acroread is slow, boring, sometimes buggy, but I need to use it as long >> as I do not find a PDF reader which has such a big compatibility range. > > Nothing says that you must only use one reader at a time. ;) > > If poppler, for example, doesn't render *exactly* but searches > /rapidly/, then you could search using poppler and "read" using > Acroread. > > Alternatively, install poppler-utils for it's pdftohtml. Certainly > it won't be perfect, but a browser might be faster than Acroread. > > -- > Dissent is patriotic, remember? > > > -- > To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org > with a subject of "unsubscribe". Trouble? Contact > listmas...@lists.debian.org > Archive: http://lists.debian.org/4c01afdd.7050...@cox.net > > -- To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org Archive: http://lists.debian.org/aanlktil7rpw7qqyqmpmbbkzvwea2a3ns8fqczfa43...@mail.gmail.com