Ok, sent you a read-only access invitation for now.  Thanks for your offer to 
help.  Here is my bigger issues list to get a flavor – a lot of fun things to 
do.  Let me know what you want to do with pdftohtml!


 1.  Translate drawing operations into canvas with SVG
 2.  Find better way to calculate vertical positioning, by looking at browser 
source code
 3.  z-index handling -- currently text is never masked by graphics
 4.  Algorithmic extraction of TOC
 5.  Algorithmic extraction of page numbering (Alec may be working on this)
 6.  Algorithmic identification of chapters
 7.  Right-to-left text, proper display (e.g. Arabic, Hebrew)
 8.  Algorithmic detection of text flow (Stephen may be working on this)
 9.  Detection / removal of duplicate images
 10. Jpg vs. png selection; automatically choose the best format for each image

--josh

From: Clément Wehrung <[email protected]<mailto:[email protected]>>
Date: Mon, 24 Oct 2011 15:27:23 -0700
To: Josh Richardson <[email protected]<mailto:[email protected]>>
Cc: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>, Alec 
Taylor <[email protected]<mailto:[email protected]>>
Subject: Re: [poppler] pdftohtml does not preserve fonts

Sure ! Do you have a link for the repo so that I can already have a look (I 
didn't figure out which one it is right now) ? I'm really interested in helping 
you, if you need something on any specific topic don't hesitate. Many thanks 
again,

Clément


On Mon, Oct 24, 2011 at 8:01 PM, Josh Richardson 
<[email protected]<mailto:[email protected]>> wrote:
Can you give me a couple of days?  I want to try to get a repo hosted on,
e.g. bitbucket, which is connected to my repo, so that it's easier to keep
everything in synch.  Alec Taylor set up a repo there already, which you
can use to get an immediate snapshot if needed.

Best, --josh

On 10/24/11 10:45 AM, "iclems" 
<[email protected]<mailto:[email protected]>> wrote:

>
>Dear Josh,
>
>Being working on a pdftohtml project which requires font preservation, I'd
>be really interested in getting this too. Do you think it's possible ?
>
>Thanks,
>
>Clement
>[email protected]<mailto:[email protected]>
>
>
>Josh Richardson wrote:
>>
>> Preserving fonts is not integrated into the master repository yet.  If
>>you
>> like, I can send you a patched version of Poppler which will do it.
>> You'll still have to run your own process (like Fontforge) to convert
>>the
>> fonts into a web-usable format, but it's straightforward as long as the
>> fonts have mapping to unicode, and doable even without.
>>
>> --josh
>>
>> From: M Naveed Akram 
>> <[email protected]<mailto:[email protected]><mailto:[email protected]<mailto:[email protected]>>>
>> Date: Fri, 30 Sep 2011 06:52:14 -0700
>> To:
>>"[email protected]<mailto:[email protected]><mailto:[email protected]<mailto:[email protected]>>"
>> <[email protected]<mailto:[email protected]><mailto:[email protected]<mailto:[email protected]>>>
>> Subject: [poppler] pdftohtml does not preserve fonts
>>
>> Hi,
>>
>> I have been using 0.16 release of poppler-utils, but I am facing a
>> problem. When converting pdf to html using pdftohtml it does not
>>preserve
>> fonts in the output html. How can I solve this issue. Please help
>>
>>
>> _______________________________________________
>> poppler mailing list
>> [email protected]<mailto:[email protected]>
>> http://lists.freedesktop.org/mailman/listinfo/poppler
>>
>>
>
>--
>View this message in context:
>http://old.nabble.com/pdftohtml-does-not-preserve-fonts-tp32569116p3271208
>4.html
>Sent from the Free Desktop - poppler mailing list archive at Nabble.com.
>
>_______________________________________________
>poppler mailing list
>[email protected]<mailto:[email protected]>
>http://lists.freedesktop.org/mailman/listinfo/poppler
>


_______________________________________________
poppler mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/poppler

Reply via email to