Re: convert html to xml

2025-09-03 Thread Roy J. Tellason, Sr.
On Sunday 31 August 2025 11:09:12 pm Max Nikulin wrote: > I have no idea if search engines parse links in PDFs. They apparently do. A while back I was using some of the tools offered by google on my website, and got errors reported that were caused by links that were inside of pdf files, unti

Re: convert html to xml

2025-09-02 Thread Russell L. Harris
On Sat, Aug 30, 2025 at 03:22:58PM +0700, Max Nikulin wrote: For me it is not uncommon to get PDF files in search results. That is why I suspect that something is wrong with your PDF's. Are they generated to be sent to printer or to be published on a web site? Does "pdftotext FILE.PDF -" is abl

Re: convert html to xml

2025-09-02 Thread Max Nikulin
On 01/09/2025 11:03, Russell L. Harris wrote: I do not see in dvipdfm a way to add metadata. It seems, you have found a recipe how to add metadata. In addition, the hyperref manual may be used as a reference. I am unaware if XMP PDF/A metadata may give additional benefits related to rank a

Re: convert html to xml

2025-08-31 Thread Russell L. Harris
On Mon, Sep 01, 2025 at 10:09:12AM +0700, Max Nikulin wrote: It was a hint that you need to add metadata. Help search engines to create meaningful entries when they are presenting search results. I do not see in dvipdfm a way to add metadata. I suppose there must LaTeX commands (such as \pdfau

Re: convert html to xml

2025-08-31 Thread Max Nikulin
On 31/08/2025 14:51, Russell L. Harris wrote: On Sat, Aug 30, 2025 at 03:22:58PM +0700, Max Nikulin wrote: Does "pdftotext FILE.PDF -" is able to extract readable text? Yes. OK, I was afraid that TeX-specific encoding may be an obstacle for crawlers. Does "pdfinfo FILE.PDF" list author, ti

Re: convert html to xml

2025-08-31 Thread Greg
On 2025-08-30, jeremy ardley wrote: > > On 30/8/25 13:06, Russell L. Harris wrote: >> >> This is a Bible teaching ministry. www.gospelbroadcasting.org > > > Congratulations! you have just made your first step in SEO. You have  a > link from a respected (ahem) mailing list to your website! > > I d

Re: convert html to xml

2025-08-31 Thread Russell L. Harris
at latexml converts LaTeX into XML. I was not aware of latexml until a day or two ago. My plan of attack was to convert LaTeX to HTML, then convert HTML to XML, and finally, import XML into WordPress. My original thought was to create a "quick-and-dirty" WordPress blog site with Word

Re: convert html to xml

2025-08-30 Thread Šarūnas Burdulis
If your source is in TeX, you're in good hands. Conversion to almost anything is possible. Have look at `apt show pandoc`, for example. However I do not think conversion and or using different online format will change the online visibility. As long as the site is reachable, includes sensible

Re: convert html to xml

2025-08-30 Thread Max Nikulin
On 30/08/2025 11:17, Russell L. Harris wrote: I have a growing bunch of studies which I compose in LaTeX markup.  I currently these post in PDF format, on-line and freely-accessible.  I created the web site with the Debian package make4ht. But in their present form, the studies are not readily f

Re: convert html to xml

2025-08-30 Thread jeremy ardley
On 30/8/25 13:06, Russell L. Harris wrote: This is a Bible teaching ministry. www.gospelbroadcasting.org Congratulations! you have just made your first step in SEO. You have  a link from a respected (ahem) mailing list to your website! I did some research and making xml copies of html te

Re: convert html to xml

2025-08-29 Thread Russell L. Harris
On Sat, Aug 30, 2025 at 12:26:32PM +0800, jeremy ardley wrote: I seriously doubt that wordpress will improve your visibility, or your wallet if you pay for SEO This is a Bible teaching ministry. www.gospelbroadcasting.org I charge for nothing. Nothing is for sale. Conversion to XML is

Re: convert html to xml

2025-08-29 Thread jeremy ardley
On 30/8/25 12:17, Russell L. Harris wrote: WordPress is touted as the best platform for search engine optimization (S.E.O.).  I plan to post my studies on a WordPress web site and have someone knowledgeable apply S.E.O. techniques. I seriously doubt that wordpress will improve your visibil

convert html to xml

2025-08-29 Thread Russell L. Harris
version of WordPress. The next promising solution is XML; it appears that WordPress is able to read XML. But I have not yet found a Debian package which is able to convert HTML to XML. RLH