On 30/08/2025 11:17, Russell L. Harris wrote:
I have a growing bunch of studies which I compose in LaTeX markup. I
currently these post in PDF format, on-line and freely-accessible. I
created the web site with the Debian package make4ht.
But in their present form, the studies are not readily findable by
search engines.
For me it is not uncommon to get PDF files in search results. That is
why I suspect that something is wrong with your PDF's. Are they
generated to be sent to printer or to be published on a web site? Does
"pdftotext FILE.PDF -" is able to extract readable text? Does "pdfinfo
FILE.PDF" list author, title, etc.? Are links to these files have
descriptive context?
WordPress is touted as the best platform for search engine
optimization (S.E.O.).
I often get in search results pages with poor metadata. I admit there
are other aspects like markup and CSS suitable for smartphones, but I
suspect other issues with your documents again. Reading Google
recommendations might provide some insights. Tuning a bit TeX4ht output
may be enough.
The problem is finding a way to import the studies into WordPress.
You have asked it earlier. It seems, active subscribers on this list do
not have this specific experience. I expect, there are enough ways to
import content into WordPress. You may ask your question in some LaTeX
community. You may ask in some WordPress community what formats are
suitable for import (perhaps there are not so much participants familiar
with LaTeX there).
The next promising solution is XML;
Are you realizing that XML is a rather generic data format? You need
some specific format *based* on XML.