On 2/1/22 22:54, George Rawlinson via arch-dev-public wrote:
On 22-02-01 08:21, Morten Linderud via arch-dev-public wrote:
At this stage, the following [community] packages that I maintain
require massaging of HTML sources:

* html-xml-utils
* oil
* parallel
* libmilter (bundled with sendmail source)
* time

I suppose if a nvchecker plugin existed that utilised bs4 (beautiful
soup), that would work. But I assume that would still fit your
definition of "arbitrary script". :p

There is a regex plugin and a htmlparser plugin for this.

The htmlparser plugin accepts XPath, but if you want to process it further the regex plugin may just work better.

Examples for your packages:

[html-xml-utils]
source = "regex"
url = "https://www.w3.org/Tools/HTML-XML-utils/";
regex = "html-xml-utils-(.*?).tar.gz"

[oil]
source = "htmlparser"
url = "https://www.oilshell.org/release/latest/";
xpath = "//h1/text()"
prefix = "Oil "

--
Regards,
Felix Yan

Attachment: OpenPGP_signature
Description: OpenPGP digital signature

Reply via email to