dependabot[bot] opened a new pull request, #110: URL: https://github.com/apache/maven-dist-tool/pull/110
Bumps [org.jsoup:jsoup](https://github.com/jhy/jsoup) from 1.19.1 to 1.20.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/jhy/jsoup/releases">org.jsoup:jsoup's releases</a>.</em></p> <blockquote> <h2>jsoup 1.20.1</h2> <h2>Changes</h2> <ul> <li>To better follow the HTML5 spec and current browsers, the HTML parser no longer allows self-closing tags (<code><foo /></code>) to close HTML elements by default. Foreign content (SVG, MathML), and content parsed with the XML parser, still supports self-closing tags. If you need specific HTML tags to support self-closing, you can register a custom tag via the <code>TagSet</code> configured in <code>Parser.tagSet()</code>, using <code>Tag#set(Tag.SelfClose)</code>. Standard void tags (such as <code><img></code>, <code><br></code>, etc.) continue to behave as usual and are not affected by this change. <a href="https://redirect.github.com/jhy/jsoup/issues/2300">#2300</a>.</li> <li>The following internal components have been <strong>deprecated</strong>. If you do happen to be using any of these, please take the opportunity now to migrate away from them, as they will be removed in jsoup 1.21.1. <ul> <li><code>ChangeNotifyingArrayList</code>, <code>Document.updateMetaCharsetElement()</code>, <code>Document.updateMetaCharsetElement(boolean)</code>, <code>HtmlTreeBuilder.isContentForTagData(String)</code>, <code>Parser.isContentForTagData(String)</code>, <code>Parser.setTreeBuilder(TreeBuilder)</code>, <code>Tag.formatAsBlock()</code>, <code>Tag.isFormListed()</code>, <code>TokenQueue.addFirst(String)</code>, <code>TokenQueue.chompTo(String)</code>, <code>TokenQueue.chompToIgnoreCase(String)</code>, <code>TokenQueue.consumeToIgnoreCase(String)</code>, <code>TokenQueue.consumeWord()</code>, <code>TokenQueue.matchesAny(String...)</code></li> </ul> </li> </ul> <h2>Functional Improvements</h2> <ul> <li>Rebuilt the HTML pretty-printer, to simplify and consolidate the implementation, improve consistency, support custom Tags, and provide a cleaner path for ongoing improvements. The specific HTML produced by the pretty-printer may be different from previous versions. <a href="https://redirect.github.com/jhy/jsoup/issues/2286">#2286</a>.</li> <li>Added the ability to define custom tags, and to modify properties of known tags, via the <code>TagSet</code> tag collection. Their properties can impact both the parse and how content is serialized (output as HTML or XML). <a href="https://redirect.github.com/jhy/jsoup/issues/2285">#2285</a>.</li> <li><code>Element.cssSelector()</code> will prefer to return shorter selectors by using ancestor IDs when available and unique. E.g. <code>#id > div > p</code> instead of <code>html > body > div > div > p</code> <a href="https://redirect.github.com/jhy/jsoup/pull/2283">#2283</a>.</li> <li>Added <code>Elements.deselect(int index)</code>, <code>Elements.deselect(Object o)</code>, and <code>Elements.deselectAll()</code> methods to remove elements from the <code>Elements</code> list without removing them from the underlying DOM. Also added <code>Elements.asList()</code> method to get a modifiable list of elements without affecting the DOM. (Individual Elements remain linked to the DOM.) <a href="https://redirect.github.com/jhy/jsoup/issues/2100">#2100</a>.</li> <li>Added support for sending a request body from an InputStream with <code>Connection.requestBodyStream(InputStream stream)</code>. <a href="https://redirect.github.com/jhy/jsoup/issues/1122">#1122</a>.</li> <li>The XML parser now supports scoped xmlns: prefix namespace declarations, and applies the correct namespace to Tags and Attributes. Also, added <code>Tag#prefix()</code>, <code>Tag#localName()</code>, <code>Attribute#prefix()</code>, <code>Attribute#localName()</code>, and <code>Attribute#namespace()</code> to retrieve these. <a href="https://redirect.github.com/jhy/jsoup/issues/2299">#2299</a>.</li> <li>CSS identifiers are now escaped and unescaped correctly to the CSS spec. <code>Element#cssSelector()</code> will emit appropriately escaped selectors, and the QueryParser supports those. Added <code>Selector.escapeCssIdentifier()</code> and ` Selector.unescapeCssIdentifier(). <a href="https://redirect.github.com/jhy/jsoup/pull/2297">#2297</a>, <a href="https://redirect.github.com/jhy/jsoup/pull/2305">#2305</a></li> </ul> <h2>Structure and Performance Improvements</h2> <ul> <li>Refactored the CSS <code>QueryParser</code> into a clearer recursive descent parser. <a href="https://redirect.github.com/jhy/jsoup/pull/2310">#2310</a>.</li> <li>CSS selectors with consecutive combinators (e.g. <code>div >> p</code>) will throw an explicit parse exception. <a href="https://redirect.github.com/jhy/jsoup/pull/2311">#2311</a>.</li> <li>Performance: reduced the shallow size of an Element from 40 to 32 bytes, and the NodeList from 32 to 24. <a href="https://redirect.github.com/jhy/jsoup/pull/2307">#2307</a>.</li> <li>Performance: reduced GC load of new StringBuilders when tokenizing input HTML. <a href="https://redirect.github.com/jhy/jsoup/pull/2304">#2304</a>.</li> <li>Made <code>Parser</code> instances threadsafe, so that inadvertent use of the same instance across threads will not lead to errors. For actual concurrency, use <code>Parser#newInstance()</code> per thread. <a href="https://redirect.github.com/jhy/jsoup/pull/2314">#2314</a>.</li> </ul> <h2>Bug Fixes</h2> <ul> <li>Element names containing characters invalid in XML are now normalized to valid XML names when serializing. <a href="https://redirect.github.com/jhy/jsoup/issues/1496">#1496</a>.</li> <li>When serializing to XML, characters that are invalid in XML 1.0 should be removed (not encoded). <a href="https://redirect.github.com/jhy/jsoup/issues/1743">#1743</a>.</li> <li>When converting a <code>Document</code> to the W3C DOM in <code>W3CDom</code>, elements with an attribute in an undeclared namespace now get a declaration of <code>xmlns:prefix="undefined"</code>. This allows subsequent serialization to XML via <code>W3CDom.asString()</code> to succeed. <a href="https://redirect.github.com/jhy/jsoup/issues/2087">#2087</a>.</li> <li>The <code>StreamParser</code> could emit the final elements of a document twice, due to how <code>onNodeCompleted</code> was fired when closing out the stack. <a href="https://redirect.github.com/jhy/jsoup/issues/2295">#2295</a>.</li> <li>When parsing with the XML parser and error tracking enabled, the trailing <code>?</code> in <code><?xml version="1.0"?></code> would incorrectly emit an error. <a href="https://redirect.github.com/jhy/jsoup/issues/2298">#2298</a>.</li> <li>Calling <code>Element#cssSelector()</code> on an element with combining characters in the class or ID now produces the correct output. <a href="https://redirect.github.com/jhy/jsoup/issues/1984">#1984</a>.</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/jhy/jsoup/blob/master/CHANGES.md">org.jsoup:jsoup's changelog</a>.</em></p> <blockquote> <h2>1.20.1 (2025-04-29)</h2> <h3>Changes</h3> <ul> <li>To better follow the HTML5 spec and current browsers, the HTML parser no longer allows self-closing tags (<code><foo /></code>) to close HTML elements by default. Foreign content (SVG, MathML), and content parsed with the XML parser, still supports self-closing tags. If you need specific HTML tags to support self-closing, you can register a custom tag via the <code>TagSet</code> configured in <code>Parser.tagSet()</code>, using <code>Tag#set(Tag.SelfClose)</code>. Standard void tags (such as <code><img></code>, <code><br></code>, etc.) continue to behave as usual and are not affected by this change. <a href="https://redirect.github.com/jhy/jsoup/issues/2300">#2300</a>.</li> <li>The following internal components have been <strong>deprecated</strong>. If you do happen to be using any of these, please take the opportunity now to migrate away from them, as they will be removed in jsoup 1.21.1. <ul> <li><code>ChangeNotifyingArrayList</code>, <code>Document.updateMetaCharsetElement()</code>, <code>Document.updateMetaCharsetElement(boolean)</code>, <code>HtmlTreeBuilder.isContentForTagData(String)</code>, <code>Parser.isContentForTagData(String)</code>, <code>Parser.setTreeBuilder(TreeBuilder)</code>, <code>Tag.formatAsBlock()</code>, <code>Tag.isFormListed()</code>, <code>TokenQueue.addFirst(String)</code>, <code>TokenQueue.chompTo(String)</code>, <code>TokenQueue.chompToIgnoreCase(String)</code>, <code>TokenQueue.consumeToIgnoreCase(String)</code>, <code>TokenQueue.consumeWord()</code>, <code>TokenQueue.matchesAny(String...)</code></li> </ul> </li> </ul> <h3>Functional Improvements</h3> <ul> <li>Rebuilt the HTML pretty-printer, to simplify and consolidate the implementation, improve consistency, support custom Tags, and provide a cleaner path for ongoing improvements. The specific HTML produced by the pretty-printer may be different from previous versions. <a href="https://redirect.github.com/jhy/jsoup/issues/2286">#2286</a>.</li> <li>Added the ability to define custom tags, and to modify properties of known tags, via the <code>TagSet</code> tag collection. Their properties can impact both the parse and how content is serialized (output as HTML or XML). <a href="https://redirect.github.com/jhy/jsoup/issues/2285">#2285</a>.</li> <li><code>Element.cssSelector()</code> will prefer to return shorter selectors by using ancestor IDs when available and unique. E.g. <code>#id > div > p</code> instead of <code>html > body > div > div > p</code> <a href="https://redirect.github.com/jhy/jsoup/pull/2283">#2283</a>.</li> <li>Added <code>Elements.deselect(int index)</code>, <code>Elements.deselect(Object o)</code>, and <code>Elements.deselectAll()</code> methods to remove elements from the <code>Elements</code> list without removing them from the underlying DOM. Also added <code>Elements.asList()</code> method to get a modifiable list of elements without affecting the DOM. (Individual Elements remain linked to the DOM.) <a href="https://redirect.github.com/jhy/jsoup/issues/2100">#2100</a>.</li> <li>Added support for sending a request body from an InputStream with <code>Connection.requestBodyStream(InputStream stream)</code>. <a href="https://redirect.github.com/jhy/jsoup/issues/1122">#1122</a>.</li> <li>The XML parser now supports scoped xmlns: prefix namespace declarations, and applies the correct namespace to Tags and Attributes. Also, added <code>Tag#prefix()</code>, <code>Tag#localName()</code>, <code>Attribute#prefix()</code>, <code>Attribute#localName()</code>, and <code>Attribute#namespace()</code> to retrieve these. <a href="https://redirect.github.com/jhy/jsoup/issues/2299">#2299</a>.</li> <li>CSS identifiers are now escaped and unescaped correctly to the CSS spec. <code>Element#cssSelector()</code> will emit appropriately escaped selectors, and the QueryParser supports those. Added <code>Selector.escapeCssIdentifier()</code> and ` Selector.unescapeCssIdentifier(). <a href="https://redirect.github.com/jhy/jsoup/pull/2297">#2297</a>, <a href="https://redirect.github.com/jhy/jsoup/pull/2305">#2305</a></li> </ul> <h3>Structure and Performance Improvements</h3> <ul> <li>Refactored the CSS <code>QueryParser</code> into a clearer recursive descent parser. <a href="https://redirect.github.com/jhy/jsoup/pull/2310">#2310</a>.</li> <li>CSS selectors with consecutive combinators (e.g. <code>div >> p</code>) will throw an explicit parse exception. <a href="https://redirect.github.com/jhy/jsoup/pull/2311">#2311</a>.</li> <li>Performance: reduced the shallow size of an Element from 40 to 32 bytes, and the NodeList from 32 to 24. <a href="https://redirect.github.com/jhy/jsoup/pull/2307">#2307</a>.</li> <li>Performance: reduced GC load of new StringBuilders when tokenizing input HTML. <a href="https://redirect.github.com/jhy/jsoup/pull/2304">#2304</a>.</li> <li>Made <code>Parser</code> instances threadsafe, so that inadvertent use of the same instance across threads will not lead to errors. For actual concurrency, use <code>Parser#newInstance()</code> per thread. <a href="https://redirect.github.com/jhy/jsoup/pull/2314">#2314</a>.</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/jhy/jsoup/commit/8f0dc293c9c34ef9b7715493a423df88a06bed1d"><code>8f0dc29</code></a> [maven-release-plugin] prepare release jsoup-1.20.1</li> <li><a href="https://github.com/jhy/jsoup/commit/6b96c742ef84bd0347d3338aff13ed6da8278b1f"><code>6b96c74</code></a> Don't log InterruptedException</li> <li><a href="https://github.com/jhy/jsoup/commit/dec21ef066af58e694b48ff483079f346ba5c217"><code>dec21ef</code></a> Bump com.google.code.gson:gson from 2.13.0 to 2.13.1 (<a href="https://redirect.github.com/jhy/jsoup/issues/2316">#2316</a>)</li> <li><a href="https://github.com/jhy/jsoup/commit/fccfaaed2ac4ebaddfb4371ed2f0e6a2ed469273"><code>fccfaae</code></a> Updating changelog</li> <li><a href="https://github.com/jhy/jsoup/commit/603261dd12a6703a9e81c59bf73a894edb1f93f3"><code>603261d</code></a> Perf optimize these deep IT tests</li> <li><a href="https://github.com/jhy/jsoup/commit/f0d878210fb83c58c3a1a99a138dec86ec542a08"><code>f0d8782</code></a> Use a ReentrantLock in HttpConnection</li> <li><a href="https://github.com/jhy/jsoup/commit/01c823e35033c86a3247c451891d13df1ed2ee11"><code>01c823e</code></a> Make Parser instances threadsafe (<a href="https://redirect.github.com/jhy/jsoup/issues/2314">#2314</a>)</li> <li><a href="https://github.com/jhy/jsoup/commit/dcb4f799123febff991e64ff24b7d20538e34ff1"><code>dcb4f79</code></a> Only match ASCII letters and digits (<a href="https://redirect.github.com/jhy/jsoup/issues/2312">#2312</a>)</li> <li><a href="https://github.com/jhy/jsoup/commit/c4dd25ee7257748cbf1f9e54ee0885d462d18694"><code>c4dd25e</code></a> Disallow consecutive combinators in CSS selectors (<a href="https://redirect.github.com/jhy/jsoup/issues/2311">#2311</a>)</li> <li><a href="https://github.com/jhy/jsoup/commit/f70a7bc0e596651f81d4e47726d104d9f6db3acd"><code>f70a7bc</code></a> Revamp QueryParser (<a href="https://redirect.github.com/jhy/jsoup/issues/2310">#2310</a>)</li> <li>Additional commits viewable in <a href="https://github.com/jhy/jsoup/compare/jsoup-1.19.1...jsoup-1.20.1">compare view</a></li> </ul> </details> <br /> [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@maven.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org