[
http://jira.codehaus.org/browse/DOXIA-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=133147#action_133147
]
Lukas Theussl commented on DOXIA-236:
-------------------------------------
Thanks for the graphic illustration :)
However, I most definitely disagree with your conclusion. Curiously, I had to
defend my point several times already, so let me just direct you to some
issues: DOXIA-152, DOXIA-138 (lower part of the discussion). In short: a parser
doesn't know yet where it's output will go, some feature that might be
acceptable for one Sink may lead to errors in others. Only a Sink knows what
output is legal for its format, a Parser should therefore never insert anything
that was not explicitly there in the original input format. Otherwise you would
not be able to produce eg a pdf and a html from the same set of source
documents.
bq. restricting the parsers is equivalent to restricting the input format
I consider it a fundamental design flaw if an input format defines implicit
anchors for section titles. We have modified the original apt format (as
documentet in the doxia-apt.apt document on the doxia site) for these reasons.
> Clarify Sink API
> ----------------
>
> Key: DOXIA-236
> URL: http://jira.codehaus.org/browse/DOXIA-236
> Project: Maven Doxia
> Issue Type: Task
> Components: Sink API
> Affects Versions: 1.0-alpha-2
> Reporter: Benjamin Bentmann
>
> If the idea with extensibility and interchangeable input/output formats
> should be more than a nice dream, the Sink API needs a thorough specification
> (e.g. by means of more javadoc at {{Sink}}) because that's were everything
> meets. It should define
> # what rules parsers must obey when generating events and
> # what events a sink needs to be prepared to handle
> Currently, all of this is left to assumptions. Some example issues that need
> to be clarified:
> - What characters may constitute an anchor reported by {{anchor()}}?
> Arbitrary, ASCII-only, ...?
> - What format applies to the {{name}} parameter of {{link()}}? How are
> internal and external links to be distinguished (DOXIA-208)?
> - What character chunks are reported by {{text()}}? Longest consecutive
> sequence, line-by-line, arbitrary, ... (DOXIA-222)?
> - What exactly is a figure's source as reported by {{figureGraphics()}}?
> Relative/absolute path, relative to which directory? What about file
> extensions (DOXIA-99)?
> - What order of events is "reasonable" (DOXIA-132)? May parsers report table
> body and caption in a specific or arbitrary order? Must the document head
> always be reported before body or may it be postponed?
> - Is closing a sink twice acceptable or an error?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira