Added XMLTokenizer language to Gitbook
Project: http://git-wip-us.apache.org/repos/asf/camel/repo Commit: http://git-wip-us.apache.org/repos/asf/camel/commit/a13f062f Tree: http://git-wip-us.apache.org/repos/asf/camel/tree/a13f062f Diff: http://git-wip-us.apache.org/repos/asf/camel/diff/a13f062f Branch: refs/heads/master Commit: a13f062f8ff70de904ee2d89c67be109db213047 Parents: 35ea8de Author: Andrea Cosentino <anco...@gmail.com> Authored: Wed Aug 24 13:04:42 2016 +0200 Committer: Andrea Cosentino <anco...@gmail.com> Committed: Wed Aug 24 13:04:42 2016 +0200 ---------------------------------------------------------------------- .../src/main/docs/xtokenize-language.adoc | 34 ++++++++++++++++++++ 1 file changed, 34 insertions(+) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/camel/blob/a13f062f/camel-core/src/main/docs/xtokenize-language.adoc ---------------------------------------------------------------------- diff --git a/camel-core/src/main/docs/xtokenize-language.adoc b/camel-core/src/main/docs/xtokenize-language.adoc new file mode 100644 index 0000000..2171407 --- /dev/null +++ b/camel-core/src/main/docs/xtokenize-language.adoc @@ -0,0 +1,34 @@ +[[XMLTokenizer-XMLTokenizer]] +XMLTokenizer +~~~~~~~~~~~~ + +The xml tokenizer language is a built-in language in camel-core, which +is a truly XML-aware tokenizer that can be used with the Splitter as the +conventional link:tokenizer.html[Tokenizer] to efficiently and +effectively tokenize XML documents. XMLTokenizer is capable of not only +recognizing XML namespaces and hierarchical structures of the document +but also more efficiently tokenizing XML documents than the conventional +Tokenizer. + +For more details see link:splitter.html[Splitter]. + +[[XMLTokenizer-Options]] +XML Tokenizer Options +^^^^^^^^^^^^^^^^^^^^^ + +// language options: START +The XML Tokenize language supports 4 options which are listed below. + + + +{% raw %} +[width="100%",cols="2,1m,1m,6",options="header"] +|======================================================================= +| Name | Default | Java Type | Description +| headerName | | String | Name of header to tokenize instead of using the message body. +| mode | | String | The extraction mode. The available extraction modes are: i - injecting the contextual namespace bindings into the extracted token (default) w - wrapping the extracted token in its ancestor context u - unwrapping the extracted token to its child content t - extracting the text content of the specified element +| group | | Integer | To group N parts together +| trim | true | Boolean | Whether to trim the value to remove leading and trailing whitespaces and line breaks +|======================================================================= +{% endraw %} +// language options: END