The parser token takes one parameter. The parameter is a regular expression. The first matching section is what is assigned
to chunk name. A matching section, as defined in RegEx is surrounded with parenthisis
If we wish to capture the price '9.99' excluding the dollar sign from this HTML:
<div id="interestingContent>
<a href="somerandomlink.txt">
$9.99 on sale!
</a>
</div>
we could use the following token parser configuration.
link
<|>regex:<div id="interestingContent"><a*>\$([0-9]+\.[0-9]+)<||>
Also available are two flavours of this parser. regexmulti - which captures repeating
elements on a page. and regexmulticount - which captures repeating elements and indexes them on the output.
When these tokens are used your vertical export file will contain multiple instances of these chunks
regexmulti - will find multiple matches of the expression on a page.
<|>regexmulti:(<RegEx>)<||>
regexmulticount - identical to regexmulti except that the export file will have a zero indexed suffix appended
to the chunk name
<|>regexmulticount:(<RegEx>)<||>