Keep the comments when parsing HTML? - BaseX-Talk - mailman.uni-konstanz.de

18 Aug 2017


      Hello --
So I have a pile of near-XML HTML with semantically significant comments to
deal with.  (I must have been sinning much more than I realized!)
Using BaseX866-20170818.124137, BaseX will parse the content but all the
comments go away.  This is with passing the "lexical" option on the parser
tab where it asks for TagSoup options, which I understand from
https://github.com/orbeon/tagsoup/blob/master/trunk/README to pass through
comments (and DOCTYPE declarations).
How do I parse HTML and keep the comments?
Thanks!
Graydon