HI Christian --
There's no query! This is about loading the files into a DB with the GUI.
I've attached two files.
If I load them as Database->New with "input format" HTML, the comments go away.
If I load them the same way but with "lexical" as a TagSoup parser option, the comments go away. I expect "lexical" is the TagSoup option that keeps comments from going away. (And for the DOCTYPE in the example that has it to be retained.)
If I use
java -jar /usr/share/java/tagsoup.jar --lexical --files *html
from the
command
line, the comments do NOT go away,
so I don't think it's a TagSoup problem, at least not with 1.2.1