Hello!
On Sat, Jun 12, 2021 at 07:11:00PM +0200, Christian Grün scripsit:
Could you share exemplary and minimized input documents with us?
I have created some text documents and attached them. (The "remember to throw away some metadata markup, etc." step on the way to getting text to compare from the before and after vocabularies is believed to work reliably.)
Can the structure of the documents (hierarchy of nodes, element names, etc.) be completely ignored?
It can! This test is meant to test only that no words have been lost or re-ordered; that the transformation is semantically correct is out of scope for it.
Thank you! Graydon