On Mon, 2020-05-11 at 22:29 +0200, Christian Grün wrote:
Providing access to the starts and ends may be difficult due to all the logical operators that can be used
A way to go from ($input, $phrases) to a $input autmented with db:milestone elements each containing starts="0 7 23" ends="2 6 18" attributes (where the numbers are positional in the sequene of phrases) might be good. Or the mileston element could iclude the phrase,
I saw his db:milestone <db:start ref="3">naked hooves</db:start> <db:start ref="6">unshod</db:start> </db:milestonr>baredb:milestone <db:end ref="6" /></db:mileston> feet....
as two problems are (1) overlapping results, and (2) query expansion using a thesaurus and/or stemming.
Liam
(ftor, ftand, ftnot, not in). A simple example:
let $xml := <_>a b c d</_> update {} return ft:mark($xml[text() contains text 'b c' ftand 'c d'])
We could possibly make the full data structures available that need to be internally generated. I fear people wouldn’t really work with it as they are fairly complex (a look into the specification may give you an impression of that [1]).
But thanks for your thoughts, I’ll let them grow.
[1] https://www.w3.org/TR/xpath-full-text-10/#FTOperatorsSemanticsSec