Hi,

I am seeing strange behavior with Full Text retrieval. The following query fails for a number of words that are in the XML document (see attached):

for $trial in db:open('CTGovDebug') (: [clinical_study/id_info/nct_id='NCT00473512'] :)
return $trial contains text { 'neoplasms' }

It fails on a good number of words including neoplasms, cougar, industry, yes, completed, november, 2005, interventional, single, male, female, assignment, none, research, principal, primary, secondary, age, years, gender, etc. But it matches most of the words in the file.

Observation: The words that fail are located at the beginning and/or end of the text and do not occur anywhere else in the middle of any text.

The document is the only one in the database. It does not make a difference whether full text indexing is on or off. My BaseX version is 8.6.4.

Thanks,
Ron


Ron Katriel, Ph.D. | Principal Data Scientist | Medidata Solutions
350 Hudson Street, 7th Floor, New York, NY 10014
rkatriel@mdsol.com | direct: +1 201 337 3622 | mobile: +1 201 675 5598 | main: +1 212 918 1800