On Wednesday, April 5, 2023 at 03:28:57 AM EDT, <jorge@vionta.net> wrote:

Hi Owen,

You may check the full text configuration cappabilities https://docs.basex.org/wiki/Full-Text like possitional filters and Fuzzy Quering. It may be a bug, but I would exclude configuration at first.

I can see that you are making good progresses, and love that you have taken the basex option. I think that you are on the right path.

Love to see progresses.

Kind regards.

On 08/03/2023 17:31, Owen Ambur wrote:

Christian, do you know if this has been identified as a bug in BaseX's full-text query capability and, if so, if there are any plans to do anything about it?

If memory serves me correctly, I subscribed to the BaseX listserv for awhile to try to enlist a developer(s) for a StratML-enabled query service, like the one on which Naval is now working for me for hosting at https://aboutthem.info/.

When the query service is in relatively good shape, I may wish to resubscribe to the listserv to announce it there as well as on LinkedIn and perhaps elsewhere. However, do you think it might be worthwhile to raise this issue on the listserv in the meantime?

Owen Ambur
https://www.linkedin.com/in/owenambur/

On Tuesday, March 7, 2023 at 03:16:49 PM EST, Naval Sarda <nsarda@epicomm.net> wrote:

Hi Owen.

The inbuild search provided by BaseX is combining the text from next file and then searching.

So if the line ends with word "end." and next line starts with "less", it will match search criteria "endless"

This is false positive matching. There is nothing much we can do about it as replacing with custom search will be slow.

Naval

On 07/03/23 6:38 am, Owen Ambur wrote:

What can we do about it?

Owen Ambur
https://www.linkedin.com/in/owenambur/

On Monday, March 6, 2023 at 07:16:17 PM EST, Naval Sarda <nsarda@epicomm.net> wrote:

Please see below

-------- Forwarded Message --------

Subject: Re: Fwd: False Positives

Date: Mon, 6 Mar 2023 21:38:43 +0530

From: Sudarshana <sudarshanac@epicomm.net>

To: Naval Sarda <nsarda@epicomm.net>, jitendras@epicomm.net

Owen,

This was known issue we were informed you.

In fulltext search, if there is any space character like (tab, space or new line) is present then it is coming in result.

In file APQC.xml, Board of Governors of the Federal Reserve System is one organization and Bombardier Aerospace Inc. is next adjacent organization.

So Board of Governors of the Federal Reserve System Bombardier Aerospace Inc. highlighted keyword is considering as tembom .

So those files are coming in result.

-Sudarshana

On 3/6/2023 10:18 AM, Naval Sarda wrote:

Get Outlook for iOS

From: Owen Ambur <owen.ambur@verizon.net>
Sent: Monday, March 6, 2023 6:35 AM
To: Naval Sarda <nsarda@epicomm.net>
Cc: abouttheminfoplan@googlegroups.com <abouttheminfoplan@googlegroups.com>
Subject: False Positives

Naval, Ken Holman's LinkedIn posting about his health issue prompted me to query to confirm that Project TEMBO's about statement is in the StratML collection.

However, the full-text query also revealed a couple apparently false positives, as shown in the screen shot below. They are:

https://stratml.us/docs/APQC.xml

https://stratml.us/docs/DOSAID2022.xml

Since the latter is tangentially related in terms of foreign aid, it might be logical for an AI-enhanced query service to reveal it as such. However, ours isn't that "intelligent," is it?

What do you suppose might account for the false positives? This isn't the first time I've encountered them.

Owen Ambur
https://www.linkedin.com/in/owenambur/

--
Thanks & Regards
Sudarshana

Subject:	Re: Fwd: False Positives
Date:	Mon, 6 Mar 2023 21:38:43 +0530
From:	Sudarshana <sudarshanac@epicomm.net>
To:	Naval Sarda <nsarda@epicomm.net>, jitendras@epicomm.net