-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Hello Chris,
as you correctly observed, string comparisons within BaseX is based on unicode codepoints. However, you can define other collations, see https://docs.basex.org/wiki/Full-Text#Collations for some more details.
So, for example, something like for $e in ("Äa", "Ab") order by $e collation "?lang=de" return $e
will now return ("Äa", "Ab"), whereas with normal unicode ordering it would return ("Ab", "Äa").
Cheers, Dirk
On 22/05/14 17:09, Chris Yocum wrote:
Hi Everyone,
Thank your help last time. I was hoping you might be able to answer another question for me. I have a database with many words where the first letter has a diacritic. When I apply "order by" on these, the diacritic led words are sorted at the bottom instead of in their non-diacritic letter place. I could use fn:normalize-unicode but I was wondering what a "normal" sort order would be.
All the best, Chris
- -- Dirk Kirsten, BaseX GmbH, http://basex.org |-- Firmensitz: Blarerstrasse 56, 78462 Konstanz |-- Registergericht Freiburg, HRB: 708285, Geschäftsführer: | Dr. Christian Grün, Dr. Alexander Holupirek, Michael Seiferle `-- Phone: 0049 7531 28 28 676, Fax: 0049 7531 20 05 22