Re: [basex-talk] normalize-unicode function

2 Jul 2012


      Hi Christian,
Thanks for your help.
I confirmed it about the relevant part of a program.
And, I confirmed that java.text.Normalizer.normalize() returned it
without converting a full-width double quotation mark that we usually
used (U+201D).
At present, As my hope,
When U+201C, U+201D, returns U+0022.
When U+2018, U+2019, returns U+0027.
However, these will not be enough for users of the whole world
(U+201A, U+201E... Other punctuation mark... etc...).
Furthermore, I cannot be convinced whether you should implement it its
changing the standard specifications of the Java.
Best regards,
Toshio
2012/6/28 Christian Grün christian.gruen@gmail.com:
...
Toshio san,
thanks for your e-mail.
...
FULLWIDTH QUOTATION MARK, FULLWIDTH APOSTROPHE:
 fn:normalize-unicode("”’", "NFKC")
 returns ”’
As far as I can judge, the result is actually correct; it’s returned
by Java’s standard Unicode algorithms, and also returned by other
XQuery  (Saxon, Zorba, XMLPrime, etc). I may need to do more research
on how to normalize quotes the way you’d like them to have, though.
Hope this helps (at least a little),
Christian

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

Re: [basex-talk] normalize-unicode function