On September 1, 2017 at 11:13:58 AM, Alexander Holupirek (alex@holupirek.de) wrote:
> On 1. Sep 2017, at 17:06, Ron Katriel <rkatriel@mdsol.com> wrote:
>
> Hi,
>
> I ran into a peculiar issue with importing an XML file from DrugBank into BaseX. It involves the file header, which looks like this:
>
> <?xml version="1.0" encoding="UTF-8"?>
> <drugbank xmlns="https://urldefense.proofpoint.com/v2/url?u=http-3A__www.drugbank.ca&d=DwIFAg&c=fi2D4-9xMzmjyjREwHYlAw&r=44jDQvzmnB_-ovfO6Iusj0ItciJrcWMOQQwd2peEBBE&m=4gh7MwX9kgAiuOIjM8N3S_zHFOutxVbKozwC0xz90BA&s=JdGaEkE3BkQuY6eLHh9eY-oEZ9HB2ZW9J3k6Hft-eaU&e= " xmlns:xsi="https://urldefense.proofpoint.com/v2/url?u=http-3A__www.w3.org_2001_XMLSchema-2Dinstance&d=DwIFAg&c=fi2D4-9xMzmjyjREwHYlAw&r=44jDQvzmnB_-ovfO6Iusj0ItciJrcWMOQQwd2peEBBE&m=4gh7MwX9kgAiuOIjM8N3S_zHFOutxVbKozwC0xz90BA&s=Tepbq62aOfIniXVpKLDYbXjyTd36uY8BK1aN4VG-UkQ&e= " xsi:schemaLocation="https://urldefense.proofpoint.com/v2/url?u=http-3A__www.drugbank.ca&d=DwIFAg&c=fi2D4-9xMzmjyjREwHYlAw&r=44jDQvzmnB_-ovfO6Iusj0ItciJrcWMOQQwd2peEBBE&m=4gh7MwX9kgAiuOIjM8N3S_zHFOutxVbKozwC0xz90BA&s=JdGaEkE3BkQuY6eLHh9eY-oEZ9HB2ZW9J3k6Hft-eaU&e= https://urldefense.proofpoint.com/v2/url?u=http-3A__www.drugbank.ca_docs_drugbank.xsd&d=DwIFAg&c=fi2D4-9xMzmjyjREwHYlAw&r=44jDQvzmnB_-ovfO6Iusj0ItciJrcWMOQQwd2peEBBE&m=4gh7MwX9kgAiuOIjM8N3S_zHFOutxVbKozwC0xz90BA&s=e9xaYijF4iXv_DD0kkUyk2njpqeFntG6_tYvv5rwZNQ&e= " version="5.0" exported-on="2017-08-31">
>
> The XML file (which is quite large ~3GB) loads in cleanly but queries executed against it produce no output (no errors reported).
>
> When I change the header to the following and re-import the file everything works fine:
>
> <?xml version="1.0" encoding="UTF-8"?>
> <drugbank>
>
> I had this issue for a while (go thought this every month) so thought it might be good to share. I am currently using BaseX 8.6.4.
>
> Any idea why this is happening? Attached is the console trace for both cases (they show subtle differences in the query plans).
>
> Thanks,
> Ron
Hi Ron,
is it possible to reproduce the error on a reduced, smaller dataset and a simplified query that you can post to the list? Maybe a namespace issue?
Thanks
Alex