On 11.07.2020 14:41, Giuseppe G. A. Celano wrote:
I am trying to perform a join operation between two large XML files (~490 MB and ~40 MB), which are the result of the automatic conversion of old sql dumps into XML files. I created two databases for the files. The query I wrote to join them is correct because it works when I limit the join to just a few items, but it never ends if I apply it to all items:
here is the xquery: https://git.informatik.uni-leipzig.de/celano/perseus_morpheus/-/blob/master/... here is the first file:
Saxon EE seems to be capable of handling it (loading the files with the doc function of course instead of from a db), although needing more than 2GB of memory.
Not sure where BaseX struggles, I am sure someone of the BaseX team can tell you soon.
Saxon HE also struggles so it must be some of the advanced join optimizations in EE that allow it to run that query in a reasonable time.