Hi Ronny,
Hi Johannes & Charles, thanks for joining the conversation.
In my opinion, and speaking officially for BaseX, I'd suppose that XML processing with BaseX databases should almost always[1] be faster than processing the XML sequentially via lxml.
However, performance may vary depending on the actual queries and/or the python glue code.
I think Charles' approach of having as much logic in XQuery as possible will be the best option to pick here.
Maybe some of your Python code could as well be rewritten in XQuery, on the other hand this might not even be necessary due to XQuery rewrites as Johannes suggested.
@Ronny, maybe you could provide us with some sample code? In case it is not intended for the general public feel free to send it to
support@basex.org.
Looking forward to seeing your code!
Viele Grüße vom Bodensee
Michael
[1] I can sure think of examples that prove me wrong ;-)
Am 28.03.2012 um 23:19 schrieb Johannes.Lichtenberger:
Thus I suppose it
would be the best to write the queries in a reply, such that the BaseX
team can make suggestions for similar queries which better utilize
index-structures and the query optimizations from the query processor.