Hi,
I have a requirement to build XML repository where I will store millions of XML documents (size may vary) based on some unique Ids (one level path like - \10). For another operation, I need to compare the incoming XML document with the stored XML document for any changes.
1. Can any of you please let me know on how to implement this in BaseX? Apart from inserting the XMLs and comparing them with the incoming XML, there is no other query required. 2. Performance is the major criteria for this. Do you think BaseX will be able to handle this as the repository size grows? Also, is there a way to do batch processing of the above operations to improve performance?
Thanks in advance,
~ VK
Hi Vaibhav,
I have a requirement to build XML repository where I will store millions of XML documents (size may vary) based on some unique Ids (one level path like
- \10). For another operation, I need to compare the incoming XML document
with the stored XML document for any changes.
One popular way to do this is to have two databases, which are merged on a regular basis. The first one contains old documents, which are well-indexed and can be quickly searched. The second, incremental one contains the daily or weekly updates. With XQuery, you can access both databases in a single query and merge all results of a search request,
Hope this helps, Christian
basex-talk@mailman.uni-konstanz.de