Hello,
First I'd like to thank you guys for all your great work on BaseX. I am fairly familiar with XML DBs and have done a significant amount of development on top of Mark Logic. I would like to ask some questions about capacity and scalability. I have reviewed the documentation and see that the biggest store is for SDMX @ approximately 8000 GB. So I am just trying to understand what this means better and would appreciate any of your expert advice for my questions below:
1. Is the expectation that you can query against 8 TB of XML data efficiently?
2. My requirements will be to query across probably 24 TB of XML data. Do you guys feel this is possible?
3. What is the method to scale horizontally and vertically? I.E. Would I be adding more servers, or
starting more instances, etc.?
4. How does high availability work? I.E. Can I have multiple active-active nodes, or should it be active-passive, etc.?
Any help anyone can render is greatly appreciated.
Thanks
Raj