Once again forwarding to the list as I am apparently unable to correctly use this Gmail-client... So for the sake of completeness see attached mails
---------- Forwarded message ---------- From: Dirk Kirsten dk@basex.org Date: Wed, Jul 11, 2012 at 12:20 PM Subject: Re: [basex-talk] Replication with BaseX To: Usama Dar munir.usama@gmail.com
Oh, yeah, sure. You are absolutely right, sorry for misguiding you there. Yes, the indexes are on disk and will be transferred, so using rsync should be pretty performant.
Cheers, Dirk
On Wed, Jul 11, 2012 at 9:31 AM, Usama Dar munir.usama@gmail.com wrote:
Thanks Dirk. I noticed that all data structures are stored in the data directory , so if i copy over the full data directory on a remote machine and then continuously rsync them , do you think there would still be some index re-building required? i thought the indexes like everything else are on disk already?
Best Regards, Usama Dar
On Wed, Jul 11, 2012 at 9:02 AM, Dirk Kirsten dk@basex.org wrote:
Dear Usama,
Currently there is no recommended strategy how to this. So there is no single right answer to your question.
I think your strategy using rsync sounds very reasonable. Rsync has the nice advantage that it will be able to just transfer the incremental change, so you don't have to transfer the whole 500K documents all the time. However, you then still have to update the indexes, so depending on your data and your update interval this could take some time.
Another very simple approach could be by using the BaseX Client-Server Infrastructure to send all commands and queries to both servers and trying to keep them synchronized. This could be more efficient, but this highly depends on your data and your queries (i.e. you use case).
Cheers, Dirk
On Tue, Jul 10, 2012 at 8:12 PM, Usama Dar munir.usama@gmail.com wrote:
Hi Guys,
I am new to BaseX and i am going to use it to store 500K XML documents which will be edited, Queried and stuff. I have already built the main application. I have need to be able to run BaseX in a high availability environment. For that i will initially have two servers and i plan to use rsync on baseX data directory to replicate data.
My question basically is are there any recommended strategies to do this? is using rsync going to do the job? i can copy files over in the data directory in a running instance? or the other server should be shutdown while i copy files? any tool/ strategy you might know of?
Thanks for your help
Usama Dar.
BaseX-Talk mailing list BaseX-Talk@mailman.uni-konstanz.de https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk
basex-talk@mailman.uni-konstanz.de