Re: [basex-talk] Fw: Suggestion for a requirement

9 Oct 2014


      Hi Christian,
Thank you so much for your reponse christian,Please find my responses in the below mail for your queries,Please respond on those responses.
Thanks & Regards
Sateesh.A
________________________________________
From: Christian Grün christian.gruen@gmail.com
Sent: Wednesday, October 8, 2014 7:15 PM
To: Sateesh
Cc: basex-talk@mailman.uni-konstanz.de
Subject: Re: [basex-talk] Fw: Suggestion for a requirement
Dear Sateesh,
...
We have started a web application in that we have a requirement to create
xml db's per user based means each user who logs in we need to create DB's
on his xml files in the backend.
This should be no problem. New databases can easily be created and
dropped via the existing APIs or even XQuery.
...
,for this we dont want to use the
client-server architecture of the basex as we dont want that extra time to
hit a http server request to basex server
Have you encountered the client/server architecture to be a serious
bottleneck, or is this a theoretical assumption? If you want to
distribute your data anyway, it could make sense to do some manual
sharding and experiment with multiple BaseX instances on different
servers. If you believe that this makes no sense in your scenario, you
can surely use BaseX in an embedded way.
...
also note that volumes of the xml files are very huge they
might go up to gb's also
Please note that the creation of a database with gigabytes of data
might take some seconds or even minutes. Do you want to created these
database every time a user logs in, or only once (the first time)?
#Sateesh: This would be only one time activity only,but one problem with this approach is lets suppose i have 1gb of xml file when i create db for this xml it would result in approximatly 2gb of database and for this i need to maintain TB's of hard disk which we are not able to defend with clients.Is there a way where the db size does not go that big without loosing performance.
...
and query output will also be in huge volume and
also we must make sure that this huge volumes of query output will not eatup
all the heap memory
If you use the APIs in the right way, all data will be streamed, so
you can expect constant memory consumption (as long as you do not
buffer the received results in your client).
...
as the number of parallel users using this site are very
high
What does "very high" mean? Do you expect 1000 parallel requests per
day, hour, minute, second (ms, ns, ...)? Will you have thousands or
millions of users?
#Sateesh: Per minute we can expect 100 parallel hits,in this hits some queries might be complex queries and some might be returning huge volumes of data in response.
...
and we cannot afford to go out of memory.Also we would need to
configure the DB paths as this are huge volumes we cannot store them in the
default locations to.
One approach to distribute database directories is to create symlinks
to other drives. If a new database is created, it will always be added
to the default folder, but you could create empty databases in
advance, which are then populated as soon as new users are registered.
Hope this helps,
Christian
The information contained in this e-mail and any accompanying documents may contain information that is confidential or otherwise protected from disclosure. If you are not the intended recipient of this message, or if this message has been addressed to you in error, please immediately alert the sender by reply e-mail and then delete this message, including any attachments. Any dissemination, distribution or other use of the contents of this message by anyone other than the intended recipient is strictly prohibited.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

Re: [basex-talk] Fw: Suggestion for a requirement