We have set up a system in which we have about 17 million BaseX databases, but my operating system does not allow 17 million subdirectories in one directory ( the one set in the .basex file). In order to resolve this, we had to set up four (4) BaseX servers on four different machines so we could resolve this. Depending on the name of the BaseX database, we know on which BaseX server we have to look. As this is a work around, it would be nice to have either the possibility to have a hierarchy of databases, so we can store them into more manageable directories, or to allow to run more than one BaseX server onto a single machine (for instance, listening on different ports). For more info on the reasons why we did this: Vandeghinste and Augustinus (2014). Making Large Treebanks Searchable. The SONAR case. In Marc Kupietz, Hanno Biber, Harald Lüngen, Piotr Bański, Evelyn Breiteneder, Karlheinz Mörth, Andreas Witt, Jani Takhsha (eds.), Proceedings of the 2nd workshop on Challenges in the management of large corpora (CMLC-2) at the Ninth International Conference on Language Resources and Evaluation (LREC). Reykjavik, Iceland. pp. 15-20.
Is there another way to solve this?
thanks, v.