Because of the size, is there a way to store xml-data as archive
If you want to compress your XML documents with XQuery, you can have a look at the Archive or ZIP Module of BaseX. I am not sure, however, if this is what you are looking for?
(with possibility to search) ?
The BaseX implementation of fn:doc() and fn:collection() supports zipped files as arguments. The contents will be unzipped automatically:
collection('my-xml-files.zip')
If you create databases, you can specify ZIP files as input, too.
Hi Vladimir,
If your question is : is there a compression option to reduce database files and use built-in index features ? the answer is no. I can remember that BaseX use custom compression for text nodes. [1]
Christian, I hope you are doing well ? Did I loose my mind or Did BaseX have a long time ago a compressed database option ?
Best regards,
[1] http://basex.org/2018/03/23/basex-9.0--the-spring-edition/
BaseX
-----Message d'origine----- De : BaseX-Talk [mailto:basex-talk-bounces@mailman.uni-konstanz.de] De la part de Christian Grün Envoyé : mercredi 22 août 2018 17:08 À : Ветошкин Владимир Cc : BaseX Objet : Re: [basex-talk] archive
I can compress xml-data and store it in db. But then how can I search inside that db using index? Is it possible?
I am not sure if I understand. How do you proceed? Could you possibly give us a step-by-step explanation?
Hi Fabrice,
Thanks for your response to Vladimir.
You are completely right, BaseX has its own compression algorithms (without compression, databases would be much larger). Due to the query update facilities and additional index structures, a database will always cosume more space than zipped XML documents.
Cheers Christian
On Wed, Aug 22, 2018 at 5:24 PM Fabrice ETANCHAUD fetanchaud@pch.cerfrance.fr wrote:
Hi Vladimir,
If your question is : is there a compression option to reduce database files and use built-in index features ? the answer is no. I can remember that BaseX use custom compression for text nodes. [1]
Christian, I hope you are doing well ? Did I loose my mind or Did BaseX have a long time ago a compressed database option ?
Best regards,
[1] http://basex.org/2018/03/23/basex-9.0--the-spring-edition/
BaseX
-----Message d'origine----- De : BaseX-Talk [mailto:basex-talk-bounces@mailman.uni-konstanz.de] De la part de Christian Grün Envoyé : mercredi 22 août 2018 17:08 À : Ветошкин Владимир Cc : BaseX Objet : Re: [basex-talk] archive
I can compress xml-data and store it in db. But then how can I search inside that db using index? Is it possible?
I am not sure if I understand. How do you proceed? Could you possibly give us a step-by-step explanation?
This possibility to open zipped XMLs via doc() is awesome.
Universität Leipzig Institute of Computer Science, NLP Augustusplatz 10 04109 Leipzig Deutschland E-mail: celano@informatik.uni-leipzig.de E-mail: giuseppegacelano@gmail.com Web site 1: http://asv.informatik.uni-leipzig.de/en/staff/Giuseppe_Celano Web site 2: https://sites.google.com/site/giuseppegacelano/
On Aug 22, 2018, at 4:28 PM, Christian Grün christian.gruen@gmail.com wrote:
Because of the size, is there a way to store xml-data as archive
If you want to compress your XML documents with XQuery, you can have a look at the Archive or ZIP Module of BaseX. I am not sure, however, if this is what you are looking for?
(with possibility to search) ?
The BaseX implementation of fn:doc() and fn:collection() supports zipped files as arguments. The contents will be unzipped automatically:
collection('my-xml-files.zip')
If you create databases, you can specify ZIP files as input, too.
basex-talk@mailman.uni-konstanz.de