Dear users,
Can I remove the file txtl.basex and still get an access to my data ? BaseX of course will be stopped. My HDD is full and I can't optimize all because BaseX times out. Then I would like to deactivate the UPDINDEX option on my already existing DB. I already read the thread https://mailman.uni-konstanz.de/pipermail/basex-talk/2014-July/006942.html ( UPDINDEX and ever growing index size ) and was well aware of the problem but I didn't know that it would grow so fast.
total 214335012 -rw-r--r-- 1 root root 1620720 Dec 23 02:54 atv.basex -rw-r--r-- 1 root root 2300192644 Dec 23 02:54 atvl.basex -rw-r--r-- 1 root root 337605 Dec 23 02:54 atvr.basex -rw-r--r-- 1 root root 15 Dec 23 02:54 idp.basex -rw-r--r-- 1 root root 137256 Dec 23 02:54 inf.basex -rw-r--r-- 1 root root 631959552 Dec 23 02:54 tbl.basex -rw-r--r-- 1 root root 1220641 Dec 23 02:54 tbli.basex -rw-r--r-- 1 root root 146626967 Dec 23 02:54 txt.basex -rw-r--r-- 1 root root 216396026742 Dec 23 02:54 txtl.basex -rw-r--r-- 1 root root 838900 Dec 23 02:54 txtr.basex
TIA,
Ludovic Kuty
BTW here is the output of the INFO INDEX command. We can see that the text index has a size of 202GB.
Elements - Structure: Hash - Entries: 4 value 2075397x, strings, leaf name 2075397x, 47 values, leaf data 2075397x, strings archive 47354x, strings
Attributes - Structure: Hash - Entries: 2 time 47354x, strings, leaf site 47354x, 1 values, leaf
Text Index - Structure: Sorted List - Size: 202 GB - Entries: 167780 false 1409387x true 1384561x 0.0 162432x 0 116279x 0.000 113438x 9 106765x 1 77692x debit turbinable auto SPW quand 67530x debit turbinable auto SPW 67530x debit meuse OP 67530x debit meuse FTP SPW quand 67530x debit meuse FTP SPW 67530x choix acquisition 67530x acquisition automate SPW ok 67530x acquisition FTP SPW ok 67530x pression gen4 66632x pression gen3 66632x pression gen2 66632x pression gen1 66632x p_active_gen6 66632x p_active_gen5 66632x p_active_gen4 66632x p_active_gen3 66632x p_active_gen2 66632x p_active_gen1 66632x niveau aval 66632x niveau amont 66632x nb turbines demarrees 66632x e produite 66632x P_ACTIVE 66632x ... -109.9 1x -109.5 1x -108.9 1x -108.6 1x -108.5 1x -108.4 1x -108.1 1x -108.0 1x -107.7 1x -107.3 1x -107.2 1x -105.8 1x -105.1 1x -105.0 1x -103.8 1x -102.9 1x -102.3 1x -101.7 1x -101.5 1x -10.7 1x -10.4 1x -1.8 1x -1.7 1x -1.5 1x -1.3 1x -1.0 1x -0.9 1x -0.6 1x -0.5 1x -0.48 1x
Attribute Index - Structure: Sorted List - Size: 2 GB - Entries: 67521 hun 67530x 2014-12-04T15:28:55 2x 2014-11-16T13:31:15 2x 2014-11-11T16:12:50 2x 2014-11-05T07:25:44 2x 2014-10-27T17:24:14 2x 2014-10-10T14:10:39 2x 2014-08-21T11:21:35 2x 2014-05-08T09:42:39 2x 2014-04-11T13:27:36 2x 2014-02-12T13:45:21 2x 2014-01-01T05:00:03 1x 2014-01-01T04:50:01 1x 2014-01-01T04:39:59 1x 2014-01-01T04:29:57 1x 2014-01-01T04:19:55 1x 2014-01-01T04:09:54 1x 2014-01-01T03:59:52 1x 2014-01-01T03:49:50 1x 2014-01-01T03:39:48 1x 2014-01-01T03:29:46 1x 2014-01-01T03:19:41 1x 2014-01-01T03:09:42 1x 2014-01-01T02:59:40 1x 2014-01-01T02:49:38 1x 2014-01-01T02:39:36 1x 2014-01-01T02:29:34 1x 2014-01-01T02:19:32 1x 2014-01-01T02:09:28 1x 2014-01-01T01:59:28 1x ... 2014-01-01T10:17:34 1x 2014-01-01T10:07:05 1x 2014-01-01T09:57:02 1x 2014-01-01T09:46:58 1x 2014-01-01T09:36:56 1x 2014-01-01T09:26:54 1x 2014-01-01T09:16:52 1x 2014-01-01T09:06:50 1x 2014-01-01T08:56:49 1x 2014-01-01T08:46:47 1x 2014-01-01T08:36:45 1x 2014-01-01T08:26:43 1x 2014-01-01T08:16:41 1x 2014-01-01T08:06:39 1x 2014-01-01T07:56:37 1x 2014-01-01T07:46:36 1x 2014-01-01T07:36:33 1x 2014-01-01T07:26:31 1x 2014-01-01T07:16:31 1x 2014-01-01T07:06:29 1x 2014-01-01T06:56:27 1x 2014-01-01T06:34:53 1x 2014-01-01T06:24:51 1x 2014-01-01T06:14:49 1x 2014-01-01T06:04:47 1x 2014-01-01T05:54:45 1x 2014-01-01T05:44:43 1x 2014-01-01T05:34:41 1x 2014-01-01T05:24:38 1x 2014-01-01T05:10:05 1x
Full-Text Index - Not available
Path Summary doc(): 47354x, strings archive: 47354x, strings @site: 47354x, 1 values, leaf @time: 47354x, strings, leaf data: 2075397x, strings name: 2075397x, leaf text(): 2075397x, 47 values, leaf value: 2075397x, leaf text(): 2075397x, strings, leaf
On 23 déc. 2014, at 10:32, Ludovic Kuty mailing@kuty.be wrote:
Dear users,
Can I remove the file txtl.basex and still get an access to my data ? BaseX of course will be stopped. My HDD is full and I can't optimize all because BaseX times out. Then I would like to deactivate the UPDINDEX option on my already existing DB. I already read the thread https://mailman.uni-konstanz.de/pipermail/basex-talk/2014-July/006942.html ( UPDINDEX and ever growing index size ) and was well aware of the problem but I didn't know that it would grow so fast.
total 214335012 -rw-r--r-- 1 root root 1620720 Dec 23 02:54 atv.basex -rw-r--r-- 1 root root 2300192644 Dec 23 02:54 atvl.basex -rw-r--r-- 1 root root 337605 Dec 23 02:54 atvr.basex -rw-r--r-- 1 root root 15 Dec 23 02:54 idp.basex -rw-r--r-- 1 root root 137256 Dec 23 02:54 inf.basex -rw-r--r-- 1 root root 631959552 Dec 23 02:54 tbl.basex -rw-r--r-- 1 root root 1220641 Dec 23 02:54 tbli.basex -rw-r--r-- 1 root root 146626967 Dec 23 02:54 txt.basex -rw-r--r-- 1 root root 216396026742 Dec 23 02:54 txtl.basex -rw-r--r-- 1 root root 838900 Dec 23 02:54 txtr.basex
TIA,
Ludovic Kuty
I made a drop index and it worked and was fast. I took the risk :)
On 23 déc. 2014, at 10:34, Ludovic Kuty mailing@kuty.be wrote:
BTW here is the output of the INFO INDEX command. We can see that the text index has a size of 202GB.
Elements
- Structure: Hash
- Entries: 4
value 2075397x, strings, leaf name 2075397x, 47 values, leaf data 2075397x, strings archive 47354x, strings
Attributes
- Structure: Hash
- Entries: 2
time 47354x, strings, leaf site 47354x, 1 values, leaf
Text Index
- Structure: Sorted List
- Size: 202 GB
- Entries: 167780
false 1409387x true 1384561x 0.0 162432x 0 116279x 0.000 113438x 9 106765x 1 77692x debit turbinable auto SPW quand 67530x debit turbinable auto SPW 67530x debit meuse OP 67530x debit meuse FTP SPW quand 67530x debit meuse FTP SPW 67530x choix acquisition 67530x acquisition automate SPW ok 67530x acquisition FTP SPW ok 67530x pression gen4 66632x pression gen3 66632x pression gen2 66632x pression gen1 66632x p_active_gen6 66632x p_active_gen5 66632x p_active_gen4 66632x p_active_gen3 66632x p_active_gen2 66632x p_active_gen1 66632x niveau aval 66632x niveau amont 66632x nb turbines demarrees 66632x e produite 66632x P_ACTIVE 66632x ... -109.9 1x -109.5 1x -108.9 1x -108.6 1x -108.5 1x -108.4 1x -108.1 1x -108.0 1x -107.7 1x -107.3 1x -107.2 1x -105.8 1x -105.1 1x -105.0 1x -103.8 1x -102.9 1x -102.3 1x -101.7 1x -101.5 1x -10.7 1x -10.4 1x -1.8 1x -1.7 1x -1.5 1x -1.3 1x -1.0 1x -0.9 1x -0.6 1x -0.5 1x -0.48 1x
Attribute Index
- Structure: Sorted List
- Size: 2 GB
- Entries: 67521
hun 67530x 2014-12-04T15:28:55 2x 2014-11-16T13:31:15 2x 2014-11-11T16:12:50 2x 2014-11-05T07:25:44 2x 2014-10-27T17:24:14 2x 2014-10-10T14:10:39 2x 2014-08-21T11:21:35 2x 2014-05-08T09:42:39 2x 2014-04-11T13:27:36 2x 2014-02-12T13:45:21 2x 2014-01-01T05:00:03 1x 2014-01-01T04:50:01 1x 2014-01-01T04:39:59 1x 2014-01-01T04:29:57 1x 2014-01-01T04:19:55 1x 2014-01-01T04:09:54 1x 2014-01-01T03:59:52 1x 2014-01-01T03:49:50 1x 2014-01-01T03:39:48 1x 2014-01-01T03:29:46 1x 2014-01-01T03:19:41 1x 2014-01-01T03:09:42 1x 2014-01-01T02:59:40 1x 2014-01-01T02:49:38 1x 2014-01-01T02:39:36 1x 2014-01-01T02:29:34 1x 2014-01-01T02:19:32 1x 2014-01-01T02:09:28 1x 2014-01-01T01:59:28 1x ... 2014-01-01T10:17:34 1x 2014-01-01T10:07:05 1x 2014-01-01T09:57:02 1x 2014-01-01T09:46:58 1x 2014-01-01T09:36:56 1x 2014-01-01T09:26:54 1x 2014-01-01T09:16:52 1x 2014-01-01T09:06:50 1x 2014-01-01T08:56:49 1x 2014-01-01T08:46:47 1x 2014-01-01T08:36:45 1x 2014-01-01T08:26:43 1x 2014-01-01T08:16:41 1x 2014-01-01T08:06:39 1x 2014-01-01T07:56:37 1x 2014-01-01T07:46:36 1x 2014-01-01T07:36:33 1x 2014-01-01T07:26:31 1x 2014-01-01T07:16:31 1x 2014-01-01T07:06:29 1x 2014-01-01T06:56:27 1x 2014-01-01T06:34:53 1x 2014-01-01T06:24:51 1x 2014-01-01T06:14:49 1x 2014-01-01T06:04:47 1x 2014-01-01T05:54:45 1x 2014-01-01T05:44:43 1x 2014-01-01T05:34:41 1x 2014-01-01T05:24:38 1x 2014-01-01T05:10:05 1x
Full-Text Index
- Not available
Path Summary doc(): 47354x, strings archive: 47354x, strings @site: 47354x, 1 values, leaf @time: 47354x, strings, leaf data: 2075397x, strings name: 2075397x, leaf text(): 2075397x, 47 values, leaf value: 2075397x, leaf text(): 2075397x, strings, leaf
On 23 déc. 2014, at 10:32, Ludovic Kuty mailing@kuty.be wrote:
Dear users,
Can I remove the file txtl.basex and still get an access to my data ? BaseX of course will be stopped. My HDD is full and I can't optimize all because BaseX times out. Then I would like to deactivate the UPDINDEX option on my already existing DB. I already read the thread https://mailman.uni-konstanz.de/pipermail/basex-talk/2014-July/006942.html ( UPDINDEX and ever growing index size ) and was well aware of the problem but I didn't know that it would grow so fast.
total 214335012 -rw-r--r-- 1 root root 1620720 Dec 23 02:54 atv.basex -rw-r--r-- 1 root root 2300192644 Dec 23 02:54 atvl.basex -rw-r--r-- 1 root root 337605 Dec 23 02:54 atvr.basex -rw-r--r-- 1 root root 15 Dec 23 02:54 idp.basex -rw-r--r-- 1 root root 137256 Dec 23 02:54 inf.basex -rw-r--r-- 1 root root 631959552 Dec 23 02:54 tbl.basex -rw-r--r-- 1 root root 1220641 Dec 23 02:54 tbli.basex -rw-r--r-- 1 root root 146626967 Dec 23 02:54 txt.basex -rw-r--r-- 1 root root 216396026742 Dec 23 02:54 txtl.basex -rw-r--r-- 1 root root 838900 Dec 23 02:54 txtr.basex
TIA,
Ludovic Kuty
Hi Ludovic,
Thanks for your report. Yes, dropping the index, or calling "optimize all" are both completely viable solutions to reduce the database size.
In Summer, we have spent lots of efforts to get around this restriction, so I would be glad if you could try the beta version of BaseX 8.0 [1] and tell us about your experience!
Thanks in advance, Christian
[1] http://files.basex.org/releases/latest
On Tue, Dec 23, 2014 at 10:56 AM, Ludovic Kuty mailing@kuty.be wrote:
I made a drop index and it worked and was fast. I took the risk :)
On 23 déc. 2014, at 10:34, Ludovic Kuty mailing@kuty.be wrote:
BTW here is the output of the INFO INDEX command. We can see that the text index has a size of 202GB.
Elements
- Structure: Hash
- Entries: 4
value 2075397x, strings, leaf name 2075397x, 47 values, leaf data 2075397x, strings archive 47354x, strings
Attributes
- Structure: Hash
- Entries: 2
time 47354x, strings, leaf site 47354x, 1 values, leaf
Text Index
- Structure: Sorted List
- Size: 202 GB
- Entries: 167780
false 1409387x true 1384561x 0.0 162432x 0 116279x 0.000 113438x 9 106765x 1 77692x debit turbinable auto SPW quand 67530x debit turbinable auto SPW 67530x debit meuse OP 67530x debit meuse FTP SPW quand 67530x debit meuse FTP SPW 67530x choix acquisition 67530x acquisition automate SPW ok 67530x acquisition FTP SPW ok 67530x pression gen4 66632x pression gen3 66632x pression gen2 66632x pression gen1 66632x p_active_gen6 66632x p_active_gen5 66632x p_active_gen4 66632x p_active_gen3 66632x p_active_gen2 66632x p_active_gen1 66632x niveau aval 66632x niveau amont 66632x nb turbines demarrees 66632x e produite 66632x P_ACTIVE 66632x ... -109.9 1x -109.5 1x -108.9 1x -108.6 1x -108.5 1x -108.4 1x -108.1 1x -108.0 1x -107.7 1x -107.3 1x -107.2 1x -105.8 1x -105.1 1x -105.0 1x -103.8 1x -102.9 1x -102.3 1x -101.7 1x -101.5 1x -10.7 1x -10.4 1x -1.8 1x -1.7 1x -1.5 1x -1.3 1x -1.0 1x -0.9 1x -0.6 1x -0.5 1x -0.48 1x
Attribute Index
- Structure: Sorted List
- Size: 2 GB
- Entries: 67521
hun 67530x 2014-12-04T15:28:55 2x 2014-11-16T13:31:15 2x 2014-11-11T16:12:50 2x 2014-11-05T07:25:44 2x 2014-10-27T17:24:14 2x 2014-10-10T14:10:39 2x 2014-08-21T11:21:35 2x 2014-05-08T09:42:39 2x 2014-04-11T13:27:36 2x 2014-02-12T13:45:21 2x 2014-01-01T05:00:03 1x 2014-01-01T04:50:01 1x 2014-01-01T04:39:59 1x 2014-01-01T04:29:57 1x 2014-01-01T04:19:55 1x 2014-01-01T04:09:54 1x 2014-01-01T03:59:52 1x 2014-01-01T03:49:50 1x 2014-01-01T03:39:48 1x 2014-01-01T03:29:46 1x 2014-01-01T03:19:41 1x 2014-01-01T03:09:42 1x 2014-01-01T02:59:40 1x 2014-01-01T02:49:38 1x 2014-01-01T02:39:36 1x 2014-01-01T02:29:34 1x 2014-01-01T02:19:32 1x 2014-01-01T02:09:28 1x 2014-01-01T01:59:28 1x ... 2014-01-01T10:17:34 1x 2014-01-01T10:07:05 1x 2014-01-01T09:57:02 1x 2014-01-01T09:46:58 1x 2014-01-01T09:36:56 1x 2014-01-01T09:26:54 1x 2014-01-01T09:16:52 1x 2014-01-01T09:06:50 1x 2014-01-01T08:56:49 1x 2014-01-01T08:46:47 1x 2014-01-01T08:36:45 1x 2014-01-01T08:26:43 1x 2014-01-01T08:16:41 1x 2014-01-01T08:06:39 1x 2014-01-01T07:56:37 1x 2014-01-01T07:46:36 1x 2014-01-01T07:36:33 1x 2014-01-01T07:26:31 1x 2014-01-01T07:16:31 1x 2014-01-01T07:06:29 1x 2014-01-01T06:56:27 1x 2014-01-01T06:34:53 1x 2014-01-01T06:24:51 1x 2014-01-01T06:14:49 1x 2014-01-01T06:04:47 1x 2014-01-01T05:54:45 1x 2014-01-01T05:44:43 1x 2014-01-01T05:34:41 1x 2014-01-01T05:24:38 1x 2014-01-01T05:10:05 1x
Full-Text Index
- Not available
Path Summary doc(): 47354x, strings archive: 47354x, strings @site: 47354x, 1 values, leaf @time: 47354x, strings, leaf data: 2075397x, strings name: 2075397x, leaf text(): 2075397x, 47 values, leaf value: 2075397x, leaf text(): 2075397x, strings, leaf
On 23 déc. 2014, at 10:32, Ludovic Kuty mailing@kuty.be wrote:
Dear users,
Can I remove the file txtl.basex and still get an access to my data ? BaseX of course will be stopped. My HDD is full and I can't optimize all because BaseX times out. Then I would like to deactivate the UPDINDEX option on my already existing DB. I already read the thread https://mailman.uni-konstanz.de/pipermail/basex-talk/2014-July/006942.html ( UPDINDEX and ever growing index size ) and was well aware of the problem but I didn't know that it would grow so fast.
total 214335012 -rw-r--r-- 1 root root 1620720 Dec 23 02:54 atv.basex -rw-r--r-- 1 root root 2300192644 Dec 23 02:54 atvl.basex -rw-r--r-- 1 root root 337605 Dec 23 02:54 atvr.basex -rw-r--r-- 1 root root 15 Dec 23 02:54 idp.basex -rw-r--r-- 1 root root 137256 Dec 23 02:54 inf.basex -rw-r--r-- 1 root root 631959552 Dec 23 02:54 tbl.basex -rw-r--r-- 1 root root 1220641 Dec 23 02:54 tbli.basex -rw-r--r-- 1 root root 146626967 Dec 23 02:54 txt.basex -rw-r--r-- 1 root root 216396026742 Dec 23 02:54 txtl.basex -rw-r--r-- 1 root root 838900 Dec 23 02:54 txtr.basex
TIA,
Ludovic Kuty
basex-talk@mailman.uni-konstanz.de