Hi,
Sorry for all the questions of late first of all...
I've been using basex for a while now and wrote some nice xquery that gather metrics from datasets ~7000 files querying the whole corpus to create statistics and it quite fast at that but were thinking this query below feels slow at 34 seconds so we thought we should ask for you thoughts on it's duration.
Query: let $content := db:open('F-DDEX')//MessageHeader/MessageThreadId[text() eq '8937478'] return $content
Content: So every file has a message header with a MessageThreadId.
<MessageHeader xmlns:ern="http://ddex.net/xml/2011/ern-main/33" xmlns:xsi=" http://www.w3.org/2001/XMLSchema-instance"> <MessageThreadId>8937478</MessageThreadId> <MessageId>C2C977FDFDHF98DHF9D8FHEURYX</MessageId> <MessageSender> <PartyId>PA47F93H54HU93HJSFDINF</PartyId> <PartyName> <FullName>Warner Music Group</FullName> </PartyName> </MessageSender> <MessageRecipient> <PartyId>3G3E</PartyId> <PartyName> <FullName>3G3E-YADS</FullName> </PartyName> </MessageRecipient> <MessageCreatedDateTime>2012-06-18T05:35:54Z</MessageCreatedDateTime> </MessageHeader>
Given the following database with text, attribute and fulltext indexes on.
Database Properties Name: F-DDEX Size: 5251 MB Nodes: 239945615 Documents: 7954
Specs Server:
$ cat /proc/meminfo | grep MemTotal MemTotal: 7633876 kB
$ cat /proc/cpuinfo | grep name model name : Intel(R) Xeon(R) CPU E5430 @ 2.66GHz model name : Intel(R) Xeon(R) CPU E5430 @ 2.66GHz
Thoughts?
Thanks