Hi Christian,
Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters.
With this number I am able to say that the word 'zonnebloemen' occurs 30 times in all documents. For example. I thought this could be accomplished with a small alteration of this query.
If you see a solution that would be great, otherwise I just count them by hand.
Regards,
Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb n="18" xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb n="20" xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte en geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den achtergrond rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier rose en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak – en een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0" rend="ital">Zonnebloemen</hi> </hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote:
Hi Wiard,
sorry for the delay; my todo list is long today, but I'll try to give you some feedback soon (if not someone else is faster..).
Christian ___________________________
On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen wiard.vasen@gmail.com
wrote:
Hi Christian, Did I throw to much over the fence? Regards, Wiard
2011/5/25 Wiard Vasen wiard.vasen@gmail.com
Hi Christian, I hereby send you several xml-files containing the term 'zonnebloemen'. The first four documents(266,653,667,740) are with this term. The last four(1,2,3,4) don't have the term. Thanks for looking at my problem! Regards, Wiard
2011/5/25 Christian Grün christian.gruen@gmail.com
Hi Wiard,
I get the number of all documents as the number of hits. And what I want is the number of documents containing the term 'zonnebloemen'. Last time you had a solution for the number of hits in one specific document. I hope you have a solution for this problem.
Could you provide us with a little document that allows us to reproduce the problem?
Thanks, Christian
Hi Wiard,
thanks for your answer. Just to get sure.. How should the result look like for the documents in question? ;)
Christian
On Thu, May 26, 2011 at 8:00 AM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters. With this number I am able to say that the word 'zonnebloemen' occurs 30 times in all documents. For example. I thought this could be accomplished with a small alteration of this query. If you see a solution that would be great, otherwise I just count them by hand. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb n="18" xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb n="20" xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte en geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den achtergrond rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier rose en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak – en een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0" rend="ital">Zonnebloemen</hi> </hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote:
Hi Wiard,
sorry for the delay; my todo list is long today, but I'll try to give you some feedback soon (if not someone else is faster..).
Christian ___________________________
On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Did I throw to much over the fence? Regards, Wiard
2011/5/25 Wiard Vasen wiard.vasen@gmail.com
Hi Christian, I hereby send you several xml-files containing the term 'zonnebloemen'. The first four documents(266,653,667,740) are with this term. The last four(1,2,3,4) don't have the term. Thanks for looking at my problem! Regards, Wiard
2011/5/25 Christian Grün christian.gruen@gmail.com
Hi Wiard,
> I get the number of all documents as the number of hits. > And what I want is the number of documents containing the term > 'zonnebloemen'. > Last time you had a solution for the number of hits in one specific > document. > I hope you have a solution for this problem.
Could you provide us with a little document that allows us to reproduce the problem?
Thanks, Christian
Hi Christian,
The result should be: hits: 3.
In that case I can use the same query for all kind of terms. Like; how many occurrences are there in the repository of the term " gauguin", or "money".
That way I can both find the total number of occurrences as well as the files containing the term.
I hope this answers your question.
Regards,
Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
thanks for your answer. Just to get sure.. How should the result look like for the documents in question? ;)
Christian
On Thu, May 26, 2011 at 8:00 AM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters. With this number I am able to say that the word 'zonnebloemen' occurs 30 times in all documents. For example. I thought this could be accomplished with a small alteration of this
query.
If you see a solution that would be great, otherwise I just count them by hand. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb n="18" xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb n="20" xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte en geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den achtergrond rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier rose en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak – en een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0"
rend="ital">Zonnebloemen</hi>
</hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote:
Hi Wiard,
sorry for the delay; my todo list is long today, but I'll try to give you some feedback soon (if not someone else is faster..).
Christian ___________________________
On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Did I throw to much over the fence? Regards, Wiard
2011/5/25 Wiard Vasen wiard.vasen@gmail.com
Hi Christian, I hereby send you several xml-files containing the term 'zonnebloemen'. The first four documents(266,653,667,740) are with this term. The last four(1,2,3,4) don't have the term. Thanks for looking at my problem! Regards, Wiard
2011/5/25 Christian Grün christian.gruen@gmail.com > > Hi Wiard, > > > I get the number of all documents as the number of hits. > > And what I want is the number of documents containing the term > > 'zonnebloemen'. > > Last time you had a solution for the number of hits in one
specific
> > document. > > I hope you have a solution for this problem. > > Could you provide us with a little document that allows us to > reproduce the problem? > > Thanks, > Christian
Hi Christian,
The best would be that it's possible to enter more terms in the query. That way it will be possible for me to see in how many documents this combination of terms occur.
Regards,
Wiard
2011/5/26 Wiard Vasen wiard.vasen@gmail.com
Hi Christian,
The result should be: hits: 3.
In that case I can use the same query for all kind of terms. Like; how many occurrences are there in the repository of the term " gauguin", or "money".
That way I can both find the total number of occurrences as well as the files containing the term.
I hope this answers your question.
Regards,
Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
thanks for your answer. Just to get sure.. How should the result look like for the documents in question? ;)
Christian
On Thu, May 26, 2011 at 8:00 AM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters. With this number I am able to say that the word 'zonnebloemen' occurs 30 times in all documents. For example. I thought this could be accomplished with a small alteration of this
query.
If you see a solution that would be great, otherwise I just count them
by
hand. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb n="18" xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb n="20" xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte en geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den achtergrond rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier rose en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak – en een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0"
rend="ital">Zonnebloemen</hi>
</hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote:
Hi Wiard,
sorry for the delay; my todo list is long today, but I'll try to give you some feedback soon (if not someone else is faster..).
Christian ___________________________
On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Did I throw to much over the fence? Regards, Wiard
2011/5/25 Wiard Vasen wiard.vasen@gmail.com > > Hi Christian, > I hereby send you several xml-files containing the term > 'zonnebloemen'. > The first four documents(266,653,667,740) are with this term. > The last four(1,2,3,4) don't have the term. > Thanks for looking at my problem! > Regards, > Wiard > > 2011/5/25 Christian Grün christian.gruen@gmail.com >> >> Hi Wiard, >> >> > I get the number of all documents as the number of hits. >> > And what I want is the number of documents containing the term >> > 'zonnebloemen'. >> > Last time you had a solution for the number of hits in one
specific
>> > document. >> > I hope you have a solution for this problem. >> >> Could you provide us with a little document that allows us to >> reproduce the problem? >> >> Thanks, >> Christian >
Hi Wiard,
the following query might give you the requested result:
ft:count ( db:open('brievenvangogh')//*[text() contains text 'zonnebloemen'] )
Hope this helps, Christian ___________________________
On Thu, May 26, 2011 at 5:10 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, The result should be: hits: 3. In that case I can use the same query for all kind of terms. Like; how many occurrences are there in the repository of the term " gauguin", or "money". That way I can both find the total number of occurrences as well as the files containing the term. I hope this answers your question. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
thanks for your answer. Just to get sure.. How should the result look like for the documents in question? ;)
Christian
On Thu, May 26, 2011 at 8:00 AM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters. With this number I am able to say that the word 'zonnebloemen' occurs 30 times in all documents. For example. I thought this could be accomplished with a small alteration of this query. If you see a solution that would be great, otherwise I just count them by hand. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb n="18" xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb n="20" xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte en geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den achtergrond rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier rose en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak – en een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0" rend="ital">Zonnebloemen</hi> </hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote:
Hi Wiard,
sorry for the delay; my todo list is long today, but I'll try to give you some feedback soon (if not someone else is faster..).
Christian ___________________________
On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Did I throw to much over the fence? Regards, Wiard
2011/5/25 Wiard Vasen wiard.vasen@gmail.com > > Hi Christian, > I hereby send you several xml-files containing the term > 'zonnebloemen'. > The first four documents(266,653,667,740) are with this term. > The last four(1,2,3,4) don't have the term. > Thanks for looking at my problem! > Regards, > Wiard > > 2011/5/25 Christian Grün christian.gruen@gmail.com >> >> Hi Wiard, >> >> > I get the number of all documents as the number of hits. >> > And what I want is the number of documents containing the term >> > 'zonnebloemen'. >> > Last time you had a solution for the number of hits in one >> > specific >> > document. >> > I hope you have a solution for this problem. >> >> Could you provide us with a little document that allows us to >> reproduce the problem? >> >> Thanks, >> Christian >
Wow, exactly what I was looking for!
Thanks a lot Christian.
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
the following query might give you the requested result:
ft:count ( db:open('brievenvangogh')//*[text() contains text 'zonnebloemen'] )
Hope this helps, Christian ___________________________
On Thu, May 26, 2011 at 5:10 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, The result should be: hits: 3. In that case I can use the same query for all kind of terms. Like; how many occurrences are there in the repository of the term " gauguin", or "money". That way I can both find the total number of occurrences as well as the files containing the term. I hope this answers your question. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
thanks for your answer. Just to get sure.. How should the result look like for the documents in question? ;)
Christian
On Thu, May 26, 2011 at 8:00 AM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters. With this number I am able to say that the word 'zonnebloemen' occurs
30
times in all documents. For example. I thought this could be accomplished with a small alteration of this query. If you see a solution that would be great, otherwise I just count them by hand. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb n="18" xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb
n="20"
xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte
en
geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den
achtergrond
rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier
rose
en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak – en een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/>
</ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0" rend="ital">Zonnebloemen</hi> </hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote:
Hi Wiard,
sorry for the delay; my todo list is long today, but I'll try to
give
you some feedback soon (if not someone else is faster..).
Christian ___________________________
On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen <
wiard.vasen@gmail.com>
wrote: > Hi Christian, > Did I throw to much over the fence? > Regards, > Wiard > > 2011/5/25 Wiard Vasen wiard.vasen@gmail.com >> >> Hi Christian, >> I hereby send you several xml-files containing the term >> 'zonnebloemen'. >> The first four documents(266,653,667,740) are with this term. >> The last four(1,2,3,4) don't have the term. >> Thanks for looking at my problem! >> Regards, >> Wiard >> >> 2011/5/25 Christian Grün christian.gruen@gmail.com >>> >>> Hi Wiard, >>> >>> > I get the number of all documents as the number of hits. >>> > And what I want is the number of documents containing the term >>> > 'zonnebloemen'. >>> > Last time you had a solution for the number of hits in one >>> > specific >>> > document. >>> > I hope you have a solution for this problem. >>> >>> Could you provide us with a little document that allows us to >>> reproduce the problem? >>> >>> Thanks, >>> Christian >> > >
Hi Christian,
With an earlier solution of yours I can also find combinations of words in the repository.
ft:count ( db:open('brievenvangogh')//*[text() contains text {'van','waarde','de','theo'}all ])
I am very happy with your help!
Regards,
Wiard
2011/5/27 Wiard Vasen wiard.vasen@gmail.com
Wow, exactly what I was looking for!
Thanks a lot Christian.
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
the following query might give you the requested result:
ft:count ( db:open('brievenvangogh')//*[text() contains text 'zonnebloemen'] )
Hope this helps, Christian ___________________________
On Thu, May 26, 2011 at 5:10 PM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, The result should be: hits: 3. In that case I can use the same query for all kind of terms. Like; how many occurrences are there in the repository of the term " gauguin", or "money". That way I can both find the total number of occurrences as well as the files containing the term. I hope this answers your question. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
thanks for your answer. Just to get sure.. How should the result look like for the documents in question? ;)
Christian
On Thu, May 26, 2011 at 8:00 AM, Wiard Vasen wiard.vasen@gmail.com wrote:
Hi Christian, Thank you for looking at the query. You are always a great help! The result I am looking for is the total number of occurences. Because there is such a large repository of letters. With this number I am able to say that the word 'zonnebloemen' occurs
30
times in all documents. For example. I thought this could be accomplished with a small alteration of this query. If you see a solution that would be great, otherwise I just count
them
by hand. Regards, Wiard
2011/5/26 Christian Grün christian.gruen@gmail.com
Hi Wiard,
finally a quick answer; the result seems correct to me, as the term "zonnebloemen" occurs in three of the documents that you sent to me. I've attached the result below; which result would you have
expected?
Christian
<document uri="file:/G:/XML/brievenvangogh/let001.xml"/> <document uri="file:/G:/XML/brievenvangogh/let002.xml"/> <document uri="file:/G:/XML/brievenvangogh/let003.xml"/> <document uri="file:/G:/XML/brievenvangogh/let004.xml"/> <document uri="file:/G:/XML/brievenvangogh/let266.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Verder maakte ik nog een studie van een<lb n="28" xml:id="l-28"/>hofje met een bleek & zonnebloemen.<anchor n="4" xml:id="note-o-4"/> </ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let653.xml"> <hit score="0.2953945291203477" count="1"> <ab xmlns="http://www.tei-c.org/ns/1.0" rend="indent">Ik heb een studie van een tuin<c type="kk">/</c>haast een meter breed.<lb
n="18"
xml:id="l-18"/>op den voorgrond klaprozen & andere roode bloemen in groen.<lb n="19" xml:id="l-19"/>dan een vak<hi rend="ital">blaauwe</hi>klokken<c type="kp">_</c>Dan een vak<lb
n="20"
xml:id="l-20"/>oranje en geele afrikanen<c type="kk">/</c>dan witte
en
geele<lb n="21" xml:id="l-21"/>bloemen en eindelijk op den
achtergrond
rose en lilas<lb n="22" xml:id="l-22"/>en nog scabiosas<c type="kk">/</c>donkerviolet<c type="kk">/</c>en roode geraniums en zonnebloemen<lb n="23" xml:id="l-23"/>en een vijgeboom en laurier
rose
en een wingerd<c type="kp">_</c>Aan ’t eind zwarte<lb n="24" xml:id="l-24"/>cypressen tegen lage witte huisjes met oranje dak –
en
een fijn<lb n="25" xml:id="l-25"/>groenblaauwe strook lucht.<anchor n="1" xml:id="note-o-1"/>
</ab> </hit> </document> <document uri="file:/G:/XML/brievenvangogh/let740.xml"> <hit score="1" count="1"> <hi xmlns="http://www.tei-c.org/ns/1.0" rend="ital">Zonnebloemen</hi> </hit> </document>
On Wed, May 25, 2011 at 11:21 PM, Christian Grün christian.gruen@gmail.com wrote: > Hi Wiard, > > sorry for the delay; my todo list is long today, but I'll try to
give
> you some feedback soon (if not someone else is faster..). > > Christian > ___________________________ > > On Wed, May 25, 2011 at 8:56 PM, Wiard Vasen <
wiard.vasen@gmail.com>
> wrote: >> Hi Christian, >> Did I throw to much over the fence? >> Regards, >> Wiard >> >> 2011/5/25 Wiard Vasen wiard.vasen@gmail.com >>> >>> Hi Christian, >>> I hereby send you several xml-files containing the term >>> 'zonnebloemen'. >>> The first four documents(266,653,667,740) are with this term. >>> The last four(1,2,3,4) don't have the term. >>> Thanks for looking at my problem! >>> Regards, >>> Wiard >>> >>> 2011/5/25 Christian Grün christian.gruen@gmail.com >>>> >>>> Hi Wiard, >>>> >>>> > I get the number of all documents as the number of hits. >>>> > And what I want is the number of documents containing the
term
>>>> > 'zonnebloemen'. >>>> > Last time you had a solution for the number of hits in one >>>> > specific >>>> > document. >>>> > I hope you have a solution for this problem. >>>> >>>> Could you provide us with a little document that allows us to >>>> reproduce the problem? >>>> >>>> Thanks, >>>> Christian >>> >> >> >
basex-talk@mailman.uni-konstanz.de