Am 01.08.2018 um 09:56 schrieb Christian Grün:
Just for fun, I wrote a little crawler in XQuery (see the attached files).
Very interesting, indeed! Nice to see an example of lazy:cache and prof:dump. I did not use them, so far, and that is some good news to see them in action.
it should surely be used decently, otherwise the remote server might block further access.
Sure! That is one reason I am grinding my teeth on some link analysation right now, so the crawl can be limited to URIs of (a) certain kind(s).
P.S. Resending, since my MUA filled the wrong address (Christian's) instead of the list's into the TO: field and I forgot about it.