Hi all,

 

I would really like to be able to query a large corpus of documents to get names and counts of the DTDs which are declared in the (somewhat old-fashioned now) DOCTYPE declaration:

 

<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE converted-article PUBLIC "-//ES//DTD journal article DTD version 4.5.2//EN//XML" "art452.dtd" [
]>
<converted-article> <!-- etc -->

 

Is there any way to get BaseX to preserve this information? Can I rewrite the doctype declaration into some sort of element node as the DB is being created so that this info can be queried?

 

Thanks for any tips,

Constantine.





Elsevier B.V. Registered Office: Radarweg 29, 1043 NX Amsterdam, The Netherlands, Registration No. 33156677, Registered in The Netherlands.