Hi Steve,
The following options may be helpful for importing EPUB files in one go:
set parser html set createfilter *.htm,*.html,*.xhtml,*.xml,*.opf set addraw on create db db doc.epub
They can all be set via the GUI as well (provided you find the corresponding checkboxes ;·).
Cheers, Christian
On Thu, Aug 22, 2019 at 12:17 AM Majewski, Steven Dennis (sdm7g) sdm7g@virginia.edu wrote:
However, I neglected to notice that all of the html files in the EPUB container were all added as raw files. So it’s really just unpacking the EPUB zip into raw/ . Serving them up still works, but I’m guessing that will limit some of the search features on the unparsed files.
— Steve.
On Aug 21, 2019, at 5:01 PM, Majewski, Steven Dennis (sdm7g) sdm7g@virginia.edu wrote:
BTW: I was pleasantly surprised to discover that when I made a BaseX database with an ePub zip file, and selected the appropriate options ( Parse files in archives, add other files as raw files ) I was able to serve and view the ePub files thru the rest interface, and links from the TOC to other document parts, as well as images in the ePUB all worked.