Tue Dec 12 01:04:12 2023, original submission:
I have been working with Michal on this via private email but thought to enter a bug report on this just for tracking and documentation.
I have one large file (57,000 PDF pages) that when compiled with tex4ht (takes 14 hrs), and at about 10% when generating the final HTML pages, it gets XML error and stops.
i.e. the 90% rest of the sections are missing from the final web pages.
-------------------------------------------------------
[INFO] make4ht-lib: parse_lg process file: reportsubsection1100.htm
[WARNING] domfilter: DOM parsing of reportsubsection1100.htm failed:
[WARNING] domfilter: ...ive/2023/texmf-dist/tex/luatex/luaxml/luaxml-mod-xml.lua:175: Incomplete XML Document [char=33675]
[INFO] make4ht-lib: parse_lg process file: reportsubsection1100.htm
[WARNING] domfilter: DOM parsing of reportsubsection1100.htm failed:
[WARNING] domfilter: ...ive/2023/texmf-dist/tex/luatex/luaxml/luaxml-mod-xml.lua:175: Incomplete XML Document [char=33675]
[INFO] make4ht-lib: parse_lg process file: reportsubsection1100.htm
----------------------------------
I've just send Michal a link to complete self contained ZIP file (450 MB) with instructions how to run as standalone in order to see these errors on his end.
I tried this on latest texlive 2023 on new Linux installation.
I will work with Michal to provide any additional information he needs from me, to hopefully find the cause of this problem.
This happens only on this file. I think may be due to the large size, since the Latex code is all generated by same program and only this file gives this error.
--Nasser
|