<div class="gmail_quote">On Thu, Jan 20, 2011 at 6:06 PM, <span dir="ltr"><<a href="mailto:exarkun@twistedmatrix.com">exarkun@twistedmatrix.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div class="im">On 20 Jan, 11:57 pm, <a href="mailto:screwtape@froup.com">screwtape@froup.com</a> wrote:<br>
>On Thu, Jan 20, 2011 at 05:22:55PM -0600, Kevin Horn wrote:<br>
>>Hmmm...yes, I think it does. It looks like the l2s_builder script is<br>
>>choking on fetching the DTD for some reason. lore2sphinx itself is<br>
>>supposed<br>
>>to cache the DTD between runs, but I'm betting it doesn't work in the<br>
>>buildbot due to the (hopefully) clean environment it has every time it<br>
>>runs.<br>
>><br>
>>I can change it to not care about that, but if I do, then it won't be<br>
>>able<br>
>>to resolve HTML entities (of which there are a few in the Lore<br>
>>sources,<br>
>>mostly em-dashes IIRC).<br>
><br>
>You mean these DTDs?<br>
><br>
> twisted/lore/xhtml1-strict.dtd<br>
> twisted/lore/xhtml1-transitional.dtd<br>
><br>
>They reference the xhtml-*.ent entity definitions which are also in the<br>
>same directory. It would be neat if lore2sphinx could be taught to use<br>
>the DTDs packaged with lore instead of having to download them from the<br>
>Internet every time.<br>
<br>
</div>It could even use Lore's parser, twisted.lore.tree.parseFileAndReport ;)<br>
<br>
Jean-Paul<br>
<br></blockquote><div> </div></div>Yes, but:<br><br>- The docs (well, docstrings) aren't very clear about exactly what that does, or what quirks it might or might not have. <br>- Is the DOM document it returns a typical XHTML DOM document? Is there anything special/different about it since it's a Lore XHTML doc?<br>
- Consider the number of tickets open involving Lore's parser (or at least there were several when I started this whole business).<br>- The whole point of this project is to stop using (and by extension supporting/maintaining) Lore, so it seems kind of backwards to me to depend on it.<br>
- I'd rather deal with an etree-a-like, than a straight DOM document (admittedly, personal preference)<br>- I certainly don't want to rewrite everything at this point ;)<br><br>Kevin Horn<br>