[Twisted-Python] Unable to write to "stuck" TCP client connections

Glyph glyph at twistedmatrix.com
Tue Feb 26 00:36:59 MST 2013


On Feb 25, 2013, at 10:24 PM, Wenxiang Wu <wenxiang at zopim.com> wrote:

> From what I can tell, there isn't a specific pattern that. But from this small sample size, users that run into this problem are those with higher levels of usage, i.e. connections with more traffic.
> 
> Also, this *seems* to affect users in the same LAN more. The 2 groups of affected users I worked closely with regarding this issue were both connected to the same WiFi router.

So, it sounds like it's happening with connections that are (A) faster and (B) shipping around more traffic.  Smells like a possible race condition - and maybe not in Twisted.  Is it possible for you to test with any other OS / kernel variations, to see if it behaves differently on other Linux versions, ideally some with changes to epoll?

I don't know epoll quite well enough to read the strace and tell what's going on.  If those are the only calls being traced... there's no gettimeofday or send or recv calls?  That suggests it's just thrashing, but if your workaround works, other stuff must be going on.  Hmm.  I guess I don't have a clear enough picture for any useful conjecture yet :).  Can you open a bug, and attach a more complete strace, assuming that the one with just the epoll calls was filtered?

-glyph

-------------- next part --------------
An HTML attachment was scrubbed...
URL: </pipermail/twisted-python/attachments/20130225/3b05c0ba/attachment-0001.html>


More information about the Twisted-Python mailing list