[Dxspider-support] Followup - Hang hang and some more hang...

Kelly Jones kjones at sullivan1.com
Sun Nov 13 17:49:43 GMT 2005


As a followup -

I am able to make this happen on demand.  I have a "load tester" program 
that was written when I was having issue with CLX several years ago (my 
reason for adopting DXSpdier).  If load up about 8 "robot" users in 
addition to the already connected nodes/users, I can make it so nobody else 
can log in.  As soon as I stop the robots, I am able to initiate new logins.

This is obviously a load issue.  Due to the fact I typically have 50-75 
users and 25 node connections, I suspect I see a bit more load/traffic than 
the average cluster.  The unusual thing is that the box CPU is hardly 
breathing, so it's not a maxed out CPU problem.  It's as though something 
within Perl/Spider can't keep up with the amount of data being 
sent/received.  If anybody is game, I can try to "load up" their node with 
"users" and see how your cluster handles things.

I'm not sure where to go from here.  Dirk, ideas?

Kelly - N0VD



At 08:14 AM 11/13/2005, you wrote:
>As I came down to the shack this morning, I was greeted with the same 
>symptoms described below.  In less than 12 hours I was no longer able to 
>log into my node.  As a test, last night I started the node from the 
>command line instead of as a Windows service.  As soon as I hit cntrl-c, 
>my echo was printed on the screen, then the node proceded to shut down - I 
>found this interesting.  It's as though some buffer somewhere fills up 
>then just stops responding.
>
>I received a private email from another sysop that said he started seeing 
>the exact same symptoms during the CQWW SSB contest.  Something *is* going 
>on here, but I don't know what it is.  I do know I can't keep recycling my 
>node every 12 hours.....
>
>Anybody else seeing this?
>
>Kelly
>
>
>
>At 06:21 PM 11/12/2005, you wrote:
>>This is starting to irritate me....
>>
>>It seems that once I get above 40 or so users something stops 
>>working.  The scenario is that I connect to the node, it asks for my 
>>call, I type it in, hit enter... nothing.  It just sits there for the 60 
>>seconds then disconnects me.
>>
>>If I am a user that is already connected, the system is as snappy as can 
>>be.  It only happens to new, incoming connections.  At this moment I have 
>>41 users and 19 nodes.  If I try to connect either as a node or user, I 
>>get the above scenario.
>>
>>This started happening a couple of weeks ago while still running an older 
>>version of Spider on Linux.  I was hoping that it would go away after I 
>>switched to Windows with the latest and greatest of Spider and Activesate 
>>Perl, but apparently not.  Has anybody seen this before or have a 
>>suggestion as to where to look for the problem?  Something that is 
>>interesting is that when it happens, I no longer get an echo back from 
>>the system.  When it's working, the echo returns every keystroke.  The 
>>only way I have found to "fix" it is to recycle the cluster and wait for 
>>it to happens again...
>>
>>Kelly - N0VD
>>
>>
>>
>>
>>_______________________________________________
>>Dxspider-support mailing list
>>Dxspider-support at dxcluster.org
>>http://mailman.tobit.co.uk/mailman/listinfo/dxspider-support
>
>
>
>
>_______________________________________________
>Dxspider-support mailing list
>Dxspider-support at dxcluster.org
>http://mailman.tobit.co.uk/mailman/listinfo/dxspider-support






More information about the Dxspider-support mailing list