[Dxspider-support] Cluster Hang

Mike Lewis mlewis at digitalglobe.com
Wed Oct 31 21:23:37 CET 2007


Thanks to everyone who sent me a reply. This has been a NAGGING issue for a while. I FINALLY solved it on the eve of the CQDX WW SSB contest, and was able to stay connected to my cluster throughout the contest. I am posting my findings just in case someone else has an issue like this. It was NOT a Spider issue.

The problem was some software that was installed on both of the PC laptops I traditionally use to remote connect to my Linux system. I sometimes log into our work network from home using my laptops. Therefore I have some Cisco VPN software installed. It turns out this software was the problem. It would drop the connection to the spider computer after a certain amount of inactivity. This would happen even if I was not using the VPN software to connect to my workplace. Turns out the Cisco VPN software runs a service at boot time on the Windows PC that is the culprit. Once I disabled that, the link was completely bulletproof. Thinking back after the fact, the time period during which I first had this installed coincides with the start of my troubles.

So if anyone is experiencing problems keeping a remote session alive to a spider system from another Windows box, it just might be your VPN software!



----------------------------------------------------
This mailbox protected from junk email by MailFrontier Desktop
from MailFrontier, Inc. http://info.mailfrontier.com
 

> -----Original Message-----
> From: dxspider-support-bounces at dxcluster.org 
> [mailto:dxspider-support-bounces at dxcluster.org] On Behalf Of 
> Bela Markus
> Sent: Thursday, October 25, 2007 12:33 AM
> To: The DXSpider Support list
> Subject: Re: [Dxspider-support] Cluster Hang
> 
> Hi Mike,
> 
> the leading number is the standard UNIX time interpreted as 
> seconds elapsed from January 1st, 1970.
> 
> I don't think your issue is related to SUSE. The very high 
> CPU load usually caused by corrupted dupe and/or user file, 
> same happened to me also after a migration to new hardware. 
> Delete dupe first and restart spider. If you are changing 
> distribution, for server I strongly advice CentOS.
> 
> Yes, a chat.
> 
> Regards... Béla
> 
> 
> Mike Lewis írta:
> >
> > I am still having all kinds of problems getting my clkuster to run 
> > reliably, mostly since switching to running on a SuSE 10.2 
> distro. I 
> > have had problems with my local telnet sessions hanging, 
> but now have 
> > an even bigger problem where the cluster itself seems to hang. I 
> > recently (within the last week) loaded a new build of DXSpider (it 
> > shows as V1.54 build 0.172).
> >
> > Here is an excerpt from the log showing the last entries 
> prior to the 
> > hang. I had logged on, composed a message to a friend, 
> logged out and 
> > then later back on, and then left myself connected. I had been 
> > expanding the usdbraw file to add state info, but had not 
> yet run the 
> > load/usdb command. When I came back to the system, the 
> terminal that I 
> > had run the client program in was not returning any prompts. top 
> > showed the cluster.pl using a consistent 95% or more of the cpu.
> > killing the client and re-running did not connect to the cluster. I 
> > had to manually kill the cluster.pl instance.
> >
> > Log file:
> >
> > 1193281900^ann^ALL^IZ7AUH-6^IZ7AUH-6 DX CLUSTER -> 
> dx.iz7auh.net port 
> > 8000 1193282956^msg^msg 1 from KE0MF to KB0TVH stored 
> > 1193283206^DXCommand^KE0MF disconnected 1193283284^DXCommand^KE0MF 
> > connected from 127.0.0.1 1193284847^ann^ALL^PA4JJ-2^dx 
> cluster telnet 
> > pa4jj-no-ip.org port 8000
> > 1193285786^chat^MW^SM3BEI^#49 vaken!
> >
> > A few questions:
> >
> > What is the way to interpret the leading number (I am 
> assuming it is a 
> > time stamp of some kind?) on each line of the log. Is it 
> possible to 
> > determine from the log the amount of time between entries?
> >
> > What is the last entry telling me? is this some sort of a 
> chat request 
> > to my node?
> >
> >
> > If there is anyone out there with DXSpider experience on SuSE 
> > (preferably with a relatively new version) maybe they can 
> help me. I 
> > had this running on a Debian distro on an older box with less 
> > problems. I guess I could scrap SuSE and try going back to a Debian 
> > (or some other Linux) install, but I have other unrelated 
> reasons for 
> > wanting to keep this system running SuSE.
> >
> > ML
> >
> >
> >
> >
> > --
> > This message has been scanned for viruses and dangerous content by 
> > *MailScanner* <http://www.mailscanner.info/>, and is believed to be 
> > clean.
> > 
> ----------------------------------------------------------------------
> > --
> >
> > _______________________________________________
> > Dxspider-support mailing list
> > Dxspider-support at dxcluster.org
> > http://mailman.tobit.co.uk/mailman/listinfo/dxspider-support
> >   
> 
> 
> _______________________________________________
> Dxspider-support mailing list
> Dxspider-support at dxcluster.org
> http://mailman.tobit.co.uk/mailman/listinfo/dxspider-support
> 



More information about the Dxspider-support mailing list