[Dxspider-support] Weird node issue that prevents user logins

Dirk Koopman G1TLH gb7tlh at dxcluster.org
Thu Oct 8 22:55:46 BST 2009


Brendan

What version of perl are you using and is it the same on both boxes? 
What does the other box say was passing through at the time? You have 
done the 'perl user_asc' thing and regenerated your user file?

Could it be hardware?

Google has lately come up with a rather worrying picture of rather high 
DRAM error rates which seem to be related to M/B design.

It is a source of considerable concern to me that many machines are 
totally reliable (GB7DJK springs to mind) and a few fall over rather 
frequently. Usually corrupting the user file on the way. It appears that 
with 5.10, things have become noticeably worse and I can't say that I 
recommend it at the moment. But some people don't have a choice.

DB_File together with Berkeley DB is, frankly, a pile of sh*t*, but the 
alternatives are either just as bad (but in different ways) or require 
extra packages which usually difficult to find for Windows users.

It is starting to become a bigger worry.

Dirk

Brendan Minish wrote:
> Well the issue continues. 
> 
> firstly the 'stuck' cp processes issue I reported a few days ago were
> down to a an error with a cron job that has now been rectified and have
> no bearing on the underlying issue 
> 
> However the node EI7SDX still hangs, preventing user logins and freezing
> existing connections.
> 
> the perl process is still running, is not consuming excessive resources,
> just not doing any network I/O and will not process connections, even
> the local console   
> 
> The time that the error occurs does not coincide with any scheduled cron
> job, any activity at all on the web server or any issues logged in the
> main system log  or in /var/log/secure (which I was tailing) 
> 
> I have some debugging turned on and managed to capture the following
> just as it froze.
> 
> The base machine configuration is nearly identical to the setup I am
> running with no issues at EI7MRE  
> 
> 
> Any ideas? this is getting a tad tiresome ;-) 
> 
> 1254934441^<- I CX2SA-6 PC41^ZS1A^2^Cape Town^H25^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 38 E^H26^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 37 E^H24^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 38 E^H24^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^1^Johan^H24^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^2^Cape Town^H25^~
> 1254934442^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 37 E^H24^~
> 1254934442^<- I EI7MRE PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H46^~
> 1254934442^-> D OE3GCU DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z IM98%07%07
> 1254934442^-> D EI3GU DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z IM98%07%07
> 1254934442^-> D EI6FR DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z IM98
> 1254934442^-> D F5MZN-3 PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D EI7WDX PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D OZ5BBS-7 PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D ON4KST DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z IM98%07%07
> 1254934442^-> D GI0KOW DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z IM98%07%07
> 1254934442^-> D EI6IZ DX de EA5BRE:     7070.0  HB0/OE9SDV                                 1653Z IM98
> 1254934442^-> D K2UT PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H1^~
> 1254934442^-> D CX2SA-6 PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D EI9JF DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z%07%07
> 1254934442^-> D 5B4FL DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z%07%07
> 1254934442^-> D GI4FUE DX de EA5BRE:     7070.0  HB0/OE9SDV                                  1653Z IM98%07%07
> 1254934442^<- I CX2SA-6 PC41^ZS1A^1^Johan^H21^~
> 1254934442^<- I EI7WDX PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H10^~
> 1254934442^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 38 E^H24^~
> 1254934443^<- I CX2SA-6 PC41^ZS1A^2^Cape Town^H21^~
> 1254934443^<- I CX2SA-6 PC41^ZS1A^1^Johan^H20^~
> 
> 
> DX de DL5EAG: 10368100.0  CQ           RS JO21 CQ RS                 1653Z JO31
> DX de PA0DX:      1834.3  IK4WMA                                     1652Z
> DX de EA7HMC:     7075.0  EA7HMC       DIPLOMA 60AÑOS URE SEVILLA09 1653Z
> DX de K1NVY:     14001.5  E51NOU                                     1653Z
> DX de PA0DX:      1834.0  S50A                                       1653Z
> DX de EA5BRE:     7070.0  HB0/OE9SDV                                 1653Z IM98
> Connection closed by foreign host.
> 
> 




More information about the Dxspider-support mailing list