[Dxspider-support] Weird node issue that prevents user logins
Dirk Koopman G1TLH
gb7tlh at dxcluster.org
Thu Oct 8 22:55:46 BST 2009
Brendan
What version of perl are you using and is it the same on both boxes?
What does the other box say was passing through at the time? You have
done the 'perl user_asc' thing and regenerated your user file?
Could it be hardware?
Google has lately come up with a rather worrying picture of rather high
DRAM error rates which seem to be related to M/B design.
It is a source of considerable concern to me that many machines are
totally reliable (GB7DJK springs to mind) and a few fall over rather
frequently. Usually corrupting the user file on the way. It appears that
with 5.10, things have become noticeably worse and I can't say that I
recommend it at the moment. But some people don't have a choice.
DB_File together with Berkeley DB is, frankly, a pile of sh*t*, but the
alternatives are either just as bad (but in different ways) or require
extra packages which usually difficult to find for Windows users.
It is starting to become a bigger worry.
Dirk
Brendan Minish wrote:
> Well the issue continues.
>
> firstly the 'stuck' cp processes issue I reported a few days ago were
> down to a an error with a cron job that has now been rectified and have
> no bearing on the underlying issue
>
> However the node EI7SDX still hangs, preventing user logins and freezing
> existing connections.
>
> the perl process is still running, is not consuming excessive resources,
> just not doing any network I/O and will not process connections, even
> the local console
>
> The time that the error occurs does not coincide with any scheduled cron
> job, any activity at all on the web server or any issues logged in the
> main system log or in /var/log/secure (which I was tailing)
>
> I have some debugging turned on and managed to capture the following
> just as it froze.
>
> The base machine configuration is nearly identical to the setup I am
> running with no issues at EI7MRE
>
>
> Any ideas? this is getting a tad tiresome ;-)
>
> 1254934441^<- I CX2SA-6 PC41^ZS1A^2^Cape Town^H25^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 38 E^H26^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 37 E^H24^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 38 E^H24^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^1^Johan^H24^~
> 1254934441^<- I CX2SA-6 PC41^ZS1A^2^Cape Town^H25^~
> 1254934442^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 37 E^H24^~
> 1254934442^<- I EI7MRE PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H46^~
> 1254934442^-> D OE3GCU DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98%07%07
> 1254934442^-> D EI3GU DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98%07%07
> 1254934442^-> D EI6FR DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98
> 1254934442^-> D F5MZN-3 PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D EI7WDX PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D OZ5BBS-7 PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D ON4KST DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98%07%07
> 1254934442^-> D GI0KOW DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98%07%07
> 1254934442^-> D EI6IZ DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98
> 1254934442^-> D K2UT PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H1^~
> 1254934442^-> D CX2SA-6 PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H45^~
> 1254934442^-> D EI9JF DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z%07%07
> 1254934442^-> D 5B4FL DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z%07%07
> 1254934442^-> D GI4FUE DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98%07%07
> 1254934442^<- I CX2SA-6 PC41^ZS1A^1^Johan^H21^~
> 1254934442^<- I EI7WDX PC11^7070.0^HB0/OE9SDV^ 7-Oct-2009^1653Z^ ^EA5BRE^EA5URA-5^H10^~
> 1254934442^<- I CX2SA-6 PC41^ZS1A^3^33 51 S 18 38 E^H24^~
> 1254934443^<- I CX2SA-6 PC41^ZS1A^2^Cape Town^H21^~
> 1254934443^<- I CX2SA-6 PC41^ZS1A^1^Johan^H20^~
>
>
> DX de DL5EAG: 10368100.0 CQ RS JO21 CQ RS 1653Z JO31
> DX de PA0DX: 1834.3 IK4WMA 1652Z
> DX de EA7HMC: 7075.0 EA7HMC DIPLOMA 60AÑOS URE SEVILLA09 1653Z
> DX de K1NVY: 14001.5 E51NOU 1653Z
> DX de PA0DX: 1834.0 S50A 1653Z
> DX de EA5BRE: 7070.0 HB0/OE9SDV 1653Z IM98
> Connection closed by foreign host.
>
>
More information about the Dxspider-support
mailing list