[Dxspider-support] Node crashed at 18:04
djk
djk at tobit.co.uk
Sun Mar 23 23:07:56 GMT 2025
There is standard limit of 1024 files open at once per process. You can
change this in a shell with 'ulimit -n 2048' (for example). There is
also way of changing it system wide in systemd (<spit>) but you'll have
to research that yourself or start the node in a shell script like:
#!/bin/sh
ulimit -n 2048
/spider/perl/cluster.pl
Personally, 900+ users on a 4GB RPi x is going some, especially
considering power required to run some windows cluster software (and
then still not keeping).
What does your 'top' say when you are running it at this sort of usage?
Dirk G1TLH
On 23/03/2025 18:54, Keith, G6NHU via Dxspider-support wrote:
> I suppose this really is for Dirk.
>
> This has never happened before - I came into the shack with a freshly
> poured shackbeer and noticed my ssh session had closed so I logged
> back in and saw my uptime was just 21 minutes.
>
> Checking the debug log (attached as a .zip), this is what happened in
> the same timestamp with the actual error that caused the crash being
> at the end.
>
> My cluster is running on a Pi5 with 4Gb RAM and an external Samsung
> SSD. I don’t know the exact number of connected users but when I
> logged back on, there were 938 so I’d imagine the number prior to the
> crash was around the same. The node had been up for about a month.
>
> “Too many open files” ?
>
> 73 Keith
>
> 1742753087^###
> 1742753087^### RINGBUFFER END 501 debug lines written
> 1742753087^###
> 1742753087^(trace) writing /spider/local_data/rbn_cache Too many open
> files
> 1742753087^(trace) Stack (2): RBN::DXDebug::confess in
> /spider/perl/RBN.pm line: 926
> 1742753087^(trace) Stack (3): RBN::RBN::write_cache in
> /spider/perl/RBN.pm line: 871
> 1742753087^(trace) Stack (4): main::RBN::per_minute in
> /spider/perl/cluster.pl <http://cluster.pl> line: 892
> 1742753087^(*) DXSpider Ceasing
> 1742753087^(*) DXQSL finished
> 1742753087^(*) RBN:WRITE_CACHE size: 423.804KB time to write: 26 mS
> 1742753087^(*) DXDupe finishing
> 1742753087^(*) DXUser finished
> 1742753087^(cluster) DXSpider v1.57 build 568 (git: mojo/0920a333[r])
> using perl v5.36.0 on linux ended
> 1742753087^(*) bye bye everyone - bye bye
> 1742753087^###
> 1742753087^### RINGBUFFER START at line 0 (zero base)
> 1742753087^###
>
> Then it repeats
>
> 1742753087^RING: 18:04:47^(trace) writing /spider/local_data/rbn_cache
> Too many open files
> 1742753087^RING: 18:04:47^(trace) Stack (2): RBN::DXDebug::confess in
> /spider/perl/RBN.pm line: 926
> 1742753087^RING: 18:04:47^(trace) Stack (3): RBN::RBN::write_cache in
> /spider/perl/RBN.pm line: 871
> 1742753087^RING: 18:04:47^(trace) Stack (4): main::RBN::per_minute in
> /spider/perl/cluster.pl <http://cluster.pl> line: 892
> 1742753087^RING: 18:04:47^(*) DXSpider Ceasing
> 1742753087^RING: 18:04:47^(*) DXQSL finished
> 1742753087^RING: 18:04:47^(*) RBN:WRITE_CACHE size: 423.804KB time to
> write: 26 mS
> 1742753087^RING: 18:04:47^(*) DXDupe finishing
> 1742753087^RING: 18:04:47^(*) DXUser finished
> 1742753087^RING: 18:04:47^(cluster) DXSpider v1.57 build 568 (git:
> mojo/0920a333[r]) using perl v5.36.0 on linux ended
> 1742753087^RING: 18:04:47^(*) bye bye everyone - bye bye
> 1742753087^###
> 1742753087^### RINGBUFFER END 501 debug lines written
> 1742753087^###
>
> _______________________________________________
> Dxspider-support mailing list
> Dxspider-support at tobit.co.uk
> https://mailman.tobit.co.uk/mailman/listinfo/dxspider-support
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.tobit.co.uk/pipermail/dxspider-support/attachments/20250323/2692be31/attachment.htm>
More information about the Dxspider-support
mailing list