new small memory argus and clients

Peter Van Epp vanepp at sfu.ca
Mon Aug 27 17:40:40 EDT 2007


On Mon, Aug 27, 2007 at 03:34:42PM -0400, Carter Bullard wrote:
> Well, lets go back to small.rc.2 and I'll incrementally add back  
> functionality, now
> that I have it integrated.  Keep running small.rc.2 as you can't have  
> enough testing.
> 
> Carter
> 

	Maybe we can have too much testing :-). Looks like link busyness has
started to grow as has small.rc.2 memory usage:

hcids:/scratch # !psps auxwwww | grep argus
root     24056  8.0 23.2 1029332 913984 pts/1  SLl  12:54   4:46 argus -P 560 -i eth0 -i eth1 -m -F /scratch/argus.conf
root     24234  0.0  0.0   3132   832 pts/1    S+   13:54   0:00 grep argus

	An clients .rc.49 listening to it became unhappy a while back:

ra3[25417.01804400]: 07-08-27 13:33:24 ArgusCalloc (1, 716) returning 0x180f000
ra3[25417.01804400]: 07-08-27 13:33:24 ArgusCopyRecordStruct (0x3091803) seq 106689
ra3[25417.01804400]: 07-08-27 13:33:24 ArgusCalloc (1, 716) returning 0x180f400
ra3[25417.01804400]: 07-08-27 13:33:24 ArgusCopyRecordStruct (0x3091803) seq 106729
ra3[25417.01804400]: 07-08-27 13:33:24 ArgusCalloc (1, 716) returning 0x180f800
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusClientTimeout()
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1812c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1813000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1813400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1813800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1813c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1814000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1814400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1814800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1814c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1815000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1815400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1815800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1815c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1816000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1816400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1816800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1816c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1817000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1817400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1817800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1817c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1818000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1818400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1818800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1818c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1819000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1819400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1819800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1819c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181a000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181a400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181a800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181ac00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181b000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181b400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181b800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181bc00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181c000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181c400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181c800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181cc00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181d000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181d400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181d800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181dc00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181e000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181e400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181e800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181ec00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181f000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181f400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181f800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x181fc00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1820000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1820400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1820800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1820c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1821000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1821400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1821800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1821c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1822000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1822400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1822800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1822c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1823000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1823400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1823800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1823c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1824000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1824400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1824800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1824c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1825000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1825400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1825800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1825c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1826000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1826400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1826800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1826c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1827000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1827400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1827800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1827c00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1828000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x1828400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x180e400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x180e800)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x180ec00)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x180f000)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x180f400)
ra3[25417.01804000]: 07-08-27 13:33:24 ArgusFree (0x180f800)
ra3[25417.01804000]: 07-08-27 13:33:25 ArgusClientTimeout()
ra3[25417.01804000]: 07-08-27 13:33:26 ArgusClientTimeout()
...
ra3[25417.01804000]: 07-08-27 13:35:24 ArgusClientTimeout()
ra3[25417]: 07-08-27 13:35:25 ArgusReadStream 192.75.244.191: idle stream: closing
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusCloseInput(0x405000) closing
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusWriteConnection: write(4, 0x7f7d0, 6)
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusWriteConnection(0x405000, 0x7f7d0, 6) returning 6
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusFree (0x605000)
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusFree (0x2ba000)
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusCloseInput(0x405000) done
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusConnectRemote(0x405000) ArgusReadStream returned -1
ra3[25417.01804400]: 07-08-27 13:35:25 ArgusConnectRemote() done!
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusShutDown (0)
ra3[25417.a000ed88]: 07-08-27 13:35:25 main () exiting
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusFree (0x500100)
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusDeleteList (0x500100, 4) returning
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusFree (0x500170)
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusDeleteList (0x500170, 4) returning
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusFree (0x5001e0)
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusDeleteQueue (0x5001e0) returning
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusFree (0x500220)
ra3[25417.a000ed88]: 07-08-27 13:35:25 ArgusDeleteQueue (0x500220) returning
[25417.a000ed88]:  ArgusFree (0x500260)

	Doing a restart on the client hasn't worked either so it looks like
argus side is in trouble:

#!/bin/sh

(/usr/local/bin/ra3 -S 192.75.244.191:560 -n -D4 -w /var/log/argus/com_argus  >> /var/log/argus.logs/com_ra3.log 2>&1) &


ra3[25521.a000ed88]: 07-08-27 14:00:48 ArgusCalloc (1, 100) returning 0x5003c0
ra3[25521.a000ed88]: 07-08-27 14:00:48 ArgusNewList () returning 0x5003c0
ra3[25521.a000ed88]: 07-08-27 14:00:48 ArgusCalloc (1, 156) returning 0x500460
ra3[25521.a000ed88]: 07-08-27 14:00:48 main: reading files completed
ra3[25521.01804400]: 07-08-27 14:00:48 ArgusConnectRemote(0x405000) starting
ra3[25521.01804400]: 07-08-27 14:00:48 ArgusCalloc (1, 60) returning 0x500540
ra3[25521.01804400]: 07-08-27 14:00:48 ArgusNewQueue () returning 0x500540
ra3[25521.01804400]: 07-08-27 14:00:48 Trying 192.75.244.191 port 560 Expecting Argus records
ra3[25521.01804400]: 07-08-27 14:00:48 connected
ra3[25521.01804400]: 07-08-27 14:00:48 ArgusGetServerSocket (0x405000) returning 4
ra3[25521.01804000]: 07-08-27 14:00:49 ArgusClientTimeout()
ra3[25521.01804000]: 07-08-27 14:00:50 ArgusClientTimeout()
ra3[25521.01804000]: 07-08-27 14:00:51 ArgusClientTimeout()
ra3[25521.01804000]: 07-08-27 14:00:52 ArgusClientTimeout()
ra3[25521.01804000]: 07-08-27 14:00:53 ArgusClientTimeout()
...

	No problem indication in the server side /var/log/messages:

Aug 27 12:54:35 hcids kernel: RING: succesfully allocated 0 KB [tot_mem=12664896][order=12]
Aug 27 12:54:35 hcids kernel: RING: allocated 10851 slots [slot_len=1546][tot_mem=16777216]
Aug 27 12:54:35 hcids kernel: RING: succesfully allocated 0 KB [tot_mem=12664896][order=12]
Aug 27 12:54:35 hcids kernel: RING: allocated 10851 slots [slot_len=1546][tot_mem=16777216]
Aug 27 13:30:19 hcids syslog-ng[3231]: STATS: dropped 0

	but the argus task is hung and not killable at the moment:

hcids:/scratch # kill -HUP 24056
hcids:/scratch # !ps
ps auxwwww | grep argus
root     24056  7.7 35.4 1488084 1394468 pts/1 SLl  12:54   5:37 argus -P 560 -i eth0 -i eth1 -m -F /scratch/argus.conf
root     24294  0.0  0.0   3132   832 pts/1    S+   14:07   0:00 grep argus

...
hcids:/scratch # kill -INT 24056 
hcids:/scratch # !ps
ps auxwwww | grep argus
root     24056  5.4 35.4 1488084 1394468 pts/1 SLl  12:54   5:37 argus -P 560 -i eth0 -i eth1 -m -F /scratch/argus.conf
root     24422  0.0  0.0   3132   832 pts/1    S+   14:38   0:00 grep argus

	And nothing in the log file either:

cat debug.log
  ArgusWarning: argus[24056.0000040000026f50]: (null) started
  ArgusWarning: argus[24056.0000040000026f50]: 09 Jul 78 11:04:32.210396 ArgusGetInterfaceStatus: interface eth1 is up
  ArgusWarning: argus[24056.0000040000026f50]: 09 Jul 78 11:04:32.459432 ArgusGetInterfaceStatus: interface eth0 is up
     ArgusInfo: argus[24056.0000040002b98230]: 31 Dec 69 20:33:04.314032 connect from test4.ucs.sfu.ca

Peter Van Epp / Operations and Technical Support 
Simon Fraser University, Burnaby, B.C. Canada



More information about the argus mailing list