[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [FW-1] Solaris 2.7, Sun420R cpu panic




Petra,

I saw similar problems on our 420R with dual CPU's.  We do not have HA configured.
In my instance this problem was isolated to a Solaris Issue.  I upgraded the PROM and loaded the most current OS fixes and the problem went away.

Thanks,
Phillip

Phillip Blatzheim
ATOFINA Petrochemical, Inc.
Information Technology -- Network Services

P:
F:
Email: [email protected]

My candle burns at both ends;
 It will not last the night;
But ah, my foes, and oh, my friend --
 It gives a lovely light!
         Edna St. Vincent Millay



Petra Klein <[email protected]>

12/09/2002 09:48 AM
Please respond to Mailing list for discussion of Firewall-1        

       
        To:        [email protected]
        cc:        
        Subject:        [FW-1] Solaris 2.7, Sun420R cpu panic



Hi,

We have a pair of Sun420R DualCPU who has started to coredump and reboots:

Sun Enterprise 420R (2 X UltraSPARC-II 450MHz)
Sun Solaris 5.7 106541-22
Checkpoint Firewall-1 4.1 SP-6
Stonebeat Fullcluster 2.0 (Build 2035) revision 05-03

unix: BAD TRAP: cpu=2 type=0x31 rp=0x4029f5f0 addr=0x8 mmu_fsr=0x0
unix: BAD TRAP occurred in module "ip" due to a NULL pointer dereference.
unix: trap type = 0x31
unix: addr=0x8
unix: pid=379, pc=0x1015e838, sp=0x4029f680, tstate=0x80001e02, context=0x8a7
unix: g1-g7: 0, 0, 10000, 360, 0, 0, 71a65ce0
unix: Begin traceback... sp = 4029f680
unix: Called from 10037db0, fp=4029f6e0, args=71af86f8 71db89a0 71cffa20 71b0c2c8 e0 70044b78
unix: Called from 101f4874, fp=4029f740, args=71b0c2c8 71db89a00 1014dd74 700e1f88
unix: Called from 10037d20, fp=4029f7a8, args=71b0d978 71db89a0 5490 10001 0 71b0d9d8
unix: Called from 100bf404, fp=4029f830, args=21e6 71af86f8 0 719321e4 0
unix: Called from 101dc714, fp=4029fad0, args=cfc7c 703b9f80 719321dc 4029fc7c 100003
unix: Called from 10086de0, fp=4029fbb0, args=71b032ec 5000 5303 4029fc7c 71b032a8 c0086914
unix: Called from 10036038, fp=4029fc80, args=b c0086914 ffbef7e0 1 700e85a0 0
unix: Called from 25740, fp=ffbee6d8, args=b c0086914 ffbef7e0 1 0 0
unix: End traceback...
unix: panic[cpu2]/thread=71a65ce0:
unix: trap
unix:
unix: syncing file systems...
unix:  83
unix:  done
unix: dumping to /dev/dsk/c0t0d0s1, offset 107741184
unix: ^M100% done: 10775 pages dumped, compression ratio 2.86,
unix: dump succeeded


We changed the CPU2 on both machines and now it happened again so it does not
seem to be a hardwareproblem....

This happened today;
--------------------------------

unix: kernel memory allocator:
unix: invalid free: buffer not in cache
unix: buffer=71583b20  bufctl=0  cache: streams_dblk_1064
unix: panic[cpu2]/thread=40007e60:
unix: kernel heap corruption detected
unix:
unix: syncing file systems...
unix:  4
unix:  1
unix:  done
unix: dumping to /dev/dsk/c0t0d0s1, offset 107741184

----------------------------------

There has been different reasons;
recursive mutex_enter, trap and kernel heap corruption detected.

We have another machine set up exactly the same as these ones and that
mahine has no problems with this.

There is a known issue with Firewall-1 before SP-6 but we have upgraded
to Firewall-1 SP-6

From Firewall-1 SP-6 release notes;

Fixes and Improvements
Firewall-1

"A Stonebeat configuration on a dual CPU machine may cause a system panic"

Has anyone experienced similar problems?


Regards
Petra Klein

=================================================
To set vacation, Out Of Office, or away messages,
send an email to [email protected]
in the BODY of the email add:
set fw-1-mailinglist nomail
=================================================
To unsubscribe from this mailing list,
please see the instructions at
http://www.checkpoint.com/services/mailing.html
=================================================
If you have any questions on how to change your
subscription options, email
[email protected]
=================================================