Boot archive corruption with solaris on v490

We are running our Sybase production database on SunFire v490 with Solaris 10. We recently applied the cluster patch 141414 on the system. The root disk is encapsulated in raid1 with SVM with 3 metadb replicas on each disk.

After running for a week the system ran out of max number of processes that a user can fork out. We were unable to log into the system from the RSC console also. The system was frozen and as the last option the system had to be hard rebooted. Eventually, this led to the boot archive corruption and the system did not allowed us to boot from the default disk.

We had to boot failsafe mode, to recover from the boot archive failure. This also did not worked out, so we booted of from the cdrom and mounted the first disk in /a.

We commented out the rootdev section in /etc/system and also removing the svm disk from vfstab and replacing it with ctds format disk.

Also, we put a new parameter for maxusers in /etc/system as 4096, as the default value of 2048 was not sufficient.

echo "set maxusers = 4096" >> /etc/system

After that we booted of from the first disk and it worked. Then, we again encapsulated the disk with svm.

Comments

Popular posts from this blog

zpool and power path partial compatibility

Recoverpoint replication reporting with vmax splitter

Weird problem post power maintenance, no Powerpath pseudo name for Luns