Solaris Troubleshooting : recover hung System Controller [ Sun Fire v1280, E2900, 3800, 4800, 4810, 6800, E4900, E6900, and Netra 1280, 1290 Server ]

When a system controller (SC) is hung, try a few steps before pressing the Reset button on the SC.

Step 1 . Try to Telnet or directly connect to the serial port of the “hung” SC, TIP into the platform shell, and use the “reboot” command.

Step 2.  If the “reboot” command does not work, or you cannot enter anything, log in to the spare SC and try to force a failover by using the “setfailover force” command.

Please note below points:

This step is not available on Sun Fire v1280, E2900, and Netra 1280, 1290 servers.
This step will probably not work if the primary SC is completely hung.
If this step does work, it will reboot the hung SC and make the spare SC the primary SC.

Step 3.  If failover does not complete, the LAST RESORT is to use the Reset button on the SC.

BEFORE YOU PRESS THIS BUTTON,  you must bring down the domains. Bringing down the domains is critical because there is a possibility that the domain will crash if the Reset button is pressed and the domains are up and running.

NOTE:- Make sure that connections setting are proper on SC.

Use a tip session onto the serial port of the SC:

6800a-sc0:SC> showplatform -p network
The system controller is configured to be on a network.
Network settings: static
Hostname: 6800a-sc0
IP Address: 129.156.xx.xx
Netmask: 255.255.255.0
Gateway: 129.156.xx.1
DNS Domain: UK.Sun.COM
Primary DNS Server: 129.156.xx.xx
Secondary DNS Server: 129.156.xx.xx
***Connection type: none    <—– No remote access enabled
Idle connection timeout : Notimeout
Sun Fire Link Enabled: no
*** This shows remote access via telnet or ssh is not enabled.

Running the command below, changes Connection type :

6800a-sc0:SC> setupplatform -p network

Network Configuration
Is the system controller on a network? [yes]:
Use DHCP or static network settings? [static]:
Hostname [6800a-sc0]:
IP Address [129.156.xx.xx]:
Netmask [255.255.255.0]:
Gateway [129.156.xx.1]:
DNS Domain [UK.Sun.COM]:
Primary DNS Server [129.156.xx.xx]:
Secondary DNS Server [129.156.xx.xx]:
**To enable remote access to the system controller, select “ssh” or “telnet”.
**Connection type (ssh, telnet, none) [telnet]:
Idle connection timeout (in minutes; 0 means no timeout) [0]:
Enable Sun Fire Link? [no]:

To enable remote access to the system controller, select either:

* ssh
* telnet

Rebooting the SC is required, for changes in the above network settings to take effect.


Ramdev

Ramdev

I have started unixadminschool.com ( aka gurkulindia.com) in 2009 as my own personal reference blog, and later sometime i have realized that my leanings might be helpful for other unixadmins if I manage my knowledge-base in more user friendly format. And the result is today's' unixadminschool.com. You can connect me at - https://www.linkedin.com/in/unixadminschool/

4 Responses

  1. Santosh says:

    Hi Ramdev,
    can you provide few examples for DR (Disaster recovery)? And also steps when node down, or server down, or system hang’s at OK prompt.
    Thank you!!!

  1. September 25, 2011

    IT Questions…

    […]gurkulindia.com » Solaris Troubleshooting : recover hung System Controller [ Sun Fire v1280, E2900, 3800, 4800, 4810, 6800, E4900, E6900, and Netra 1280, 1290 Server ][…]…

  2. September 16, 2015

    […] Read– System Controller Hung Issues […]

What is in your mind, about this post ? Leave a Reply

Close
  Our next learning article is ready, subscribe it in your email

What is your Learning Goal for Next Six Months ? Talk to us