Professional Documents
Culture Documents
failover SC
/opt/SUNWSMS/SMS1.5/bin/failover force
restart sms
/opt/SUNWSMS/SMS1.5/bin/failover stop
/etc/init.d/sms stop
/etc/init.d/sms start
/opt/SUNWSMS/SMS1.5/bin/failover start
setfailover off
/etc/init.d/sms stop
smsconfig -m
/etc/init.d/sms start
setfailover on
setfailover force
#must reboot each SC when this is completed
#may have to change /etc/hostname.dman0 on each domain & reconfig that NIC
#repeat on other SC
delete a board:
add a board:
see if a board cannot be moved (kernel)- 'cfgadm -alv | grep for permanent'
Can check number of cpus and memory via "psrinfo" and "prtconf | head -5" commands from each OS
instance. Can see all cpu boards on a 6800 via "showboards -p cpu" , can see all cpus in a
domain via "showboards -d a -p cpu" for domain a. Can see all memory by using "memory" in place
of "cpu" in the last commands.
Component Description
--------- -----------
/N0/SB0/P0 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB0/P1 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB0/P2 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB0/P3 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB2/P0 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB2/P1 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB2/P2 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB2/P3 UltraSPARC-III, 750MHz, 8M ECache
Component Description
--------- -----------
/N0/SB1/P0 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB1/P1 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB1/P2 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB1/P3 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB3/P0 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB3/P1 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB3/P2 UltraSPARC-III, 750MHz, 8M ECache
/N0/SB3/P3 UltraSPARC-III, 750MHz, 8M ECache
showplatform
showboards -v -d A
showboards -d A
--------------------------------------------------------------------------------
poweron EX0 (May already be on)
poweron SB0
flashupdate -f /opt/SUNWSMS/hostobjs/sgcpu.flash SB0
You can run multiple flashes at the same time from multi windows to reduce
the amount of time it take( avg. 7Min/board ) - but they will all load the
system controller - the SC is the choke point - do not do too many at once.
Reconfigure server:
As the boards are replaced reconfigure server to only have 4 SB and bring
that domain up first.
On the SC run.
deleteboard SB4 SB5 SB6 SB7 SB8 SB9 SB10 SB11 SB12 SB13 SB14 SB15 SB16 SB17
This will leave SB0 SB1 SB2 and SB3 in the server domain.
Then:
cd ~/extended Note: This directory has a .postrc file set to level 32.
setkeyswitch -d server on
This will run a level 32 POST on the boards and boot the domain.
Reconfigure server:
On the SC run the following command after the SB have been Flashed.
addboard -d server -y -c configure SB4 SB5 SB6 SB7 SB8 SB9 SB10
cd ~/extended
setkeyswitch -d server on
Reconfigure server:
On the SC run the following command after the SB have been Flashed.
addboard -d server -y -c configure SB11 SB12 SB13 SB14 SB15 SB16 SB17
cd ~/extended
setkeyswitch -d server on
su - sms-svc
console -d server
To shutdown a domain:
init 0
poweron SB0
Note: you may have to power the extender board on before the SB.( poweron EX0)
================================================================================
----------------------------------------------------------------------
MAN Configuration Files
----------------------------------------------------------------------
/etc/opt/SUNWSMS/config/MAN.cf
1.
/etc/opt/SUNWSMS/config/MAN.cf
#
15k
C SC-FLOATER-C1 sc 192.68.66.172
C SC-TEST-hme0-C1 sc0-hme0
C SC-TEST-eri1-C1 sc0-eri1
I1 NM-I1 netmask-i1 255.255.255.224
I1 SC-I1 15k-sc-i1 13.1.1.1
I1 DA-I1 15k-a 13.1.1.2
I1 DB-I1 15k-b 13.1.1.3
I1 DC-I1 15k-c 13.1.1.4
I1 DD-I1 15k-d 13.1.1.5
I1 DE-I1 15k-e 13.1.1.6
I1 DF-I1 15k-f 13.1.1.7
I1 DG-I1 15k-g 13.1.1.8
I1 DH-I1 15k-h 13.1.1.9
I1 DI-I1 15k-i 13.1.1.10
I1 DJ-I1 15k-j 13.1.1.11
I1 DK-I1 15k-k 13.1.1.12
I1 DL-I1 15k-l 13.1.1.13
I1 DM-I1 15k-m 13.1.1.14
I1 DN-I1 15k-n 13.1.1.15
I1 DO-I1 15k-o 13.1.1.16
I1 DP-I1 15k-p 13.1.1.17
I1 DQ-I1 15k-q 13.1.1.18
I1 DR-I1 15k-r 13.1.1.19
I2 NM-I2 netmask-i2 255.255.255.252
I2 SC0-I2 sc0-i2 10.2.1.1
I2 SC1-I2 sc1-i2 10.2.1.2
/var/opt/SUNWSMS/data/<domain>/idprom.image
Example:
1.
sysid -d A
IDPROM in /var/opt/SUNWSMS/data/A/idprom.image for domain A
Format = 0x01
Machine Type = 0x82
Ethernet Address = 0:0:ae:a6:4a:42
Manufacturing Date = Wed Jun 27 14:58:00 MDT 2001
Serial number (Host ID) = 0xa64a42 (11029056)
Checksum = 0xac
/var/opt/SUNWSMS/doors/mand
Example:
# file /var/opt/SUNWSMS/doors/mand
/var/opt/SUNWSMS/doors/mand: door to mand[18356]
/etc/hosts
cat /etc/hosts
#
# Internet host table
#
127.0.0.1 localhost
192.68.66.148 sc0 loghost
13.1.1.2 15k-a #smsconfig-entry#
13.1.1.3 15k-b #smsconfig-entry#
13.1.1.4 15k-c #smsconfig-entry#
13.1.1.5 15k-d #smsconfig-entry#
13.1.1.6 15k-e #smsconfig-entry#
13.1.1.7 15k-f #smsconfig-entry#
13.1.1.8 15k-g #smsconfig-entry#
13.1.1.9 15k-h #smsconfig-entry#
13.1.1.10 15k-i #smsconfig-entry#
13.1.1.11 15k-j #smsconfig-entry#
13.1.1.12 15k-k #smsconfig-entry#
13.1.1.13 15k-l #smsconfig-entry#
13.1.1.14 15k-m #smsconfig-entry#
13.1.1.15 15k-n #smsconfig-entry#
13.1.1.16 15k-o #smsconfig-entry#
13.1.1.17 15k-p #smsconfig-entry#
13.1.1.18 15k-q #smsconfig-entry#
13.1.1.19 15k-r #smsconfig-entry#
192.68.66.172 sc #smsconfig-entry#
13.1.1.1 sc-i1 #smsconfig-entry#
192.68.66.250 sc0-C1-failover #smsconfig-entry# 192.68.66.170 sc0-eri1
#smsconfig-entry#
192.68.66.171 sc0-hme0 #smsconfig-entry#
10.2.1.1 sc0-i2 #smsconfig-entry#
10.2.1.2 sc1-i2 #smsconfig-entry#
/etc/hostname.*
/etc/ipnodes
-This is the standard Solaris hosts listing which contains relevant IPv6
hosts (if used).
-Located on SCs and domains where appropriate.
/etc/netmasks
Example:
1.
cat netmasks
#
2.
network-number netmask
#
5.
guidelines.
#
10.
128.32.0.0 255.255.255.0
#
192.168.66.0 255.255.255.0
192.168.103.0 255.255.255.224
192.168.103.32 255.255.255.252
13.1.1.0 255.255.255.224
10.2.1.0 255.255.255.252
Worth noting also it that if you do not have a default router for your
settings, you will notice it will take longer time for IPMP to start up
properly. This may have a negative effect in the essense that the IPMP
negotiation might take longer time and SMS might experience difficulties
during startup.
Internal Only: Top
References:
- Infodoc 82102 "Sun Fire[TM] 12K/15K/E20K/E25K: Supported Ethernet settings
for System Controllers"
- Infodoc 73002 "Sun Fire[TM] 12K/15K: MAN Interface Mapping"
- SRDB 48123 "Sun Fire[TM] 12K/15K: Troubleshooting the I1 MAN Network"
- Troubleshooting Article 72578 "Sun Fire[TM] 12K/15K: Troubleshooting MAN I2
Network."
- SRDB 48144 "Sun Fire[TM] 15K: IDPROM layout for OpenBoot[TM] PROM failed"
This shows how to recreate lost/corrupted idprom.image files, as does the
site,
http://has.central.sun.com/starcat/15kinfo/faq/recreate_idprom.image.html.
================================================================================
15/25k domain commands
frame1-sc0:sms-svc:3> help
usage:
addboard -d domain_indicator [-c function] [-r retry_count [-t timeout]]
[-q] [-f] [-y|-n] location...
addboard -h
usage:
addcodlicense <license-signature>
addcodlicense -h
usage:
addtag -d domain_indicator [-q] [-y|-n] new_tag
addtag -h
usage:
cancelcmdsync cmdsync_descriptor
cancelcmdsync -h
usage:
/opt/SUNWSMS/SMS1.5/bin/console -d domain_indicator [[-f] | [-l] | [-g] | [-r]] [-e
escapeChar]
/opt/SUNWSMS/SMS1.5/bin/console -h
usage:
deleteboard [-c function] [-r retry_count [-t timeout]] [-q] [-f] [-y|-n] location...
deleteboard -h
usage:
deletecodlicense [-f] <license-signature>
deletecodlicense -h
usage:
deletetag -d domain_indicator [-q] [-y|-n]
deletetag -h
usage:
disablecomponent [-d domain_indicator] [-i "reason"]
location...
disablecomponent [-h]
usage:
enablecodboard [-y|-n] [location]
enablecodboard -h
usage:
enablecomponent [-a | -d domain_indicator] location...
enablecomponent [-h]
usage:
flashupdate -d domain_indicator -f path [-q|-v] [-y|-n]
flashupdate -f path [-q|-v] [-y|-n] location...
flashupdate -h
usage:
help [command_name]
help -h
usage:
initcmdsync script_name [parameters]
initcmdsync -h
usage:
moveboard -d domain_indicator [-c function] [-r retry_count [-t timeout]]
[-q] [-f] [-y|-n] location
moveboard -h
usage:
poweroff [-q] [-y|-n] [location]
poweroff -h
usage:
poweron [-q] [-y|-n] [location]
poweron -h
Usage:
rcfgadm -d domain_indicator [-f] [-y|-n] [-v] [-o hardware_options]
-c function [-r retry_count [-T timeout] ] ap_id...
rcfgadm -d domain_indicator [-f] [-y|-n] [-v] [-o hardware_options]
-x hardware_function ap_id...
rcfgadm -d domain_indicator [-v] [-a] [-s listing_options]
[-o hardware_options] [-l [ap_id|ap_type...]]
rcfgadm -d domain_indicator [-v] [-o hardware_options] -t ap_id...
rcfgadm -d domain_indicator [-v] [-o hardware_options] -h
[ap_id|ap_type...]
usage:
reset -d domain_indicator[,domain_indicator]...
[-d domain_indicator[,domain_indicator]...]... [-q] [-y | -n] [-x]
reset -h
usage:
resetsc [-q] [-y|-n]
resetsc -h
usage:
runcmdsync script_name [parameters]
runcmdsync -h
usage:
savecmdsync -M identifier cmdsync_descriptor
savecmdsync -h
usage:
setbus [-q] [-y|-n] -c cs0|cs1|cs0,cs1 [-b buses] [location...]
setbus -h
usage:
setdatasync [-i interval] schedule filename
setdatasync push filename
setdatasync cancel filename
setdatasync backup
setdatasync -h
usage:
setdate [-d domain_indicator] [-u] [-q] [mmdd]HHMM|mmddHHMM[cc]yy[.SS]
setdate -h
usage:
setdefaults [-d domain_indicator [-p]] [-y|n]
setdefaults -h
usage:
setfailover on|off|force
setfailover -h
usage:
setkeyswitch -d domain_indicator [-q] [-y|-n]
position
setkeyswitch -h
usage:
setobpparams -d domain_indicator param=value...
setobpparams -h
usage:
setupplatform
setupplatform -p available [-d domain_indicator [-a|-r] location...]
setupplatform -p cod [headroom | -d domain_indicator domainRTU]
setupplatform [-d domain_indicator -]
setupplatform -h
usage:
showboards [-d domain_indicator] [-v]
showboards [-d domain_indicator] -c
showboards -h
usage:
showbus [-v]
showbus -h
usage:
showcmdsync [-v]
showcmdsync -h
usage:
showcodlicense [-r] [-v]
showcodlicense -h
usage:
showcodusage [-v] [-p <resource|domains>]
showcodusage -h
usage:
showcomponent [-a | -d domain_indicator] [-v] [location...]
showcomponent [-h]
usage:
showdatasync [-l|-Q] [-v]
showdatasync -h
usage:
showdate [-d domain_indicator] [-u] [-v]
showdate -h
usage:
showdevices [-v] [-p bydevice|byboard|query|force] location...
showdevices [-v] [-p bydevice|byboard]
-d domain_indicator
showdevices -h
usage:
showenvironment [-d domain_indicator[,domain_indicator]...]... [-p
temps|volts|currents|fans|powers[,temps|volts|currents|fans|powers]...]...
[-v]
showenvironment [-d domain_indicator[,domain_indicator]...]... [-p faults]
[-v]
showenvironment -h
usage:
showfailover [-r] [-v]
showfailover -h
usage:
showkeyswitch -d domain_indicator [-v]
showkeyswitch -h
usage:
showlogs [ -F ] [ -f filename ] [ -d domain_indicator ] [ -p m|c|s ] [ -v
]
showlogs [ -F ] [ -f filename ] [ -d domain_indicator ] [ -E ]
[ -p e [ <event_class> | list | ereport | ena0x<ena_value>
| uuid<uuid_value> | <event_code> ] [ <number> ] ]
showlogs -h
usage:
showobpparams -d domain_indicator [-v]
showobpparams -h
Usage:
showplatform [-d domain_indicator] [-p report] [-v]
showplatform -h
usage:
showxirstate -d domain_indicator [-v]
showxirstate -f filename [-v]
showxirstate -h
usage:
smsconnectsc [-y|-n]
smsconnectsc -h
Restore up the SMS environment
Usage:
smsrestore filename
smsrestore -h
================================================================================
F6800 Configuration Recommendations
--------------------------------------------------------------------------------
Purpose
The purpose of this document is to examine Sun Fire Midframe servers and present
recommendations for exploiting their capabilities and avoiding issues that cause down time.
Assumptions
The first of these is to evaluate alternatives and make decisions that improve the Reliability,
Availability, and Serviceability of your platforms.
The primary goal of system administration is to provide stable, available platforms and services
so users can accomplish work in a predictable manner. It is necessary to choose options that
improve availability, thereby avoiding or reducing periods of non-availability due to planned or
unplanned events.
The second alternative is to optimize performance. Only after achieving a high degree of RAS
(Reliability, Availability, Serviceability) should considerations turn to improving system and
application performance.
Dynamic Reconfiguration
A core function of the Solaris Operating Environment, Dynamic Reconfiguration (DR) enables
you to safely add and remove CPU/Memory boards and I/O assemblies while the system is
operating. DR controls the software aspects of dynamically changing the hardware used by a
domain with minimal disruption to user processes running in the domain.
� Disable a failing device by removing it from the logical configuration before its
failure
can cause an operating system outage.
� Initiate testing of a board within a domain while the domain continues to run.
In mission or business critical environments where availability is one of the most important
criteria, the real value of DR is the ability to perform numerous maintenance activities without
the need for downtime.
When populating a Uniboard or server with memory, the following rules and recommendations
should be noted:
� Do not fill the second bank of a CPU until all of the other CPU's on that board have had
their first banks filled.
� Within a domain, do not fill the second bank of a CPU until all of the other system
boards
have had all of their first banks filled.
� Wherever possible, use only the same size DIMM to fill all of the banks on a system
board.
� Wherever possible, populate all system boards within a domain with the same amount of
memory.
� When constructing domains using system boards of unequal memory size, always define
the domain from smallest installed memory to largest memory.
For example, if SB0 contains 8G of memory and SB2 and SB4 each contain 4G, issue the
following commands to define the domain (addboard -d A SB2 SB4 SB0). This will
ensure that SB2 is assigned as slice 0 and will contain the Solaris Operating
Environment when the domain is booted until the slice is reassigned to another board as
a result of a DR operation.
Permanent memory is memory which is non-pageable, such as the kernel or OpenBoot PROM.
All other memory is referred to as non-permanent memory.
During DR operations involving memory, all memory must be unconfigured from the affected
system board. In the case of non-permanent memory, the memory can simply be flushed back to
disk, moved to another memory location, or swapped out. Permanent memory, on the other
hand, must be moved to another board of the same total memory size or larger using a copy-
rename technique. The reason for the different mechanism is that the permanent memory cannot
be swapped out since it contains critical kernel structures that control the operation of the
Solaris
Operating Environment.
Frame Configuration
The Sun Fire 6800 Midframe server is capable of being configured into multiple Segments
(partitions) and Domains. A domain is the logical subset of system resources running an
independent copy of the Solaris Operating Environment. Additionally, the power distribution
within the platform lends itself to another form of partitioning.
A segment or partition is established by the system controller for the purpose of logically
isolating connections between segments such that the failure of a domain in one segment should
not normally affect a domain running in the other segment.
While it is possible for each segment to contain two domains, the isolation of errors is the
primary reason that segmentation is recommended for multiple domain configurations. An
exception to this rule is the use of a temporary domain to facilitate DR operations.
Segmentation is not, however, capable of preventing all failures in one segment from affecting
domains in another segment. For example, the failure of a centerplane may not be contained to a
single segment.
When segmenting a 6800, it should also be noted that each segment is assigned a pair of
Fireplane Repeater Boards. A minimum of two Fireplane Repeater Boards are required for the
operation of a domain. Failure of a repeater board will cause failure of any domain within that
segment until it is replaced.
If DR operations are to be supported on a 6800 with one domain, our recommendation is the
system be kept in single-segment mode and that the Solaris Operating Environment be loaded on
domain A. This leaves domain B available for performing DR and POST testing as required for
the DR of I/O assemblies. If two domains are required, configure the box into a dual-segment
mode and load Solaris onto Domains A and C. This will leave domains B and D available for
use in DR operations.
The Sun Fire 6800 Midframe server differs from all other Sun Fire models in that it has two
separate internal power grids, each supplied from a different RTU.
To ensure the loss of a power feed does not cause the loss of a domain, it is essential that
domains be constructed with all of the devices powered from the same grid (RTU). The boards
within the 6800 are separated as follows:
================================================================================
Document Audience: SPECTRUM
Document ID: 51772
Title: Sun Fire[TM] 12K/15K/E20K/E25K: Remote Dynamic Reconfiguration (DR)
generates "DCA/DCS Communication Error" and showdevices is Unable to get
device information from domain.
Copyright Notice: Copyright � 2006 Sun Microsystems, Inc. All Rights
Reserved
Update Date: Tue Sep 19 00:00:00 MDT 2006
Products: Sun Fire 15K Server, Sun Fire 12K Server, Sun Fire E25K
Server, Sun Fire E20K Server
Technical Areas: System Management
# showdevices -v -d x
Unable to get device information from domain x
This showdevices error could also be seen in Explorer data from the Main
System Controller (SC). The file, showdevices_-v_-d_x.out, which is in the
/explorer/sf15k/<Domain_ID> directory of Explorer will show the same "Unable
to get device information from domain x" error message.
Also the following messages might be logged in the platform log file on the
SC ( $SMSVAR/adm/platform/messages )
Resolution Top
ensure that the network interfaces are properly configured and running,
*
verify that relevant parameters are not commented out of key files,
*
The dca <> dcs handshaking takes place over the I1 network. This means that
scman0 on the SC and dman0 on the domain must be configured and running
properly. This is often overlooked, so be sure to verify this information
with the following command:
# ifconfig -a
On SC:
scman0: flags=1008843<UP,BROADCAST,RUNNING,MULTICAST,PRIVATE,IPv4>
mtu 1500 index 3
inet 10.10.1.1 netmask ffffffe0 broadcast 10.10.1.31
On domain:
dman0: flags=1008843<UP,BROADCAST,RUNNING,MULTICAST,PRIVATE,IPv4>
mtu 1500 index 3
inet 10.10.1.3 netmask ffffffe0 broadcast 10.10.1.31
ether 0:0:be:a8:17:57
Note that the IP addresses and netmasks on the dman0 and scman0 interfaces
should match the information stored in the /etc/SUNWMSMS/config/MAN.cf file
on the SC.
manc_magic = 0x4d414e43
manc_version = 01
manc_csum = 0x0
manc_ip_type = AF_INET
manc_dom_ipaddr = 10.10.1.3
manc_dom_ip_netmask = 255.255.255.224
manc_dom_ip_netnum = 10.10.1.0
manc_sc_ipaddr = 10.10.1.1
manc_dom_eaddr = 0:0:be:a8:17:57
manc_sc_eaddr = 8:0:20:fa:5f:1a
manc_iob_bitmap = 0xa0 io boards = 5.1, 7.1,
manc_golden_iob = 5
The Domain Configuration Agent (DCA) daemon runs on the SC, one per domain.
Similar to a netcon session on a Sun Enterprise[TM] 10000 server, the DCA
provides communication between the DCA on the SC and the Domain Configuration
Server (DCS) on the specified domain. If DCA is not running, the showdevices
and the rcfgadm commands fail.
To verify that DCA is running, issue the following command on the SC:
If either command fails, check the domain for the following lines in the
/etc/inetd.conf file:
These lines must be in the /etc/inetd.conf file for the rcfgadm and
showdevices commands to work properly. If the lines are not in the file, and
showdevices fails from the SC, add the indicated lines above and restart the
inetd process as follows:
# inetadm
ENABLED STATE FMRI
enabled online svc:/application/font/stfsloader:default
[output omitted]
disabled disabled svc:/network/talk:default
enabled online svc:/platform/sun4u/dcs:default
[output omitted]
The /platform/sun4u/dcs service must be enabled/online.
You can now get more information from the svc:/platform/sun4u/dcs service and
list its properties via the svccfg command :
Note that, on domains running Solaris 10 w/o 120253-02, the dcs process will
not be running if the SC has not recently communicated with the domain. It's
forked by inetd upon request (Remote DR request started from the SC). Hence,
the PPID for dcs is the inetd PID.
Ex :
# ptree 304
159 /usr/sbin/inetd -s
304 dcs
introduced in patch 120253-02, dcs does not belong to inetd any longer.
Since inetd does not support per-socket IPsec, dcs will be changed to run
standalone. Both dcs and cvcd will be controlled by SMF and use SMF
properties to define command line options.
Hence, running inetadm | grep dcs will not return information about dcs any
longer.
Use the following command to get the status from the dcs service :
# svcs dcs
Ex :
# ptree 220
Note that the dcs process might not be running if the SC has not recently
communicated with the domain.
To check to see if any process is actually listening on the sun-dr port (port
665), run
This verifies that there is indeed some process listening on the sun-dr port,
665. If there is nothing listening on port 665, then the showdevices and
addboard / deleteboard commands on the SC can never work properly.
The /etc/services file must also have the following entry on the domain for
remote Dynamic Reconfiguration (DR):
If you are using the NIS+, make sure that above entry is present in the
/etc/services file of NIS+ server. You can check this using the following
command:
# ipsecconf -a /etc/inet/ipsecinit.conf
Use the following command to check that the system is now running with these
settings:
# ipsecconf
The console command uses DXS. It is similar to the netcon_server on the Sun
Enterprise[TM] 10000 server. DXS runs on the SC, one per domain.
To verify that DXS is running, issue the following command on the SC:
Console commands take place over the console bus but can be toggled between
the console bus and I1 network using the ~= command.
The sckmd daemon must be running on the domain in order for the "showdevices"
or "rcfgadm" commands to work on the domain.
To verify that the sckmd daemon is running, issue the following command on
the domain:
After this installation or patch update, at boot time, dcs has problem to get
online; staying in maintenance mode :
# svcs dcs
Check the reason why dcs never got online via the
/etc/svc/volatile/platform-sun4u-dcs:default.log log file.
# svcs -xv
See: http://sun.com/msg/SMF-8000-KS
See: /etc/svc/volatile/platform-sun4u-dcs:default.log
# svcs dcs
STATE STIME FMRI
For a workaround and a more detailed explanation on this issue, please see CR
#6453706.
# svcs -xv
svc:/platform/sun4u/dcs:default (domain configuration server)
State: maintenance since Tue Sep 19 10:57:55 2006
Reason: Restarter svc:/network/inetd:default gave no explanation.
See: http://sun.com/msg/SMF-8000-9C
See: man -M /usr/share/man -s 1M dcs
Impact: This service is not running.
# svcprop dcs
general/enabled boolean true
general/entity_stability astring Unstable
general/restarter fmri svc:/network/inetd:default
[...]
To fix this problem, the new manifest must be imported using the following
procedure :
# svcs dcs
STATE STIME FMRI
maintenance 10:57:55 svc:/platform/sun4u/dcs:default
# svcadm disable dcs
# Sep 19 11:02:13 v4u-15ka-c-epar02 inetd[250]: Property 'name' of instance
# svc:/platform/sun4u/dcs:default is missing, inconsistent or invalid
Sep 19 11:02:13 v4u-15ka-c-epar02 inetd[250]: Property 'endpoint_type' of
instance svc:/platform/sun4u/dcs:default is missing, inconsistent or invalid
Sep 19 11:02:13 v4u-15ka-c-epar02 inetd[250]: Property 'isrpc' of instance
svc:/platform/sun4u/dcs:default is missing, inconsistent or invalid
Sep 19 11:02:13 v4u-15ka-c-epar02 inetd[250]: Property 'wait' of instance
svc:/platform/sun4u/dcs:default is missing, inconsistent or invalid
Sep 19 11:02:13 v4u-15ka-c-epar02 inetd[250]: Unspecified inetd_start method
for instance svc:/platform/sun4u/dcs:default
# svcs dcs
STATE STIME FMRI
disabled 11:02:13 svc:/platform/sun4u/dcs:default
# svccfg -v delete dcs
# svcs dcs
svcs: Pattern 'dcs' doesn't match any instances
STATE STIME FMRI
# svccfg -v import /var/svc/manifest/platform/sun4u/dcs.xml
svccfg: Taking "initial" snapshot for svc:/platform/sun4u/dcs:default.
svccfg: Taking "last-import" snapshot for svc:/platform/sun4u/dcs:default.
svccfg: Refreshed svc:/platform/sun4u/dcs:default.
svccfg: Successful import.
# svcs dcs
STATE STIME FMRI
disabled 11:03:04 svc:/platform/sun4u/dcs:default
# svcadm enable dcs
# svcs dcs
STATE STIME FMRI
online 11:03:20 svc:/platform/sun4u/dcs:default
# svcs -p dcs
STATE STIME FMRI
online 11:03:20 svc:/platform/sun4u/dcs:default
11:03:20 717 dcs
# svcprop dcs
[...]
-Method 3 uses the rcfgadm (remote cfgadm) command on the System Controller.
Note: When working on a security hardened system "Method 2" must be used.
Document Body
Method 1
This method uses the System Controller to replace the system board.
For this method, the System Controller issues all the commands.
1) Logically remove the board (unconfigures, disconnects, and powers off the
board).
/opt/SUNWSMS/bin/poweron SB#
/opt/SUNWSMS/bin/flashupdate -f
/opt/SUNWSMS/hostobjs/sgcpu.flash -v -y SB#
/opt/SUNWSMS/bin/poweroff SB#
The board is now back in the system, and running the Solaris Operating
System.
From the domain, verify with prtdiag.
__________________________________________
Method 2
Method 2 replaces the system board from within the domain. For this method,
the
domain issues most of the commands. However, the System Controller issues
two
of the commands.
/opt/SUNWSMS/bin/poweron SB#
6) Issue the command on the System Controller to power off the new board:
/opt/SUNWSMS/bin/poweroff SB#
The board is now back in the system, and running the Solaris Operating
System.
From the domain, verify with prtdiag.
__________________________________________
Method 3
Method 3 uses the System Controller, which issues the cfgadm command, to
replace
the system board. This method is similar to Method 2, except the domain
needs to
be specified with the -d option.
/opt/SUNWSMS/bin/poweron SB#
/opt/SUNWSMS/bin/flashupdate -f
/opt/SUNWSMS/hostobjs/sgcpu.flash -v -y SB#
/opt/SUNWSMS/bin/poweroff SB#
The board is now back in the system, and running Solaris Operating System.
From
the domain, verify with prtdiag.
NOTE: For all three methods above, be aware of kernel cage changes affecting
DR
that were introduced with Solaris 9 KU patch 118558-05.
--------------------------------------------------------------------------------