1、惠普设备维护培训日常检查命令 中国惠普有限公司支持服务事业部 QIANYun2004 4 日常维护检查项目 系统日志syslog log ccerrlog dmesg系统运行状态cmviewcl bdf ioscan vgdisplay top sar swapinfo netstat磁盘阵列状态armdsp avaarraydsp aautoraidamdsp afc60 ccerrlog 283PM0 608 17 200319 16 58LogEntry283 08 17 200319 16 58AlertLevel6 Systemcouldfail attentionrequired
2、Keyword Bulkpowersupply BPS 2failed Status 15Loggedbypowermonitor0duringmonitoringoflowvoltagepowersupply0 x0020016a4402404f0 x00000000000000000 x5820096a4402404f0 x000067071113103a 执行cclogview var stm logs os ccerrlog 可以通过telnet检查GSP MP里的告警日志情况 应注意检查是否有AlertLevel大于等于2的新条目 dmesg Revision vmunix vw p
3、rojselectors CUPI80 BL2000 1108 c VwforCUPI80 BL2000 1108build cupi80 bl2000 1108 CUPI80 BL2000 1108 WedNov819 24 56PST2000 MemoryInformation physicalpagesize 4096bytes logicalpagesize 4096bytesPhysical 4177920Kbytes lockable 3859368Kbytes available 3859944KbytesUsing3162bufferscontaining24576Kbytes
4、ofmemory 驻留在内存中的系统最近一段时间的日志信息 常见的异常信息 SCSIResetDetectedLPMCI CacheerrorFileSystemFull发现后应及时察看syslog log中的相应条目 cmviewcl CLUSTERSTATUShpclusterupNODESTATUSSTATEGMS STATEbjscp1auprunninghaltedNetwork Parameters INTERFACESTATUSPATHNAMEPRIMARYup0 5 0 0lan1PRIMARYup0 0 0 0lan0STANDBYup1 12 0 0lan2PACKAGES
5、TATUSSTATEAUTO RUNNODEscppkguprunningenabledbjscp1a NODESTATUSSTATEGMS STATEbjscp1buprunninghaltedNetwork Parameters INTERFACESTATUSPATHNAMEPRIMARYup0 5 0 0lan1STANDBYup1 12 0 0lan2PRIMARYup0 0 0 0lan0 观察双机状态 执行cmviewcl v 确认STATUS和STATE为up和running 同时包自动切换 AUTO RUN 属性为enable bdf Filesystemkbytesuseda
6、vail usedMountedon dev vg00 lvol32048004816815542424 dev vg00 lvol12950243885622666415 stand dev vg00 lvol847063041523976315759233 var dev vg00 lvol7116326470830445146461 usr dev vg00 lvol42048009640810756847 tmp dev vg00 lvol6104857676602428036073 opt dev vg00 lvol51048576445610360240 home 检查文件系统的使
7、用率 应检查有无使用率大于90 的文件系统 ioscan fn ClassIH WPathDriverS WStateH WTypeDescription root0rootCLAIMEDBUS NEXUSioa00sbaCLAIMEDBUS NEXUSSystemBusAdapter 803 ba00 0lbaCLAIMEDBUS NEXUSLocalPCIBusAdapter 782 lan00 0 0 0btlan3CLAIMEDINTERFACEHPPCI10 100Base TXCore dev diag lan0 dev ether0ext bus00 0 1 0c720CLAIM
8、EDINTERFACESCSIC895UltraWideSingle Endedtarget00 0 1 0 1tgtCLAIMEDDEVICEdisk00 0 1 0 1 0sdiskNO HWDEVICEHPDVD ROM305 dev dsk c0t1d0 dev rdsk c0t1d0 检察IO设备是否正常 应检查有无状态为NO HW的设备 vgdisplay Volumegroups VGName dev vg00VGWriteAccessread writeVGStatusavailableMaxLV255 Logicalvolumes LVName dev vg00 lvol1L
9、VStatusavailable syncdLVSize Mbytes 100CurrentLE25AllocatedPE50UsedPV2 Physicalvolumes PVName dev dsk c4t0d0PVName dev dsk c6t0d0AlternateLinkPVStatusavailableTotalPE12992FreePE0AutoswitchOff 显示卷组状态 重点检查vg00 执行vgdisplay vvg00 检查各项status值为available sync 不是stale top CPULOADUSERNICESYSIDLEBLOCKSWAITINT
10、RSSYS00 2820 2 0 0 2 6 77 2 0 0 0 0 0 0 0 0 10 1714 6 0 0 3 4 82 0 0 0 0 0 0 0 0 0 20 3318 6 0 0 3 0 78 4 0 0 0 0 0 0 0 0 30 2013 0 0 0 4 2 82 8 0 0 0 0 0 0 0 0 40 1114 4 0 0 2 0 83 6 0 0 0 0 0 0 0 0 50 4419 8 0 0 4 2 76 0 0 0 0 0 0 0 0 0 60 2813 2 0 0 11 2 75 6 0 0 0 0 0 0 0 0 70 1714 8 0 0 1 8 83
11、4 0 0 0 0 0 0 0 0 avg0 250 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 Memory 1106604K 999800K real 1527608K 1362680K virtual 1987924KfreePage 1 6CPUTTYPIDUSERNAMEPRINISIZERESSTATETIME WCPU CPUCOMMAND2 18777informix156207404K5052Ksleep9233 0230 4930 43oninit6 19002tellin1542029248K22572Ksleep5256 0317 0517 02ma
12、nager0 18779informix156207404K4784Ksleep1681 279 629 60oninit 观察CPU和内存使用情况 重点检查有无占用CPU过大的进程 并检查freememory是否足够 sar u 10 02 18cpu usr sys wio idle10 02 2103721601185175215102723942854213175523247061043837155179system195275 观察CPU使用情况 sar u M310 重点检查 idle是否足够 一般不小于25 sar v HP UXbjscp1aB 11 00U9000 80007
13、 07 0310 02 48text szovproc szovinod szovfile szov10 02 51N AN A189 66402119 736001127 12018010 02 54N AN A188 66402102 736001121 12018010 02 57N AN A187 66402067 736001114 12018010 03 00N AN A187 66402037 736001108 12018010 03 03N AN A187 66402033 736001108 12018010 03 06N AN A187 66402036 73600110
14、8 12018010 03 09N AN A187 66402033 736001108 12018010 03 12N AN A188 66402032 736001113 12018010 03 15N AN A187 66402032 736001108 12018010 03 18N AN A187 66402032 736001108 120180 观察文件线程资源使用情况 sar v310 重点检查有无即将达到上限的值 sar d HP UXbjscp1aB 11 00U9000 80007 07 0310 03 18device busyavquer w sblks savwai
15、tavserv10 03 21c1t6d04 330 507495 975 57c2t6d03 670 506435 864 78c4t0d01 000 5010515 112 94c4t0d11 670 5011534 493 27c4t0d21 670 5010525 162 63c4t0d31 670 5016755 012 97 观察IO使用情况 sar d310 重点检查有无 busy过大的设备 swapinfo MbMbMbPCTSTART MbTYPEAVAILUSEDFREEUSEDLIMITRESERVEPRINAMEdev3072030720 0 1 dev vg00 lv
16、ol2dev3000030000 0 0 dev vg00 lv swapreserve 2161 2161total60722161391136 0 观察交换区使用情况 通常swap区的使用率为0 如有0以上数值 需进行进一步检查 netstat in NameMtuNetworkAddressIpktsOpktslan1 1500192 9 200 0192 9 200 100lan0150015 79 48 015 79 48 170745893334436lo04136127 0 0 0127 0 0 12654026540 观察网络连接情况 检查有无网络连接中断 执行netstat
17、in 如在网卡后带 号则表示网络不通 AutoRAID状态观察1 执行 arraydsp i确认VA的别名 如auto1 执行 arraydsp aauto1 auto1 dsp执行vi检查生成的信息文件auto1 dsp 确认ArrayState为Ready VendorID HPProductID C5447AArrayserialnumber 000000261E8B ArrayState READYServername mscp1Arraytype 3Mfg ProductCode IJMTU00004 Diskspaceusage Totalphysical 121565MB All
18、ocatedtoLUNs 28700MB UsedasActiveHotspare 17366MB Usedbynon includeddisks 0MB UsedforRedundancy 23727MB Unallocated availforLUNs 51772MB AutoRAID状态观察2 VendorID HPProductID C5447AArrayserialnumber 000000261E8B FanF1 GOODFanF2 GOODFanF3 GOODPowersupplyPS1 GOODPowersupplyPS2 GOODPowersupplyPS3 GOODCont
19、rollerX Overallstate GOODBattery 0state GOODBattery 1state GOODDRAM 0state GOODNVRAM 0state GOODNVRAM 1state GOODControllerY Overallstate GOODBattery 0state GOODBattery 1state GOODDRAM 0state GOODNVRAM 0state GOODNVRAM 1state GOOD FC60状态观察1 执行 amdsp i确认VA的别名 如fc1 执行 amdsp afc1 fc1 dsp执行vi检查生成的信息文件fc
20、1 dsp 确认ArrayState为Ready VendorID HPProductID A5277AArrayID 000400A0B80942BCArrayAlias fc1 ArrayState READYServerName bjscp1aArrayType 3Mfg ProductCode 348 0040789Date TueApr2223 32 52EAT2003PathtoCtlrAVSB0 0 4 0 0 8 0 5 0 0 0PathtoCtlrBVSB0 1 10 0 0 8 0 4 0 0 0 Diskspaceusage TotalPhysical 440 9GBA
21、llocatedtoLUNs 220 0GBUsedasHotSpare 0 0GBUnallocated availforLUNs 0 0GB FC60状态观察2 VendorID HPProductID A5277AArrayID 000400A0B80942BCArrayAlias fc1 LUNStatusCapacityCtrlRAIDSegmentDisks 0OPTIMAL50 8GBA1161 02 03 04 05 06 01DEGRADE WAITREPAIR50 8GBA1161 12 13 14 15 16 1 FC60状态观察3 ArrayControllerSubsystem ControllerA GOODControllerB GOODPS1 GOODPS2 GOODFan1 GOODFan2 GOODTempSensor GOODBattery GOODDiskSystem1 USSA05103199 BCCControllerA GOODBCCControllerB GOODPSA GOODPSB GOODFanA GOODFanB GOODTempSensor GOOD NoLUNsarecurrentlyrebuilding Q A