- 論壇徽章:
- 3
|
本帖最后由 有機天使 于 2016-05-30 11:01 編輯
各位大神,求助個問題
系統(tǒng)環(huán)境:兩臺SUN M8000,每臺SUN M8000兩個域,總共四個域,兩個磁盤(一個1713.69GB,一個1142.46GB)全部掛載在四臺主機上做了oracle rac。
現(xiàn)在的問題是:
域1主機: iostat -E 查看顯示
ssd1 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: HITACHI Product: OPEN-V*6 -SUN Revision: 6008 Serial No:
Size: 1713.69GB <1713691033600 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
ssd28 Soft Errors: 0 Hard Errors: 17 Transport Errors: 17
Vendor: HITACHI Product: OPEN-V*4 -SUN Revision: 6008 Serial No:
Size: 1142.46GB <1142461300736 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
域2主機:iostat -E 查看顯示
ssd1 Soft Errors: 0 Hard Errors: 2 Transport Errors: 2
Vendor: HITACHI Product: OPEN-V*4 -SUN Revision: 6008 Serial No:
Size: 1142.46GB <1142461300736 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
ssd24 Soft Errors: 0 Hard Errors: 5 Transport Errors: 5
Vendor: HITACHI Product: OPEN-V*6 -SUN Revision: 6008 Serial No:
Size: 1713.69GB <1713691033600 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
域3主機:iostat -E之前每報錯,目前也開始報錯
ssd1 Soft Errors: 0 Hard Errors: 3 Transport Errors: 3
Vendor: HITACHI Product: OPEN-V*6 -SUN Revision: 6008 Serial No:
Size: 1713.69GB <1713691033600 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
ssd27 Soft Errors: 0 Hard Errors: 3 Transport Errors: 3
Vendor: HITACHI Product: OPEN-V*4 -SUN Revision: 6008 Serial No:
Size: 1142.46GB <1142461300736 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
域4主機:iostat -E
ssd1 Soft Errors: 0 Hard Errors: 2 Transport Errors: 1
Vendor: HITACHI Product: OPEN-V*6 -SUN Revision: 6008 Serial No:
Size: 1713.69GB <1713691033600 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
ssd28 Soft Errors: 0 Hard Errors: 4 Transport Errors: 3
Vendor: HITACHI Product: OPEN-V*4 -SUN Revision: 6008 Serial No:
Size: 1142.46GB <1142461300736 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0
這四臺主機掛載一樣的磁盤,怎么有的主機上報錯,有的主機上又沒報錯呢?該如何處理呢?
另外在磁盤報錯的主機上,有過如下的錯誤日志:
May 12 20:31:14 rdms02b /scsi_vhci/ssd@g60060e80056389000000638900000000 (ssd22): Command failed to complete (3) on path fp3/ssd@w50060e8005638924,0
May 12 20:31:14 rdms02b SCSI transport failed: reason 'tran_err': retrying command
May 12 20:31:14 rdms02b scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60060e80056389000000638900000000 (ssd22):
請問大家該如何處理?
更新:
最近又開始報錯了,但這次只是2個域(域1與域2)主機增多報錯,另外兩臺卻沒變化,請教下是為什么呢
數(shù)字已經(jīng)增長到3了:
ssd28 Soft Errors: 0 Hard Errors: 3 Transport Errors: 3
警告日志也又增多了:
May 12 15:52:51 rdms01a /scsi_vhci/ssd@g60060e80056389000000638900000008 (ssd2: Command failed to complete (3) on path fp3/ssd@w50060e8005638900,1
May 12 15:52:51 rdms01a SCSI transport failed: reason 'tran_err': retrying command
May 12 15:52:51 rdms01a scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60060e80056389000000638900000008 (ssd2:
May 20 10:40:33 rdms01a /scsi_vhci/ssd@g60060e80056389000000638900000008 (ssd2: Command failed to complete (3) on path fp3/ssd@w50060e8005638900,1
May 20 10:40:35 rdms01a /scsi_vhci/ssd@g60060e80056389000000638900000008 (ssd2: Command failed to complete (3) on path fp3/ssd@w50060e8005638900,1
May 21 11:28:20 rdms01a /scsi_vhci/ssd@g60060e80056389000000638900000000 (ssd1): Command failed to complete (3) on path fp3/ssd@w50060e8005638900,0
May 21 11:28:20 rdms01a SCSI transport failed: reason 'tran_err': retrying command
May 21 11:28:20 rdms01a scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g60060e80056389000000638900000000 (ssd1):
現(xiàn)在幾乎每天都開始報錯了,持續(xù)增長,每天就這么1-2條,這是鏈路故障碼?我檢查存儲交換機也沒發(fā)現(xiàn)交換機出問題~~檢查存儲設(shè)備,也沒有存儲告警,這能什么問題呢?要是HBA卡的問題這4臺系統(tǒng) 四個HBA卡不能都出問題吧,而且檢查主機底層硬件也沒發(fā)現(xiàn)告警日志~~
有沒有過來人給個指點啊~~ |
|