- 論壇徽章:
- 0
|
各位大俠:\r\n這兩天遇到一奇怪問題:\r\n硬件:SUN FIRE V250, 4 SCSI HD (73G), ROOT FS 由兩塊MIRROR成, OS: SOLARIS 9 With SUn Volume Manager\r\nMESSAGES 顯示0號(hào)硬盤讀寫錯(cuò)誤(only part of them):\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:04:01 sun2 Error for Command: write(10) Error Level: Retryable\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Requested Block: 139704496 Error Block: 139704496\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Sense Key: Unit Attention\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] ASC: 0x29 (<vendor unique code 0x29>), ASCQ: 0x3, FRU: 0x4\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:04:01 sun2 Error for Command: write(10) Error Level: Informational\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Requested Block: 142820212 Error Block: 142820212\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Sense Key: Soft Error\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] ASC: 0x5d (drive operation marginal, service immediately (failure prediction threshold excee\r\nded)), ASCQ: 0x0, FRU: 0x5\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:06:41 sun2 Error for Command: write(10) Error Level: Retryable\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] Requested Block: 142820212 Error Block: 142820212\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] Sense Key: Hardware Error\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] ASC: 0x32 (no defect spare location available), ASCQ: 0x0, FRU: 0x4\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:06:42 sun2 Error for Command: write(10) Error Level: Retryable\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] Requested Block: 142820212 Error Block: 142820212\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] Sense Key: Hardware Error\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] ASC: 0x32 (no defect spare location available), ASCQ: 0x0, FRU: 0x4\r\n\r\nIOSTAT -EN也顯示同樣問題. 可是METASTAT 卻顯示所有分區(qū)OK, 并且USER也沒記得遇到讀寫錯(cuò)誤. ORACLE數(shù)據(jù)庫(kù)運(yùn)行正常(一部分?jǐn)?shù)據(jù)文件在ROOT上). W我試著光驅(qū)啟動(dòng)并FSCK硬盤分區(qū),發(fā)現(xiàn)有數(shù)據(jù)壞塊(只有一兩塊), 以及REFERENCE不對(duì)等小問題, 并回答\"Y\"修正這些問題, 然后發(fā)現(xiàn), 修正后的硬盤不能METAREPLACE, 報(bào)告一些塊讀不到. 于是, 拿原來(lái)0號(hào)盤(本來(lái)想換下來(lái)的壞盤), 重新與一塊新盤做MIRROR, 居然沒有任何問題, 而且整個(gè)系統(tǒng)完全恢復(fù)了. \r\n\r\n現(xiàn)在, 我很迷惑:這0號(hào)盤究竟是不是真有問題?\r\n\r\n先謝了! |
|