V2EX = way to explore
V2EX 是一个关于分享和探索的地方
现在注册
已注册用户请  登录
V2EX 提问指南
qwerthhusn
V2EX  ›  问与答

RAID 降级问题,进来的统统 diameter_cm += 2, length_cm += 3

  •  
  •   qwerthhusn · 2019-06-05 09:51:07 +08:00 · 832 次点击
    这是一个创建于 910 天前的主题,其中的信息可能已经有所发展或是发生改变。

    Dell 服务器,用 MegaCli 查看状态提示已降级( Degraded ),机器正面也有一块硬盘在闪黄灯,其他的都是绿灯

    smartctl -H /dev/bus/0 -d megaraid,N(所有盘,包括那块闪灯的)去看健康状态,都是 ok 的

    这个是硬盘问题还是 RAID 配置问题,如何定位,请给个思路,我本来就是一个写增删改查的码畜,这些东西完全不知道该怎么玩。

    4 条回复    2019-06-05 17:55:37 +08:00
    onehao28
        1
    onehao28  
       2019-06-05 10:11:56 +08:00
    smartctl 看 是不准确的,应该 smartctl -a 看一下,看下具体参数。很有可能坏道太多。
    lucifer9
        2
    lucifer9  
       2019-06-05 10:42:33 +08:00
    找 Dell 工程师来处理,给钱了干嘛还要给自己找麻烦
    qwerthhusn
        3
    qwerthhusn  
    OP
       2019-06-05 17:13:06 +08:00
    @onehao28 这个能帮忙看一下么?
    ···
    smartctl 6.7 2018-08-04 r4747 [x86_64-linux-3.10.0-957.12.2.el7.x86_64] (daily-20180804)
    Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

    === START OF INFORMATION SECTION ===
    Vendor: SEAGATE
    Product: ST1200MM0099
    Revision: ST31
    Compliance: SPC-4
    User Capacity: 1,200,243,695,616 bytes [1.20 TB]
    Logical block size: 512 bytes
    Formatted with type 2 protection
    8 bytes of protection information per logical block
    LU is fully provisioned
    Rotation Rate: 10000 rpm
    Form Factor: 2.5 inches
    Logical Unit id: 0x5000c500bca87617
    Serial number: WFK2RM60
    Device type: disk
    Transport protocol: SAS (SPL-3)
    Local Time is: Wed Jun 5 09:10:55 2019 UTC
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled
    Temperature Warning: Disabled or Not Supported
    Read Cache is: Enabled
    Writeback Cache is: Disabled

    === START OF READ SMART DATA SECTION ===
    SMART Health Status: OK

    Grown defects during certification <not available>
    Total blocks reassigned during format <not available>
    Total new blocks reassigned = 1
    Power on minutes since format <not available>
    Current Drive Temperature: 34 C
    Drive Trip Temperature: 60 C

    Manufactured in week 49 of year 2018
    Specified cycle count over device lifetime: 10000
    Accumulated start-stop cycles: 20
    Specified load-unload count over device lifetime: 300000
    Accumulated load-unload cycles: 225
    Elements in grown defect list: 39

    Vendor (Seagate Cache) information
    Blocks sent to initiator = 2181055139
    Blocks received from initiator = 2453981
    Blocks read from cache and sent to initiator = 13569935
    Number of read and write commands whose size <= segment size = 35257
    Number of read and write commands whose size > segment size = 0

    Vendor (Seagate/Hitachi) factory information
    number of hours powered up = 12.98
    number of minutes until next internal SMART test = 51

    Error counter log:
    Errors Corrected by Total Correction Gigabytes Total
    ECC rereads/ errors algorithm processed uncorrected
    fast | delayed rewrites corrected invocations [10^9 bytes] errors
    read: 2180959242 1 0 2180959243 0 1134.149 0
    write: 0 0 0 0 0 1.352 0
    verify: 1237 0 0 1237 26 0.000 0

    Non-medium error count: 0

    SMART Self-test log
    Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
    Description number (hours)
    # 1 Background long Failed in first segment 96 5 - [0x4 0x44 0x0]

    Long (extended) Self-test duration: 6723 seconds [112.0 minutes]

    scsiPrintBackgroundResults Failed [Input/output error]
    Protocol Specific port log page for SAS SSP
    relative target port id = 1
    generation code = 2
    number of phys = 1
    phy identifier = 0
    attached device type: SAS or SATA device
    attached reason: unknown
    reason: loss of dword synchronization
    negotiated logical link rate: phy enabled; 12 Gbps
    attached initiator port: ssp=1 stp=1 smp=1
    attached target port: ssp=0 stp=0 smp=0
    SAS address = 0x5000c500bca87615
    attached SAS address = 0x54cd98f0469eb107
    attached phy identifier = 7
    Invalid DWORD count = 0
    Running disparity error count = 0
    Loss of DWORD synchronization = 6
    Phy reset problem = 0
    Phy event descriptors:
    Invalid word count: 0
    Running disparity error count: 0
    Loss of dword synchronization count: 6
    Phy reset problem count: 0
    relative target port id = 2
    generation code = 2
    number of phys = 1
    phy identifier = 1
    attached device type: no device attached
    attached reason: unknown
    reason: unknown
    negotiated logical link rate: phy enabled; unknown
    attached initiator port: ssp=0 stp=0 smp=0
    attached target port: ssp=0 stp=0 smp=0
    SAS address = 0x5000c500bca87616
    attached SAS address = 0x0
    attached phy identifier = 0
    Invalid DWORD count = 0
    Running disparity error count = 0
    Loss of DWORD synchronization = 0
    Phy reset problem = 0
    Phy event descriptors:
    Invalid word count: 0
    Running disparity error count: 0
    Loss of dword synchronization count: 0
    Phy reset problem count: 0
    ···
    crc8
        4
    crc8  
       2019-06-05 17:55:37 +08:00
    硬盘问题,显示 OK 只能表明还没到显示不 OK 的时候。
    关于   ·   帮助文档   ·   API   ·   FAQ   ·   我们的愿景   ·   广告投放   ·   感谢   ·   实用小工具   ·   1158 人在线   最高记录 5497   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 20ms · UTC 22:51 · PVG 06:51 · LAX 14:51 · JFK 17:51
    ♥ Do have faith in what you're doing.