• Interpreting a smartctl -a report

    From bp@bp@www.zefox.net to comp.sys.ibm.pc.hardware.storage on Tue May 7 15:32:27 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    A raspberry pi running 64-bit RasPiOS froze up and failed
    to reboot reliably, apparently unable to find the hard disk.

    The USB-serial bridge was replaced with a known-good unit,
    which allowed repeated successful reboots. After a long self-test,
    sysctl -a reported a raw_read_error_rate that looks rather bad but
    nonetheless claims the disk "passed":

    bob@raspberrypi:~ $ sudo smartctl -a /dev/sda | more
    smartctl 7.2 2020-12-30 r5155 [aarch64-linux-6.1.21-v8+] (local build) Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

    === START OF INFORMATION SECTION ===
    Model Family: Seagate Barracuda 2.5 5400
    Device Model: ST1000LM048-2E7172
    Serial Number: ZKP1CTK1
    LU WWN Device Id: 5 000c50 0dc14483b
    Firmware Version: 0001
    User Capacity: 1,000,204,886,016 bytes [1.00 TB]
    Sector Sizes: 512 bytes logical, 4096 bytes physical
    Rotation Rate: 5400 rpm
    Form Factor: 2.5 inches
    TRIM Command: Available
    Device is: In smartctl database [for details use: -P show]
    ATA Version is: ACS-3 T13/2161-D revision 3b
    SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is: Tue May 7 08:04:15 2024 PDT
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x00) Offline data collection activity
    was never started.
    Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: ( 0) seconds.
    Offline data collection
    capabilities: (0x71) SMART execute Offline immediate.
    No Auto Offline data collection support.
    Suspend Offline collection upon new
    command.
    No Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported. Short self-test routine
    recommended polling time: ( 1) minutes.
    Extended self-test routine
    recommended polling time: ( 168) minutes.
    Conveyance self-test routine
    recommended polling time: ( 2) minutes.
    SCT capabilities: (0x3035) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 10
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x000f 100 064 006 Pre-fail Always - 1769744
    3 Spin_Up_Time 0x0003 100 099 000 Pre-fail Always - 0
    4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 108
    5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
    7 Seek_Error_Rate 0x000f 081 060 045 Pre-fail Always - 113998668
    9 Power_On_Hours 0x0032 087 087 000 Old_age Always - 11911 (91 132 0)
    10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
    12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 96
    184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
    189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
    190 Airflow_Temperature_Cel 0x0022 073 058 040 Old_age Always - 27 (Min/Max 19/39)
    191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 7
    192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 52
    193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 301007
    194 Temperature_Celsius 0x0022 027 042 000 Old_age Always - 27 (0 16 0 0 0)
    197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
    240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 11074 (49 218 0)
    241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 4402454028
    242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 482619242
    254 Free_Fall_Sensor 0x0032 100 100 000 Old_age Always - 0

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log structure revision number 1
    Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
    # 1 Extended offline Completed without error 00% 11902 - # 2 Short offline Completed without error 00% 1 -

    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.

    I'm somewhat baffled at the wording, can anybody tell if the test actually ran to completion and the results imply a healthy disk?

    Thanks for reading,

    bob prohaska
    --- Synchronet 3.20a-Linux NewsLink 1.114
  • From Shadow@Sh@dow.br to comp.sys.ibm.pc.hardware.storage on Tue May 7 16:03:08 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    On Tue, 7 May 2024 15:32:27 -0000 (UTC), <bp@www.zefox.net> wrote:

    A raspberry pi running 64-bit RasPiOS froze up and failed
    to reboot reliably, apparently unable to find the hard disk.

    The USB-serial bridge was replaced with a known-good unit,
    which allowed repeated successful reboots. After a long self-test,
    sysctl -a reported a raw_read_error_rate that looks rather bad but >nonetheless claims the disk "passed":

    bob@raspberrypi:~ $ sudo smartctl -a /dev/sda | more
    smartctl 7.2 2020-12-30 r5155 [aarch64-linux-6.1.21-v8+] (local build) >Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

    === START OF INFORMATION SECTION ===
    Model Family: Seagate Barracuda 2.5 5400
    Device Model: ST1000LM048-2E7172
    Serial Number: ZKP1CTK1
    LU WWN Device Id: 5 000c50 0dc14483b
    Firmware Version: 0001
    User Capacity: 1,000,204,886,016 bytes [1.00 TB]
    Sector Sizes: 512 bytes logical, 4096 bytes physical
    Rotation Rate: 5400 rpm
    Form Factor: 2.5 inches
    TRIM Command: Available
    Device is: In smartctl database [for details use: -P show]
    ATA Version is: ACS-3 T13/2161-D revision 3b
    SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is: Tue May 7 08:04:15 2024 PDT
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x00) Offline data collection activity
    was never started.
    Auto Offline Data Collection: Disabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: ( 0) seconds.
    Offline data collection
    capabilities: (0x71) SMART execute Offline immediate.
    No Auto Offline data collection support.
    Suspend Offline collection upon new
    command.
    No Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 1) minutes.
    Extended self-test routine
    recommended polling time: ( 168) minutes.
    Conveyance self-test routine
    recommended polling time: ( 2) minutes.
    SCT capabilities: (0x3035) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 10
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x000f 100 064 006 Pre-fail Always - 1769744
    3 Spin_Up_Time 0x0003 100 099 000 Pre-fail Always - 0
    4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 108
    5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
    7 Seek_Error_Rate 0x000f 081 060 045 Pre-fail Always - 113998668
    9 Power_On_Hours 0x0032 087 087 000 Old_age Always - 11911 (91 132 0)
    10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
    12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 96
    184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
    189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
    190 Airflow_Temperature_Cel 0x0022 073 058 040 Old_age Always - 27 (Min/Max 19/39)
    191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 7
    192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 52
    193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 301007
    194 Temperature_Celsius 0x0022 027 042 000 Old_age Always - 27 (0 16 0 0 0)
    197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
    240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 11074 (49 218 0)
    241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 4402454028
    242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 482619242
    254 Free_Fall_Sensor 0x0032 100 100 000 Old_age Always - 0

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log structure revision number 1
    Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
    # 1 Extended offline Completed without error 00% 11902 - ># 2 Short offline Completed without error 00% 1 -

    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.

    I'm somewhat baffled at the wording, can anybody tell if the test actually ran >to completion and the results imply a healthy disk?

    Save any important data you have on that disk, then "retire"
    it.
    If you're interested, HDSentinel has a free utility for Linux:

    <https://www.hdsentinel.com/download.php>

    Unpack it and run it as root (or sudo). It will translate all
    that stuff above to human language.
    []'s

    Thanks for reading,

    bob prohaska
    --
    Don't be evil - Google 2004
    We have a new policy - Google 2012
    Google Fuchsia - 2021
    --- Synchronet 3.20a-Linux NewsLink 1.114
  • From ant@ant@zimage.comANT (Ant) to comp.sys.ibm.pc.hardware.storage on Tue May 7 20:34:51 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    Shadow <Sh@dow.br> wrote:
    ...
    If you're interested, HDSentinel has a free utility for Linux:

    <https://www.hdsentinel.com/download.php>

    Unpack it and run it as root (or sudo). It will translate all
    that stuff above to human language.
    []'s

    That looks old. :(
    --
    "Cast all your anxiety on [God] because he cares for you." --1 Peter 5:7. Itchy body! ZzzZzz...
    Note: A fixed width font (Courier, Monospace, etc.) is required to see this signature correctly.
    /\___/\ Ant(Dude) @ http://aqfl.net & http://antfarm.home.dhs.org.
    / /\ /\ \ Please nuke ANT if replying by e-mail.
    | |o o| |
    \ _ /
    ( )
    --- Synchronet 3.20a-Linux NewsLink 1.114
  • From bp@bp@www.zefox.net to comp.sys.ibm.pc.hardware.storage on Tue May 7 22:04:18 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    Ant <ant@zimage.comant> wrote:
    Shadow <Sh@dow.br> wrote:
    ...
    If you're interested, HDSentinel has a free utility for Linux:

    <https://www.hdsentinel.com/download.php>

    Unpack it and run it as root (or sudo). It will translate all
    that stuff above to human language.
    []'s

    That looks old. :(

    Website, or hard disk? The disk was bought and set up in September of 2022, which more-or-less gybes with the 11k hour runtime as it's on 24/7. I wouldn't normally think of a year-and-change as "old" for a hard disk. Am I mistaken?

    What did puzzle me was the substantial Raw_Read_Error_Rate, zero count of uncorrected errors and zero Current_Pending_Sector figure. One is bad, the other two seem good, unless I misunderstand the meaning.

    Thanks for writing,

    bob prohaska

    --- Synchronet 3.20a-Linux NewsLink 1.114
  • From Shadow@Sh@dow.br to comp.sys.ibm.pc.hardware.storage on Tue May 7 20:41:04 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    On Tue, 07 May 2024 20:34:51 +0000, ant@zimage.comANT (Ant) wrote:

    Shadow <Sh@dow.br> wrote:
    ...
    If you're interested, HDSentinel has a free utility for Linux:

    <https://www.hdsentinel.com/download.php>

    Unpack it and run it as root (or sudo). It will translate all
    that stuff above to human language.
    []'s

    That looks old. :(

    hdsentinel-019b.gz is dated 5 March 2021.
    Probably not as old as the HD the OP is testing...


    PS I use it under Devuan. Works perfectly.
    []'s
    --
    Don't be evil - Google 2004
    We have a new policy - Google 2012
    Google Fuchsia - 2021
    --- Synchronet 3.20a-Linux NewsLink 1.114
  • From bp@bp@www.zefox.net to comp.sys.ibm.pc.hardware.storage on Wed May 8 01:20:36 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    Shadow <Sh@dow.br> wrote:
    On Tue, 07 May 2024 20:34:51 +0000, ant@zimage.comANT (Ant) wrote:

    Shadow <Sh@dow.br> wrote:
    ...
    If you're interested, HDSentinel has a free utility for Linux:

    <https://www.hdsentinel.com/download.php>

    Unpack it and run it as root (or sudo). It will translate all
    that stuff above to human language.
    []'s

    That looks old. :(

    hdsentinel-019b.gz is dated 5 March 2021.
    Probably not as old as the HD the OP is testing...

    Not quite 8-) Looks like the drive was made in December of 2021.

    Thanks for writing,

    bob prohaska

    --- Synchronet 3.20a-Linux NewsLink 1.114
  • From ant@ant@zimage.comANT (Ant) to comp.sys.ibm.pc.hardware.storage on Wed May 8 16:10:52 2024
    From Newsgroup: comp.sys.ibm.pc.hardware.storage

    bp@www.zefox.net wrote:
    Ant <ant@zimage.comant> wrote:
    Shadow <Sh@dow.br> wrote:
    ...
    If you're interested, HDSentinel has a free utility for Linux:

    <https://www.hdsentinel.com/download.php>

    Unpack it and run it as root (or sudo). It will translate all
    that stuff above to human language.
    []'s

    That looks old. :(

    Website, or hard disk? The disk was bought and set up in September of 2022, which more-or-less gybes with the 11k hour runtime as it's on 24/7. I wouldn't
    normally think of a year-and-change as "old" for a hard disk. Am I mistaken?

    What did puzzle me was the substantial Raw_Read_Error_Rate, zero count of uncorrected errors and zero Current_Pending_Sector figure. One is bad, the other two seem good, unless I misunderstand the meaning.

    Thanks for writing,

    bob prohaska

    The programs.
    --
    "But God demonstrates his own love for us in this: While we were still sinners, Christ died for us." --Romans 5:8. Itchy & sleepy body.
    Note: A fixed width font (Courier, Monospace, etc.) is required to see this signature correctly.
    /\___/\ Ant(Dude) @ http://aqfl.net & http://antfarm.home.dhs.org.
    / /\ /\ \ Please nuke ANT if replying by e-mail.
    | |o o| |
    \ _ /
    ( )
    --- Synchronet 3.20a-Linux NewsLink 1.114