#1501 closed defect (invalid)
Buffer I/O error logical block async page read (TOSHIBA MQ01ABD100)
Reported by: | 4joeyirosh1 | Owned by: | |
---|---|---|---|
Priority: | major | Milestone: | |
Component: | all | Version: | |
Keywords: | Cc: |
Description
Hi!I have Debian 11 64 bit installed on my machine.
When I booted my machine today,I got the errors below during boot.Please find the output of the smartctl command below
# smartctl -a /dev/sda smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.0-7-amd64] (local build) Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Toshiba 2.5" HDD MQ01ABD... Device Model: TOSHIBA MQ01ABD100 Serial Number: 87PSPGCKT LU WWN Device Id: 5 000039 7f2f0a12f Firmware Version: AX1R4C User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 5400 rpm Form Factor: 2.5 inches Device is: In smartctl database [for details use: -P show] ATA Version is: ATA8-ACS (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Sat Jun 12 19:55:38 2021 EAT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x86) Offline data collection activity was aborted by the device with a fatal error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 121) The previous self-test completed having the read element of the test failed. Total time to complete Offline data collection: ( 120) seconds. Offline data collection capabilities: (0x51) SMART execute Offline immediate. No Auto Offline data collection support. Suspend Offline collection upon new command. No Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 199) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 050 Pre-fail Always - 0 2 Throughput_Performance 0x0027 100 100 050 Pre-fail Always - 0 3 Spin_Up_Time 0x0023 100 100 002 Pre-fail Always - 1762 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 3079 5 Reallocated_Sector_Ct 0x0033 099 099 010 Pre-fail Always - 320 7 Seek_Error_Rate 0x002f 100 100 050 Pre-fail Always - 0 8 Seek_Time_Performance 0x0025 100 100 050 Pre-fail Offline - 0 9 Power_On_Hours 0x0032 058 058 000 Old_age Always - 17022 10 Spin_Retry_Count 0x0033 161 100 030 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 2919 183 Runtime_Bad_Block 0x0032 100 100 001 Old_age Always - 0 184 End-to-End_Error 0x0033 100 100 097 Pre-fail Always - 0 185 Unknown_Attribute 0x0032 100 100 001 Old_age Always - 65535 187 Reported_Uncorrect 0x0032 001 001 000 Old_age Always - 1653 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 001 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 067 052 040 Old_age Always - 33 (Min/Max 28/42) 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 213 192 Power-Off_Retract_Count 0x0022 100 100 000 Old_age Always - 5308497 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 5737 194 Temperature_Celsius 0x0022 067 052 040 Old_age Always - 33 (Min/Max 28/42) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 36 197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 ATA Error Count: 1653 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 1653 occurred at disk power-on lifetime: 17011 hours (708 days + 19 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 41 d0 06 f6 7f 40 Error: UNC at LBA = 0x007ff606 = 8386054 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 60 f8 d8 86 f6 7f 40 00 01:02:19.290 READ FPDMA QUEUED 60 08 d0 06 f6 7f 40 00 01:02:19.290 READ FPDMA QUEUED ef 10 02 00 00 00 a0 00 01:02:19.282 SET FEATURES [Enable SATA feature] ec 00 00 00 00 00 a0 00 01:02:19.281 IDENTIFY DEVICE ef 03 45 00 00 00 a0 00 01:02:19.281 SET FEATURES [Set transfer mode] Error 1652 occurred at disk power-on lifetime: 17011 hours (708 days + 19 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 41 28 06 f6 7f 40 Error: UNC at LBA = 0x007ff606 = 8386054 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 60 f8 30 86 f6 7f 40 00 01:02:19.070 READ FPDMA QUEUED 60 08 28 06 f6 7f 40 00 01:02:19.070 READ FPDMA QUEUED ef 10 03 00 00 00 a0 00 01:02:19.058 SET FEATURES [Enable SATA feature] ef 10 02 00 00 00 a0 00 01:02:19.046 SET FEATURES [Enable SATA feature] ec 00 00 00 00 00 a0 00 01:02:19.045 IDENTIFY DEVICE Error 1651 occurred at disk power-on lifetime: 17011 hours (708 days + 19 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 41 f8 06 f6 7f 40 Error: UNC at LBA = 0x007ff606 = 8386054 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 60 f8 00 86 f6 7f 40 00 01:02:18.597 READ FPDMA QUEUED 60 78 f8 06 f6 7f 40 00 01:02:18.597 READ FPDMA QUEUED 60 30 b8 c6 f5 7f 40 00 01:02:18.597 READ FPDMA QUEUED 60 10 b0 a6 f5 7f 40 00 01:02:18.597 READ FPDMA QUEUED 60 08 a8 8e f5 7f 40 00 01:02:18.597 READ FPDMA QUEUED Error 1650 occurred at disk power-on lifetime: 17011 hours (708 days + 19 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 41 08 06 f6 7f 40 Error: UNC at LBA = 0x007ff606 = 8386054 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 60 20 50 1e a6 94 40 00 01:02:04.069 READ FPDMA QUEUED 60 02 08 06 f6 7f 40 00 01:02:04.060 READ FPDMA QUEUED 60 02 18 0c f6 7f 40 00 01:02:04.027 READ FPDMA QUEUED 60 00 10 1e 89 95 40 00 01:02:04.027 READ FPDMA QUEUED 60 60 08 1e f6 7f 40 00 01:02:04.027 READ FPDMA QUEUED Error 1649 occurred at disk power-on lifetime: 17011 hours (708 days + 19 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 41 e0 06 f6 7f 40 Error: UNC at LBA = 0x007ff606 = 8386054 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 60 00 48 1e 89 95 40 00 01:02:03.896 READ FPDMA QUEUED 60 02 00 0c f6 7f 40 00 01:02:03.872 READ FPDMA QUEUED 60 02 f0 0a f6 7f 40 00 01:02:03.872 READ FPDMA QUEUED 60 02 e8 08 f6 7f 40 00 01:02:03.872 READ FPDMA QUEUED 60 02 e0 06 f6 7f 40 00 01:02:03.872 READ FPDMA QUEUED SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 17008 209712646 # 2 Short offline Completed: read failure 90% 17008 209712646 # 3 Short offline Completed without error 00% 6747 - # 4 Short offline Completed without error 00% 6747 - # 5 Short offline Completed without error 00% 6747 - # 6 Extended offline Aborted by host 90% 6747 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
Output of dmesg from linux is as below
# dmesg [ 8431.341307] sd 0:0:0:0: [sda] tag#13 Sense Key : Medium Error [current] [ 8431.341316] sd 0:0:0:0: [sda] tag#13 Add. Sense: Unrecovered read error - auto reallocate failed [ 8431.341327] sd 0:0:0:0: [sda] tag#13 CDB: Read(10) 28 00 0c 7f f6 06 00 00 08 00 [ 8431.341340] blk_update_request: I/O error, dev sda, sector 209712646 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0 [ 8431.341358] Buffer I/O error on dev sda6, logical block 17, async page read [ 8431.341415] ata1: EH complete [ 8432.438648] ata1.00: exception Emask 0x0 SAct 0x8000000 SErr 0x40000 action 0x0 [ 8432.438653] ata1.00: irq_stat 0x40000008 [ 8432.438656] ata1: SError: { CommWake } [ 8432.438659] ata1.00: failed command: READ FPDMA QUEUED [ 8432.438664] ata1.00: cmd 60/08:d8:00:f6:7f/00:00:0c:00:00/40 tag 27 ncq dma 4096 in res 41/40:00:06:f6:7f/00:00:0c:00:00/40 Emask 0x409 (media error) <F> [ 8432.438666] ata1.00: status: { DRDY ERR } [ 8432.438668] ata1.00: error: { UNC } [ 8432.441244] ata1.00: configured for UDMA/100 [ 8432.441258] sd 0:0:0:0: [sda] tag#27 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=0s [ 8432.441260] sd 0:0:0:0: [sda] tag#27 Sense Key : Medium Error [current] [ 8432.441262] sd 0:0:0:0: [sda] tag#27 Add. Sense: Unrecovered read error - auto reallocate failed [ 8432.441264] sd 0:0:0:0: [sda] tag#27 CDB: Read(10) 28 00 0c 7f f6 00 00 00 08 00
What do these errors mean and how can I resolve them.Seems there is a disk I/O error and I hope I dont have to replace the disk. /dev/sda6 is my /home partition and I have manged to boot my OS and reach the gnome desktop and somehow I am able to work normally but the disk I/O issues concern me.
Please help
Attachments (3)
Change History (9)
by , 3 years ago
Attachment: | TOSHIBA_MQ01ABD100_87PSPGCKT_2021-06-12 statistics.txt added |
---|
by , 3 years ago
Attachment: | TOSHIBA_MQ01ABD100_87PSPGCKT_2021-06-12 self test.txt added |
---|
by , 3 years ago
Attachment: | TOSHIBA_MQ01ABD100_87PSPGCKT_2021-06-12 statistics.2.txt added |
---|
comment:1 by , 3 years ago
comment:2 by , 3 years ago
Priority: | minor → major |
---|
comment:3 by , 3 years ago
Summary: | Buffer I/O error logical block async page read → Buffer I/O error logical block async page read (TOSHIBA MQ01ABD100) |
---|
comment:4 by , 3 years ago
Note: All three attachments are identical.
... Device Model: TOSHIBA MQ01ABD100 ... SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE ... 5 Reallocated_Sector_Ct PO--CK 099 099 010 - 320 ... 9 Power_On_Hours -O--CK 058 058 000 - 17023 ... 12 Power_Cycle_Count -O--CK 100 100 000 - 2919 ... 196 Reallocated_Event_Count -O--CK 100 100 000 - 36 ... SMART Extended Comprehensive Error Log Version: 1 (64 sectors) ... 40 -- 41 00 d0 00 00 0c 7f f6 06 40 00 Error: UNC at LBA = 0x0c7ff606 = 209712646 ... 40 -- 41 00 f0 00 00 0c cd 35 7e 40 00 Error: UNC at LBA = 0x0ccd357e = 214775166 ... 40 -- 41 00 10 00 00 0c cc 91 7e 40 00 Error: UNC at LBA = 0x0ccc917e = 214733182 ... SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 17023 209712646 # 2 Short offline Completed: read failure 90% 17008 209712646 # 3 Short offline Completed: read failure 90% 17008 209712646 ...
This disk has 320 already reallocated bad sectors and at least three (209712646, 214733182, 214775166) bad sectors pending for reallocation. This may increase in the near future. The model name, the total power on time and the power cycle count suggest that this disk is at least six years old. I would suggest to replace this disk ASAP.
See the Bad block HOWTO for further info.
PS: This is a bug tracker, not a support forum. For future support questions, please use the smartmontools-support mailing list instead. Thanks.
comment:5 by , 3 years ago
Resolution: | → invalid |
---|---|
Status: | new → closed |
Disk has unreadable sectors which are correctly reported by smartctl.
comment:6 by , 3 years ago
The serial number suggests a August 2017 manufacture date (Toshiba serial numbers are closely tied with date of manufacture) so not that old, but still, this drive is the problem, not smartctl.
Moreover I installed GSmartControl and the logs from the Statistics,Short Self Test and Error Log are as per files TOSHIBA_MQ01ABD100_87PSPGCKT_2021-06-12 statistics.txt,TOSHIBA_MQ01ABD100_87PSPGCKT_2021-06-12 self test.txt and TOSHIBA_MQ01ABD100_87PSPGCKT_2021-06-12 error log.txt attached.
Please help