Skip to content

Iostat -E, whats that all about?

September 14, 2005

Another question that seems to come up again and again. What do the errors returned by iostat -E really mean. Well now we have opensolaris it is all in the source. Iostat -E will give output like this:

sd21      Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 Vendor: SEAGATE  Product: ST373307LSUN72G  Revision: 0207 Serial No: 0241B09EN1 Size: 73.40GB <73400057856 bytes> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal Request: 0 Predictive Failure Analysis: 0

These entries are simply all the error kstats for the device and can be printed using the kstat command;


$ kstat -n sd21,err module: sderr                           instance: 21 name:   sd21,err                        class:    device_error         Device Not Ready                0         Hard Errors                     0         Illegal Request                 0         Media Error                     0         No Device                       0         Predictive Failure Analysis     0         Product                         ST373307LSUN72G Revision         Recoverable                     0         Revision                        0207         Serial No                       0241B09EN1         Size                            73400057856         Soft Errors                     0         Transport Errors                0         Vendor                          SEAGATE         crtime                          315.74644992         snaptime                        852020.62705776


So what do they all mean? For scsi disk drives they are set up in the sd_create_errstats() routine and then updated in the sd drvier when ever each of the errors is hit.


Device Not Ready: The drive returned the sense key 0x2 (Not ready).

Media Error: The drive returned the sense key 0x3(Medium Error).

No Device: The drive returned the sense key 0x6 (Unit Attention) or in the case of a removable device it must have happened multiple times.


Hard Errors: All the above conditions are counted as Hard errors with the addition of the SCSI sense key 0x4 (Hardware Error).


Illegal Request: The drive returned the sense key 0x5 (Illegal Request). This also treats as a Soft Error and that kstat is also incremented.


Recoverable: The drive returned the sense key 0x1 (Recovered Error) to indicate that the last command completed successfully but some recovery action had to be taken by the drive. This also treats as a Soft Error and that kstat is also incremented.


Predictive Failure Analysis: The drive returned sense key 0x6 (Unit Attention) with and ASC (Additional Sense Code) of 0x5D indicating that the drive has exceeded it’s predictive failure threshold. This is treated as a soft error.


Transport Error: This error occurs for a number of reasons all related to being unable to transport the command. The command could have been timed out or reset or the host bus adapter unable to put the command onto the SCSI bus. This is neither as soft nor a hard error.


Tags: topic:[opensolaris] topic:[disks] topic:[kstats]

Advertisements

From → Solaris

Iostat -E, whats that all about?

September 14, 2005

Another question that seems to come up again and again. What do the errors returned by iostat -E really mean. Well now we have opensolaris it is all in the source. Iostat -E will give output like this:

sd21      Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 Vendor: SEAGATE  Product: ST373307LSUN72G  Revision: 0207 Serial No: 0241B09EN1 Size: 73.40GB <73400057856 bytes> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal Request: 0 Predictive Failure Analysis: 0

These entries are simply all the error kstats for the device and can be printed using the kstat command;


$ kstat -n sd21,err module: sderr                           instance: 21 name:   sd21,err                        class:    device_error         Device Not Ready                0         Hard Errors                     0         Illegal Request                 0         Media Error                     0         No Device                       0         Predictive Failure Analysis     0         Product                         ST373307LSUN72G Revision         Recoverable                     0         Revision                        0207         Serial No                       0241B09EN1         Size                            73400057856         Soft Errors                     0         Transport Errors                0         Vendor                          SEAGATE         crtime                          315.74644992         snaptime                        852020.62705776


So what do they all mean? For scsi disk drives they are set up in the sd_create_errstats() routine and then updated in the sd drvier when ever each of the errors is hit.


Device Not Ready: The drive returned the sense key 0x2 (Not ready).

Media Error: The drive returned the sense key 0x3(Medium Error).

No Device: The drive returned the sense key 0x6 (Unit Attention) or in the case of a removable device it must have happened multiple times.


Hard Errors: All the above conditions are counted as Hard errors with the addition of the SCSI sense key 0x4 (Hardware Error).


Illegal Request: The drive returned the sense key 0x5 (Illegal Request). This also treats as a Soft Error and that kstat is also incremented.


Recoverable: The drive returned the sense key 0x1 (Recovered Error) to indicate that the last command completed successfully but some recovery action had to be taken by the drive. This also treats as a Soft Error and that kstat is also incremented.


Predictive Failure Analysis: The drive returned sense key 0x6 (Unit Attention) with and ASC (Additional Sense Code) of 0x5D indicating that the drive has exceeded it’s predictive failure threshold. This is treated as a soft error.


Transport Error: This error occurs for a number of reasons all related to being unable to transport the command. The command could have been timed out or reset or the host bus adapter unable to put the command onto the SCSI bus. This is neither as soft nor a hard error.


Tags: topic:[opensolaris] topic:[disks] topic:[kstats]

From → Solaris

2 Comments
  1. Lyle permalink

    There can be lots of other check conditions that can be returned. Here is the best reference out there:
    http://www.t10.org/lists/asc-num.htm

  2. Hi,
    Thanks for your information it was very much helpful to me.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: