Failed Chunklets

In the 3 Par system rather than a whole disk being shown as failed you can just see individual chunklets that have done so.

I saw the following in the GUI, under System and Summary

I investigated further via the command line to see which physical disks contained the failed chunklets. I saw all 3 failed chunklets were on the same disk.

showpdch -fail

Once I had seen this I decided to proactively evacuate all the data from disk 28 so it can be removed.

servicemag start -pdid 28

Double check its running in the background

servicemag status

Interestingly I was told by the second HP engineer a disk is not considered failed until it has 6 failed chunklets. I would say the decision of when to proactively hotspare out a disk is down to personal discretion.

Published by

2 thoughts on “Failed Chunklets

  1. I have 2 degraded disks in different cage and magazine that i want to replace with new ones. Both of them fails to move 1 chanklet and loops trying to move it.

    mc3par-f400-1 cli% servicemag status -d

    Cage 3, magazine 4:
    The magazine is being brought offline due to a servicemag start.
    The last status update was at Wed Aug 30 15:28:15 2017.
    Chunklets relocated: 1006 in 49 minutes and 11 seconds
    Chunklets remaining: 1
    Chunklets marked for moving: 1
    Estimated time for relocation completion based on 2 seconds per chunklet is: 2 seconds
    The cumulative output so far is:
    servicemag start -pdid 26
    … servicing disks in mag: 3 4
    … normal disks:
    … not normal disks: WWN [2000B4525358D703] Id [26] diskpos [0]
    … relocating chunklets to spare space…
    … chunklet 26:2139 – move_error,move_failed, failed move
    … 1 chunklets – move_error,disk_relocating, will retry
    … chunklet 26:2139 – move_error,move_failed, failed move
    … 1 chunklets – move_error,disk_relocating, will retry
    … chunklet 26:2139 – move_error,move_failed, failed move
    … 1 chunklets – move_error,disk_relocating, will retry
    … chunklet 26:2139 – move_error,move_failed, failed move
    … 1 chunklets – move_error,disk_relocating, will retry

    My question is can i just remove the drive without causing any damage on the system? or is there a procedure i can follow to manually move the chunklet and fail the drive so i can remove it safely.

    Once remove is there another procedure to rebuild the replacement drive or the system will just automatically resume rebuilding the new drive.

    Thank you for the assistance

    1. See if this helps:

      Workaround

      1. Verify and ensure the media-fail.. is a single chunklet RAID0 LD and is not part of any VV;

      showld -d media-fail..

      showldmap media-fail..

      2. Remove the LD media-fail..

      removeld media-fail..

      3. Repeat the above steps for all media-fail LDs for the failed PD (if more than one media-fail LDs exists).

      4. After deleting the media-fail LD(s), the servicemag start operation should complete successfully. If the ?servicemag start? is in

      failed state (for the affected PD), unmark and clear the status of servicemag, then start the servicemag again, it should succeed and

      the servicemag resume should start automatically, when replace the failed drive with new drive.

      servicemag unmark

      servicemag clearstatus

      servicemag start

Leave a Reply to Richard T Arnold Cancel reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.