HP 3Par Replacing a Failed Disk

Replacing a failed disk in a 3PAR is pretty simple you just need to follow a few steps to make sure you do it safely.

1 Check to see if you have any failed or degraded disks in the system. Take a note of the disk ID and cage position.

3PARSAN01 cli% showpd -failed -degraded

                           -Size(MB)– —-Ports—-

Id CagePos Type RPM State   Total Free A     B     Cap(GB)

46 2:8:0? FC   10 failed 417792   0 —– —–     450

————————————————————

1 total                  417792   0

 

2 Check if the disk sevicemag command is running on the drive. The servicemag command is used to inform the system to evacuate all the chunklets from a drive so that it is ready for service. Below we can see the servicemag has succeeded on the drive we identified in step 1.

3PARSAN01 cli% servicemag status

Cage 2, magazine 8:

The magazine was successfully brought offline by a servicemag start command.

The command completed Thu Jul 10 20:07:03 2014.

servicemag start -pdid 46 – Succeeded

 

3 Next we double check there is no data left on the drive. You can do this by running showpd –space driveID as below. You need to check that all columns other than size and failed are zero

3PARSAN01 cli% showpd –space 46

 

Id CagePos Type -State-   Size Volume Spare Free Unavail Failed

46 2:8:0? FC   failed 417792     0                 0     0          0         417792

—————————————————————

1 total                        417792     0            0   0          0               417792

 

4 Next to replace the physical disk. Make sure you are happy with the above steps. Then pop that bad boy out, you will have note of the location of the failed drive from step 1. If you need to see how to physically remove and reinsert the disk take a look at the demo provided by HP here  

5 Once the disk is in you can monitor the progress of the rebuild by running servicemag status, which will give you an ETA for completion.

3PARSAN01 cli% servicemag status

Cage 2, magazine 8:

The magazine is being brought online due to a servicemag resume.

The last status update was at Thu Jun 26 12:09:19 2014.

Chunklets relocated: 73 in 50 minutes and 34 seconds

Chunklets remaining: 400

Chunklets marked for moving: 400

Estimated time for relocation completion based on 41 seconds per chunklet is: 4hours, 57 minutes and 39 seconds

servicemag resume 2 8 — is in Progress

 

6 Once this is complete you can check that your disks are showing in a normal state with showpd -state

 

3PARSAN01 cli% showpd -state

Id CagePos Type -State- ——————–Detailed_State———————

43 2:5:0   FC   normal normal

44 2:6:0   FC   normal normal

45 2:7:0   FC   normal normal

46 2:8:0 FC   normal normal

47 2:9:0   FC   normal normal

 

That’s it, job done!

 

To stay in touch with more 3PAR news and tips connect with me on LinkedIn and Twitter.

Published by

25 thoughts on “HP 3Par Replacing a Failed Disk

  1. I believe servicemag resume cageposition is not necessary with 3.1.2 and newer. System will automatically detect new PD and start rebuilding.

  2. Hi Richard,

    Thanks for your article on replacing HDDs.
    I have a requirement to replace 4 HDDs on a 8 cage 7400 3par box.
    Is it possible do so at the same time?

    1. Not sure why you would be replacing 4 disks at the same time. If they are already failed I see no harm in replacing them with a brief pause inbetween. If you are removing disks for any other reason I would do one at a time.

  3. DIsk 3 failed on our 7200 node. The disk ID was 3 and when the “relocation” started on the new replaced drive the new disk was assigned disk ID of 48. Is there a way to change IDs?

    1. Hi you are quite right the ID will change after a disk swap. I have never seen a way to change back, but will publish your comment here to see if anyone else has come across this. Thanks for reading!

  4. Hi,

    Have a V800, with a failed disk within a 4disk cage. Its state is failed, and the disk is empty when showing showpd -space pdid as you describe. When I run servicemag status Im not seeing any servicemag operations like you have shown, it just says “no servicemag operations running”. Any thoughts? Do I need to perform something active before replacing the disk, ie. pulling out the cage?

    1. Hi. The disks on the V800 are held in disk magazines of 4 disks as you describe. The servicemag command will work at the magazine level and log or relocate the data from all 4 disks in the magazine you wish to remove. When the servicemag command has completed you can remove the magazine.

      1. Thanks a bunch, it worked and the drive has been replaced now 🙂

        I have another question:

        I have 2 disks in degraded state, 1 in a 7400 system and 1 in a v800 system. But it remains in degraded state and does not change to failed as I would expect it to do after some time?

        Is there a way to force it into failed mode or will I be able to run the servicemag start on these disks even though they are only in degraded state?

        1. It could remain in a degraded state if it had failed chunklets on it, you can check for this using showpd -c. The disk will only be marked failed automatically if it has 6 failed chunklets on it. Like you say you can force an evacuation by using servicemag start if you like. The only other thing I can think of is to check the firmware versions on those degraded disks look correct showpd -i. Good luck!

      2. hi
        in this case we run this command servicemag start -log -pdid 137
        and all the 4 disk datanil ( 3 disks are degraded state , 1 is faile)

        we didnt replace the disk untill 24 hours

        is this leads any impact

  5. Hello Richard,
    I’ve got 3Par InServ e200 system, and after replacing bunch of failed disks, I still have this error:
    inserv-e200b cli% showpd -failed -degraded
    –Size(MB)– —-Ports—-
    Id CagePos Type Speed(K) State Total Free A B
    1 0:12:0? FC 10 failed 380928 0 —– —–
    3 0:3:0? FC 10 failed 380928 0 —– —–
    7 0:7:0? FC 10 failed 380928 0 —– —–
    10 0:10:0? FC 10 failed 380928 0 —– —–
    12 0:12:0? FC 10 failed 380928 0 —– —–
    15 0:15:0? FC 10 failed 380928 0 —– —–
    16 1:0:0? FC 10 failed 380928 0 —– —–
    20 1:4:0? FC 10 failed 380928 0 —– —–
    21 1:5:0? FC 10 failed 380928 0 —– —–
    25 1:9:0? FC 10 failed 380928 0 —– —–
    30 1:14:0? FC 10 failed 380928 0 —– —–
    33 0:3:0? FC 10 failed 380928 0 —– —–
    ———————————————————-
    12 total 4571136 0

    Is there command to clean all the settings and make system like new. At this point I don’t are about any data.
    Thanks in advance
    Bohdan

    1. In this scenario : I would have first checked : showpd -p -cg 0 -mg 12, then for another drive and if there is more than 1 drives on particular slot, would have checked servicemag status, if it reflects succeeded, then we can go ahead and run dismisspd 1. We need to repeat this for all the drives reflecting above. To clean the system, the steps given by 3pardude is perfect.

  6. Hi
    I am using a HPE 3PAR StoreServ 7200 with 16 disk and 8 empty hdd bays. So I am going to add 4 new disk. I inserted new disks and did every single step right, but somehow it says “degraded” in status for a while and then “Failed”. What can I do?
    Thanks a lot, and sorry for bad english.

      1. yes, my 3PAR OS Version Was 3.2.1 mu2 and the hard disks was not listed in that version of OS, So i found out that i had to update OS at least to 3.2.1 mu21. i Updated that to 3.2.2 mu2 and that worked perfectly.

  7. I have a replaced a disk on Hp 3par 7400, the disk went online and rebuild completed. But the old disk (degraded) is still showing in the system. I’ve tried dismisspd without any luck. Any suggestions ?

  8. At step 3, you may use “showpd -c 46” to check the detailed chunklet information, other than “showpd -space 46”.

    1. You could run the Out Of Box setup again, but this would remove your previous config and volumes. But of course this would not be the same as a secure wipe ie the data could still be recovered by experts

Leave a Reply

Your email address will not be published. Required fields are marked *