progetti:cloud-areapd:ced-c:operations:resolve_iscsi_alarms_due_to_missing_target
Table of Contents
Resolve iSCSI alarms due to missing target
Problem description
It might happen that a compute node tries to access an iSCSI target that has disappeared for some reason. The cinder driver is still working so the problem is at a lower level, e.g. doesn't impact openstack operations. In this case the iSCSI device logs many alarms, flooding the admins with mail messages containing something like:
http://192.168.40.100/ ----------------------------------------- ERROR event from storage array iSCSIUnipdA subsystem: MgmtExec event: 7.4.3 time: Mon Nov 28 17:56:19 2016 iSCSI login to target '192.168.40.100:3260, iqn.2001-05.com.equallogic:0-fe83b6-aad7200c3-c4d00647e67568e2-volume-5bfac1ad-f884-4f18-ac40-f0423a5af4c1' from initiator '192.168.40.164:52285, iqn.1994-05.com.redhat:881385dab36e' failed for the following reason: Requested target not found. ----------------------------------------- ERROR event from storage array iSCSIUnipdA subsystem: MgmtExec event: 7.4.3 time: Mon Nov 28 17:56:19 2016 iSCSI login to target '192.168.40.100:3260, iqn.2001-05.com.equallogic:0-fe83b6-b097200c3-25200647e73568e2-volume-191a6a71-ea05-44af-86ce-51e895612d72' from initiator '192.168.40.164:52286, iqn.1994-05.com.redhat:881385dab36e' failed for the following reason: Requested target not found. ... ...
Recovery procedure
- login to the affected node; on the above example it will be 192.168.40.164 → cld-blu-15
- list the iscsi volumes effectively used by the node with
# iscsiadm -m session | cut -d' ' -f4 | sort iqn.2001-05.com.equallogic:0-fe83b6-0a2c4c0c5-fa7bc3e56da5a983-volume-43b1aba0-1020-4dc6-a205-ab4d33fed99f iqn.2001-05.com.equallogic:0-fe83b6-a857200c3-8a900647e61568e2-volume-e737884c-b6f7-486d-9c2e-ece3056d271f iqn.2001-05.com.equallogic:0-fe83b6-b2dc4c0c5-918bc32f6d158b6c-volume-3e82eca8-8535-436b-be57-7f37d68d142d iqn.2001-05.com.equallogic:0-fe83b6-ea5c4c0c5-d64bc3353dc58ca5-volume-f0f32dcd-acf7-4932-be0d-d1f06cd487c2
- compare this list with the one generated by
# iscsiadm -m discovery -t st -p 192.168.40.100 | cut -d' ' -f2 | sort iqn.2001-05.com.equallogic:0-fe83b6-0a2c4c0c5-fa7bc3e56da5a983-volume-43b1aba0-1020-4dc6-a205-ab4d33fed99f iqn.2001-05.com.equallogic:0-fe83b6-b2dc4c0c5-918bc32f6d158b6c-volume-3e82eca8-8535-436b-be57-7f37d68d142d iqn.2001-05.com.equallogic:0-fe83b6-ea5c4c0c5-d64bc3353dc58ca5-volume-f0f32dcd-acf7-4932-be0d-d1f06cd487c2
- if there are entries on the first list not showing on the second (like …486d-9c2e-ece3056d271f in this case) this means that the node thinks it's connect to a nonexistent target. In this case issue a logout from the target with
# iscsiadm -m node --target iqn.2001-05.com.equallogic:0-THE-WRONG-TARGET -p 192.168.40.100 --logout
- if you see no differences between the two lists, this means that iscsiadm got confused. You have to connect to the iscsi device and list the volumes from there:
# ssh grpadmin@192.168.40.100 Last login: Wed Dec 7 10:34:48 2016 from 192.168.40.121 on tty?? Welcome to Group Manager Copyright 2001-2016 Dell Inc. CloudUnipdVeneto> show volume Name Size Snapshots Status Permission Connections T --------------- ---------- --------- ------- ---------- ----------- - volume-1fb65656 500GB 2 online read-write 1 Y -f358-4ab8-8b aa-fb0c48a0e1 8d volume-86a39713 16GB 2 online read-write 1 Y -50ba-4885-8f dc-9d7cdf3d16 a8 volume-c930e59f 30GB 2 online read-write 1 Y -35b4-4625-a1 4a-af2ef5792f 5a volume-c85996dd 30GB 2 online read-write 2 Y -b3b7-43d8-ae 35-f15185734b 6e volume-e737884c 8GB 2 online read-write 1 Y -b6f7-486d-9c 2e-ece3056d27 1f volume-f276ae66 4GB 2 online read-write 0 Y -404f-4b0a-99 83-75eb87e7e2 dd volume-dab04b6a 5GB 2 online read-write 1 Y -f7fe-485a-b7 8e-f0927477b1 5f volume-3b8cbb62 20GB 2 online read-write 1 Y -a6d9-4a6c-a1 db-5613438455 59 ... ...
- after all the missing volumes are sorted out restart (just in case…) the openstack-cinder-volume services on cld-blu-01 and cld-blu-02
progetti/cloud-areapd/ced-c/operations/resolve_iscsi_alarms_due_to_missing_target.txt · Last modified: 2018/06/05 08:04 by mazzon@infn.it