Hi All,
I've got a 3 node Hyper V cluster running Server 2012 R2 and live migration is failing between 2 of the nodes. Live migrations work back and forth from node 2 to all other nodes fine. Live migrations from node 3 work to all other nodes, Node 1 can live migrate to node 2 but fails when migrating to node 3 and gets the following error:
Log Name: System
Event ID: 21502
Live migration of 'Virtual Machine XXXX' failed.
Virtual machine migration operation for 'XXXX' failed at migration source 'Node 1'.
The Virtual Machine Management Service failed to establish a connection for a Virtual Machine migration with host 'Node 3': A connection attempt failed because the connected party did not properly respond after a period of time, or established connection
failed because connected host has failed to respond. (0x8007274C).
The Virtual Machine Management Service failed to authenticate the connection for a Virtual Machine migration at the source host: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection
failed because connected host has failed to respond. (0x8007274C).
Failed to send data for a Virtual Machine migration: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond. (0x8007274C).
Looking around at other support articles on this it looks like this error code (0x8007274C) points to Virtual Machine Manager Service not listening on port 6600 but have confirmed it is listening on all nodes running netstat -ano | findstr 6600 and live migration does work on all nodes (just not one way between these two). Quick migration works fine from Node 1 to Node 3.
All Nodes are running the exact same hardware with the following nic configuration:
Broadcom BCM57800 NetXtreme 11 10 GigE cards.
Dual 10gbe ports teamed (traffic untagged at switch)
2 vEthernet adapters configured on teamed nic, one for management and one solely for migrations, each on different subnets.
Obviously this a tricky one to troubleshoot but if someone can give me any tips or tools I can use to further troubleshoot this it would be appreciated.
Cheers