SCVMM 2012 R2 – Error 20506 / 20552 / 2606

Had an interesting issue today, All hosts spread over two clusters were in a state of ‘not responding’ with no clear explanation why.

It looked as though the issue began at 2am, a time when nobody would have been working on the VMM console.

Initial diagnosis showed that the network was up, The Failover Cluster was working just fine, but VMM had lost WinRM connectivity to the hosts.

The usual error 20506 appeared in the host status window indicating that WinRM “could not complete the Windows Remote Management request on computer xxx”

Furthermore each time the Refresh Host Cluster job would fail it would show errors

2606 – unable to perform the job because on or more of the selected objects are locked by another job

20552 – VMM does not have appropriate permissions to access the resource on the xxx server

I spent a fail amount of time looking into the WinRM issue, everything was fine and nothing had changed. WinRM tests showed that the VMM server was able to communicate with the hosts.

I looked for locked jobs or running jobs, even checking the SQL server using the query

SELECT * FROM [VirtualManagerDB].[dbo].[tbl_VMM_Lock] where TaskID=’Task_GUID’ (See http://support.microsoft.com/kb/2795040 )

but that showed no jobs were active at all.

 

The thing that resolved it, was running another job (a non cluster related job) in my case i changed a run-as account. As soon as this went through I was able to refresh the cluster just fine.

I cant completely explain this one, my guess is that a phantom job had gotten stuck somewhere. Thought i would share the results.