Hey,
since today my (Debian9)-VM on an ESXi 6.5.0 (host, no vCenter) freezes permanently every ~ 3 hours. All services running on the VM (SSH, Apache) can no longer be accessed and I can't type anything in the VMware window (via browser or remote console).
The VM itself has been running without problems for over 300 days now, while the ESXi has a much longer runtime with 688 days. There are 3 other VMs running on this ESXi host, and they are running quite well, so I thought the problems were coming directly from the VM(Debian9), but the syslog doesn't really show anything related to the problem.
These are the VM log I got from the ESXi until I powered off the machine:
2020-06-03T09:51:55.419Z| vcpu-2| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-1| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-11| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-6| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-9| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-10| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-12| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-8| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-7| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-4| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-0| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-13| I125: APIC THERMLVT write: 0x10000
2020-06-03T09:51:55.419Z| vcpu-0| I125: Vix: [10906353 vmxCommands.c:7212]: VMAutomation_HandleCLIHLTEvent. Do nothing.
2020-06-03T09:51:55.419Z| vcpu-0| I125: MsgHint: msg.monitorevent.halt
2020-06-03T09:51:55.419Z| vcpu-0| I125+ The CPU has been disabled by the guest operating system. Power off or reset the virtual machine.
I could not get much info from the ESXi host logs but, of course, I can post some parts out of them.
And like I said, the Linux syslog is not really helpful either, which is strange because there is a gap between when the VM was first reported as down and when I restarted the VM.
I don't have that much experience in debugging and fixing problems on ESXi hosts, so I appreciate any tip that might lead me to a solution!
Thanks.