VirtualBox

Opened 9 years ago

Closed 7 years ago

#13762 closed defect (obsolete)

"Not syncing: An NMI occurred" kernel panic

Reported by: Fiorangelo Peluso Owned by:
Component: guest control Version: VirtualBox 4.3.20
Keywords: kernel panic vboxdrv Cc:
Guest type: Linux Host type: all

Description

Hi all, we're experiencing several kernel panics with vboxdrv on a brand new HP ProLiant DL560 G8. We currently have Virtualbox 4.3.20 installed with about 15-20 vm running with both Windows and Linux OS. Attached you'll find screenshots of panic errors got from ILO Remote Console. We also tried to reinstall different x86_64 Linux distributions (CentOS 6.6, Oracle Linux 6.6, Red Hat 6.5) and different Virtualbox packages (4.3.10, 4.3.18, 4.3.20) but panics still occurred. The panic event is random and we're unable to reproduce it. It occurs after few days of activity and we would be very afraid to give up on this. Please let us know if any further info is needed. Thanks in advance.

Attachments (15)

vbox_err_3.JPG (88.4 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_centos.JPG (93.4 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_oracle.JPG (102.9 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_oracle_2.JPG (85.2 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_oracle_3.JPG (98.9 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_oracle_4.JPG (92.8 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_oracle_5.JPG (79.4 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_redhat6.5_1.png (97.0 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
vbox_err_redhat6.5_2.PNG (104.6 KB ) - added by Fiorangelo Peluso 9 years ago.
Kernel Panic Screenshot
cpuinfo (56.8 KB ) - added by Fiorangelo Peluso 9 years ago.
/proc/cpuinfo file
vbox_logs.tar.gz (48.2 KB ) - added by Fiorangelo Peluso 9 years ago.
VBox.log files of one vm
PD Project messages.guest.tar.gz (58.7 KB ) - added by nuess0r 9 years ago.
Log file of the crashing guest
PD Project messages.host (63.9 KB ) - added by nuess0r 9 years ago.
Log file of the host (DL360 G5)
PD Project VBox.log.1 (108.5 KB ) - added by nuess0r 9 years ago.
publicdomainwiki.vbox (19.8 KB ) - added by nuess0r 9 years ago.

Download all attachments as: .zip

Change History (31)

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_3.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_centos.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_oracle.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_oracle_2.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_oracle_3.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_oracle_4.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_oracle_5.JPG added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_redhat6.5_1.png added

Kernel Panic Screenshot

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_err_redhat6.5_2.PNG added

Kernel Panic Screenshot

comment:1 by Fiorangelo Peluso, 9 years ago

Guest Kernel (currently 2.6.32-431.el6.x86_64 - Red Hat 6.5) boots with "intel_idle.max_cstate=0 idle=poll" kernel parameters in grub.conf file.

comment:2 by Fiorangelo Peluso, 9 years ago

Googling about that, we found that a possible solution was to disable nmi_watchdog ("echo 0 > /proc/sys/kernel/nmi_watchdog" command used) since it could be a possible NMI storm issue (this article is quite old but also interesting: http://x86vmm.blogspot.it/2005/10/linux-nmis-on-intel-64-bit-hardware.html ).

Next steps:

  • use kernel boot parameter "divider=10" since it's suggested in VirtualBox troubleshooting for Red Hat / Fedora guests;
  • Disable Hyper Threading

Any help is truly appreciated anyway.

comment:3 by Klaus Espenlaub, 9 years ago

Normally VBox should disable the NMI watchdog automatically as it interferes with VM execution (guest execution can take quite a while, and during that time the watchdog isn't re-set). No immediate idea why this doesn't work...

comment:4 by Klaus Espenlaub, 9 years ago

Could it be that you enabled some watchdog in the BIOS? Scratching my head why we don't get many reports like yours. Must be a very special config.

comment:5 by Fiorangelo Peluso, 9 years ago

I started with default BIOS configuration.. I really wonder if there is any watchdog enabled in BIOS. Anyway now server is working with nmi_watchdog manually disabled. So far so good.. really hope this is the key..

comment:6 by Frank Mehnert, 9 years ago

Please attach a VBox.log file of such a VM session. What I'm after is the exact CPU specification of your host. Thank you!

comment:7 by Fiorangelo Peluso, 9 years ago

Which VBox.log should I attach? Panic occurs on guest and I ignore the cause (doesn't seem a single vm to be the cause). Attached you'll find /proc/cpuinfo

Thanks for your feedback

by Fiorangelo Peluso, 9 years ago

Attachment: cpuinfo added

/proc/cpuinfo file

comment:8 by Frank Mehnert, 9 years ago

Please add a VBox.log file from such a VM anyway. It doesn't matter which one.

by Fiorangelo Peluso, 9 years ago

Attachment: vbox_logs.tar.gz added

VBox.log files of one vm

comment:9 by Frank Mehnert, 9 years ago

Thanks you for the log. I had an idea what it could be but looking at the log file my idea was proven wrong. Disabling the NMI watchdog on the host might help like in #13506. Of course that's only a workaround.

comment:10 by Fiorangelo Peluso, 9 years ago

Thanks for your feedback. Is it safe for the host disabling NMI watchdog?

comment:11 by Frank Mehnert, 9 years ago

Yes. The watchdog is only there for detecting unusual behavior.

in reply to:  description comment:12 by nuess0r, 9 years ago

I'm having similar issiues with my HP DL360 G5 Server. I got it second hand and installed it in dezember. Before that I was running memtest sucessfully.

Host running Debian 7 Stable, Linux 3.2.0-4-amd64 #1 SMP Debian 3.2.65-1 x86_64 GNU/Linux Guest also running Debian 7 Stable, Linux 3.2.0-4-amd64 #1 SMP Debian 3.2.65-1 x86_64 GNU/Linux

Virtualbox: 4.3.20

In my case I have three VMs which I have migrated from the old system. These VMs are still 32bit and with only one CPU. These three VMs are running absolutely fine, as they did on the old server.

One VM is a newly created one to have more computing power, it has SMP with six CPUs and 24 GByte of Ram configured. This VM crashes often. Sometimes it takes half a day, sometimes it runs for three days.

The symptom is that the guest is not responding to anything. A new VRDP connection keeps black and empty. On the host, six CPUs are running with 100%, the rest of the host is fine and works normal. Also on the host there is no information about the crash of the guest available (VBox.log or syslog).

After the restart of the guest, the entries in the /var/log/messages most of the time look the same, the name of the affected process differs, sometimes the Pid belongs to mysql, sometimes to apache etc.

I attached the messages log from the host and guest (each including boot process), the config of the crashing VM and the Log of the crashing VM.

One thing to add, I don't know if has an effect: Booting Linux on this HP DL360 G5 gives some warning message during boot, that the BIOS is somekind of buggy and I should complain at the vendor.

by nuess0r, 9 years ago

Log file of the crashing guest

by nuess0r, 9 years ago

Attachment: PD Project messages.host added

Log file of the host (DL360 G5)

by nuess0r, 9 years ago

Attachment: PD Project VBox.log.1 added

by nuess0r, 9 years ago

Attachment: publicdomainwiki.vbox added

comment:13 by nuess0r, 9 years ago

Yesterday I upgraded our server to VirtualBox 4.3.22r98236

Today the problematic VM crashed again, so this bug is still valid.

comment:14 by nj, 9 years ago

I raised ticket #14034 which seems to bear some similarity to this ticket

Last edited 9 years ago by Frank Mehnert (previous) (diff)

comment:15 by Frank Mehnert, 9 years ago

Actually I'm not sure if this is a VBox bug at all. See this Ubuntu ticket. Could you try to blacklist the hpwdt module like suggested there and check if this resolves your problem as well?

comment:16 by Frank Mehnert, 7 years ago

Resolution: obsolete
Status: newclosed
Note: See TracTickets for help on using tickets.

© 2023 Oracle
ContactPrivacy policyTerms of Use