Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

General Protection Fault on Dell R720 #677

Open
ikogan opened this issue Feb 28, 2024 · 15 comments
Open

General Protection Fault on Dell R720 #677

ikogan opened this issue Feb 28, 2024 · 15 comments

Comments

@ikogan
Copy link

ikogan commented Feb 28, 2024

Recently I started encountering a GPF in UEFI on at least 2 Dell R720s with BIOS 2.9.0:

image

I'm trying to boot a Proxmox on a Samsung SSD 950 PRO NVMe drive using this Sabrent adapter: https://www.amazon.com/gp/product/B084GDY2PW.

This was working for quite some time and randomly failed on my existing server today. It's also failing the same way on another R720 that wasn't being used until today. This stack trace happens both when attempting to boot from the NVMe and when doing simple things like "Print all UEFI boot options to log". I'm trying Clover 5157.

@k0lja
Copy link

k0lja commented Mar 8, 2024

Encountered the same Problem in UEFI on 2 R640s with BIOS 2.20.1. Also trying to run proxmox with a similar m2 nvme adapter and CloverBootloader 5157. Same Error with 5156.

@dascathea
Copy link

Hi sir,

Did you ever solve this issue? I have Proxmox running on samsung 960 pro for a while without issues. I upgraded some rams and then the system was still booting fine. Then I plugged back my midplane drives, now it doesnt boot.

@ikogan
Copy link
Author

ikogan commented Mar 11, 2024

Nope, I ended up moving my OS to a normal SSD and using my NVMe as just another local-lvm thin pool.

@loop1dev
Copy link

I ran into the same issue with a T620. I found that rEFInd worked after adding the nvme driver.

@Armynator
Copy link

Same problem on a R720xd. Downgrading to some random older version (Clover 5132) worked.

@brandonw62
Copy link

Encountered the same exact error on my dell R620 running BIOS version 2.9.0. Ran Boot Disk Utility (BDU) with clover version 5156 and am back up and running again. Here is a download link for BDU: https://www.softpedia.com/get/System/Boot-Manager-Disk/Bootdisk-Utility.shtml

@titou10titou10
Copy link

titou10titou10 commented Mar 29, 2024

Same problem here. R720 Bios 2.9.0, Clover v5157 NVMe M2 SSD on a PCIe card on slot 6

@SergeySlice
Copy link
Collaborator

I will wait for some essential observation or investigation.

@ikogan
Copy link
Author

ikogan commented Apr 20, 2024

What are some observations or investigation that could be helpful? I have an R720 that hasn't been configured yet that I might be able to do some work with if that one has this issue as well.

@SergeySlice
Copy link
Collaborator

What are some observations or investigation that could be helpful? I have an R720 that hasn't been configured yet that I might be able to do some work with if that one has this issue as well.

I wish you find what to report other than "not working".

@ikogan
Copy link
Author

ikogan commented May 11, 2024

What are some observations or investigation that could be helpful? I have an R720 that hasn't been configured yet that I might be able to do some work with if that one has this issue as well.

I wish you find what to report other than "not working".

I'm a little confused, maybe we have a misunderstanding as I feel like there's a good bit more detail in this thread than that. To summarize:

  • I own a few Dell R720s.
  • I am attempting to boot them off a Samsung SSD 950 PRO NVMe drive using a PCI-e adapter. The BIOS on these Dells does not support NVMe boot.
  • The operating system installed on the NVMe drive was Proxmox, running a Linux 6.5 kernel, I can't remember the exact version. Probably something like 6.5.11.
  • I have installed Clover on to a USB 3.0 flash drive
  • Suddenly, and with no changes to the system, I started getting the error in the screenshot taken at the start of this post, a General Protection Fault (13) with the attached trace when booting that machine.
  • This was happening both on a server that had been working and on a new one I hadn't yet setup.
  • I also get the same error if I try to "Print all UEFI boot options to log" in the Clover UI. This was done with Clover version 5157.

Unfortunately at this time I have found a use for my remaining R720 but may be getting another in the future. Should I get one, is there other information that I could provide that would help this out? Again, for a full stack dump please see the screenshot in the first post.

@ikogan
Copy link
Author

ikogan commented May 22, 2024

It's the option shown in the "Boot Options" screen here: https://github.com/5T33Z0/Clover-Crate/blob/main/GUI/Boot_Menu_Options.md#clover-boot-options:

https://user-images.githubusercontent.com/76865553/181207372-8dd33de4-9932-4ad4-9558-5c2819b7c102.png

@NachtRaben
Copy link

Encountered this problem on an R630 and had to hard power cycle the server. Turned out to be an issue with the UEFI not communicating with the lifecycle controller. Unsure if related or coincidence that I encountered the same exact fault message.

@jcastro
Copy link

jcastro commented Oct 18, 2024

Same here on R720. With Clover 5122 it boots but the partition is hidden (I need to press F3). Not sure what's new on latest clover but the partitions were showing up automatically

@wcb33
Copy link

wcb33 commented Oct 31, 2024

same problem on R720XD

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests