{"id":142,"date":"2026-04-17T19:52:34","date_gmt":"2026-04-17T19:52:34","guid":{"rendered":"https:\/\/www.purple-liquid.com\/?p=142"},"modified":"2026-04-24T16:38:27","modified_gmt":"2026-04-24T16:38:27","slug":"proxmox-is-dying-long-live-proxmox","status":"publish","type":"post","link":"https:\/\/www.purple-liquid.com\/?p=142","title":{"rendered":"Proxmox Is Dying, Long Live Proxmox"},"content":{"rendered":"\n<p>I woke up one morning to notice that this blog, my AdGuard, and everything else hosted on Proxmox was offline. Thankfully the AdGuard sync was working great and the secondary instance was handling all of the traffic as expected.<\/p>\n\n\n\n<p>The Proxmox UI was unresponsive. The power was on on the server but I hard rebooted it to see what would happen. <\/p>\n\n\n\n<p>It came up fine and I started digging into the logs to figure out what had happened.<\/p>\n\n\n\n<p>The proxmox logs didn&#8217;t have anything particularly interesting. But the kernal logs were very helpful:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>  Apr 12 23:27:43 pve kernel: nvme 0000:02:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer,\n  (Receiver ID)\n  Apr 12 23:27:43 pve kernel: nvme 0000:02:00.0:    &#91; 6] BadTLP\n  Apr 13 00:26:19 pve kernel: nvme 0000:02:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer,\n  (Receiver ID)<\/code><\/pre>\n\n\n\n<p>PCIe errors on the SSD! BadTLP (corrupted PCIe packets) and RxErr<br>(physical layer receive errors). These are signal integrity failures between the SSD and the motherboard.<\/p>\n\n\n\n<p>So either the motherboard was failing or the SSD was failing. This was a cheap PC from a secondhand store in Eugene, OR. so my money (literally and figuratively) was on the SSD. <\/p>\n\n\n\n<p>I tried a few different troubleshooting things first. I installed nvme-cli and ran <code>nvme smart-log \/dev\/nvme0<\/code>:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>power_cycles: 2901 vs power_on_hours: 2817 \u2014 nearly 1 power cycle per hour, indicating the drive had been\ncausing crashes for a long time\n\nmedia_errors: 0 \u2014 no data corruption yet (lucky)\n\navailable_spare: 100% \u2014 drive not worn out\n\nnum_err_log_entries: 2809 \u2014 extremely high\n\nunsafe_shutdowns: 95 \u2014 95 prior crashes\/power losses<\/code><\/pre>\n\n\n\n<p>I powered off, reseated the drive, and powered back on to see if that would help. Error started back within 6 seconds of powering the PC back on. <\/p>\n\n\n\n<p>Given the current harddrive price nightmare we&#8217;re living through I was apprehensive about buying a new SSD. But for a small homelab PC like this that only had 256GB to start with, I was able to find a replacement for $60.00 USD. <\/p>\n\n\n\n<p>While I waited for it to arrive I backed up all of my VMs and containers to my NAS with vzdump. It was extremely painless. <\/p>\n\n\n\n<p>I seated the new drive, reinstalled Proxmox, and restored my VMs and containers from the backups. It was extremely simple and I was back up in less than an hour.<\/p>\n\n\n\n<p>What I wish I would have done was actually back up the Proxmox settings themselves. The cron jobs I had set to upgrade LXCs &amp; the configurations around VLAN tagging for the bridge were gone and that took me a while to remember how I had set up to start with.<\/p>\n\n\n\n<p>But once that was fixed, I was all set up and back in action.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I woke up one morning to notice that this blog, my AdGuard, and everything else hosted on Proxmox was offline. Thankfully the AdGuard sync was working great and the secondary instance was handling all of the traffic as expected. The Proxmox UI was unresponsive. The power was on on the server but I hard rebooted [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":144,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[43],"tags":[26,6],"class_list":["post-142","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech","tag-homelab","tag-proxmox"],"_links":{"self":[{"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/posts\/142","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=142"}],"version-history":[{"count":1,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/posts\/142\/revisions"}],"predecessor-version":[{"id":145,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/posts\/142\/revisions\/145"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=\/wp\/v2\/media\/144"}],"wp:attachment":[{"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=142"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=142"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.purple-liquid.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=142"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}