直接都内核bug了,磁盘无响应
Message from syslogd@vm906233 at Dec 5 13:31:08 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 49s! [kworker/0:2:25910]
Message from syslogd@vm906233 at Dec 5 13:37:13 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 24s! [kworker/0:3:26358]
Top Task Manager Wiki README
0.0% st — The percentage of CPU occupied by overselling of response
wait time (steal time) during virtualization
The st value should always remain at 0.0%. If there is any
fluctuation, it means that the host is oversold.
Dear [Service Provider],
I hope this message finds you well.
We have encountered severe performance issues with your service, and after researching Proxmox VE documentation, we suspect the issue is due to the host machine's cache settings. The current setting, `cache=writeback`, seems to be causing significant performance degradation. I strongly recommend switching all VMs to `cache=writethrough` to improve performance.
In a Proxmox VE environment, cache settings greatly impact storage performance. The `cache=writeback` option caches I/O operations in memory, making writes appear faster. However, this can lead to increased I/O wait times (Steal Time, st) under high loads, which is contributing to the high load issues we're experiencing.
According to official performance optimization documentation, the `cache=writethrough` mode is more reliable, ensuring that write operations are fully confirmed before reaching storage devices. While it may slightly decrease apparent I/O speed, it significantly reduces system load over the long term and improves stability and reliability. This is especially important in virtualized environments, as it maximizes data consistency and durability, avoiding data loss risks from uncommitted cache writes.
Moreover, our analysis shows that the `cache=writeback` setting leads to high CPU usage on the host machine, particularly during heavy data requests, which significantly increases the st value. This indicates uneven resource consumption, affecting the normal operation of the VPS and user experience.
To ensure system stability and performance enhancement, we suggest promptly adjusting the VM configurations on the host to `cache=writethrough`. This change should alleviate the current high load issues and ensure reliable future operations. Additionally, we recommend monitoring the system after making these changes to ensure the desired outcomes. Please feel free to reach out if you have further questions.