Skip to content

System fails to resume from suspend with NVIDIA GSP heartbeat timeout #1125

@SorenHJohansen

Description

@SorenHJohansen

NVIDIA Open GPU Kernel Modules Version

595.58.03-2 (from package nvidia-open-dkms)

Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.

  • I confirm that this does not happen with the proprietary driver package.

Operating System and Version

Arch Linux

Kernel Release

6.19.13-arch1-1 SMP PREEMPT_DYNAMIC Tue, 21 Apr 2026 23:38:22 +0000 x86_64 GNU/Linux (stable release)

Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.

  • I am running on a stable kernel release.

Hardware: GPU

NVIDIA GeForce RTX 3080 (UUID: GPU-8c983cec-3f59-dd0b-50c0-b0d42da360bb)

Describe the bug

After suspending the system, resume fails with NVIDIA GSP (Graphics System Processor) heartbeat timeout errors, followed by DRM atomic modeset failures. The system becomes unresponsive and requires a hard reset to recover.

To Reproduce

To Reproduce:

  1. Use system normally
  2. Suspend the system
  3. Wait for any duration
  4. Attempt to resume the system
  5. Observe failure to resume properly, requiring hard reset

Bug Incidence

Sometimes

nvidia-bug-report.log.gz

nvidia-bug-report.log.gz

More Info

The problem appears to be that after the GSP heartbeat times out, the driver fails to properly reset the GPU and restore display configuration, leading to DRM atomic modeset failures.

Key error messages from journalctl:

  • apr 27 18:37:30 kernel: NVRM: _kgspIsHeartbeatTimedOut: Heartbeat timed out, currentTimeMs 3486357473 heartbeat 0 heartbeatWithOffsetMs 0 diff 3486357473 timeout 5200
  • apr 27 18:37:30 kernel: NVRM: _kgspRpcRecvPoll: GSP RM heartbeat timed out
  • apr 27 18:37:31 kernel: [drm:nv_drm_atomic_apply_modeset_config [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000a00] Failed to initialize semaphore for plane fence
  • apr 27 18:37:31 kernel: [drm:nv_drm_atomic_commit [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000a00] Failed to apply atomic modeset. Error code: -11

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions