healthcheck

package
v1.0.20 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jul 29, 2022 License: Apache-2.0 Imports: 6 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

This section is empty.

Types

type GPUDevice

type GPUDevice struct{}

type GPUHealthChecker

type GPUHealthChecker struct {
	// contains filtered or unexported fields
}

GPUHealthChecker checks the health of nvidia GPUs. Note that with the current device naming pattern in device manager, GPUHealthChecker will not work with MIG devices.

func NewGPUHealthChecker

func NewGPUHealthChecker(devices map[string]pluginapi.Device, health chan pluginapi.Device, codes []int) *GPUHealthChecker

NewGPUHealthChecker returns a GPUHealthChecker object for a given device name

func (*GPUHealthChecker) Start

func (hc *GPUHealthChecker) Start() error

Start registers NVML events and starts listening to them

func (*GPUHealthChecker) Stop

func (hc *GPUHealthChecker) Stop()

Stop deletes the NVML events and stops the listening go routine

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL