0% found this document useful (0 votes)

103 views8 pages

The NVIDIA GPU Query Properties

The document provides information on running Docker containers to query NVIDIA GPU properties and run the AlexNet model. It then describes various NVIDIA GPU properties that can be queried, including manufacturer settings, user settings, setup properties, runtime properties, and properties not supported by the OS or device.

Uploaded by

Antonio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views8 pages

The NVIDIA GPU Query Properties

Uploaded by

Antonio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

For Call the Collector Container

docker run --name collector -it -d --runtime=nvidia nvidia/cuda:latest nvidia-smi

--query-gpu=uuid,power.draw,pcie.link.gen.current,pcie.link.width.current,memory.used,
memory.free,utilization.gpu,clocks.current.graphics,clocks.current.sm,clocks.current.me
mory --format=csv --loop-ms=2000

For Call the AlexNet Container

docker run -it --rm -v /opt/ml-benchmarks/input/mlfull:/tmp/input -v

/home/wellington/projects/ml-benchmarks/output:/tmp/output --runtime=nvidia
mlbench/apps/alexnet/tensorflow/cuda

The NVIDIA GPU Query Properties

ManufactureSetting Properties

name or gpu_name: The official product name of the GPU. This is an alphanumeric
string. For all products.

serial or gpu_serial: This number matches the serial number physically printed on each
board. It is a globally unique immutable alphanumeric value.

uuid or gpu_uuid: This value is the globally unique immutable alphanumeric identifier of
the GPU. It does not correspond to any physical label on the board.

memory.total: Total installed GPU memory.

clocks.default_applications.graphics or clocks.default_applications.gr: Default

frequency of applications graphics (shader) clock.

clocks.default_applications.memory or clocks.default_applications.mem: Default

frequency of applications memory clock.

clocks.max.graphics or clocks.max.gr: Maximum frequency of graphics (shader) clock.

clocks.max.sm or clocks.max.sm: Maximum frequency of SM (Streaming

Multiprocessor) clock.

clocks.max.memory or clocks.max.mem: Maximum frequency of memory clock.

pcie.link.gen.max: The maximum PCI-E link generation possible with this GPU and
system configuration. For example, if the GPU supports a higher PCIe generation than
the system supports then this reports the system PCIe generation.
pcie.link.width.max: The maximum PCI-E link width possible with this GPU and system
configuration. For example, if the GPU supports a higher PCIe generation than the
system supports then this reports the system PCIe generation.

User Setting Properties

Setup Properties

inforom.oem: Version for the OEM configuration data.

power.default_limit: The default power management algorithm's power ceiling, in watts.

Power Limit will be set back to Default Power Limit after driver unload.

power.min_limit: The minimum value in watts that power limit can be set to.

power.max_limit: The maximum value in watts that power limit can be set to.

display_mode: A flag that indicates whether a physical display (e.g. monitor) is currently
connected to any of the GPU's connectors. "Enabled" indicates an attached display.
"Disabled" indicates otherwise.

pci.bus_id or gpu_bus_id: PCI bus id as "domain:bus:device.function", in hex.

pci.domain: PCI domain number, in hex.

pci.bus: PCI bus number, in hex.

pci.device: PCI device number, in hex.

pci.device_id: PCI vendor device id, in hex.

pci.sub_device_id: PCI SubSystem id, in hex.

driver_version: The version of the installed NVIDIA display driver. This is an
alphanumeric string.

count: The number of NVIDIA GPUs in the system.

index: Zero based index of the GPU. Can change at each boot.

display_active: A flag that indicates whether a display is initialized on the GPU's (e.g.
memory is allocated on the device for display). Display can be active even when no
monitor is physically attached. "Enabled" indicates an active display. "Disabled"
indicates otherwise.

persistence_mode: A flag that indicates whether persistence mode is enabled for the
GPU. Value is either "Enabled" or "Disabled". When persistence mode is enabled the
NVIDIA driver remains loaded even when no active clients, such as X11 or nvidia-smi,
exist. This minimizes the driver load latency associated with running dependent apps,
such as CUDA programs. Linux only.

accounting.mode: A flag that indicates whether accounting mode is enabled for the
GPU. Value is either "Enabled" or "Disabled". When accounting is enabled statistics are
calculated for each computer process running on the GPU. Statistics are available for
query after the process terminates. See --help-query-accounted-apps for more info.

accounting.buffer_size: The size of the circular buffer that holds a list of processes that
can be queried for accounting stats. This is the maximum number of processes that
accounting information will be stored for before information about oldest processes will
get overwritten by information about new processes.

vbios_version: The BIOS of the GPU board.

inforom.img or inforom.image: Global version of the infoROM image. Image version just
like VBIOS version uniquely describes the exact version of the infoROM flashed on the
board in contrast to infoROM object version which is only an indicator of supported
features.

inforom.pwr or inforom.power: Version for the power management data.

gom.current or gpu_operation_mode.current: The GOM currently in use. GOM allows to

reduce power usage and optimize GPU throughput by disabling GPU features. Each
GOM is designed to meet specific user needs. In "All On" mode everything is enabled
and running at full speed. The "Compute" mode is designed for running only computer
tasks. Graphics operations are not allowed. The "Low Double Precision" mode is
designed for running graphics applications that don't require high bandwidth double
precision. GOM can be changed with the (--gom) flag.

gom.pending" or "gpu_operation_mode.pending: The GOM that will be used on the next

reboot.

clocks_throttle_reasons.supported: Bitmask of supported clock throttle reasons. See

nvml.h for more details. Retrieves information about factors that are reducing the
frequency of clocks. If all throttle reasons are returned as "Not Active" it means that
clocks are running as high as possible.

clocks_throttle_reasons.active: Bitmask of active clock throttle reasons. See nvml.h for

more details.

clocks_throttle_reasons.gpu_idle: Nothing is running on the GPU and the clocks are

dropping to Idle state. This limiter may be removed in a later release.

clocks_throttle_reasons.applications_clocks_setting: GPU clocks are limited by

applications clocks setting. E.g. can be changed by nvidia-smi --applications-clocks=

clocks_throttle_reasons.sw_power_cap: SW Power Scaling algorithm is reducing the

clocks below requested clocks because the GPU is consuming too much power. E.g. SW
power cap limit can be changed with nvidia-smi --power-limit=
clocks_throttle_reasons.hw_slowdown: HW Slowdown (reducing the core clocks by a
factor of 2 or more) is engaged. This is an indicator of:
* temperature being too high
* External Power Brake Assertion is triggered (e.g. by the system power supply)
* Power draw is too high and Fast Trigger protection is reducing the clocks
* May be also reported during PState or clock change
* This behavior may be removed in a later release

clocks_throttle_reasons.unknown: Some other unspecified factor is reducing the clocks.

compute_mode: The compute mode flag indicates whether individual or multiple

compute applications may run on the GPU.
*"Default" means multiple contexts are allowed per device.
*"Exclusive_Thread" means only one context is allowed per device, usable from one
thread at a time.
*"Exclusive_Process" means only one context is allowed per device, usable from
multiple threads at a time.
*"Prohibited" means no contexts are allowed per device (no compute apps).

power.management: A flag that indicates whether power management is enabled. Either

"Supported" or "[Not Supported]". Requires Inforom PWR object version 3.0 or higher or
Kepler device.

power.limit: The power management algorithm's power ceiling, in watts. Total board
power draw is manipulated by the power management algorithm so that it stays under
this value. On Kepler devices Power Limit can be adjusted using [-pl | --power-limit=]
switches.

clocks.applications.graphics or clocks.applications.gr: User specified frequency of

graphics (shader) clock. User specified frequency at which applications will be running
at. Can be changed with [-ac | --applications-clocks] switches.

clocks.applications.memory or clocks.applications.mem: User specified frequency of

memory clock.

Runtime Properties

timestamp: The timestamp of where the query was made in format "YYYY/MM/DD
HH:MM:SS.msec".

pcie.link.gen.current: The current PCI-E link generation. These may be reduced when the
GPU is not in use.

pcie.link.width.current: The current PCI-E link width. These may be reduced when the
GPU is not in use.

fan.speed: The fan speed value is the percent of maximum speed that the device's fan is
currently intended to run at. It ranges from 0 to 100 %. Note: The reported speed is the
intended fan speed. If the fan is physically blocked and unable to spin, this output will
not match the actual fan speed. Many parts do not report fan speeds because they rely
on cooling via fans in the surrounding enclosure.
pstate: The current performance state for the GPU. States range from P0 (maximum
performance) to P12 (minimum performance).

memory.used: Total memory allocated by active contexts.

memory.free: Total free memory.

utilization.gpu: Percent of time over the past second during which one or more kernels
was executing on the GPU.

utilization.memory: Percent of time over the past second during which global (device)
memory was being read or written.

temperature.gpu: Core GPU temperature. in degrees C.

power.draw: The last measured power draw for the entire board, in watts. Only available
if power management is supported. This reading is accurate to within +/- 5 watts.

clocks.current.graphics or clocks.gr: Current frequency of graphics (shader) clock.

clocks.current.sm or clocks.sm: Current frequency of SM (Streaming Multiprocessor)

clock.

clocks.current.memory or clocks.mem: Current frequency of memory clock.

Not Supported by the OS or Device

inforom.ecc: Version for the ECC recording data.

ecc.mode.current: The Error Correcting Code (ECC) mode that the GPU is currently
operating under.

ecc.mode.pending: The ECC mode that the GPU will operate under after the next reboot.

ecc.errors.corrected.volatile.device_memory: Errors detected in global device memory.

ecc.errors.corrected.volatile.register_file: Errors detected in register file memory.

ecc.errors.corrected.volatile.l1_cache: Errors detected in the L1 cache.

ecc.errors.corrected.volatile.l2_cache: Errors detected in the L2 cache.

ecc.errors.corrected.volatile.texture_memory: Parity errors detected in texture memory.

ecc.errors.corrected.volatile.total: Total errors detected across the entire chip. Sum of

device_memory, register_file, l1_cache, l2_cache and texture_memory.

ecc.errors.corrected.aggregate.device_memory: Errors detected in global device

memory.

ecc.errors.corrected.aggregate.register_file: Errors detected in register file memory.

ecc.errors.corrected.aggregate.l1_cache: Errors detected in the L1 cache.

ecc.errors.corrected.aggregate.l2_cache: Errors detected in the L2 cache.

ecc.errors.corrected.aggregate.texture_memory: Parity errors detected in texture

memory.

ecc.errors.corrected.aggregate.total: Total errors detected across the entire chip. Sum of

device_memory, register_file, l1_cache, l2_cache and texture_memory.

ecc.errors.uncorrected.volatile.device_memory: Errors detected in global device memory.

ecc.errors.uncorrected.volatile.register_file: Errors detected in register file memory.

ecc.errors.uncorrected.volatile.l1_cache: Errors detected in the L1 cache.

ecc.errors.uncorrected.volatile.l2_cache: Errors detected in the L2 cache.

ecc.errors.uncorrected.volatile.texture_memory: Parity errors detected in texture

memory.

ecc.errors.uncorrected.volatile.total: Total errors detected across the entire chip. Sum of

device_memory, register_file, l1_cache, l2_cache and texture_memory.

ecc.errors.uncorrected.aggregate.device_memory: Errors detected in global device

memory.
ecc.errors.uncorrected.aggregate.register_file: Errors detected in register file memory.

ecc.errors.uncorrected.aggregate.l1_cache: Errors detected in the L1 cache.

ecc.errors.uncorrected.aggregate.l2_cache: Errors detected in the L2 cache.

ecc.errors.uncorrected.aggregate.texture_memory: Parity errors detected in texture

memory.

ecc.errors.uncorrected.aggregate.total: Total errors detected across the entire chip. Sum

of device_memory, register_file, l1_cache, l2_cache and texture_memory.

retired_pages.single_bit_ecc.count or retired_pages.sbe: The number of GPU device

memory pages that have been retired due to multiple single bit ECC errors.

retired_pages.double_bit.count or retired_pages.dbe: The number of GPU device memory

pages that have been retired due to a double bit ECC error.

retired_pages.pending: Checks if any GPU device memory pages are pending retirement
on the next reboot. Pages that are pending retirement can still be allocated, and may
cause further reliability issues.

driver_model.current: The driver model is currently in use. Always "N/A" on Linux.

driver_model.pending: The driver model that will be used on the next reboot. Always
"N/A" on Linux.

Memtune_User_Guide
No ratings yet
Memtune_User_Guide
9 pages
Xenia-Canary Config Toml
100% (1)
Xenia-Canary Config Toml
15 pages
CSI106
100% (1)
CSI106
11 pages
Kawpow Tuning
No ratings yet
Kawpow Tuning
4 pages
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
From Everand
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Rodrigo Copetti
No ratings yet
Web Caching
No ratings yet
Web Caching
119 pages
Nvidia Smi 367.38
No ratings yet
Nvidia Smi 367.38
34 pages
General Options H, Help
No ratings yet
General Options H, Help
14 pages
Nvidia Smi.1. Driver
No ratings yet
Nvidia Smi.1. Driver
20 pages
Nvidia-Smi 1 PDF
No ratings yet
Nvidia-Smi 1 PDF
26 pages
Nvidia Smi.1
No ratings yet
Nvidia Smi.1
28 pages
NVML SDK: Python Bindings
No ratings yet
NVML SDK: Python Bindings
28 pages
Nvidia Smi
No ratings yet
Nvidia Smi
37 pages
Nvidia Smi.1
No ratings yet
Nvidia Smi.1
26 pages
Nvidia-Smi 1 PDF
No ratings yet
Nvidia-Smi 1 PDF
11 pages
Undervolting (Overclocking) NVIDIA GitHub
No ratings yet
Undervolting (Overclocking) NVIDIA GitHub
2 pages
Nvidia
No ratings yet
Nvidia
1 page
Nvflash
No ratings yet
Nvflash
6 pages
How to Add GPU to Syno DS1820+
No ratings yet
How to Add GPU to Syno DS1820+
6 pages
CMD Gpu Parameters
No ratings yet
CMD Gpu Parameters
1 page
Manual CPU Adjuster
No ratings yet
Manual CPU Adjuster
11 pages
DA-08588-001_v04
No ratings yet
DA-08588-001_v04
32 pages
Engineering AI Excellence
From Everand
Engineering AI Excellence
Azhar ul Haque Sario
No ratings yet
Cuda Versions
No ratings yet
Cuda Versions
3 pages
Nvidia
No ratings yet
Nvidia
1 page
help
No ratings yet
help
2 pages
Reverse Engineering Power Management On NVIDIA
No ratings yet
Reverse Engineering Power Management On NVIDIA
9 pages
Reverse Engineering Power managment Nvidia GPU
No ratings yet
Reverse Engineering Power managment Nvidia GPU
10 pages
Envytools PDF
No ratings yet
Envytools PDF
701 pages
CMD Cpu Parameters
No ratings yet
CMD Cpu Parameters
1 page
Nvidia
No ratings yet
Nvidia
1 page
GPU Overclocking Guide
From Everand
GPU Overclocking Guide
Alisa Turing
No ratings yet
Envy Tools
No ratings yet
Envy Tools
601 pages
Help
No ratings yet
Help
2 pages
J
No ratings yet
J
5 pages
biossss2
No ratings yet
biossss2
2 pages
Mantle Programming Guide and API Reference
No ratings yet
Mantle Programming Guide and API Reference
435 pages
Xenia Canary Settings
No ratings yet
Xenia Canary Settings
12 pages
xenia.config
No ratings yet
xenia.config
14 pages
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet
Ms I Afterburner 1
No ratings yet
Ms I Afterburner 1
5 pages
Neofetch
No ratings yet
Neofetch
7 pages
README Scrypt
No ratings yet
README Scrypt
5 pages
Neofetch
No ratings yet
Neofetch
5 pages
kipas bener
No ratings yet
kipas bener
21 pages
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
Ethash Tuning Guide
No ratings yet
Ethash Tuning Guide
13 pages
HWM Readme
No ratings yet
HWM Readme
3 pages
Installation Manual
No ratings yet
Installation Manual
132 pages
Claymore GPU Miner Read Me
No ratings yet
Claymore GPU Miner Read Me
4 pages
NVIDIA Base Command Manager 10 Installation Manual
No ratings yet
NVIDIA Base Command Manager 10 Installation Manual
129 pages
T Rex Help
No ratings yet
T Rex Help
10 pages
bindless_graphics
No ratings yet
bindless_graphics
12 pages
HWM Readme
No ratings yet
HWM Readme
3 pages
Monero and - The Definitive Guide - MoneroMining
No ratings yet
Monero and - The Definitive Guide - MoneroMining
5 pages
Wii Architecture: Architecture of Consoles: A Practical Analysis, #11
From Everand
Wii Architecture: Architecture of Consoles: A Practical Analysis, #11
Rodrigo Copetti
No ratings yet
shell3.sh
No ratings yet
shell3.sh
1 page
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
S62256 - Demystify CUDA Debugging and Performance with Powerful Developer Tools
No ratings yet
S62256 - Demystify CUDA Debugging and Performance with Powerful Developer Tools
44 pages
Deprecation Notices
No ratings yet
Deprecation Notices
5 pages
50+ Linux Commands before joining a Company (1)
No ratings yet
50+ Linux Commands before joining a Company (1)
44 pages
Nvidia Inspector
No ratings yet
Nvidia Inspector
19 pages
Ampere INS
No ratings yet
Ampere INS
6 pages
Intel Pentium 4 Processor
No ratings yet
Intel Pentium 4 Processor
10 pages
COA - JAN 2024
No ratings yet
COA - JAN 2024
3 pages
Product Brief Product Brief: Powerful Processor Cores Integrated Memory Controllers
No ratings yet
Product Brief Product Brief: Powerful Processor Cores Integrated Memory Controllers
2 pages
Moog BRE440 RADHardCPU Datasheet
No ratings yet
Moog BRE440 RADHardCPU Datasheet
2 pages
Tutorial 3
No ratings yet
Tutorial 3
14 pages
PGDCA I Semester Jan 2020
No ratings yet
PGDCA I Semester Jan 2020
19 pages
Rockchip RK3036 Datasheet: Revision 1.2 Dec. 2015
No ratings yet
Rockchip RK3036 Datasheet: Revision 1.2 Dec. 2015
57 pages
Sample Quotation Letter
56% (18)
Sample Quotation Letter
4 pages
BCS302 DDCO Module 5 - Notes-Aruna
No ratings yet
BCS302 DDCO Module 5 - Notes-Aruna
15 pages
vsphere8-virtual-topology-perf
No ratings yet
vsphere8-virtual-topology-perf
27 pages
Computer Software Concept
No ratings yet
Computer Software Concept
11 pages
Automating The Construction of Compiler Heuristics Using Machine Learning
No ratings yet
Automating The Construction of Compiler Heuristics Using Machine Learning
162 pages
Dpco QB
No ratings yet
Dpco QB
3 pages
Problems Memory Hierarchy
No ratings yet
Problems Memory Hierarchy
27 pages
COC 1 Computer Systems Servicing Presentation by Horacio N. Aceveda JR
No ratings yet
COC 1 Computer Systems Servicing Presentation by Horacio N. Aceveda JR
92 pages
High-Performance Dual Core CPU System On Chip Technical Product Brief
No ratings yet
High-Performance Dual Core CPU System On Chip Technical Product Brief
2 pages
Memory Tagging: A Memory Efficient Design: Aditi Partap Dan Boneh
No ratings yet
Memory Tagging: A Memory Efficient Design: Aditi Partap Dan Boneh
16 pages
Differences Between Cache and RAM Memory
No ratings yet
Differences Between Cache and RAM Memory
1 page
Lecture Plan 2024
No ratings yet
Lecture Plan 2024
5 pages
A NOR Emulation Strategy Over NAND Flash Memory PDF
No ratings yet
A NOR Emulation Strategy Over NAND Flash Memory PDF
8 pages
Computer Organization Hamacher Instructor Manual Solution Chapter 5 PDF
No ratings yet
Computer Organization Hamacher Instructor Manual Solution Chapter 5 PDF
13 pages
Factors Affecting Processing Speed
100% (2)
Factors Affecting Processing Speed
2 pages
Computer Organization
No ratings yet
Computer Organization
1 page
COA Fall 2022 Lec01
No ratings yet
COA Fall 2022 Lec01
23 pages
Computer Memory ? Different Types of Memory in Computer With Examples
No ratings yet
Computer Memory ? Different Types of Memory in Computer With Examples
13 pages
Group 3 External Memory
No ratings yet
Group 3 External Memory
21 pages
Chapter - 1 Basic Structure of Computers: Computertypes
No ratings yet
Chapter - 1 Basic Structure of Computers: Computertypes
18 pages

The NVIDIA GPU Query Properties

Uploaded by

The NVIDIA GPU Query Properties

Uploaded by

For Call the Collector Container

docker run --name collector -it -d --runtime=nvidia nvidia/cuda:latest nvidia-smi

For Call the AlexNet Container

docker run -it --rm -v /opt/ml-benchmarks/input/mlfull:/tmp/input -v

The NVIDIA GPU Query Properties

memory.total: Total installed GPU memory.

clocks.default_applications.graphics or clocks.default_applications.gr: Default

clocks.default_applications.memory or clocks.default_applications.mem: Default

clocks.max.graphics or clocks.max.gr: Maximum frequency of graphics (shader) clock.

clocks.max.sm or clocks.max.sm: Maximum frequency of SM (Streaming

clocks.max.memory or clocks.max.mem: Maximum frequency of memory clock.

User Setting Properties

inforom.oem: Version for the OEM configuration data.

power.default_limit: The default power management algorithm's power ceiling, in watts.

pci.bus_id or gpu_bus_id: PCI bus id as "domain:bus:device.function", in hex.

pci.domain: PCI domain number, in hex.

pci.bus: PCI bus number, in hex.

pci.device: PCI device number, in hex.

pci.device_id: PCI vendor device id, in hex.

pci.sub_device_id: PCI SubSystem id, in hex.

count: The number of NVIDIA GPUs in the system.

vbios_version: The BIOS of the GPU board.

inforom.pwr or inforom.power: Version for the power management data.

gom.current or gpu_operation_mode.current: The GOM currently in use. GOM allows to

gom.pending" or "gpu_operation_mode.pending: The GOM that will be used on the next

clocks_throttle_reasons.supported: Bitmask of supported clock throttle reasons. See

clocks_throttle_reasons.active: Bitmask of active clock throttle reasons. See nvml.h for

clocks_throttle_reasons.gpu_idle: Nothing is running on the GPU and the clocks are

clocks_throttle_reasons.applications_clocks_setting: GPU clocks are limited by

clocks_throttle_reasons.sw_power_cap: SW Power Scaling algorithm is reducing the

clocks_throttle_reasons.unknown: Some other unspecified factor is reducing the clocks.

compute_mode: The compute mode flag indicates whether individual or multiple

power.management: A flag that indicates whether power management is enabled. Either

clocks.applications.graphics or clocks.applications.gr: User specified frequency of

clocks.applications.memory or clocks.applications.mem: User specified frequency of

memory.used: Total memory allocated by active contexts.

memory.free: Total free memory.

temperature.gpu: Core GPU temperature. in degrees C.

clocks.current.graphics or clocks.gr: Current frequency of graphics (shader) clock.

clocks.current.sm or clocks.sm: Current frequency of SM (Streaming Multiprocessor)

clocks.current.memory or clocks.mem: Current frequency of memory clock.

Not Supported by the OS or Device

inforom.ecc: Version for the ECC recording data.

ecc.errors.corrected.volatile.device_memory: Errors detected in global device memory.

ecc.errors.corrected.volatile.register_file: Errors detected in register file memory.

ecc.errors.corrected.volatile.l1_cache: Errors detected in the L1 cache.

ecc.errors.corrected.volatile.texture_memory: Parity errors detected in texture memory.

ecc.errors.corrected.volatile.total: Total errors detected across the entire chip. Sum of

ecc.errors.corrected.aggregate.device_memory: Errors detected in global device

ecc.errors.corrected.aggregate.register_file: Errors detected in register file memory.

ecc.errors.corrected.aggregate.l1_cache: Errors detected in the L1 cache.

ecc.errors.corrected.aggregate.l2_cache: Errors detected in the L2 cache.

ecc.errors.corrected.aggregate.texture_memory: Parity errors detected in texture

ecc.errors.corrected.aggregate.total: Total errors detected across the entire chip. Sum of

ecc.errors.uncorrected.volatile.device_memory: Errors detected in global device memory.

ecc.errors.uncorrected.volatile.register_file: Errors detected in register file memory.

ecc.errors.uncorrected.volatile.l1_cache: Errors detected in the L1 cache.

ecc.errors.uncorrected.volatile.l2_cache: Errors detected in the L2 cache.

ecc.errors.uncorrected.volatile.texture_memory: Parity errors detected in texture

ecc.errors.uncorrected.volatile.total: Total errors detected across the entire chip. Sum of

ecc.errors.uncorrected.aggregate.device_memory: Errors detected in global device

ecc.errors.uncorrected.aggregate.l1_cache: Errors detected in the L1 cache.

ecc.errors.uncorrected.aggregate.l2_cache: Errors detected in the L2 cache.

ecc.errors.uncorrected.aggregate.texture_memory: Parity errors detected in texture

ecc.errors.uncorrected.aggregate.total: Total errors detected across the entire chip. Sum

retired_pages.single_bit_ecc.count or retired_pages.sbe: The number of GPU device

retired_pages.double_bit.count or retired_pages.dbe: The number of GPU device memory

driver_model.current: The driver model is currently in use. Always "N/A" on Linux.

You might also like