AWS – Amazon CloudWatch Agent adds Support for NVIDIA GPU Metrics
Amazon CloudWatch agent now supports the collection of NVIDIA GPU performance metrics from Amazon Elastic Compute Cloud (Amazon EC2) accelerated computing instances running Linux. GPU-based instances provide access to NVIDIA GPUs with thousands of compute cores. You can use these instances to accelerate scientific, engineering, and rendering applications. Customers can install and configure CloudWatch agent to collect system and application metrics from Amazon EC2, on-premises hosts, and containerized applications and send them to CloudWatch. CloudWatch provides you with data and actionable insights to monitor your applications and optimize resource utilization. GPU metrics are intended for users who want to monitor the utilization of GPU co-processors in their EC2 accelerated instances.
Read More for the details.