帮助手册

预测

基于 Zia 的预测是一个强大的工具,利用高级分析和机器学习技术,对监视器的性能提供精确的预测和洞察。通过分析资源生成的各种指标,基于 Zia 的预测可帮助用户做出明智的决策,并识别资源中的潜在异常。

使用场景

设想这样一个场景:您是一位 IT 运营经理,负责维护公司关键基础设施的健康状况和性能,需要确保服务器、数据库和应用程序平稳运行,并主动预防可能影响用户的问题。借助基于 Zia 的预测,您可以为各种性能指标设置自定义阈值,并在预测值超出这些阈值时收到告警。例如,若您的数据库 CPU 使用率预计在未来 24 小时内达到 90%,基于 Zia 的预测将提前通知您,使您能够采取预防措施,如扩展资源或优化查询。这种主动的方式有助于维持高可用性和高性能,最终将宕机时间降至最低。

基于 Zia 的预测:告警

基于 Zia 的预测允许用户为其监视器中的特定属性选择并设置自定义阈值,使用户能够专注于最相关的指标,并在预测指标突破定义的限制时收到告警。这可以通过将监视器状态更改为故障或通知监视器行为变化来实现。从而使用户能够主动监控其资源,并对任何行为变化做出响应。

除阈值监控外,基于 Zia 的预测还提供对资源季节性的洞察。通过分析历史数据,识别资源性能随时间变化的规律和趋势,帮助用户预测特定时期内的潜在问题,并采取主动措施降低风险。

凭借其高级分析和预测能力,基于 Zia 的预测使用户能够做出基于数据的明智决策。通过精准预测监视器性能并突出显示异常或风险,帮助用户优化资源管理并确保以七天为周期的平稳运营。

要启用基于 Zia 的预测功能以获取潜在阈值超限的通知,请:

  1. 登录 Site24x7。
  2. 前往管理 > 配置文件
  3. 选择 Zia 预测支持的任意监视器。请参阅下方列表了解支持的监视器和属性。
  4. 编辑阈值配置文件表单中,点击添加 Zia 预测阈值,为基于 Zia 的预测支持的属性添加 Zia 预测阈值。


    图 1. 添加 Zia 预测阈值
  5. 点击保存

工作原理

通过在基于 Zia 的预测中配置阈值设置,您可以为通知建立特定的触发点。例如,假设您设置 80% 触发故障通知,90% 触发严重通知。在此场景中,若您将 Zia 预测阈值设置为 85%,则当 Zia 预测监视器可能接近该阈值时,您将提前收到通知。这些主动通知使您能够采取预防措施,避免监视器达到严重状态。

您可以在以下位置找到基于 Zia 的预测:

仪表板

前往首页 > 仪表板。选择支持基于 Zia 预测的任意监视器类型。如果您创建的小组件符合基于 Zia 预测支持的时间段和属性,图表将反映预测数据。预测值在面积图中以半透明区域表示,在折线图中以虚线表示。

要启用或禁用基于 Zia 的预测,请点击顶部栏的编辑仪表板。您可以点击图表上的 Zia 图标来禁用此功能。


图 2. 编辑仪表板

 

 


图 3. 预测图表中的 Zia 图标

 

报表

从左侧导航栏前往报表 > 性能报表。启用顶部栏中显示的显示预测值后,您将能够在图表中看到预测数据。预测数据在面积图中以蓝色半透明区域表示,在折线图中以虚线表示。

 


图 4. 性能报表

 

定时报表
  1. 前往管理 > 报表设置 > 定时报表
  2. 点击右侧顶部栏的定时报表
  3. 在表单中,从报表类型字段的下拉菜单中选择性能报表
  4. 接下来,选择支持基于 Zia 预测的监视器类型,之后您将看到启用或禁用报表中 Zia 预测值的选项。此选项默认禁用。
  5. 点击保存


    图 5. 定时报表中的 Zia 预测
注意

在报表表格中,预测值以斜体区分显示。

公开报表
  1. 前往管理 > 共享 > 公开报表
  2. 点击右侧顶部栏的发布报表
  3. 在表单中,从报表类型字段的下拉菜单中选择公开报表
  4. 接下来,选择基于 Zia 预测支持的监视器类型,之后您将看到通过提供的切换按钮启用或禁用报表中基于 Zia 预测值的选项。此选项默认禁用。
  5. 点击保存


    图 6. 公开报表中的 Zia 预测
注意

在报表表格中,预测值以斜体区分显示。

注意

 

  • 资源需要为监视器提供至少五个连续数据点才能支持预测功能。
  • 选择报表周期时,请注意此功能仅适用于以下选项:
    • 本周
    • 过去 7 天(至昨天)
    • 过去 30 天(至昨天)
    • 本月
    • 本季度
    • 本年

 

预测功能适用于以下监视器类型和属性。

Azure

监视器类型 属性
Azure Virtual Machine
  Network Out
  Percentage CPU
  Disk Write Bytes
  Network In
  Disk Read Bytes
NetApp Capacity Pools Pool Allocated to Volume Size
  Pool Consumed Size
  Total Snapshot Size for the Pool
Cloud Services Percentage CPU
  Available Memory Bytes
Virtual Networks Failed Pings to a VM
  Round trip time for Pings to a VM
Bastions Session Count
  Used CPU
  Used Memory
Network Connections BitsInPerSecond
  BitsOutPerSecond
Managed Clusters Disk Used Percentage
  CPU Usage Percentage
  Memory RSS Percentage
App Configuration HTTP Incoming Request Count
  HTTP Incoming Request Duration
  Throttled HTTP Request Count
Network Watcher Connections AverageRoundtripMs
  RoundTripTimeMs
Data Explorer Cluster Instance Count
  CPU
  Total Number of Throttled Commands
Azure App Service Http3xx
  Http101
  Http2xx
  AppConnections
  Http401
  Http5xx
  Http4xx
  Http403
  Handles
  RequestsInApplicationQueue
  CpuTime
  Requests
  BytesReceived
  AverageMemoryWorkingSet
  AverageResponseTime
  Http404
  BytesSent
  Http406
Automation TotalJob
  TotalUpdateDeploymentRuns
  TotalUpdateDeploymentMachineRuns
Automation Accounts Memory percent
  Storage percent
  CPU percent
Server Farms File Storage Used Percent
  File Storage Percent
  File System Storage
NAT Gateway Dropped Packets
  Packets
  Total SNAT Connection Count
Storage Transactions
  Success Server Latency
  Uptime
  Success E2E Latency
Container Apps CPU Usage
  Replica Count
Cosmos DB Memory percent
  CPU percent
  Storage used
String Apps Tomcat Sessions Expired
  Tomcat Sessions Active Current
AzureDB for MYSQL Flexible Server Host Memory Percent
  IO Percent
  Host CPU Percent
Web PubSub Services Outbound Traffic
  Connection Count
Database for MySQL Servers dtu_consumption_percent
  dtu_used
  dtu_limit
Net App Account Volume Percentage Volume Consumed Size
  Volume Allocated Size
Stream Analytics Jobs Runtime Errors
  SU (Memory) Utilization %
  CPU Utilization %
Route Servers Data Processed by the Virtual Hub Router
  Bgp Peer Status
Managed Grafana HTTP Request Count
CDN Profiles ResponseSize
  ByteHitRatio
  RequestCount

Amazon Web Services

监视器类型 属性
Lightsail Database Network Transmit Throughput
  Network Receive Throughput
  Disk Queue Depth
  CPU Utilization (%)
Database Migration Service (DMS) Swap Usage
  Memory Usage
  CPU Utilization
Simple Email Service (SES) Emails Sent in Last 24hrs
  Email Usage in the Last 24 Hours
Transit Gateway Bytes Out
  Packet Drop Count Blackhole
  Bytes Drop Count Blackhole
  Bytes Drop Count No Route
  Packet Drop Count No Route
  Packets Out
  Bytes In
  Packets In
Route 53 Resolver Outbound Query Volume
  Inbound Query Volume
Network load balancer Consumed LCU Sum
Elastic Kubernetes Service (EKS) Memory utilized by pods
  Memory utilized by nodes
  CPU utilized by pods
  CPU units used by nodes
  CPU utilization by nodes
VPC-VPN connection Tunnel Data Received
  Total Tunnel Data Sent
  Tunnel Data Sent
  Total Tunnel Data Received
Route 53 Health Check Time To First Byte
  TCP Connection Time
  Time To First Byte
  Health Percentage
Storage Gateway File Cache Hit Percent
  Cache Percent Used
  Cache Percent Dirty
Elastic MapReduce Total Load
  HDFS Bytes Written
  S3 Bytes Read
  S3 Bytes Written
  HDFS Bytes Read
  Capacity Remaining
  HDFS Utilization
Lightsail Instance Network In
  Network Out
  CPU Utilization (%)
Storage Gateway Cache Hit Percent
  Upload Buffer Percent Used
  Working Storage Percent Used
  Cache Percent Used
  Cache Percent Dirty
  User Cpu Percent
  IO wait percent
Cloud Search Searchable Documents
  Index Utilization
Elastic Container Service (ECS) Running Tasks
  Memory Utilization
  CPU Utilization
EC Memcached Node Evictions Occurred
  Current Connections
  Reclaimed Occurred
  Swap Usage
  Current Items
  CPU Usage
Elastic Kubernetes Service Namespace Memory utilized by pods
  Memory Utilized
  CPU Utilized
  CPU utilized by pods
  Memory utilized by pods
  CPU utilized by pods
Amazon FSX Data Write Bytes
  Data Write Operation
  Data Read Bytes
  Meta Data Operation
  Data Read Operation
Gateway Load Balancer Consumed LCU Sum
Elastic Kubernetes Service (EKS) Memory Utilized
  CPU Utilized
  Network Traffic
Elastic Search Free Storage Space
  JVM GC Old Collection Time
  Cluster Used Space
  JVM GC Old Collection Count
  Read IOPS
  Elasticsearch Requests
  Disk Queue Depth
  System Memory Utilization
  Deleted Documents
  CPU Utilization
  CPU Credit Balance
Relational Database Service (RDS) Free Storage in %
  CPU Surplus Credits Charged
  Volume Write IOPs
  Freeable Memory (%)
  CPU Credit Usage
  CPU Usage
  CPU Credit Balance
  Aurora Bin Log Replica Lag
  Bin Log Disk Usage
  CPU Surplus Credit Balance
  Burst Balance
  Swap Usage
  Transaction Logs Disk Usage
  Volume Read IOPs
  Disk Queue Depth
  Free Local Storage
  Freeable Memory
  Database Connections Sum
Simple Storage Service (S3) Maximum Object Size
  Number of Folders
  Number of Objects in the Folder
  Total number of Objects in the Subfolders
  Minimum Object Size
  Total Number of Subfolders
  Number Of Objects Modified in Last 5 minutes
Step Functions Execution Time
  Execution Throttled
  No. of Executions Timed Out
  No. of Executions Failed
S3 Bucket Bucket Size
  Number of Objects
Amazon MQ Volume Write Ops
  Total Dequeue Count
  Store Percent Usage
  Volume Read Ops
  Broker Heap Usage
  Total Enqueue Count
  CPU Usage (CloudWatch)
Simple Queue Service (SQS) Number Of Messages Sent
  Number Of Messages Received
NAT Gateway Connection Established Count
  Connection Attempt Count
  Active Connection Count
  Packets Drop Count
  Error Port Allocation
Application Load Balancer (ALB) Request Count Per Target
  Rejected Connections
  Requests Count Sum
  Consumed LCU Sum
Route 53 Hosted Zone Request Count
Direct Connect - Virtual Interface Virtual Interface Bps Ingress (Bits)
  VirtualInterfacePpsIngress
  VirtualInterfaceBpsEgress
  VirtualInterfaceBpsIngress
  Virtual Interface Bps Egress (Bits)
  VirtualInterfacePpsEgress
Lambda@Edge Invocations
  Errors
  Throttles
  Duration
  Success Percentage
EC2 Capacity Reservation Instance Utilization
Neptune Cluster Volume Bytes Used
  SPARQL Requests
  SPARQL Errors
  Gremlin Errors
  Gremlin Requests
  CPU Utilization (%)
Redshift Evictions Occurred
  Current Connections
  Reclaimed Occurred
  Swap Usage
  Current Items
  CPU Usage
EC2 instance IOPS Usage
  CPU Surplus Credits Charged
  CPU Surplus Credit Balance
  Burst Balance
  Number of Bytes Sent
  Number of Bytes Received
  CPU Usage (CloudWatch)
  CPU Credit Usage
  CPU Credit Balance
AMQ Queue In Flight Count
Lightsail Load Balancer Request Count
  Instance Response Time
  HTTP 4xx - Load Balancer
  HTTP 4xx - Instance
  Rejected Connection Count
Elastic Container Service (ECS) Container Instances
  CPU Utilization Maximum
  CPU Reservation
  Memory Reservation
  Memory Utilization
  CPU Utilization
Transit Gateway Attachment Bytes Out
  Packet Drop Count Blackhole
  Bytes Drop Count Blackhole
  Bytes Drop Count No Route
  Packet Drop Count NoRoute
  Packets Out
  Bytes In
  Packets In
Web Application Firewall (WAF) Counted Requests
  Blocked Requests
  Passed Requests
Amazon AppStream 2.0 Capacity Utilization
EC2 Auto Scaling Group Number of Bytes Sent
  Number of Bytes Received
  CPU Usage (CloudWatch)
AMQ Topic Memory Usage
  In Flight Count
DDB cluster CPU Surplus Credits Charged
  Maximum Queue Depth For Request Throttled Due To Low Memory
  Volume Write IOPs
  Database Cursors(Max) Sum
  Database Connections(Max) Sum
  CPU Usage
  CPU Credit Balance
  CPU Credit Usage
  Database Cursors TimedOut
  CPU Surplus Credit Balance
  Swap Usage
  Transactions Open Max Sum
  Queue Depth For Request Throttled Due To Low Memory
  Volume Read IOPs
  Disk Queue Depth
  Database Connections Sum
  Number of Operations Throttled Due to Low Memory
  Database Cursors Sum
  Freeable Memory
  Transactions Open Sum
SG Volume Cache Hit Percent
  Cache Percent Used
  Cache Percent Dirty
Elastic Search Memcached Evictions Occurred
  Current Connections
  Reclaimed Occurred
  Swap Usage
  Current Items
  CPU Usage
Amazon DMS Instance Write Operations
  Swap Usage
  Read Operations
  Disk Queue Depth
  Freeable Memory
  CPU Utilization
Elastic Search Node Free Storage Space
  JVM GC Old Collection Time
  JVM GC Old Collection Count
  System Memory Utilization
  CPU Utilization
Elastic File System (EFS) File Metered Size
  Burst Credit Balance
  Permitted Throughput
Neptune Instance SPARQL Requests
  SPARQL Errors
  Gremlin Errors
  Gremlin Requests
  CPU Utilization (%)
Loadbalancer Surge Queue Length
  Spill Over Count
DDB Instance CPU Surplus Credits Charged
  Maximum Queue Depth For Request Throttled Due To Low Memory
  Volume Write IOPs
  Database Cursors(Max) Sum
  Database Connections(Max) Sum
  CPU Usage
  CPU Credit Balance
  CPU Credit Usage
  Database Cursors TimedOut
  CPU Surplus Credit Balance
  Swap Usage
  Transactions Open Max Sum
  Queue Depth For Request Throttled Due To Low Memory
  Volume Read IOPs
  Disk Queue Depth
  Database connections Sum
  Number Of Operations Throttled Due To Low Memory
  Database Cursors Sum
  Freeable Memory
  Transactions Open Sum

服务器

监视器类型 属性
Windows Backups Used Space(GB)
MySQL Database Slow Queries
  InnoDB Tables in Use
  Queries
Server Used Space (%)
  Overall Disk Usage
  Used Space
  Overall Disk Size
  Overall Disk Free Space
  CPU Utilization (%)
  Memory Utilization (%)
  Overall Disk Size
  Overall Disk Used Size (% and MB)
  Partition-level Disk Used (% and MB)
  Overall Disk Free Size (% and MB)
  Partition-level Disk Free Size (% and MB)

虚拟机


监视器类型 属性
VMware Space Occupied by VM(Percentage)
  Partition Used Space (GB)
  Free Space
  Snapshot Space (Percentage)
  Disk File Space
  Snapshot Space
  Disk File Space (Percentage)
  Occupied Space
  Snapshots Size
  Snapshot Space
  Free Space
  Disk File Space

网络

监视器类型 属性
Network Device Tx Utilized

其他

监视器类型 属性
Capacity Total Number of Monitors
  Maximum Downtime
  Availability (%)
MySQL Database Total Errors
  Throughput
Status Check Monitor Total Number of Monitors
  Maximum Downtime
  Availability (%)
GCP Cloudrun Received Bytes
  Outbound Packets Throttled
  Billable Instance Time
  Max Concurrent Requests
  Outbound Bytes Throttled
  Request Count
  Container CPU Utilization
  Container CPU Allocation
  Instance Count
  Container Memory Allocation
  Sent Bytes
  Inbound Bytes Throttled
  Request Latency
  Container Memory Utilization
  Inbound Packets Throttled
  Container Startup Latency

本文档对您有帮助吗?

您愿意帮助我们改进文档吗?请告诉我们哪些方面可以做得更好。


很抱歉本文档未能让您满意。我们希望了解可以从哪些方面改进您的体验。


感谢您抽出时间分享反馈。我们将利用您的反馈来改进在线帮助资源。

短链接已复制!