Configure your Splunk Observability Cloud account to collect GCP VertexAI metrics
Learn how to configure your Splunk Observability Cloud account to collect GCP VertexAI metrics.
Complete the following steps to collect metrics from and monitor GCP VertexAI.
Metrics
Learn about the available metrics for GCP VertexAI.
Metric name | Unit | Description |
---|---|---|
prediction/online /prediction_count | count | Number of online predictions. |
prediction/online /prediction_latencies | ms | Online prediction latency of the deployed model. |
prediction/online /response_count | count | Number of different online prediction response codes. |
prediction/online /prediction_latencies.count | count | Number of online predictions. |
prediction/online /prediction_latencies.sumOfSquareDeviation | ms | The sum of squared deviation for prediction latencies. |
publisher/online_serving /model_invocation_count | count | Number of model invocations (prediction requests). |
publisher/online_serving /model_invocation_latencies.sumOfSquareDeviation | ms | The sum of squared deviation for model invocation latencies. |
publisher/online_serving /model_invocation_latencies.count | count | Number of model invocations (prediction requests). |
publisher/online_serving /model_invocation_latencies | ms | Model invocation latencies (prediction latencies). |
publisher/online_serving /token_count | count | Accumulated input/output token count. |
publisher/online_serving /consumed_token_throughput | count | Overall throughput used (accounting for burndown rate) in terms of tokens. |
publisher/online_serving /consumed_throughput | count | Overall throughput used (accounting for burndown rate) in terms of characters. |
publisher/online_serving /character_count | count | Accumulated input/output character count. |
publisher/online_serving /first_token_latencies | ms | Duration from request received to first token sent back to the client. |
publisher/online_serving /first_token_latencies.count | count | Number of first token latencies. |
publisher/online_serving /first_token_latencies.sumOfSquareDeviation | ms | The sum of squared deviation for first token latencies. |
Attributes
Learn about the available resource attributes for GCP VertexAI.
gcp_project_status
gcp_project_name
gcp_project_label_last_revalidated_by
model_user_id
gcp_project_number
request_type
gcp_id
gcp_project_label_cloud_registration_id
gcp_project_creation_time
gcp_project_label_last_revalidated_at
input_token_size
output_token_size
project_id
metricTypeDomain
gcp_project_label_environment
publisher
monitored_resource
gcp_project_label_account_type
gcp_project_label_owner_group
service
Location
In addition, the type
resource attribute is available for the publisher/online_serving /token_count
and publisher/online_serving /character_count
metrics.
Troubleshoot
Learn how to get help if you can't see your data in Splunk Observability Cloud.
If you are a Splunk Observability Cloud customer and are not able to see your data in Splunk Observability Cloud, you can get help in the following ways:
Splunk Observability Cloud customers can submit a case in the Splunk Support Portal or contact Splunk Support.
Prospective customers and free trial users can ask a question and get answers through community support in the Splunk Community.