Route data to multiple destinations
Enrich data events with business or service context
Search and analyze data directly at its source, an S3 bucket, or Cribl Lake
Reduce the size of data
Shape data to optimize its value
Store data in S3 buckets or Cribl Lake
Replay data from low-cost storage
Collect logs and metrics from host devices
Centrally receive and route telemetry to all your tools
Redact or mask sensitive data
Optimize data for better threat detection and response
Streamline infrastructure to reduce complexity and cost
Simplify Kubernetes data collection
Optimize logs for value
Control how telemetry is stored
Easily handle new cloud telemetry
Ensure freedom in your tech stack
Accelerate the value of AIOps
Effortlessly search, collect, process, route and store telemetry from every corner of your infrastructure—in the cloud, on-premises, or both—with Cribl. Try the Cribl Suite of products today.
Learn moreGet telemetry data from anywhere to anywhere
Get started quickly without managing infrastructure
Streamline collection with a scalable, vendor-neutral agent
AI-powered tools designed to maximize productivity
Easily access and explore telemetry from anywhere, anytime
Instrument, collect, observe
Store, access, and replay telemetry.
Get hands-on support from Cribl experts to quickly deploy and optimize Cribl solutions for your unique data environment.
Work with certified partners to get up and running fast. Access expert-level support and get guidance on your data strategy.
Get inspired by how our customers are innovating IT, security, and observability. They inspire us daily!
Read customer storiesFREE training and certs for data pros
Log in or sign up to start learning
Step-by-step guidance and best practices
Tutorials for Sandboxes & Cribl.Cloud
Ask questions and share user experiences
Troubleshooting tips, and Q&A archive
The latest software features and updates
Get older versions of Cribl software
For registered licensed customers
Advice throughout your Cribl journey
Connect with Cribl partners to transform your data and drive real results.
Join the Cribl Partner Program for resources to boost success.
Log in to the Cribl Partner Portal for the latest resources, tools, and updates.
Our Criblpedia glossary pages provide explanations to technical and industry-specific terms, offering valuable high-level introduction to these concepts.
Monitoring specific APM metrics ensures a comprehensive understanding of your application’s health. These metrics provide a holistic view, ensuring you can pinpoint issues whether they originate from the application itself or the hardware and cloud services that host it.
Below are the key metrics to track:
Average Response Time
Average response time refers to the time it takes for an application to respond to a user request. This metric is crucial for understanding how quickly your application can handle interactions. A higher-than-expected response time often signals performance issues that need immediate attention, such as network latency or inefficient code execution.
CPU Usage
CPU usage measures the percentage of CPU resources that an application consumes. High CPU usage can lead to slow processing and application crashes. Monitoring this metric ensures that applications are running efficiently and helps IT teams make necessary adjustments to maintain performance under heavy loads.
Throughput
Throughput measures the number of transactions or user requests processed by the application over a specific period. High throughput indicates that the application can handle a significant volume of traffic. However, if throughput decreases, it may point to bottlenecks or resource limitations that need to be addressed.
Apdex Scores
Apdex (Application Performance Index) is a standard for measuring user satisfaction based on application performance. It scores how well an application meets performance expectations, helping teams prioritize efforts to improve the user experience. A low Apdex score suggests that users are experiencing frustration due to slow load times or unresponsive features.
Error Rate
Error rate measures the frequency of errors in an application. A high error rate can directly impact user experience, leading to frustration and potentially causing users to abandon the application. By tracking and reducing the error rate, teams can enhance reliability and prevent critical failures.
Memory Usage
Memory usage tracks the amount of memory an application consumes. Inefficient memory management can lead to memory leaks, slowdowns, or crashes, especially in resource-intensive applications. Monitoring memory usage helps prevent these issues and ensures that applications can handle larger workloads over time.
While the above metrics focus on the application layer, monitoring infrastructure is equally important. Node availability and instance count are key infrastructure performance metrics.
Node availability
Node availability tracks the uptime of the servers or virtual machines (nodes) hosting your applications. High availability is critical for ensuring that users can always access the application. Any downtime or unavailability of nodes can lead to significant service disruptions.
Instance count
The instance count metric monitors the number of active application instances. It’s important for scaling applications based on demand. A low instance count may cause slow performance during traffic spikes, while an unnecessarily high count could lead to wasted resources.
Note: Some metrics, like CPU usage, throughput, and uptime, apply to both the application and infrastructure levels. These are critical to understanding both how your application performs and how the underlying infrastructure supports it.
CPU Usage
CPU usage at the infrastructure level measures how much processing power is being consumed by the servers or virtual machines (VMs) that host your applications. High CPU usage indicates that your infrastructure is working hard to meet the demands of the application. However, consistently high usage can strain resources, leading to slower performance, increased latency, and potential system crashes.
Monitoring CPU usage across your infrastructure helps ensure that you have adequate processing power to handle current workloads. It also allows IT teams to detect when additional resources are needed, whether through scaling out to additional instances or optimizing existing workloads. Key benefits of tracking CPU usage at the infrastructure level include:
Proactively monitoring CPU usage can prevent system failures and improve overall application reliability, especially during traffic spikes or peak usage times.
Throughput
Throughput at the infrastructure level refers to the amount of data or number of transactions processed by your servers, VMs, or cloud instances over a specific period. High throughput indicates that your infrastructure can efficiently handle multiple requests, while low throughput may point to performance bottlenecks, insufficient resources, or misconfigurations.
By tracking throughput at the infrastructure level, IT teams can gain a clearer understanding of how well the system is managing overall application load. If throughput starts to decline while traffic remains consistent, it can signal that the infrastructure is struggling to keep up, leading to increased latency or failed requests. Monitoring this metric helps ensure that your system can manage the application’s demand without compromising performance.
Key considerations for monitoring throughput include:
Keeping a close eye on throughput enables teams to optimize the underlying infrastructure, ensuring the application remains responsive even as demand fluctuates.
Cribl enables organizations to efficiently manage, transform, and route their APM metrics data, surfacing signals that ensure they can improve application performance and infrastructure health. With Cribl’s flexible solutions, IT teams can ingest, normalize, and analyze APM data from various sources, and be empowered to take action on the insights provided by APM metrics. Whether you’re tracking memory usage or CPU performance, Cribl streamlines data collection and helps reduce the load on monitoring systems while maintaining comprehensive visibility.
APM metrics play a vital role in the smooth operation of modern applications. As applications grow more complex and user expectations for speed and reliability increase, the need for real-time monitoring becomes essential. APM metrics provide insight into system health, highlight areas that need attention, and enable faster troubleshooting. By leveraging these metrics, organizations can reduce downtime, improve user satisfaction, and optimize performance.
Join Cribl’s Nick Heudecker and Art Chavez for this exclusive on-demand webinar titled Building an Options-Based Observability Strategy, where you’ll learn how an options-based approach can help you.
Classic choice. Sadly, our website is designed for all modern supported browsers like Edge, Chrome, Firefox, and Safari
Got one of those handy?