Products
Product Portfolio

Cribl puts your IT and Security data at the center of your data management strategy and provides a one-stop shop for analyzing, collecting, processing, and routing it all at any scale. Try the Cribl suite of products and start building your data engine today!
Learn more ›

Evolving demands placed on IT and Security teams are driving a new architecture for how observability data is captured, curated, and queried. This new architecture provides flexibility and control while managing the costs of increasing data volumes.
Read white paper ›

Cribl Stream

Cribl Stream is a vendor-agnostic observability pipeline that gives you the flexibility to collect, reduce, enrich, normalize, and route data from any source to any destination within your existing data infrastructure.
Learn more ›

Vodafone Case Study

Vodafone Dials up Business Insights with Cribl Stream
Read Case Study ›

Cribl Edge

Cribl Edge provides an intelligent, highly scalable edge-based data collection system for logs, metrics, and application data.
Learn more ›

SpyCloud Edge Story

Listen to how SpyCloud uses Cribl Edge at scale.
Watch Video ›

Cribl Search

Cribl Search turns the traditional search process on its head, allowing users to search data in place without having to collect/store first.
Learn more ›

Happy 1st Birthday Cribl Search!
Read Blog ›

Cribl Lake

Cribl Lake is a turnkey data lake solution that takes just minutes to get up and running — no data expertise needed. Leverage open formats, unified security with rich access controls, and centralize access to all IT and security data.
Learn more ›

Navigating the future of IT and Security Data management white paper
Read white paper ›

Cribl.Cloud

The Cribl.Cloud platform gets you up and running fast without the hassle of running infrastructure.
Learn more ›

Cribl.Cloud Solution Brief

The fastest and easiest way to realize the value of an observability ecosystem.
Read Solution Brief ›

AppScope

AppScope gives operators the visibility they need into application behavior, metrics and events with no configuration and no agent required.
Learn more ›

Sandbox

Launch an AppScope Sandbox today!
Launch Now ›
Solutions
Use Cases

Explore Cribl’s Solutions by Use Cases:

Supercharge Security Insights ›

Accelerate Cloud Migration ›

Avoid Vendor Lock-in ›

Free Up Space for High-Value Data ›

Route From Any Source To Any Destination ›

Replay Data from Low-Cost Storage ›

Reduce Log Volume & Pay Less for Infrastructure ›
Integration

Explore Cribl’s Solutions by Integrations:

Amazon ›

Google ›

CrowdStrike ›

Microsoft ›

Elastic ›

Splunk ›

Exabeam ›

View All Integrations ›

Seamless Integrations for Your Observability Data
Learn More ›
Industries

Explore Cribl’s Solutions by Industry:

AIOps ›

Financial Services ›

Healthcare ›

Managed Security Services ›

Manufacturing and Logistics ›

Communications and Media ›

Public Sector ›

Retail ›
Resources
Resources

Resource Library ›

Documentation ›

Guides ›

AppScope Docs ›

Blog ›

Glossary ›

Podcasts ›

Telemetry 101

Understanding the Basics of Telemetry and Its Benefits
Learn More ›
Events & Webinars

Events ›

Webinars ›

CriblCon24
Las Vegas // June 10, 2024
Register Now ›

Watch On-Demand

3 ways to fast-track your data lake strategy without being a data expert
Watch On-Demand ›
Learning

Try the Sandboxes ›

Self Guided Trials ›

Cribl University ›

Cribl Community ›

Cribl Curious Forum ›

What is Observability? ›

Try Your Own Cribl Sandbox

Experience a full version of Cribl Stream and Cribl Edge in the cloud.
Launch Now ›
Tools & Pricing

Download Library ›

Past Releases ›

Pricing Plans ›

Stream ROI Calculator ›

Download Library

Download Cribl’s suite of products for free to get started.
Download ›
Customers
Customer Stories

Get inspired by how our customers are innovating IT, security and observability. They inspire us daily!
Read Customer Stories ›

Sally Beauty Holdings

Sally Beauty Swaps LogStash and Syslog-ng with Cribl.Cloud for a Resilient Security and Observability Pipeline
Read Case Study ›
Customer Experience

Support & Success ›

Professional Services ›

Service Delivery Partners ›

Documentation ›

AppScope Docs ›

Professional Services

Check out our new Professional Services offering.
Learn More ›
Learning

Try the Sandboxes ›

Self Guided Trials ›

Cribl University ›

Cribl Community ›

Cribl Curious Forum ›

Try Your Own Cribl Sandbox

Experience a full version of Cribl Stream and Cribl Edge in the cloud.
Launch Now ›
Company
About Cribl

Transform data management with Cribl, the Data Engine for IT and Security
Learn More ›

Cribl Corporate Overview

Cribl makes open observability a reality, giving you the freedom and flexibility to make choices instead of compromises.
Get the Guide ›

Cribl Newsroom

Stay up to date on all things Cribl and observability.
Visit the Newsroom ›

Press Releases

Read our most recent press releases.
Recent Press Releases ›

Leadership

Cribl’s leadership team has built and launched category-defining products for some of the most innovative companies in the technology sector, and is supported by the world’s most elite investors.
Meet our Leaders ›

Careers

Join the Cribl herd! The smartest, funniest, most passionate goats you’ll ever meet.
Learn More ›

Cribl Named to the Inc. 5000 List of Fastest Growing Private Companies
Learn More ›

Cribl for Startups

Whether you’re just getting started or scaling up, the Cribl for Startups program gives you the tools and resources your company needs to be successful at every stage.
Learn More ›

Contact Us

Want to learn more about Cribl from our sales experts? Send us your contact information and we’ll be in touch.
Talk to an Expert ›

Try Cribl Talk to an expert

Cribl Stream’s Replay vs Cribl Search’s Send: Understanding the Differences

January 10, 2024

Written by

Perry Correll, Principal Technical Content Manager at Cribl, is passionate about the powe... Read Morer of observability and how, when done right, it can deliver operational insights into network performance. He has 30+ years of networking experience from early Ethernet to today's observability and held positions from SE to product management with leading organizations. Read Less

Categories: Cribl Search, Cribl Stream, Learn

Back To Blogs

‘Play it Again Sam’

In today’s contemporary landscape, organizations produce more data than ever, which needs to be collected, stored, analyzed, and retained, but not necessarily in that order. Historically, most vendors’ analysis tools were also the retention point for that data. Still, while this may first appear to be the best option for performance, we have quickly seen it creates significant problems. First, those systems were never designed for the scale of today’s growing volume of data, currently at a 28% CAGR. Second, analysis systems pricing is based on the volume of ingested data; the costs are already prohibitive and will continue to climb.

The simple answer is to separate your retention system from your analysis system. Put your data in a separate, cost-effective repository (like Amazon S3) and optimize the transfer of only specific datasets from storage into your analysis system instead of dumping everything into analysis. This is where Cribl comes in.

Cribl built its reputation on providing more innovative ways to manage observability and security data. We engineered Cribl Stream, a vendor-agnostic platform that gives customers the flexibility to route, shape, restructure, and enrich data from any source to any destination and in the format required, with an additional replay capability that enables customers to route “replay” only the essential data they need from low-cost storage to existing analytics tools.

Then, last year, we added Cribl Search, which lets you perform federated “search-in-place” queries on any data in any format at any location, eliminating the complexity and costs associated with first shipping, ingesting, and storing the data before being ankled to search it.

Both Cribl Stream and Search enable administrators to retain their data in their chosen data stores and then retrieve only the specific datasets required for the task (investigative query, etc.). We pride ourselves on giving customers the flexibility to leverage whatever Cribl tools work best with their stack, so you may wonder, which do I use and when? Let’s dive into the answer!

Data Collection Strategies

Data analysis requires collecting and routing the data through some processes to glean specific information. There are multiple ways to collect data for analysis; the traditional method has been to ‘collect it all,’ like the fishing trawl, collecting the targets and a lot of unneeded (image 1) and then store and process the data in your analysis system (I.e., SIEM) to sort it out. This is highly effective and has been the cornerstone of data analysis for a long time. However, data volume growth requires a better option, and Cribl offers two new ways to collect data for analysis: Cribl Stream Replay (image 2), targeting specific subsets of data, and Cribl Search (image 3), allowing the targeting of data with surgical precision. Here’s where they differentiate:

Cribl Stream Replay

Cribl Stream is a universal receiver designed to collect from almost any machine data source, streaming, or scheduled batch collection. As data transits with Cribl Stream, it is ‘shaped’ (reduction, enrichment, format, summarize, aggregate, etc.) before the data is forwarded to its final destination(s). This helps to control costs by eliminating digital noise and, in turn, helps retain more valuable data for a longer time without blowing out budgets. At The same time, Cribl Stream enables customers to route a full-fidelity copy of raw data to low-cost storage for long-term retention for compliance/audit/investigative purposes and “replay” it to analytics tools when required, such as Amazon S3, Google Cloud Platform, Azure Blob, and other compatible systems. With Replay, you can efficiently retrieve data from the object storage and then “replay it” through a pipeline and into your destinations, giving you an affordable way to retain more data for extended periods while still having it accessible for investigations.

A typical use case would be an organization using Amazon S3 to store data, even in the petabytes, before sending it for analysis for their security and operations teams. When teams get requests to review data for use in investigations, they no longer are overwhelmed by the volume of data, allowing the retrieval of specific datasets.

How It Works

Cribl Stream’s Replay option for AWS S3 offers organizations fundamentally new ways to manage data by providing an easy way to ingest selectively and re‑ingest data into systems of analysis. Let’s walk through how to use this feature at a high level.

Your analytics systems continue to do their thing, as usual; no changes are required.
Meanwhile, as raw data flows through Cribl Stream, a copy of that data, all those critical logs, metrics, and traces, is routed to less expensive object storage destination(s).
1. The settings for the Destination allow administrators to define how the uploaded files are partitioned. Host, year, month, date, time, host, index, sourcetype, ….
Whenever required by investigations, audits, or security events, Stream can re-ingest the relevant data right back into your analytics system(s).
1. When replaying the data, partitions will make your replay searches faster, mapping segments of the path back to variables, including time, that you can use to zero in on the exact logs you need.

Replay allows you to retrieve only a subset of the dataset based on the partitioning scheme you defined, improving the quality and speed of your analytics environment.

Think about it: when (not if) there is a data breach, you’ll need access to the raw data to analyze it in a new way that you haven’t been (otherwise, you would have caught the breach sooner). Also, if you need to prove compliance with certain security standards back to a specific date, you will want your raw data to help.

Cribl Search

Cribl Search is reshaping the data search paradigm, empowering users to query data directly at its source. Effortlessly sift through data in major object stores like AWS S3, Amazon Security Lake, Azure Blob, and Google Cloud Storage, and enrich your insights by querying dozens of live API endpoints from various SaaS providers. The power of Cribl Search lies in its strategic approach: discover and forward only the critical data to your systems of analysis, thus avoiding the cost of expensive storage.

Like Cribl Stream, Cribl Search allows administrators to identify and forward only a subset of the raw data for analysis, but with surgical precision to target specific data. Once a query is defined, it is a simple process to add the ‘send’ operator to leverage the native integration with Cribl Stream to send the results to the appropriate destinations easily.

How It Works

Cribl Search enables administrators with a single search tool to query all their observability data without first collecting it: Search for any terms, patterns, value/pairs, Search for any data type, Search anywhere you can reach, and forward the results to the analysis system.

Write your Search query – Example
1. dataset="cribl_internal_logs" status=200 response_time>2 | limit 1000
2. Query Returns 1000 events where status = 200 and response time >2
Append ‘send’ to your query
1. dataset="cribl_internal_logs" status=200 response_time>2 | limit 1000| send
2. Results of the query are forwarded to your Stream cloud instance
3. Note: results can go only to Stream or Stream and local display
Results received in Stream can be shaped and or routed to 1 or more destinations
1. The query results will automatically be routed to your Cribl Stream instance. Without any additional configuration, it will go to your default group or be configured to a specific group. The received data can then be routed to any Destination you choose (see Stream docs for more details on how to accomplish this).

While Cribl Stream allows searching a subset of the data, Search allows you even to be more surgically, able to locate and retrieve very specific datasets, results from a single IP, single or range of users, and of key importance is the ability to query data from multiple data stores simultaneously. Additionally, if Cribl Stream was used to write the data into the S3 bucket (or other), its robust partitioning capabilities make retrieving specific datasets much easier; without the pre-partitioning, Cribl Search will provide a more granular retrieval capability.

Finally, Cribl Search enables administrators to query and collect logs, metrics, application data, etc., directly from a host (via Cribl Edge) and forward to Cribl Stream for additional shaping and routing. Once again, Cribl’s products improve the quality and speed of your analytics.

Stream/Replay & Search/Send Use Cases:

Incident Analysis: Store a full copy of event log data from various sources in AWS S3, with the ability to send specific data to an SIEM system for further analysis.
Aging Data: Search for events that have aged out of your system of analysis
Compliance audit: Find and report which devices are accurately sending logs and identify any devices that may be missing.
Handling Massive Cardinality: When dealing with high cardinality data, use Cribl Search to decide what data is useful and then move it to an analytics tool.
AWS Security Logs Processing and Forwarding: Store AWS Logs in S3 and send a focused subset to another tool for further analysis.
AWS S3 to S3 Hairpin: Find specific data in one S3 bucket and then transfer the distilled search results back into a file (or set of files) in the S3 bucket.
Cost Savings: Optimize the data ingested to expand the licensing room for ingesting more critical data.
Optimize Ingest: Bring original data back from S3 object storage so you don’t need to suppress or drop specific data entirely.

Conclusion

As data volumes grow, the percentage of data being analyzed will continue to drop due to licensing costs. There are only two options to address this: get a bigger budget or be smarter about processing data before ingesting it into the analysis system. The Cribl Stream Replay and Search Send features are game changers; you can now effortlessly collect specific datasets and forward them to different systems for advanced analysis, audit, and compliance—a tremendous value for anyone managing digital exhaust data at scale. Suppose you always had a full-fidelity copy of your logs, metrics, and traces in S3. Ask yourself this: Would you still bring every event into your analytics systems? Would you truly need to keep terabytes of noisy, verbose, hard-to-search logs in your expensive analysis tools daily?

TL;DR: Cribl Solution Benefits

Enable separation of the system of analysis from the system of retention
- Store raw data in low-cost data stores, not in expensive analysis systems (<$$)
- Query data in-place (data stores), the route only relevant data (wheat from the chaff)
- Forward just the relevant data to the system of analysis (lower the ingest license)
- Improve the quality and speed of your analytics environment by saving older data somewhere else
Use Cribl Search to front-end and complement your existing analysis tooling
Keep more data for more extended retention periods and pay a lot less
Replay data to any analytics tools for unexpected investigations

Want to know more about Cribl Steam, Cribl Search, and Cribl Edge? Check out Cribl University, where training is always free.

Cribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy. Customers use Cribl’s suite of products to collect, process, route, and analyze all IT and security data, delivering the flexibility, choice, and control required to adapt to their ever-changing needs.

We offer free training, certifications, and a free tier across our products. Our community Slack features Cribl engineers, partners, and customers who can answer your questions as you get started and continue to build and evolve. We also offer a variety of hands-on Sandboxes for those interested in how companies globally leverage our products for their data challenges.

Blog

Simplifying Data Management in the Cloud: How Cribl and AWS’ Strategic Collaboration Agreement Benefits Customers

Blog

Observability for Everyone

Blog

Navigating the Mainframe Logging Maze: Insights for the Modern IT Professional

Try Your Own Cribl Sandbox

Experience a full version of Cribl Stream and Cribl Edge in the cloud with pre-made sources and destinations.

Launch Now

Product Portfolio

Cribl Stream

Cribl Edge

Cribl Search

Cribl Lake

Cribl.Cloud

AppScope

Use Cases

Integration

Industries

Resources

Events & Webinars

Learning

Tools & Pricing

Download Library

Customer Stories

Customer Experience

Learning

Try Your Own Cribl Sandbox

About Cribl

Cribl Newsroom

Leadership

Careers

Cribl for Startups

Contact Us

Cribl Stream’s Replay vs Cribl Search’s Send: Understanding the Differences

Written by

Perry Correll

Data Collection Strategies

Cribl Stream Replay

How It Works

Cribl Search

How It Works

Stream/Replay & Search/Send Use Cases:

Conclusion

TL;DR: Cribl Solution Benefits

Blog

Simplifying Data Management in the Cloud: How Cribl and AWS’ Strategic Collaboration Agreement Benefits Customers

Blog

Observability for Everyone

Blog

Navigating the Mainframe Logging Maze: Insights for the Modern IT Professional

Try Your Own Cribl Sandbox

So you're rockin' Internet Explorer!

You’re awesome. We want your experience to be equally so.

We’d love your thoughts on our website. Tell us your feedback and leave here knowing you’ve helped improve the lives of IT & Security pros everywhere!