Cribl puts your IT and Security data at the center of your data management strategy and provides a one-stop shop for analyzing, collecting, processing, and routing it all at any scale. Try the Cribl suite of products and start building your data engine today!
Learn more ›Evolving demands placed on IT and Security teams are driving a new architecture for how observability data is captured, curated, and queried. This new architecture provides flexibility and control while managing the costs of increasing data volumes.
Read white paper ›Cribl Stream is a vendor-agnostic observability pipeline that gives you the flexibility to collect, reduce, enrich, normalize, and route data from any source to any destination within your existing data infrastructure.
Learn more ›Cribl Edge provides an intelligent, highly scalable edge-based data collection system for logs, metrics, and application data.
Learn more ›Cribl Search turns the traditional search process on its head, allowing users to search data in place without having to collect/store first.
Learn more ›Cribl Lake is a turnkey data lake solution that takes just minutes to get up and running — no data expertise needed. Leverage open formats, unified security with rich access controls, and centralize access to all IT and security data.
Learn more ›The Cribl.Cloud platform gets you up and running fast without the hassle of running infrastructure.
Learn more ›Cribl.Cloud Solution Brief
The fastest and easiest way to realize the value of an observability ecosystem.
Read Solution Brief ›AppScope gives operators the visibility they need into application behavior, metrics and events with no configuration and no agent required.
Learn more ›Explore Cribl’s Solutions by Use Cases:
Explore Cribl’s Solutions by Integrations:
Explore Cribl’s Solutions by Industry:
Watch On-Demand
3 ways to fast-track your data lake strategy without being a data expert
Watch On-Demand ›Try Your Own Cribl Sandbox
Experience a full version of Cribl Stream and Cribl Edge in the cloud.
Launch Now ›Get inspired by how our customers are innovating IT, security and observability. They inspire us daily!
Read Customer Stories ›Sally Beauty Holdings
Sally Beauty Swaps LogStash and Syslog-ng with Cribl.Cloud for a Resilient Security and Observability Pipeline
Read Case Study ›Experience a full version of Cribl Stream and Cribl Edge in the cloud.
Launch Now ›Transform data management with Cribl, the Data Engine for IT and Security
Learn More ›Cribl Corporate Overview
Cribl makes open observability a reality, giving you the freedom and flexibility to make choices instead of compromises.
Get the Guide ›Stay up to date on all things Cribl and observability.
Visit the Newsroom ›Cribl’s leadership team has built and launched category-defining products for some of the most innovative companies in the technology sector, and is supported by the world’s most elite investors.
Meet our Leaders ›Join the Cribl herd! The smartest, funniest, most passionate goats you’ll ever meet.
Learn More ›Whether you’re just getting started or scaling up, the Cribl for Startups program gives you the tools and resources your company needs to be successful at every stage.
Learn More ›Want to learn more about Cribl from our sales experts? Send us your contact information and we’ll be in touch.
Talk to an Expert ›April 4, 2022
Source-side queueing is a fancy way of saying: You can configure Cribl products to make sure data isn’t lost in the event of downstream backpressure, again. Those familiar with Cribl Stream might be aware of destination queuing or persistent queuing, wherein Stream can write data to the local disk in the event of an issue reaching the destination. Maybe your SIEM is suffering from disk I/O latency. Maybe there is a DNS problem with your load balancer (Hint: It’s always DNS). Maybe the destination actually went down due to a zombie apocalypse. Whatever the reason, Stream (with persistent queuing configured) would sense the destination’s having an issue, and would start writing data to disk as Stream waited for the destination to come back online.
Destination queueing could fill up quickly if myriad sources are configured. To solve this, sources are usually alerted to backpressure and asked to simmer down for a sec while Stream waits for the destination to catch its breath. Unfortunately, alerting sources to backpressure only works when they are TCP sources. UDP sources just yeet their data at Stream and don’t care what happens afterward. We can’t tell a UDP source that we need some data again, what with UDP being connectionless and all. What we need, then, is a way to backpressure sources when we can (on TCP) and to queue data from sources that don’t return our calls (on UDP).
This is where source-side queuing comes in as a new Stream feature that adds to our existing queuing capabilities. Instead of a singular destination queueing option, now administrators can configure Stream to queue in a source as well. Let’s look at how to configure a source-side queue.
Before we continue, a word of warning: All queuing in Stream is written to the local disk, meaning that if disk space runs out, data will be dropped.
Now that we’ve dispensed with the small talk, let’s look at the actual UX for source-side queuing in Stream.
First off, we’ve set the Mode to Smart. This means that Stream uses heuristics to determine when the destinations associated with this source are actually down, meaning that we need to stop writing to memory and start writing to disk (read: queuing). The other option on this drop-down is Always On, meaning that all events are buffered to disk – but remember the warning earlier about filling up the disk, and use this at your own risk.
Next to cover are two new settings added for source-side queuing: Max buffer size and Commit frequency. (These, and other settings in the screenshot above, are covered in the docs page linked earlier.)
Max buffer size refers to how many events Stream will hold in memory for this source, before they start spilling over to disk (due to destination backpressure).
Commit frequency refers to how often Stream creates checkpoints (or markers to remember where it was) when reading from queue files. By default, this is set to 42 events, meaning that Stream puts a bookmark in the queue file every time it flushes 42 events. If Stream were to go down for some reason while emptying the queue, it could then go back and start at the last bookmark / checkpoint. Note: Decreasing the frequency will improve performance (since we aren’t writing checkpoints as often), yet it increases the possibility of duplicate events downstream in the event of a downed Stream.
So, yeah – the Cribl suite now includes source-side queuing for all your data integrity needs. If anyone asks, “You down with UDP?” you can now sleep soundly, knowing that you can reply, “Yeah! You know me!”
The fastest way to get started with Cribl Stream and Cribl Edge is to try the Free Cloud Sandboxes.
Josh Biggley Apr 25, 2024
Ryan Conway Apr 9, 2024
Classic choice. Sadly, our website is designed for all modern supported browsers like Edge, Chrome, Firefox, and Safari
Got one of those handy?