How iHerb Optimizes Analytics, Security with Cribl Stream

Highlights

Seamless management of 5TB/day of data with selective routing to appropriate destinations
Reduced load on Splunk software, Elastic , and Loki via intelligent volume reduction and optimized data
Increased uptime by allowing engineers to focus on tasks more critical than building tools

A few years ago, iHerb set out to build a real-time stream processing system for their logging data. However, developing an in-house observability pipeline consumed a lot of engineering resources and left them with a lot of technical debt, making the solution costly and unmanageable for the long term.

As an online retailer iHerb was processing 2-3TB of weblogs daily, so configuring sources and keeping systems up to date was eating up valuable engineering time. Bob Chen, the organization’s Director of Infrastructure Engineering, mentioned the other factors that led to the switch from building their own tool to using Cribl Stream:

“We wanted an easy-to-use tool without having to tap into a UX team. A good API interface was critical, as was support for multiple logging sources and destinations. It turned out Cribl Stream could provide all that, and it was easy to implement and deploy, so we made the choice to put our build on hold.”
Bob Chen
Director of Infrastructure Engineering

Using Selective Routing to Manage Doubling Data Volumes

The decision to switch from open-source to Cribl Stream came at just the right time, as the amount of data iHerb processed daily, doubled. Their data now flows seamlessly from sources like Kafka and Fluentd to destinations like S3, Loki, Elastic Stack (Elasticsearch, Logstash, Kibana), and Splunk software.

All that data goes to S3 for long-term storage, with most logs going to Elastic for short-term (<3 months) storage. Some selected logs get sent to Loki for retention periods between 3-6 months. iHerb’s Security department provides guidelines to Bob and his team regarding which data gets sent to Splunk software for security use cases.

“We process a lot of data each day, and we can’t afford to skip even a few KB of it — we need every log entry to troubleshoot incidents and identify other issues. Using Cribl Stream helps us avoid losing any of the critical data we need.”
Bob Chen
Director of Infrastructure Engineering

Reduced Load on Analysis Tools

Cribl Stream’s ability to help iHerb be surgical about their data–selective dropping, sampling, suppression of whole events, and routing data to the best tool for analysis–makes Splunk software, Elastic, and Loki return searches faster while reducing required infrastructure and processing power. The ability to simply configure pipelines allows for easy reformatting, removal of redundant or unnecessary fields, stripping out null JSON values, and more.

Stream also offers native data transformation functionality, simplifying data management and reducing storage of surplus data. Annotations in Kubernetes metadata can vary in size, and the data is often unstructured. iHerb uses Stream to trim out useless fields and clean up, redact or transform these events.

Improved Security With Masking and Replay Features

With the increase in cybersecurity incidents in recent years, securing sensitive data is more important than ever. iHerb leverages Cribl Stream to mask sensitive patterns using redaction, hashing, or randomization. This feature allows Bob and his team to mask PII for the security team.

If a security incident does occur, Cribl Stream’s replay feature allows them to selectively re-ingest data from S3 back into their systems of analysis.

Observability, Metrics and Beyond

Many teams leverage Elastic for log analysis, but it’s also a popular choice for handling metrics. iHerb uses Cribl Stream to query and aggregate log counts and other statistics based on parameters like cluster, namespace, and source. The results are then seamlessly integrated from Elastic into an intuitive, user-friendly Grafana dashboard, enabling them to gain valuable insights into system performance, identify trends, and troubleshoot issues effectively.

Since successfully implementing Cribl Stream, Bob and team have also used Cribl Edge to implement a couple thousand edge nodes. Kubernetes, an integral part of their infrastructure, is notoriously difficult to monitor and often limited by the observability of the system. iHerb deploys Kubernetes with Edge already bootstrapped to collect application logs and system metrics, giving them unparalleled visibility into Kubernetes microservices.

“The combination of Cribl Stream and Edge is a lifesaver. The speed, accuracy, and ability to manipulate logs is unparalleled.”
Aaron Wilson
Senior Site Reliability Engineer

By using Cribl Stream and Edge instead of building their own infrastructure, iHerb has been able to save on infrastructure, network bandwidth, engineering, and outage costs — and the setup was even easier than Bob and his team anticipated.

“We got our Cribl Stream POC up and running within a week. We tested as many scenarios as we could, pushed a bunch of our logs through a test environment, then made the purchase and got our production environment going remarkably quickly.”
Bob Chen
Director of Infrastructure Engineering

TL;DR

iHerb switched from building their own observability infrastructure to using Cribl Stream
Reduced volume of data stored via selective dropping, sampling, and suppression of whole events
Secured sensitive data using redaction, hashing, or randomization with Cribl Stream’s Mask function
Reduced load on Splunk software, Elastic , and Loki by intelligently filtering which logs end up in each destination
Aggregated logs and metrics to create user-friendly dashboards in Grafana

From Open Source to Optimized: iHerb’s Data Journey with Cribl

Highlights

Using Selective Routing to Manage Doubling Data Volumes

Reduced Load on Analysis Tools

Improved Security With Masking and Replay Features

Observability, Metrics and Beyond

TL;DR

About Cribl

Products & Services

Learning & Resources

Company

Get Started

NewsLetter

4.7