What Is Cribl Lake?
In the beginning, there was object storage. And AWS architects saw that it was good. Until the Fire Nation attacked.... Sorry, getting all my references mixed up. Let's start with what a data lake is. That will help us explain why we made Cribl Lake.
What is a data lake?
A data lake is a centralized repository that stores large amounts of structured, semi-structured, and unstructured data. It allows for flexible storage and analysis of diverse data types. Three use cases for data lakes are:
- Data archiving and long-term storage
- Data exploration and analysis
- Machine learning and AI model training
Data lakes offer user-friendly ways to interact with object storage. These allow for quicker time to value when storing massive amounts of data.
It can't be that hard... Can it?
Let's run through a quick thought experiment describing the process for configuring your own data lake and connecting it to your Cribl.Cloud environment. I'll let you fill in how long each step takes for your own company.
What it takes to make / maintain a data lake:
- Set up S3 storage
- Set up security access
- Create roles
- Create policies
- Create permissions
- Validate with security team
- Build a bucket strategy
- Build a partitioning strategy
- Test, validate, reformat
- Set up security access
- Connect to Cribl
- Connect to Stream / Edge
- Setup Destination(s)
- Setup Collector(s)
- Connect to Search
- Create Dataset Providers
- Create Datasets
- Connect to Stream / Edge
- Manage over time
- Monitor costs
- Monitor retention
- Change management
- Any time something changes – start over at the top
Bleh. "Ain't nobody got time for that!"
Welcome to Camp Cribl Lake!
Cribl Lake is a fully managed data lake service. Cribl Lake enables admins to securely receive and store data from anywhere, and you can easily search or replay said data using our signature replay experience.
We built Cribl Lake to provide customers with a long-term storage option for their IT and Security data. You won't need to worry about normalizing or shaping your data in advance. Cribl Lake enables you to store data in either a raw or normalized vendor-neutral format. This ensures your data can be easily analyzed by any tool at any time. Cribl Lake handles all the schema management, so you don't need to learn complex data mapping.
With Cribl Lake, you can have a data lake up and running in minutes by simply enabling it in Cribl.Cloud. Cribl Lake integrates seamlessly with our suite of products and comes with all the operational tools you need to manage your data lake, pipelines, and search capabilities.
Let's quit talking reading and start clicking!