Introduction to RudderStack

Build flexible, secure, end-to-end data pipelines on your warehouse.

RudderStack is a warehouse native customer data platform that helps data teams deliver value across the entire data activation lifecycle - from collection to unification and activation.

With RudderStack, you can:

  • Deploy enterprise-grade SDKs and streaming pipelines to collect behavioral events from all of your websites, apps, and backend systems.
  • Forward event data to your entire stack using 200+ destination integrations.
  • Solve identity resolution and model a comprehensive customer 360 data set in your data warehouse.
  • Sync customer 360 data from your warehouse to business tools.
  • Build powerful use cases like attribution, propensity scoring, and real-time personalization on top of your customer 360 data.
success
RudderStack is warehouse-native, meaning it does not store or persist any data. All the data loads and modeling happen with full transparency in your data warehouse or data lake.

Overview

The following self-guided tour gives you a detailed overview of the RudderStack platform:

Collect

  • Gather event stream data using the state-of-the art SDKs and cloud app sources.
  • Enforce strict data quality and compliance controls at the source, eliminating expensive cleanup and simplifying regulatory compliance.
  • Transform and enrich your event data by implementing custom logic in JavaScript or Python using RudderStack’s robust Transformations framework.
  • Sync data to over 200 destinations including data warehouses, data lakes, and all kinds of business tools.

Unify

  • Use Profiles to centralize and resolve customer identities and build features on top of it to create a 360 view of the customer.
  • Leverage Predictions to build custom predictive features in your warehouse without the complicated data wrangling and MLOps.

Activate

  • Sync enriched warehouse data to various business tools using Reverse ETL.
  • Leverage the Activation API to make your customer 360 available in real-time via API and drive real-time personalization in your websites and apps.

Platform features

The RudderStack platform comes in two flavors:

  • RudderStack Cloud: RudderStack Cloud is a managed platform, including APIs and an intuitive UI, that enables your team to take advantage of all RudderStack features across the collection, unification, and activation products.
success
See the features walkthrough, sign up for free, and check out our pricing page to select the RudderStack Cloud plan that best suits your needs.
  • RudderStack Open Source: RudderStack Open Source includes only our event streaming and integrations features, with limited access to features like Transformations. Running RudderStack Open Source for event streaming requires you to set up your own data plane and is ideal for highly technical data teams with platform engineering resources.

Some other platform-specific features are:

  • Production-ready: Many companies like StatPearls, Wyze, Joybird, and more, use RudderStack for their data collection and activation use cases.
  • No event cap per MTU: RudderStack is fully pay-as-you-use with no cap on events per MTU. You can send billions of events without worrying about overrunning your event budget.
  • High Availability: With a sophisticated error handling and retry system, RudderStack ensures your data is delivered even during network or destination downtime.
  • Segment-compatible: RudderStack is fully compatible with Segment, making it easy for you to integrate the RudderStack SDKs into your app without any complex instrumentation. Your events will keep flowing to the destinations (including data warehouses) as before. See the Segment Migration Guide for more information.


Questions? Contact us by email or on Slack