Level up with top data content curated by team RudderStack

Get the data reading guide

Join Databricks, dbt, Fivetran, Hinge, & EssenceVC for a live discussion on the modern data stack.

Register Now
Blog banner

Product

RudderStack: An Open-Source Customer Data Platform

Soumyadeb Mitra
Founder and CEO of RudderStack

RudderStack is an open-source customer data pipeline tool. It collects, routes, processes data from your websites, apps, cloud tools, and data warehouse. With RudderStack, you can build efficient data pipelines that connect your entire customer data stack and leverage your warehoused data to trigger your analytics and other activation use-cases.

Some of the key features of RudderStack include:

  • Complete Flexibility: Unlike most commercial systems that charge you based on the event volume, RudderStack lets you collect all of your event data without worrying about overrunning your budget.
  • Warehouse-first Architecture: Most modern companies are building their CDP on top of a data warehouse. RudderStack treats your data warehouse as a first-class citizen among your destinations. It offers advanced features and configurable, real-time sync to safely collect and route your events to your data warehouse.
  • Built for Developers: RudderStack is built API-first and easily integrates with the tools that you already use and love.
  • User-specified Event Transformation: RudderStack offers a powerful JavaScript-based transformation framework that lets you transform and enhance your in-transit events before routing them to your warehouse or other preferred destinations.
  • High Availability: RudderStack’s sophisticated error handling and retry system ensures all of your event data will be delivered despite any network or destination downtime.

For more information on RudderStack, feel free to join our Slack channel and start a conversation. We’ll love to hear from you!

How to set up RudderStack

The easiest and fastest way to get started with RudderStack is using the Docker setup. However, if you wish to use RudderStack in production environments, we strongly recommend using our Kubernetes Helm charts.

The steps for setting up RudderStack using Docker are as follows:

  • Sign up on the RudderStack app. You can easily set up and configure your event data sources and destinations through the RudderStack dashboard. RudderStack self-hosts these configurations and does not charge you for it.
Note: If you want to host your own source and destination configurations, you can use the open-source RudderStack Config Generator. However, note that this open-source dashboard lacks features such as user-defined transformations and live event debugging, which are present in the RudderStack-hosted dashboard.
  • Then, copy the workspace token at the top of the dashboard page, as shown:
image-9297b1b1ea4838f809cf4214df67ed4a636c1273-440x172-png

Connections

  • Next, download the docker-compose file rudder-docker.yml.
  • Open this file, and replace <your_workspace_token> with the workspace token that you have copied above:
image-0a5d986c6271aede43a45a1f118eb1b951479002-1112x753-png

Workspace Token

  • Finally, navigate to the location where you want to set up RudderStack and run the command docker-compose -f rudder-docker.yml up
  • To verify if the setup is successful, send test events to your destination by following this guide.

RudderStack Architecture

RudderStack's architecture consists of 2 major components:

  • Control Plane: This component handles the source and destination configurations and the user-specified connections.
  • Data Plane: This is the RudderStack backend - the core engine that collects, transforms, and routes the events to the specified destinations.

Here’s a broad visual representation of RudderStack’s architecture:

image-42ba63644e91ab518c7c704ae471dbdcf0ddf213-3969x1961-png

RudderStack Architecture

For more details on the architecture, check out our documentation.

RudderStack Transformations

RudderStack’s Transformations module allows you to transform and enrich your in-transit events into a destination-specific format, before routing them to the desired destination. These transformation codes are written in JavaScript.

We recently released our Transformations API that makes it easier for you to write custom transformation functions for your business-specific use-cases. With this API, you can also reuse specific blocks of JavaScript code (called libraries) by simply importing them in the desired transformation using the module name, thus reducing your engineering work.

RudderStack Integrations

RudderStack currently supports more than 80 integrations, with newer sources and destinations added to the catalog almost every week.

Event Streams

RudderStack Event Streams allow you to track and collect event data from your websites and applications in real-time. This feature includes client-side SDKs for website, mobile, and server-side event tracking, as well as integrations with some third-party platforms like Looker, PostHog, and Customer.io.

Read more about this feature in our docs.

Cloud Extract

With RudderStack Cloud Extract, you can seamlessly build ELT pipelines from your cloud applications to your data warehouse. RudderStack also gives you the ability to choose what data you want to ingest, and specify the sync time when the data should be loaded into the warehouse.

Warehouse Actions

RudderStack’s Warehouse Actions feature lets you leverage the enriched warehouse data as a source for your entire customer data stack. This way, you can send the warehoused data to your preferred customer tools.

Support for 80+ Destinations

With support for over 80 third-party tools and destinations, RudderStack reliably routes all the tracked customer events to your preferred destinations for various activation use-cases like analytics, attribution, marketing, CRM, and personalization. Check out all our supported destinations here.

For an in-depth comparison of RudderStack and Segment check out this post on Marketing Arsenal: An open source Segment alternative? Rudderstack vs Segment

Sign up for Free and Start Sending Data

Test out our event stream, ELT, and reverse-ETL pipelines. Use our HTTP source to send data in less than 5 minutes, or install one of our 12 SDKs in your website or app. Get started.

image-7d21df92eb9e8571ecc9ed2bbe1dc5b590ece549-400x400-jpg
About the author
Soumyadeb Mitra
Founder and CEO of RudderStack. Passionate about finding engineering solutions to real-world problems.
Subscription
Subscribe

We'll send you updates from the blog and monthly release notes.