RudderStack provides a sample data set for the Snowflake warehouse, available in the Snowflake marketplace. You can use this data to run the Profiles project and Propensity Scores through the UI or the CLI.
The number of columns in this data set are intentionally limited to make the data set easily understandable. Also, all email addresses are generated randomly and no PII is used in the generation of this data set.
The following tables, properties, and user information is included in the data set:
Tables
This data set includes below-mentioned RudderStack event data tables:
PAGES - Page view events from anonymous and known users.
IDENTIFIES - Identify calls run when a user provides a unique identifier (i.e., upon signup).
ORDER_COMPLETED - Detailed payloads from tracked order_completed events.
As of January 2023, here are the approximate number of rows in each table:
PAGES: ~43k
TRACKS: ~14k
IDENTIFIES: ~4.8k
ORDER_COMPLETED: ~2.2k
These volumes follow the pattern of a normal eCommerce conversion funnel (pageview, signup, order). Specifically, here’s a rough breakdown of the user journey by volume:
30% - Never sign in
10% - Sign in but never add an item to cart
40% - Add to cart and abandon
20% - Make purchases
Note that this data includes future data until Apr 2024, and starts in June 2023. This is to ensure that future users can still run the project with ‘current’ data. RudderStack team will refresh the data periodically throughout the year.
Properties
This data set includes a subset of the standard properties found in the Warehouse schema spec for each table. The required columns for running Profiles and Predictions projects are also present.
User information
The user data includes a subset of our standard properties for identify calls.
This data set contains a total of ~10k unique users by anonymousId. About half of these unique users (~4.8k) are known users (with an associated identify call).
This site uses cookies to improve your experience while you navigate through the website. Out of
these
cookies, the cookies that are categorized as necessary are stored on your browser as they are as
essential
for the working of basic functionalities of the website. We also use third-party cookies that
help
us
analyze and understand how you use this website. These cookies will be stored in your browser
only
with
your
consent. You also have the option to opt-out of these cookies. But opting out of some of these
cookies
may
have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This
category only includes cookies that ensures basic functionalities and security
features of the website. These cookies do not store any personal information.
This site uses cookies to improve your experience. If you want to
learn more about cookies and why we use them, visit our cookie
policy. We'll assume you're ok with this, but you can opt-out if you wish Cookie Settings.