Creating New Datasets

  • Updated

Datasets are the primary artifact in the Analytics data model. A dataset is a tabular view (rows and columns) of data used for analytics. Datasets are logical views in Analytics that map to tables or views in your data warehouse and represent two main categories of data: Event data and Actor data

Event Data

Event data is composed of events that may come from many sources: internal data collection, third party CDPs, marketing sources, customer care sources, etc. They represent an action or fact that took place at a point in time. Actors represent the entity in your product ecosystem that initiated the event, typically Users, but this can also include Accounts, Vendors, Service Providers, etc.

Actor Data

Actor datasets are typically mapped into Analytics one to one from existing source tables or views in the warehouse. These often take the form of dimensions tables, where each row represents a single, unique Actor (ie: User) with a primary key or UUID. Columns represent attributes that describe the entity associated with that ID. There are no special semantic rules required to create a new actor dataset.

In Analytics, there are four types of datasets:

Before you continue, you must first create an application and establish a successful connection with a data warehouse in Analytics. This can be achieved by following the steps outlined in the Quick Start Guide.