Column actor dataset

  • Updated

In Analytics, an actor is any entity in your business commonly used in analysis, such as a user, account, subscription, product, and so on.

A dataset represents an actor – typically, this maps to a dimension table in the data warehouse. However, in some cases, there is no such dimension table in the warehouse; rather, there is only an identifier column in an event table such as
(Events.user_id)
In this case, you can create a column actor dataset. It does not have any direct mapping to a warehouse table; it is defined solely by its relationship to the identifiers in other tables. However, you can still use column actors in all the ways you can use a regular actor dataset.

Create a column actor dataset

​From the datasets page​

  1. Go to Data > Datasets > + New DatasetColumn Actor Dataset.
  2. Name your dataset, add a description, and click Save. Then, configure the dataset.

ca-dataset-1.png

Within explorations

You can also create column actor datasets inline – that is, within the exploration templates. Wherever there is a requirement for actor selection within any exploration template, the inline column actor creation option is available within the terminal selection. To create a column actor dataset within an exploration,

  1. Click + on the navigation panel and choose an exploration template. 
  2. Select a measure and the actor. Open the drop-down list and click the Create Column Actor from an ID column option in the terminal selection.

    ca-dataset-2.png

  3. Select the related dataset and columns, choose the destination folder, and click Save.

    ca-dataset-3.png

Suppose you have already selected the events for the Funnel stages. In that case, when you click the Create Column Actor from an ID column option, you preselect the related dataset – that is, you preselect the event dataset of the events selected in the stages. Otherwise, the app dataset would be preselected, if that exists, and if none of these are present, then it would be empty, and you can choose a dataset of your choice.

Dataset configuration

Related datasets

The Related datasets tab lets you define logical relationships between datasets, similar to how tables relate in a data warehouse. You can combine event and attribute data for richer analysis without understanding the underlying data structure. Analytics uses these relationships to determine which datasets to join in an exploration and only shows relevant options in the UI. For example, using common user ID columns, you can form a relationship between an events dataset and a users dataset.

  1. Click + Add Related Dataset.
  2. From the Dataset drop-down list, select the dataset you want to link to the current source dataset.
  3. Select the relevant Cardinality for the relationship:
    • many to one – Match many rows in the current dataset to one row in the related dataset.
    • one to many – Match one row in the current dataset to multiple rows in the related dataset based on the selected columns.
    • one to one – Establish a relationship by directly matching a selected column in a single row of datasets.
  4. Click + and use the drop-down lists to select the columns that must match to establish a successful relationship.
  5. Click Save.
ca-dataset-4.png
 

Cohorts and derived columns

In this section, you can define new derived columns and cohorts of rows that represent logical groupings within the respective tabs. This is an optional step. Learn more about cohorts and derived columns.

opti-cadataset-3.png

You can view and access your dataset on the Datasets page when the configuration is complete.

Edit your dataset

  1. Go to Data > Datasets.
  2. Click the dataset you want to edit on the Datasets page.
  3. Make necessary changes and click Save.
ca-dataset-5.png

Duplicate your dataset

  1. Go to Data > Datasets.
  2. Click the dataset you want to duplicate on the Datasets page.
  3. Click More (⋮) > Save As. Give your copy a name and a description, choose a destination folder, and click OK.

ca-dataset-6.png

Delete your dataset

  1. Go to Data > Datasets.
  2. Click the dataset you want to delete on the Datasets page.
  3. Click More (⋮) > Delete. Once again, click Delete in the confirmation box.

ca-dataset-7.png