Skip to content
English
  • There are no suggestions because the search field is empty.

Getting Started with Self-Service Data Collection

This article helps you focus on the information most useful to getting up and running with Self Service Data Collection

 Before you begin, it’s helpful to understand who this guide is for.

This article is written for Dataset Moderators — the people responsible for setting up datasets, managing structure, and deciding how data is collected and shared.

If you’re a Contributor (someone invited to add data to a dataset, often without access to configuration or admin settings), you’ll find more relevant guidance in this article instead.

Understanding the different roles

In Self-Service Data Collection, there are two main roles:

  • Dataset Moderators
    Dataset owners or administrators who configure datasets, create data tables, invite contributors, and control how data is shared.

  • Contributors
    People who have been invited to submit data to a dataset. Contributors can add or edit data they’ve been given access to, but they don’t manage dataset settings or structure.

Understanding this distinction will help you navigate the platform with confidence.

Start with the core concepts

If you’re new to Self-Service Data Collection, we recommend starting with these short explainers:

These concepts form the foundation for everything that follows.

Working with Unit Record Level data files

Unit Record Level data, also known as microdata, is defined as a dataset of “unit records”, where each record contains information about the “unit”.

Imagine a dataset full of census information. You might expect each row of the dataset to represent a single person. This would mean that the unit is a person, and therefore, the unit records are information about each person.

The platform enables easy upload, transformation and ingestion of URL data as CSV or XLSX files. The URL upload tool enables you to:

  • Upload and merge up to 2 data files

  • Protect sensitive data
  • Clean your data file by removing or renaming columns
  • Set data types and validations

Step through the process here!

Create Aggregate Data Table

Aggregate data, also known as macrodata or tabular data, is produced by grouping information into categories and combining values within these categories. Aggregate tables can be configured by a moderator and then shared with external contributors for easy data collection.

Follow this guide to get started:


If you are ready to dive deeper on Data Tables you can access more helpful articles in the Working with Data Tables section, but you don’t need to read them all right away. The basics are enough to begin collecting data.

Add data and invite Contributors (optional)

After your Data Table is set up, you can start adding data in one of two ways:

Contributors will only see what they need to submit data — they won’t have access to dataset configuration or admin settings.

Share your dataset for analysis and insights

Once your dataset contains data, you can make it available on the platform so others can explore, analyse, and generate insights.

To share your dataset, see:


Note: All new data first passes through a staging and ingestion process. Learn how to ingest your data here.

 


What’s next

That’s all you need to get started with Self-Service Data Collection.

If you’d like to go further, you can explore the other articles in the Data Collection section at your own pace — they’re there to support you as your datasets and use cases grow.