Validating and uploading your data files
Ensure your data is clean, accurate, and ready for analysis by applying validations during file uploads.
With this guide, learn how to add validations, handle errors, and manage column mismatches to streamline your data integration process.
When uploading Excel or CSV files to your dataset, you can add specific validations to each column to ensure data consistency and accuracy.
In order to start validating your data you should be in the Create a Self-Service Dataset area of the platform.
You will have gone through the steps to Create a Dataset, and be in the Data Configuration after selecting your file to upload.
Adding Column Validations
-
Start Validation: In the “Validation” step of the upload process, click on "Add Validation".

-
Click Add Validation +, then Select a Column: Choose the column you want to validate.

There are a few options to select:
Allow Empty: Specify if the column can contain empty cells.
Choose a Validation Type:
-
Number: Requires the data to be numerical.
-
Whole Number: Requires the data to be a whole number.
-
List: Provide a list of acceptable values (one per line) that the data must match.
-
Regular Expression: Choose a template from the dropdown or input a custom regular expression for specific formatting rules.
There are some extra configuration options you can select if you choose to, for each validation type. You don't have to fill these out if they are not needed.

3. Finalise Validation: Click "Add Validation" to save your column validation.
Number-type Columns Validation
If you have configured the data types, the system will validate number-type columns before uploading. Learn more about How to configure the data type of each column.
Uploading Data with Validations
- After setting your validations, proceed to the final step of the upload and click "UPLOAD DATA".

- If there are errors:
- 5 Rows or Fewer: You'll receive an immediate notification of the errors.

- More Than 5 Rows: Click on "See details" to review a summary of errors in a popup window, allowing you to make necessary corrections.


- 5 Rows or Fewer: You'll receive an immediate notification of the errors.
Managing Column Mismatches
When uploading new Excel or CSV data to an existing dataset, the system will flag any column mismatches. This ensures your new data aligns with the current dataset structure.
Types of Column Mismatches
- Missing Column: A column in the original dataset is missing from the uploaded data file.
- New Column Detected: The uploaded data includes a new column not present in the original dataset.
- Column Order Mismatch: The column order in the new upload does not match the current dataset’s order.
You can hover over the info icon for detailed information on these errors.

Overwriting Existing Data
If you want to replace the current dataset structure with the new upload format:
- Select "Overwrite" and check the "Force upload" option.
- Confirm the action in the popup window.
- Click "UPLOAD DATA".
Note: Overwriting will permanently replace the existing dataset with the new data structure. Ensure that all uploaded data is accurate before proceeding.
Number-type Columns Validation
If you have configured the data types, the system will validate number-type columns before uploading. Learn more about How to configure the data type of each column

