# Data Cleaning Form

The Data Cleaning lets automation do the heavy lifting.

## Requirements Before Using the Data Cleaning App

&#x20;Have the following done before completing the form.&#x20;

* [Find Sales Leads](/apps/data-cleaning/data-cleaning-form/sources.md)
  * Parvenu (with emails) - imported to your Google Sheet and you have the List name
  * Sales Navigator Search (without emails) - have the search URL and have [connected your Linkedin](/apps/linkedin-outreach/accounts/connect-a-new-linkedin-account.md) account to Big Brain
  * Google Sheets - Ensure that the Google Sheet is publicly editable.&#x20;
  * Mailshake - add your Mailshake API Key to Big Brain.&#x20;
* [Configure your Filters](/apps/data-cleaning/data-cleaning-form/filters.md)
* [Connect a Linkedin Account](/apps/linkedin-outreach/accounts/connect-a-new-linkedin-account.md) - You must have a Linkedin account connected to Big Brain to use any of the following
  * Source - Sales Navigator Search URL - Must be a Sales Navigator account
  * Data Enrichment - Linkedin Profile Scraper
  * Data Enrichment - Personalize with AI

## Filling Out the Data Cleaning Form

* Navigate to the [Data Cleaning](https://www.big-brain.co/app/data-cleaning#data-cleaning) tab
* Fill out
  * Sources
  * Data Enrichment
  * Filters
  * Destinations
* Save the form

### Sources

Sources are the inputs you want to clean.

<figure><img src="/files/5o9WuKofLdlugfG9wS6p" alt=""><figcaption><p>Data Cleaning App</p></figcaption></figure>

#### List

Give your List a name.&#x20;

{% hint style="info" %}
When using Parvenu or Google Sheets as a source, the list name should be an **exact match** to the list name in Parvenu / have a column in the Google Sheet source that matches the list name in Big Brain. Otherwise, the list name is descriptive and can be anything.&#x20;
{% endhint %}

#### Sources

The available sources are

* [Parvenu](/apps/data-cleaning/data-cleaning-form/sources/parvenu.md) - best option for finding emails from Linkedin
* [Sales Navigator Search ](/apps/data-cleaning/data-cleaning-form/sources/sales-navigator-search.md)- this option does not find email addresses
* [Google Sheets](#sheets) - import your own custom data
* [Mailshake](/apps/data-cleaning/integrations/mailshake.md) - Sync Mailshake contacts with Big Brain and send them to a Linkedin campaign for multi-channel outreach.&#x20;

{% hint style="info" %}
All Google Sheets must having sharing set to publicly editable.&#x20;
{% endhint %}

You can view how to configure each source separately below.&#x20;

{% content-ref url="/pages/q0do9wg1Q9EtH1CWspEK" %}
[Sources](/apps/data-cleaning/data-cleaning-form/sources.md)
{% endcontent-ref %}

### Data Enrichment

We can enrich your input data with additional data from external sources. The available data enrichments are

* [Linkedin Profile Finder](/apps/data-cleaning/data-cleaning-form/data-enrichment/linkedin-profile-finder.md) - Find Linkedin Profile URLs from name, company, and location.&#x20;
* [Profile Scraper](/apps/data-cleaning/data-cleaning-form/data-enrichment/profile-scraper.md) - Get additional fields by scraping their profile.&#x20;
* [Personalize with AI](/apps/data-cleaning/data-cleaning-form/data-enrichment/personalize-with-ai.md) - Create personalized icebreakers for engaging outreach messages.&#x20;
* [Email Verification](/apps/data-cleaning/data-cleaning-form/data-enrichment/email-verification.md) - Verify emails to minimize bounce rates.&#x20;

You can view how to configure each data enrichment separately below.&#x20;

{% content-ref url="/pages/5vPHY7uDtfVQrOB8eGMx" %}
[Data Enrichment](/apps/data-cleaning/data-cleaning-form/data-enrichment.md)
{% endcontent-ref %}

### Filters

You can filter contacts to remove less relevant ones from your outreach. Filter options are as follows

* Title Contains - The prospect's job title must contain at least one of these words. IE director
* Title Not Contains - The prospect's job title cannot contain any of these words. IE assistant.&#x20;
* Company Not Contains - The prospect's company name cannot contain any of these words. IE staffing
* Website Not Contains - The prospect's company domain cannot contain any of these extensions. IE .edu

You can view how to configure your filters below.&#x20;

{% content-ref url="/pages/R7K75cSrDsogiA5LD8HT" %}
[Filters](/apps/data-cleaning/data-cleaning-form/filters.md)
{% endcontent-ref %}

### Destinations

Finally you can send your cleaned data directly to external apps or internal campaigns. Available destinations are as follows.&#x20;

* [Linkedin Outreach](#linkedin-outreach) - automatically connect and send follow up messages to prospects on Linkedin
* [Mailshake Campaign](/apps/data-cleaning/data-cleaning-form/destinations/mailshake-campaign.md) - automatically send email sequences to prospects with Mailshake.&#x20;
* [Google Sheets](/apps/data-cleaning/data-cleaning-form/sources/google-sheets.md) - send data to Google Sheets automatically.&#x20;

You can view how to configure each of the destinations below.&#x20;

{% content-ref url="/pages/zPKBNgbkwZUbBXI14LCw" %}
[Destinations](/apps/data-cleaning/data-cleaning-form/destinations.md)
{% endcontent-ref %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.big-brain.co/apps/data-cleaning/data-cleaning-form.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
