Data

Data used in this project was collected and released by the Federal Election Commission.

Among different types of disbursement data, "Operating Expenditure" data was selected for analysis and visualization since this records "A committee's day-to-day expenditures".

At the time of the project, the data obtained in March 2022 was the latest processed data available on the FEC's website.

The data set contains 64320 rows, each of which describes a single disbursement of the Trump 2020 campaign between January 1st 2017 and December 31st 2020.

It's important to note that due to the FEC's multi-step process could cause delays, the data we obtained at the time of the project might not be 100% accurate to reflect the total spending of the committee.

Methods

As defined by Tactical Tech, the influence industry is the private sector built around the acquisition and use of personal data for political elections.

Almost no company brands itself as an influence industry company, which makes identifying the companies difficult. Due to its shifting and dynamic nature, it's also hard to completely tie the industry to a few specific fields. We cannot, therefore, directly investigate all the companies that worked for a political campaign, but we can use a set of proxy terms – described below – to identify noteworthy and relevant firms in the data set.

We considered companies to be relevant to the influence industry if the spending data from the FEC shows that they provided data-driven influence services to a campaign. We classified spending on such services into three categories: “Data”, “Digital Advertising”, and “Consulting”.

We developed a glossary of all the service terms related to the three categories from the data set's "disbursement description" column and used it to screen out all the influence spending and companies in the 2020 Trump campaign.

glossary
Part of the glossary that used to screen out related spending

Below are the details about the three categories:

Data:

We first found all transactions with the keyword "DATA" in the data set. By further investigate them, we filtered out the ones irrelevant to the influence industry.

As an example, Red Curve Solutions received several transactions from the Trump campaign for "COMPLIANCE CONSULTING & DATA PROCESSING SERVICE" and "DATA PROCESSING SERVICES". The services were to keep the committee's data in FEC compliance. Although it deals with data, it has nothing to do with voting influence. These types of irrelevant spendings were excluded from the data set.

red curve solution
Transactions to Red Curve Solutions were excluded

Digital Advertising:

Spending on digital advertising can also be tied to the influence industry. In order to refine the selected transactions in this category, we excluded those that were described as production services because we believe the most part of this kind of expenditure was spent on producing ad, as opposed to using voter data to support public communications of political campaigns.

In the case of the description with different types of advertising services like "DIGITAL/SMS/PRINT MEDIA ADVERTISING", we decided to exclude them from the glossary as we tried to minimize incorrect defined expenditure.

Consulting:

After the Cambridge Analytica scandal, we began to believe that consulting services could be a more opaque option for using data to influence voters.

We carefully reviewed all consulting descriptions and only included those that were most likely to involve the use of voter data. Descriptions like "LEGAL CONSULTING", "COMPLIANCE CONSULTING", and "ADMINISTRATIVE CONSULTING" were excluded from the glossary.




We were aware that our methods are not perfect, some parts are based on judgment and presumption. In the diagram below, we illustrate the scope of our analysis by the three spending categories, which could include some expenditures that did not involve personal data, while others did but were not included.

diagram shows analysis scope

Despite its limitations, we hoped this project could provide a basic way to map the influence industry and may be of use in studying other political campaigns.