1 Importing and Filtering Data

The code below performs the following steps on U.S. Presidential Election data from 2000 to 2020: It reads the data, ensuring no missing values are present, and formats the county FIPS codes with leading zeros to maintain standardization. It filters the dataset to include only the major parties, Democrat and Republican, and calculates the vote percentage to two decimal points for each candidate. The data is then grouped by state, county, and year for organized analysis. Within each group, the entry with the highest vote percentage is selected, indicating the winning party in each county. For the state election data, groups of the data by year, state, and party are created to calculate the sum of candidatevotes and totalvotes for each group. Then, it adds a new column percentage to store the percentage of votes each party received within a state for a given year. The data is filtered to identify the winning party for each state and year combination, defined as the party with the maximum percentage of votes. The data is exported to two new CSV files for visualization. Each step is executed to transform the raw data into insightful information that reveals electoral trends over the twenty-year period.

#Read the Presidential Election 2000 to 2020 data
electionData <- na.omit(read.csv("https://tkelleman.github.io/tkweb/Week9/PresidentialElection2000To2020.csv"))
electionData$county_fips <- sprintf("%05d", as.numeric(electionData$county_fips))
electionData$county_fips <- as.character(electionData$county_fips)
 
#Question 2A 
#Filter to only include Democrat and Republican and create a votePercentage variable with 2 decimal points
electionData <- filter(electionData, party == "DEMOCRAT" | party == "REPUBLICAN") %>%
mutate(votePercent = round((candidatevotes/totalvotes)*100, 2))

#Group Data by state_po and county_fips, year
groupedData <- group_by(electionData, state_po, county_fips, year)

#Use slice to keep only the row with the highest votePercent within each group
electionDataHighest <- groupedData %>%
  slice(which.max(votePercent))

#Export CSV
write.csv(electionDataHighest, "ElectionDataHighest.csv", row.names = FALSE)

# Question 2B
electionData <- electionData %>%
  group_by(year, state, party) %>%
  summarise(candidatevotes = sum(candidatevotes), totalvotes = sum(totalvotes)) %>%
  mutate(percentage = (candidatevotes / totalvotes) * 100) %>%
  ungroup()

# Filter for the winning party in each state
winningParties <- electionData %>%
  group_by(year, state) %>%
  filter(percentage == max(percentage)) %>%
  ungroup()
write.csv(winningParties, "ElectionDataState.csv", row.names = FALSE)

2 US Presidental Election Winner by County (2000-2020) - Tableau Interactive Map

This interactive choropleth map allows users to select U.S. Presidential election years from 2000 to 2020 to view the winning candidates by county. By hovering over the map, users can access details such as the year, state, county, winning party, winning candidate, vote percentage, and total votes. The map also enables users to compare different election cycles to identify counties that have switched parties. Incorporating additional data, like median salary and demographics, could provide deeper insights into the trends influencing electoral outcomes.

3 US Presidental Election Winner by State (2000-2020) - Tableau Interactive Map

This interactive choropleth map allows users to select U.S. Presidential election years from 2000 to 2020 to view the winning candidates by State By hovering over the map, users can access details such as the year, state, winning party, vote percentage, and total votes. The map also enables users to compare different election cycles to identify states that have switched parties.

