Average Semi-detached house prices in UK by county – Statistical Analysis using R

This is my first data visualization attempt and uses data from HM Land Registry to show to average cost of a semi-detached house in four counties across the past ten years.

You can see the full repository for the project on Github.

The Code

Here I have included the code at the time of writing this post. The git repository code may now differ slightly.

library("tidyverse")

regions  <- c(
  "Derbyshire",
  "Leicestershire",
  "Staffordshire",
  "Warwickshire"
)

data  <- read.csv("props.csv")

data %>%
  filter(Region_Name %in% regions) %>%
  filter(Date > "2013-01-01") %>%
  ggplot(aes(
    Date,
    Semi_Detached_Average_Price
  )) +
  geom_point(aes(color = Region_Name), size = 3) +
  theme_bw() +
  theme(axis.text.x = element_text(angle = 90, vjust = 0.5, hjust = 1)) +
  labs(
    title = "Average Semi-detached house prices per county",
    x = "Month and Year",
    y = "Average Price",
    color = "County"
  )

ggsave(
  "semi-detached-house-prices-derby-leicester-staffs-warwickshire.png",
  width = 4096,
  height = 2160,
  unit = "px"
)

The Graph

Graph to show increasing semi-detached house prices by county.

Observations

Warwickshire has been the most expensive county to buy a semi-detached house out of the four counties observed.

Derbyshire has been the least expensive county to buy a semi-detached house out of the four counties observed.

The shapes of the line formed seem consistent across the counties; the rate of price increase seems similar between them.

A lot can happen over ten years.