YJ

Fine-Scale Genetic Structure of the British Population Study Notes

Genetic Structure of the British Population

Introduction

  • Fine-scale genetic variation in human populations reflects historical demographic events and has implications for disease studies.
  • Analysis of genome-wide SNP data from 2,039 individuals across the UK reveals genetic differentiation patterns corresponding to geography.
  • Regional genetic differentiation and shared ancestry with 6,209 Europeans indicate historical demographic events.
  • Anglo-Saxon migrations contributed less than half of the genetic material in southeastern England.
  • Significant pre-Roman, post-Mesolithic movement into southeastern England from continental Europe is suggested.
  • Genetically differentiated subgroups exist in non-Saxon parts of the UK, rather than a general ‘Celtic’ population.

Methods and Data

  • PoBI Collection:
    • 2,039 samples from rural UK areas were genotyped.
    • Participants had all four grandparents born within 80 km of each other.
    • Grandparents' average birth year was 1885 (s.d. 18 years).
  • Continental European Samples:
    • 6,209 samples from 10 European countries were genotyped.
    • Samples were genotyped in the WTCCC2 study of multiple sclerosis.
    • Approximately 500,000 autosomal SNPs were used for compatibility.

Fine-Scale UK Population Differentiation

  • Population structure within the PoBI collection is limited.
    • Average pairwise FST estimates between 30 sample collection districts: 0.0007, maximum: 0.003.
  • FineSTRUCTURE Method:
    • A method for detecting fine-scale population structure was applied.
    • It models the correlation between nearby SNPs using multi-marker haplotypes.
    • The algorithm divides samples into genetic clusters hierarchically.

Genetic Clustering and Geography

  • Genetic clustering was assessed by plotting individuals on a UK map based on grandparents’ birthplaces.
  • Figure 1:
    • Shows a map for 17 clusters and a tree illustrating cluster relationships.
    • Correspondence between genetic clusters and geography is striking.
    • Clusters are highly localized, often occupying non-overlapping regions.
  • The approach separates groups in close proximity, such as Cornwall and Devon.
  • The hierarchical splitting of individuals into clusters reveals genetic differentiation.
    • Coarsest level: separates Orkney samples from all others.
    • Then Welsh samples separate from non-Orkney samples.
    • Subsequent splits differentiate north and south Wales, north of England, Scotland, and Northern Ireland.
    • Cornwall separates from the large English cluster.
  • A large cluster covers most of central and southern England.
  • Estimated FST values between clusters are small (average 0.002, maximum 0.007).
  • Patterns of ancestry as estimated by fineSTRUCTURE show highly significant differences between clusters (Supplementary Table 3).

Comparison with Other Analysis Tools

  • Compared fineSTRUCTURE with principal components and ADMIXTURE.
  • Principal components and ADMIXTURE broadly separate samples from Wales and Orkney.
  • These methods are unable to distinguish many of the other clusters found by fineSTRUCTURE.
  • Analyses confirm that clustering is not an artifact of preferentially selecting related individuals.

UK Clusters in Relation to Europe

  • Genetic differences between UK clusters reflect isolation and differing migration patterns from outside the UK.
  • Similar fineSTRUCTURE analyses were applied to 6,209 samples from continental Europe.
  • The genetic composition of the UK clusters was characterized with respect to the genetic groups in Europe.
  • European samples were divided into 51 groups.
  • Each UK cluster was estimated for an ancestry profile, characterizing it as a mixture of the ancestry of the 51 European groups.
  • The analyses used no geographical information.
  • Figure 2:
    • Illustrates estimated ancestry profiles and the sampling locations in Europe.
  • Some European groups feature substantially in the ancestry profiles of all UK clusters:
    • GER6 (western Germany).
    • BEL11 (northern Belgium).
    • FRA14 (north-west France).
    • DEN18 (Denmark).
    • SFS31 (southern France and Spain).
  • Some European groups feature substantially in some UK clusters but are absent from others:
    • GER3 (northern Germany).
    • FRA12 (France).
    • FRA17 (France).
  • Swedish groups (SWE117 and SWE121) feature in the ancestry profiles of the UK clusters.
  • Norwegian groups feature substantially in the ancestry profiles of the Orkney clusters, and to a lesser extent clusters involving Scotland and Northern Ireland.

Discussion

  • Haplotype-based analysis of genome-wide SNP data reveals fine-scale genetic differentiation within the UK, showing concordance with geography.
  • Few of these details have been captured previously.
  • The clustering is notable for differentiation over small distances and the stability of some clusters over large distances.
  • Genetic differentiation within the UK is not simply related to geographical distance.
  • Examples of fine-scale differentiation include:
    • Separation of islands within Orkney.
    • Devon from Cornwall.
    • Welsh/English borders from surrounding areas.
  • Edges between clusters follow natural geographical boundaries in some instances.
  • Clusters in Northern Ireland span the sea to Scotland.
  • Branch lengths of the hierarchical clustering tree indicate the relative differentiation between UK clusters.
  • North and south Wales are about as distinct genetically from each other as are central and southern England from northern England and Scotland.

Historical Context

  • Figure 3:
    • Gives an overview of major population groups and movements within and into the UK at different times, based on archaeological, historical, and linguistic evidence.
  • Samples in Orkney differ genetically from those in the rest of the UK, consistent with historical settlement and long-term control by Norse Vikings.
  • Ancestry profiles of the Orkney clusters show substantial contributions from groups in Norway.
  • Finding a lack of subtle population structure in central and southern England is informative.
  • There must have been sufficient movement of people and DNA since the last major invasions to make the region genetically relatively homogeneous.

Migration and Ancestry

  • European ancestry profiles for each of the UK genetic clusters were estimated.
  • Modern-day groupings are used as surrogates for the sources and results of major migration events.
  • Genetic differences persist through many generations, and there is good concordance with historical evidence.
  • European groups (e.g., GER3, FRA12, FRA17) contributing substantially to some, but not all, UK clusters suggests differential input of DNA to different parts of the UK.
  • Groups contributing significantly to the ancestry profiles of all UK clusters most probably represent relatively old migration events.
  • Groups contributing to only some UK clusters likely represent more recent migration events.
  • Briefly, the earliest migrations are best captured by GER6 (western Germany), BEL11 (Belgium), and FRA14 (north-western France).
  • A subsequent migration captured by FRA17 (France) contributed a substantial amount of ancestry to the UK outside Wales.
  • Known historical migrations and settlements, including the Saxons (GER3, DEN18) and the Norse Vikings (NOR53–NOR90), are seen.

GLOBETROTTER Analysis

  • GLOBETROTTER was applied to shed light on two major migration events, in Orkney and central/southern England.
  • It tests for recent admixture, identifies contributing groups, and dates the admixture.
  • GLOBETROTTER detected strong evidence that the largest Orkney cluster was influenced by a recent admixture event:
    • Contribution of <25\% of the DNA from groups in Norway.
    • Estimated to have occurred 29 generations ago (95% CI: 18–39 generations), corresponding to year 1100 (95% CI: 830–1418).
  • For Cent./S England, the method also detected an admixture event:
    • Contribution of <35\% of DNA from GER3.
    • Estimated date of 38 generations (95% CI: 36–40 generations), corresponding to year 858 (95% CI: 802–914).
  • GLOBETROTTER analyses detect likely source populations for the known historical migrations (Norse Vikings and Saxons).
  • Admixture dates estimated by GLOBETROTTER should provide upper bounds on the dates of the migrations.

Saxon Migrations

  • After the Saxon migrations, the language, place names, cereal crops, and pottery styles all changed.
  • Two separate analyses (ancestry profiles and GLOBETROTTER) show clear evidence in modern England of the Saxon migration.
  • The proportion of Saxon ancestry in Cent./S England is estimated as very likely to be under 50%, and most likely in the range of 10–40%.

Celtic Population

  • No evidence of a general ‘Celtic’ population was found in non-Saxon parts of the UK.
  • There were many distinct genetic clusters in these regions.
  • The ancestry profile of Cornwall is quite different from that of the Welsh clusters, and much closer to that of Devon, and Cent./S England.
  • The Welsh clusters represent populations that are more similar to the early post-Ice-Age settlers of Britain than those from elsewhere in the UK.

Conclusion

  • The first fine-scale dissection of subtle levels of genetic differentiation within a country has been presented.
  • The resulting genetic clusters and ancestry characterization provide novel insights into the peopling of the British Isles.
  • Genetic information can augment archaeological, linguistic, and historical approaches to understanding population history.
  • The study acts as a proof-of-principle for the power of detailed genetic analyses.