R course
Daniel Vaulot
2023-01-26
Metabarcode analysis - Introduction
Metabarcoding data
Factors affecting protist communities
Diversity
Fastq files
In oceanic waters:
… which depend on:
Microbial species in a sample
Richness vs. Evenness
H=−∑Si=1pi⋅logpi
Where:
pi = fraction of the entire population made up of species i (proportion of a species i relative to total number of species present)
S = numbers of species encountered
A high value of H would be a representative of a diverse and equally distributed community and lower values represent less diverse community. A value of 0 would represent a community with just one species.
Compute distance between samples:
Bray-Curtis dissimilarity: use abundance information
BCjk=1−2∑pi=1min(Nij,Nik)∑pi=1(Nij+Nik)
where Nij is the abundance of species i in sample j and p the total number of species
Jaccard similarity index
Intro to metabarcoding