── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr 1.1.4 ✔ readr 2.1.5
✔ forcats 1.0.0 ✔ stringr 1.5.1
✔ ggplot2 3.5.1 ✔ tibble 3.2.1
✔ lubridate 1.9.4 ✔ tidyr 1.3.1
✔ purrr 1.0.2
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag() masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
#ANOVA (Analysis of variance) with cuckoos data set
An analysis of variance (ANOVA) is a statistical test used to determine whether a continuous and a categorical variable differ. The categorical variable should have three or more groups, or levels.
library(DAAG)data("cuckoos")
Research question: Does egg length differ between the six species?
# A tibble: 6 × 2
species `var(length)`
<fct> <dbl>
1 hedge.sparrow 1.10
2 meadow.pipit 0.846
3 pied.wagtail 1.15
4 robin 0.465
5 tree.pipit 0.775
6 wren 0.569
##Running the anova
model1 <-aov(length ~ species, data = cuckoos)model2 <-lm(length ~ species, data = cuckoos)tukey <-TukeyHSD(model1)#make sure your object is called "tukey" for the code below.
#Extracting significant pairstukey_df <-as.data.frame(tukey$species)names(tukey_df)[names(tukey_df) =="p adj"] <-"p.adj"sig_pairs <-subset(tukey_df,p.adj <0.05)# Create list of comparisons from significant pairscomparisons <-strsplit(row.names(sig_pairs), "-")#Creating the plotlibrary(ggsignif)ggplot(cuckoos, aes(x = species, y = length)) +geom_boxplot() +geom_signif(comparisons = comparisons,annotations =ifelse(sig_pairs$p.adj <0.001, "***", ifelse(sig_pairs$p.adj <0.01, "**", ifelse(sig_pairs$p.adj <0.05, "*", ""))),y_position =seq(25, 35, length.out =nrow(sig_pairs)), #changing location of the barstip_length =0.02) +theme_bw()+theme(axis.text.x =element_text(angle =30, hjus =1, size =18),axis.text.y =element_text(size =18), axis.title.x =element_text(size =15), axis.title.y =element_text(size =20)) +labs(x ="Host species", y ="Egg length (mm)")