Additional file 3: Figure S3. Metadata analysis of viral species. (A) Number of sequencing reads in infant and maternal samples. Statistical significance assessed by Mann-Whitney. (B) Output of MaAslin and random forest analysis. (C) Important viral species and their statistical significance as assessed by MaAslin and random forest. (D) Heatmap of MaAslin identified viral species for mothers and infants. (E) ROC curves and AUC measures for delivery route and mother/infant classification using random forest and pROC packages in R. Pseudo-probabilities are plotted on graph using Out-Of-Bag (OOB) sample tree classification votes. Only infant data used in vaginal vs. C-section classification while both mother and infant data used for mother vs. infant classification. (F) Bray-Curtis Dissimilarity pairwise comparisons. Statistical significance assessed by Mann-Whitney. Kruskal-Wallis with Dunn’s multiple correction was used to assess feeding type (breastmilk vs. formula; breastmilk vs. mix; formula vs. mix) and Pre-pregnancy BMI (obese vs. overweight; obese vs. normal; overweight vs. normal). (G) Bray-Curtis Dissimilarity pairwise comparisons for related mother infant pairs. Statistical significance assessed by Mann-Whitney. Kruskal-Wallis with Dunn’s multiple correction was used to assess feeding type (breastmilk vs. formula; breastmilk vs. mix; formula vs. mix) and Pre-pregnancy BMI (obese vs. overweight; obese vs. normal; overweight vs. normal).