Skip to content

pauloizquierdo/GWASAssistedGenomicPrediction-YBC

Repository files navigation

Genome-Wide Association and Genomic Prediction in Yellow Bean Collection (YBC)

This repository contains code and data for the publication:

"Genome-wide association and genomic prediction for Fe and Zn concentration and Fe bioavailability in a yellow bean collection of dry beans"
🔗 Frontiers in Genetics, DOI: 10.3389/fgene.2024.1330361


🧬 Abstract

Dry bean (Phaseolus vulgaris L.) is a nutrient-dense crop central to global diets and the focus of biofortification efforts. This study explores the genetic basis and predictive modeling of iron (Fe), zinc (Zn), and Fe bioavailability (FeBio) in a 295-genotype Yellow Bean Collection (YBC) evaluated across two years and locations.

Key insights:

  • GWAS revealed trait-specific quantitative trait nucleotides (QTNs).
  • Genomic prediction (GP) accuracies ranged from 0.12 (Fe) to 0.72 (FeBio).
  • Adding GWAS-derived QTNs improved FeBio prediction by 0.03.
  • Fe concentration and FeBio were largely uncorrelated, supporting FeBio as an independent trait for breeding.

🗂 Repository Structure

📁 Scripts

File Description
1.Phenotype_varianCompo_GWAS_popStruct_YBC.R Analyzes correlations, performs variance component analysis, PCA, kinship matrix calculation, GWAS, and visualizations for phenotypes and seed color
2.RKHS_SSI_Yield_Fe_Zn_YBC.R Runs single-site inference (SSI) and RKHS models for Yield, Fe, and Zn across MI and NE (2018–2019)
3.RKHS_SSI_FeBio_YBC.R Fits RKHS models for FeBio including GWAS-assisted fixed effects
4.RKHS_SSI_YBC_predictionSet.R Predicts trait values in the full Yellow Bean Collection using RKHS/SSI models
5.RKHS_SSI_Andean_predictionSet.R Applies models specifically to Andean accessions in the YBC prediction set

📁 Data

  • YBC_GWASAssistedGP.RData:
    Contains phenotype, genotype, hapmap, and SNP position data for all analyses.

  • YBC_phenotype_Color_CT.csv:
    Contains agronomic, mineral, and seed quality traits, cooking time, and population structure annotations.
    Metadata was curated from:

  • GBS_barcode_plate_info_YBC.txt:
    Plate layout and metadata for genotyping-by-sequencing samples.

🔗 Raw sequencing data:
Available through NCBI BioProject PRJNA1061170


📦 Software Requirements

  • R (≥ 4.0.0)
  • Key packages:
    • tidyverse
    • data.table
    • rrBLUP
    • sommer
    • BGLR
    • FactoMineR, ggplot2 (for PCA, plots)

✅ Outputs

  • Trait variance components
  • GWAS summary tables and Manhattan plots
  • Prediction accuracy metrics
  • Marker effects and model coefficients

📣 Citation

If you use this repository, please cite:

Izquierdo, P., et al. (2024).
Genome-wide association and genomic prediction for Fe and Zn concentration and Fe bioavailability in a yellow bean collection of dry beans.
Frontiers in Genetics. DOI: 10.3389/fgene.2024.1330361


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages