This project analyzes B cell receptor (BCR) sequences from single-cell RNA-seq data using the V(D)J output generated by the Cell Ranger pipeline. The goal is to assess immunoglobulin chain usage and clonotype frequency.
This analysis extracts and summarizes BCR clonotype information from 10x Genomics single-cell V(D)J sequencing data. The input includes:
all_contig_annotations.csvto identify and separate immunoglobulin heavy (IGH), kappa (IGK), and lambda (IGL) chainsclonotypes.csvto quantify and visualize clonotype frequency and chain usage
- Read
all_contig_annotations.csvfrom Cell Ranger VDJ output - Filter and extract entries for:
- IGH (immunoglobulin heavy chain)
- IGK (kappa light chain)
- IGL (lambda light chain)
- Save each chain as a separate CSV for downstream analysis
- Read
clonotypes.csvand count occurrences of each clonotype - Visualize clonotype frequencies using bar plots
- Calculate the proportion of IGK, IGL, and IGH chain usage across clonotypes
- Create pie plots to illustrate the distribution of immunoglobulin light and heavy chains
- Raw data is not publicly available due to client ownership and confidentiality.
- Some example outputs plots are organized by task in the
output/folder. - This project is designed for both reproducibility and clarity.
Author: Nasim Rahmatpour Email: nasimrahmatpour1@gmail.com GitHub: (https://github.com/nasimbio)