Benchmarks PWM and AAA models on regulatory SNPs from allele-specific binding (ChIP-seq, GHT-SELEX).
-
Scripts (
scripts/)01_get_pwms.py— convert submitted PFMs to PWMs02_generate_uniform_thrs.sh— create PERFECTOS-APE thresholds03_extract_asb.py— (optional) rebuild ASB SNP tables04_make_filelist_for_annotation.py— generate a file with all SNP–PWM–THR combinations used by05_run_annotation.sh05_run_annotation.sh— run of annotation- additional files
ape.jar— PERFECTOS-APE tool_run_perfectos.sh— run multiple annotation processes in parallel_annotate_with_motifs.py— annotation of a single SNP-PWM-THR combination
-
Notebooks (
notebooks/)01_pwm_calculate_auc.ipynb— compute PWM AUCs02_aaa_calculate_auc_and_join_pwms.ipynb— compute AAA AUCs and merge with PWM03_draw_curves_examples.ipynb— plot concordance curves and selected figures
Execution order:
- run all scripts (01 -> 05)
- run all notebooks (01 -> 03)
pwm_submissions/— raw PWM submissionsaaa_submissions/— raw AAA predictionsmex4/— top-4 reference PWMs (MEX set)finals.tsv— IBIS Final TFsasb/— pre-filtered allele-specific SNPspwms/— converted PWMs (log-odds)thrs/— uniform thresholds (used for motif P-Value calculation)
tables/pwm_auc.tsv— PWM AUC scoresibis_auc_scores.tsv— combined PWM + AAA scoresfigures/— plots and curve data:auc_box.png— AUC boxplotaaa_auc_weighed_mean_strip.png— weighted mean AUC strip plot (AAA)curves_plots/A2G.*,curves_plots/G2A.*— summary concordance curves (PDF/PNG/SVG)curves_plots/curves/…— per-TF concordance curve TSVs for TFs: GCM1 (A2G) and MYF6 (G2A), across selected teams/models