This repository contains the files and a Jupyter notebook required to filter ClinVar entries by gene attributes. The output is a dataframe and plots that reveal the number of missense VUS in genes that are essential to HAP1 cells, encode secreted proteins, encode cytoplasmic proteins, or encode nuclear proteins. To download the ClinVar file, naviage to the FTP site (https://ftp.ncbi.nlm.nih.gov/pub/clinvar/) > tab_delimited/archive/variant_summary_date.txt.gz
All other required files are available for download here.