Skip to content

RadCLIP is a foundation model for radiologic imaging that leverages a Vision–Language Pre-training (VLP) framework to align 2D/3D radiologic images with their textual descriptions, improving diagnostic accuracy and efficiency in clinical workflows.

Notifications You must be signed in to change notification settings

luzhixiu/RadCLIP

Repository files navigation

This is the official Repository of RadCLIP: RadCLIP: Enhancing Radiologic Image Analysis Through Contrastive Language–Image Pretraining (https://pubmed.ncbi.nlm.nih.gov/40434863/)

RadCLIP

RadCLIP is trained on over 1.15 million 2D radiologic image–text pairs and 52,766 3D volumetric pairs spanning X-ray, CT, and MRI, drawn from 14 public collections.

Dataset

Our architecture builds on a dual-encoder CLIP framework: a frozen text encoder paired with a fine-tuned 2D image encoder, optimized with an InfoNCE contrastive loss to align image–text embeddings. Volumetric studies are handled by a lightweight, multi-head self-attention slice-pooling adapter that aggregates 2D slice features into a unified 3D representation—avoiding costly 3D convolutions.

RadCLIP

🔗 Pre-trained Model Links

All RadCLIP checkpoints are hosted on Hugging Face:

RadCLIP Model Weights : https://huggingface.co/zluvolyote/RadCLIP

How to Use RadCLIP

First, install libraries and dependencies specified in requirement.txt:

numpy==2.2.6

pandas==2.2.3

pydicom==3.0.1

torch==2.7.0

torchvision==0.22.0 scikit-learn==1.6.1

matplotlib==3.10.3

transformers==4.52.0

Pillow==11.2.1

Then, run through the inference example in "RadCLIP_Inference_Example_VQA.ipynb", which includes how to initialize the model, load the weights, and make image-text matching, if you are interested in doing classification instead, simple skip the similarity matching section and extract features using provided functions.

About

RadCLIP is a foundation model for radiologic imaging that leverages a Vision–Language Pre-training (VLP) framework to align 2D/3D radiologic images with their textual descriptions, improving diagnostic accuracy and efficiency in clinical workflows.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published