📦 New Sparse Matrix Storage Format: Contiguous Clustering (CC)

This project introduces Contiguous Clustering (CC) — a novel storage format optimized for diagonally dominant sparse matrices. It was developed as part of our Bachelor of Technology thesis at XIM University under the guidance of Dr. Chandan Misra.

Traditional formats like CSR, COO, CDS, and JDS each have limitations in space and time efficiency, especially when dealing with matrices that exhibit strong diagonal dominance. Our CC format addresses these with smarter clustering and minimal index storage.

🧠 Motivation

Sparse matrices arise in many real-world applications:

Scientific simulations
Social network analysis
Recommendation systems
Machine learning and deep learning

However, traditional storage formats often:

Suffer from excessive memory overhead
Have inefficient access patterns
Underperform in SpMV (Sparse Matrix-Vector Multiplication)

🆕 What is Contiguous Clustering (CC)?

A new storage method that:

Clusters non-zero values along diagonals
Stores only start row/column indices instead of every (row, col) pair
Improves cache locality and reduces index redundancy

Key Components:

storeValues[] – Non-zero matrix values
clusterSizes[] – Number of elements in each diagonal cluster
startRowClus[], startColClus[] – Starting index for each cluster
offset[] – Diagonal offset (col - row)

📈 Performance Highlights

Matrix	Format	Space (MB)	Time (s)	GFlops
mc2dmpi	CDS	28.0	0.12	0.03
mc2dmpi	CC	12.0	0.009	0.3

✅ 30–50% reduction in memory usage
✅ Significant improvement in SpMV time
✅ Higher GFlops performance across tested datasets

💻 Implementation

Implemented in C with benchmark support for:

COO Format
CDS, JDS, PDS Formats
Our Proposed CC Format

Compilation

gcc cc_sparse_matrix.c -o cc_sparse -lm

Run

./cc_sparse

📘 Thesis Details

Title: New Storage Format for Sparse Matrices

Institution: XIM University, Bhubaneswar

Authors:

Arupa Nanda Swain
Vadali S S Bharadwaja
Satyabhusan Sahu
A Anushruth Reddy

Guide: Dr. Chandan Misra

Year: 2025

🔭 Future Scope

Extend CC for general sparse matrices (non-diagonal structures)
Integration with libraries like SciPy, Eigen, PETSc
GPU-optimized and parallelized versions
Test on real-world large-scale industrial datasets

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
SpMVOpenMP		SpMVOpenMP
analysisPart		analysisPart
intelMKL		intelMKL
reportsPublishedThesisAndMore		reportsPublishedThesisAndMore
JDS		JDS
JDS.c		JDS.c
README.md		README.md
compressedDia		compressedDia
compressedDia.c		compressedDia.c
finalCCCode		finalCCCode
finalCCCode.c		finalCCCode.c
finalCDSCode		finalCDSCode
finalCDSCode.c		finalCDSCode.c
noTextDiag		noTextDiag
noTextDiag.c		noTextDiag.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📦 New Sparse Matrix Storage Format: Contiguous Clustering (CC)

🧠 Motivation

🆕 What is Contiguous Clustering (CC)?

Key Components:

📈 Performance Highlights

💻 Implementation

Compilation

Run

📘 Thesis Details

🔭 Future Scope

📜 License

About

Uh oh!

Releases

Packages

Languages

arupa444/Continues-Clustering-CC

Folders and files

Latest commit

History

Repository files navigation

📦 New Sparse Matrix Storage Format: Contiguous Clustering (CC)

🧠 Motivation

🆕 What is Contiguous Clustering (CC)?

Key Components:

📈 Performance Highlights

💻 Implementation

Compilation

Run

📘 Thesis Details

🔭 Future Scope

📜 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages