Residual Kolmogorov-Arnold Network (RKAN)

Overview

Despite their immense success, deep neural networks (CNNs) are costly to train due to hundreds of convolutional layers within network depth. Standard convolutional operations are fundamentally limited by their linear nature along with fixed activations, where at least dozens of layers are needed to learn meaningful patterns in data, making this approach prone to optimization difficulties and computationally inefficient. As a result, we introduce RKAN (Residual Kolmogorov-Arnold Network), which can be conveniently added into each stage (level) of traditional deep networks and integrates mutually complementary polynomial feature transformation to existing convolutional layers. Our proposed module offers consistent improvements in different vision tasks over baseline models on most common benchmark datasets, such as CIFAR-100, ImageNet, and Pascal VOC.

You can also find our paper on arXiv.

Networks are trained from scratch for 200 epochs using stochastic gradient descent (SGD) with a weight decay of 0.0005 (100 epochs on ImageNet with a weight decay of 0.0001). RandAugment, CutMix with a 50% probability, and MixUp ($\alpha = 0.2$) with a 30% probability are used as data augmentation. RKAN blocks are added primarily to the fourth stage of the network except for ConvNeXt and Swin where the block is implemented into the second stage.

It should be noted that the RKAN module performs exceptionally well on small and datasets that are prone to overfitting. Multi-stage RKAN performs better when RKAN blocks are only implemented into the last 2 stages. More details can be found in our original paper. Performance gains for more recent architectures (ConvNeXt and Swin) are not yet observed on ImageNet.

Usage

All necessary code is included in the repository to run RKAN with different backbone architectures on different datasets.

Clone the repository or download the ZIP file
Run the training.ipynb notebook

Key configuration parameters:

# Select dataset
dataset = "cifar_100"  # Options: cifar_100, cifar_10, svhn, tiny_imagenet, food_101, caltech_256, imagenet_1k

# Select model
model_name = "resnet50"  # See model_configs for all supported models

# RKAN configuration
reduce_factor = [2, 2, 2, 2]  # Reduce factors for each stage
mechanisms = ["addition", "addition", "addition", "addition"]  # Aggregation mechanism for each stage, input None to remove RKAN from the stage (added only to stage 4 by default)
kan_type = "chebyshev"  # Type of KAN convolutions, including chebyshev, rbf, b_spline, jacobi, hermite, etc.

Results

CIFAR-100 Results

RKAN Model	Top-1 Accuracy	Base Model	Top-1 Accuracy
RKAN-ResNeXt-101	86.15	ResNeXt-101	85.28
RKAN-ResNeXt-50	85.08	ResNeXt-50	84.40
RKAN-ResNet-152	85.40	ResNet-152	84.63
RKAN-ResNet-101	85.12	ResNet-101	84.00
RKAN-ResNet-50	84.56	ResNet-50	84.12
RKAN-RegNetY-32GF	87.03	RegNetY-32GF	85.44
RKAN-RegNetY-8GF	86.11	RegNetY-8GF	84.77
RKAN-RegNetY-3.2GF	85.46	RegNetY-3.2GF	84.68
RKAN-DenseNet-201	85.35	DenseNet-201	84.28
RKAN-DenseNet-169	84.84	DenseNet-169	84.00
RKAN-DenseNet-121	84.73	DenseNet-121	84.09

Food-101 Results

RKAN Model	Top-1 Accuracy	Base Model	Top-1 Accuracy
RKAN-ResNeXt-101	90.82	ResNeXt-101	89.87
RKAN-ResNeXt-50	90.00	ResNeXt-50	89.20
RKAN-ResNet-152	90.36	ResNet-152	89.70
RKAN-ResNet-101	90.09	ResNet-101	89.29
RKAN-ResNet-50	89.48	ResNet-50	88.84
RKAN-RegNetY-32GF	91.62	RegNetY-32GF	90.72
RKAN-RegNetY-8GF	91.17	RegNetY-8GF	90.43
RKAN-RegNetY-3.2GF	90.09	RegNetY-3.2GF	89.54
RKAN-DenseNet-201	89.58	DenseNet-201	88.83
RKAN-DenseNet-169	89.74	DenseNet-169	89.17
RKAN-DenseNet-121	89.43	DenseNet-121	88.98

Tiny ImageNet Results

RKAN Model	Top-1 Accuracy	Base Model	Top-1 Accuracy
RKAN-Wide-ResNet-101	77.56	Wide-ResNet-101	75.46
RKAN-ResNeXt-101	77.48	ResNeXt-101	75.57
RKAN-ResNeXt-50	75.41	ResNeXt-50	73.56
RKAN-ResNet-152	76.82	ResNet-152	74.88
RKAN-ResNet-101	76.29	ResNet-101	74.51
RKAN-ResNet-50	74.43	ResNet-50	72.85
RKAN-ResNet-34	72.03	ResNet-34	70.96
RKAN-RegNetY-32GF	77.79	RegNetY-32GF	75.90
RKAN-RegNetY-8GF	77.13	RegNetY-8GF	75.58
RKAN-RegNetY-3.2GF	76.05	RegNetY-3.2GF	74.07
RKAN-RegNetX-3.2GF	75.26	RegNetX-3.2GF	73.83
RKAN-DenseNet-161	75.79	DenseNet-161	74.14
RKAN-DenseNet-201	75.12	DenseNet-201	73.10
RKAN-DenseNet-169	74.88	DenseNet-169	73.55
RKAN-DenseNet-121	74.13	DenseNet-121	72.76
RKAN-ConvNeXt-T	72.07	ConvNeXt-T	70.78
RKAN-Swin-T	68.48	Swin-T	67.05

ImageNet Results

RKAN Model	Top-1 Accuracy	Base Model	Top-1 Accuracy
RKAN-ResNet-152	80.73	ResNet-152	80.22
RKAN-ResNet-101	80.09	ResNet-101	79.31
RKAN-ResNet-50	77.93	ResNet-50	77.21
RKAN-ResNet-34	74.33	ResNet-34	73.72
RKAN-RegNetY-8GF	81.38	RegNetY-8GF	81.02
RKAN-RegNetY-3.2GF	79.62	RegNetY-3.2GF	79.03
RKAN-RegNetX-3.2GF	79.11	RegNetX-3.2GF	78.70
RKAN-DenseNet-201	79.02	DenseNet-201	78.41
RKAN-DenseNet-169	78.00	DenseNet-169	77.25
RKAN-DenseNet-121	76.34	DenseNet-121	75.05

Citation

If you find our work useful, consider citing our paper at:

@article{yu2024rkan,
  title={Residual Kolmogorov-Arnold Network for Enhanced Deep Learning},
  author={Yu, Ray Congrui and Wu, Sherry and Gui, Jiang},
  journal={arXiv preprint arXiv:2410.05500},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
KAN_Conv		KAN_Conv
images		images
residual_networks		residual_networks
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyrightconfig.json		pyrightconfig.json
training.ipynb		training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Residual Kolmogorov-Arnold Network (RKAN)

Overview

Usage

Results

CIFAR-100 Results

Food-101 Results

Tiny ImageNet Results

ImageNet Results

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

withray/residualKAN

Folders and files

Latest commit

History

Repository files navigation

Residual Kolmogorov-Arnold Network (RKAN)

Overview

Usage

Results

CIFAR-100 Results

Food-101 Results

Tiny ImageNet Results

ImageNet Results

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages