Video Frame Interpolation Rankings
and Video Deblurring Rankings

Researchers! Please develope joint video deblurring and frame interpolation models, use the best method for dealing with time-to-location ambiguity between two input frames, which the BiM-VFI currently has and train at least one of your models on Style loss, also called Gram matrix loss (the best perceptual loss function):

Source: FILM - Loss Functions Ablation https://film-net.github.io/

Source: MoSt-DSA - Loss Function Comparison https://arxiv.org/html/2407.07078

List of Rankings

Each ranking includes only the best model for one method.

The rankings exclude all event-based and spike-guided models.

Joint Video Deblurring and Frame Interpolation Rankings

👑 RBI with real motion blur✔️: LPIPS😍 (no data)
This will be the King of all rankings. We look forward to ambitious researchers.
RBI with real motion blur✔️: PSNR😞>=28.5dB

Video Deblurring Rankings

(to do)

RBI with real motion blur✔️: PSNR😞>=28.5dB

📝 Note: Pre-BiT++ is pre-trained on Adobe240 and then fine-tuned on RBI.

RK	Model Links: Venue Repository	PSNR ↑ {Input fr.} Table 1&6 BiT
1	Pre-BiT++	31.32 {3}
2	DeMFI-Net_rb(5,3)	29.03 {4}
3	PRF₄ -Large	28.55 {5}

X-TEST (×8): LPIPS😍<=0.098

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 1 DvP	LPIPS ↓ {Input fr.} Table 4 BiM-VFI	LPIPS ↓ {Input fr.} Table 4&7 GIMM-VFI
1	DvP+	0.062 {4}	-	-
2	BiM-VFI	-	0.068 {2}	-
3	M2M-PWC	0.086 {2}	0.080 {2}	0.158 {2}
4	XVFI (S_tst=5)	0.089 {2}	-	-
5	UPR-Net LARGE	-	0.093 {2}	0.154 {2}
6-7	GIMM-VFI-F-P	-	-	0.098 {2}
6-7	IA-Clearer [D,R]_u AMT-S	-	0.098 {2}	-

SNU-FILM-arb Extreme (×16): LPIPS😍<=0.095

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 4&7 GIMM-VFI	LPIPS ↓ {Input fr.} Table 1 BiM-VFI
1	GIMM-VFI-F-P	0.058 {2}	-
2	BiM-VFI	-	0.070 {2}
3	M2M-PWC	0.112 {2}	0.089 {2}
4	UPR-Net LARGE	0.111 {2}	0.092 {2}
5	IA-Clearer [D,R]_u IFRNet	-	0.095 {2}

SNU-FILM-arb Hard (×8): LPIPS😍<=0.048

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 7 GIMM-VFI	LPIPS ↓ {Input fr.} Table 1 BiM-VFI
1	GIMM-VFI-F-P	0.030 {2}	-
2	BiM-VFI	-	0.039 {2}
3	IA-Clearer [D,R]_u IFRNet	-	0.048 {2}

SNU-FILM-arb Medium (×4): LPIPS😍<=0.026

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 7 GIMM-VFI	LPIPS ↓ {Input fr.} Table 1 BiM-VFI
1	GIMM-VFI-R-P	0.016 {2}	-
2	BiM-VFI	-	0.023 {2}
3	IA-Clearer [D,R]_u IFRNet	-	0.026 {2}

SNU-FILM Extreme (×2): LPIPS😍<=0.1099

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 1 HFD	LPIPS ↓ {Input fr.} Table 1 UGFI	LPIPS ↓ {Input fr.} Table 1 MoMo	LPIPS ↓ {Input fr.} Table 2 BiM-VFI	LPIPS ↓ {Input fr.} Table 2 DvP	LPIPS ↓ {Input fr.} Table 1 EDEN	LPIPS ↓ {Input fr.} Table 1 CBBD
1	HFD	0.0839 {2}	-	-	-	-	-	-
2	UGFI 𝓛_S	-	0.0864 {2}	-	-	-	-	-
3	MoMo	-	-	0.0872 {2}	-	-	-	-
4	FILM-𝓛_S	-	0.0899 {2}	0.0889 {2}	-	-	-	-
5	PerVFI	0.0901 {2}	-	0.0902 {2}	-	-	-	-
6-7	BiM-VFI	-	-	-	0.097 {2}	-	-	-
6-7	DvP+	-	-	-	-	0.097 {4}	-	-
8	EDEN	-	-	-	-	-	0.0986 {2}	-
9	CBBD	0.1040 {2}	-	-	-	-	0.1101 {2}	0.104 {2}
10	EMA-VFI	0.1099 {2}	-	0.1099 {2}	0.113 {2}	0.119 {2}	-	0.114 {2}

SNU-FILM Hard (×2): LPIPS😍<=0.052

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 1 HFD	LPIPS ↓ {Input fr.} Table 2 DvP	LPIPS ↓ {Input fr.} Table 1 MoMo	LPIPS ↓ {Input fr.} Table 1 UGFI	LPIPS ↓ {Input fr.} Table 1 CBBD	LPIPS ↓ {Input fr.} Table 2 BiM-VFI
1	HFD	0.0405 {2}	-	-	-	-	-
2	DvP+	-	0.041 {4}	-	-	-	-
3	MoMo	-	-	0.0419 {2}	-	-	-
4	UGFI 𝓛_S	-	-	-	0.0420 {2}	-	-
5	FILM-𝓛_S	-	-	0.0429 {2}	0.0434 {2}	-	-
6	CBBD	0.0467 {2}	-	-	-	0.047 {2}	-
7	PerVFI	0.0480 {2}	-	0.0561 {2}	-	-	-
8	BiM-VFI	-	-	-	-	-	0.052 {2}

SNU-FILM Medium (×2): LPIPS😍<=0.024

📝 Note: This ranking has the most up-to-date layout.

RK	Model Links: Venue Repository	LPIPS ↓ {Input fr.} Table 1 HFD	LPIPS ↓ {Input fr.} Table 2 DvP	LPIPS ↓ {Input fr.} Table 1 MoMo	LPIPS ↓ {Input fr.} Table 1 UGFI	LPIPS ↓ {Input fr.} Table 1 CBBD	LPIPS ↓ {Input fr.} Table 6 EDSC
1	HFD	0.0191 {2}	-	-	-	-	-
2	DvP+	-	0.020 {4}	-	-	-	-
3	MoMo	-	-	0.0202 {2}	-	-	-
4	UGFI 𝓛_S	-	-	-	0.0209 {2}	-	-
5	FILM-𝓛_S	-	-	0.0213 {2}	0.0215 {2}	-	-
6	CBBD	0.0274 {2}	-	-	-	0.022 {2}	-
7-8	EDSC-𝓛_F	-	-	-	-	-	0.024 {2}
7-8	SepConv - 𝓛_F	-	-	-	-	-	0.024 {2}

Vimeo-90K triplet: LPIPS😍<=0.018

RK	Model	LPIPS ↓ {Input fr.}	Training dataset	Official repository	Practical model	VapourSynth
1	EAFI-𝓛_ecp	0.012 {2}	Vimeo-90K triplet	-	EAFI-𝓛_ecp	-
2	UGFI 𝓛_S	0.0126 {2}	Vimeo-90K triplet	-	UGFI 𝓛_S	-
3	SoftSplat - 𝓛_F	0.013 {2}	Vimeo-90K triplet		SoftSplat - 𝓛_F	-
4	FILM-𝓛_S	0.0132 {2}	Vimeo-90K triplet		FILM-𝓛_S	-
5	MoMo	0.0136 {2}	Vimeo-90K triplet		MoMo	-
6	EDSC_s-𝓛_F	0.016 {2}	Vimeo-90K triplet		EDSC_s-𝓛_F	-
7	CtxSyn - 𝓛_F	0.017 {2}	proprietary	-	CtxSyn - 𝓛_F	-
8	PerVFI	0.018 {2}	Vimeo-90K triplet		PerVFI	-

Vimeo-90K triplet: LPIPS😍(SqueezeNet)<=0.014

RK	Model	LPIPS ↓	Originally announced	Official repository	Practical model	VapourSynth
1	CDFI w/ adaP/U	0.008 ¹	March 2021 ²		-	-
2	EDSC_s-𝓛_F	0.010 ²	June 2020 ³		EDSC_s-𝓛_F	-
3	DRVI	0.013 ⁴	August 2021 ⁴	-	-	-

Vimeo-90K triplet: PSNR😞>=36dB

RK	Model	PSNR ↑ {Input fr.}	Originally announced or Training dataset	Official repository	Practical model	VapourSynth
1	MA-GCSPA-triplets	36.85 {2}	Vimeo-90K triplet		-	-
2	VFIformer + HRFFM ENH:	36.69 {2}	Vimeo-90K triplet	ENH: -	-	-
3	LADDER-L	36.65 {2}	Vimeo-90K triplet	-	-	-
4-5	EMA-VFI	36.64dB ⁵	March 2023 ⁵		-	-
4-5	VFIMamba	36.64 {2}	Vimeo-90K triplet & X-TRAIN		-	-
6	IQ-VFI	36.60 {2}	Vimeo-90K triplet	-	-	-
7	DQBC-Aug	36.57dB ⁶	April 2023 ⁶		-	-
8	TTVFI	36.54dB ⁷	July 2022 ⁷		-	-
9	AMT-G	36.53dB ⁸	April 2023 ⁸		-	-
10	AdaFNIO	36.50dB ⁹	November 2022 ⁹		-	-
11	FGDCN-L	36.46dB ¹⁰	November 2022 ¹⁰		-	-
12	VFIFT	36.43 {2}	Vimeo-90K triplet	-	-	-
13	UPR-Net LARGE	36.42dB ¹¹	November 2022 ¹¹		-	-
14	EAFI-𝓛_ecc	36.38dB ¹²	July 2022 ¹²	-	EAFI-𝓛_ecp	-
15	H-VFI-Large	36.37dB ¹³	November 2022 ¹³	-	-	-
16	UGFI 𝓛₁	36.34 {2}	Vimeo-90K triplet	-	UGFI 𝓛_S	-
17	VFIT-B	36.33 {2}	?		-	-
18	SoftSplat - 𝓛_Lap with ensemble	36.28dB ¹⁴	March 2020 ¹⁵		SoftSplat - 𝓛_F	-
19	ProBoost-Net (448x256)	36.23 {2}	?	-	-	-
20	NCM-Large	36.22dB ¹⁶	July 2022 ¹⁶	-	-	-
21-22	IFRNet large	36.20dB ¹⁷	May 2022 ¹⁷		-	-
21-22	RAFT-M2M++ ENH:	36.20 {2}	Vimeo-90K triplet		-	-
23-24	EBME-H*	36.19dB ¹⁸	June 2022 ¹⁸		-	-
23-24	RIFE-Large	36.19 {2}	Vimeo-90K triplet		Practical-RIFE 4.25
25	ABME	36.18dB ¹⁹	August 2021 ¹⁹		-	-
26	HiFI	36.12 {2}	Pretraining: Raw videos Training: Vimeo-90K triplet & X-TRAIN	-	-	-
27	TDPNet_nv w/o MRTM	36.069 {2}	Vimeo-90K triplet	-	TDPNet	-
28	FILM-𝓛₁	36.06 {2}	Vimeo-90K triplet		FILM-𝓛_S	-

Vimeo-90K septuplet: PSNR😞>=36dB

RK	Model	PSNR ↑ {Input fr.}	Originally announced or Training dataset	Official repository	Practical model	VapourSynth
1	Swin-VFI	38.04 {6}	Vimeo-90K septuplet	-	-	-
2	JNMR	37.19dB ²⁰	June 2022 ²⁰		-	-
3	VFIT-B	36.96 {4}	Vimeo-90K septuplet		-	-
4	VRT	36.53 {4}	Vimeo-90K septuplet		-	-
5	ST-MFNet	36.507dB ²¹	November 2021 ²²		-	-
6	EDENVFI PVT(15,15)	36.387dB ²¹	July 2023 ²¹	-	-	-
7	IFRNet	36.37 {2}	Vimeo-90K septuplet		-	-
8	RN-VFI	36.33 {4}	Vimeo-90K septuplet	-	-	-
9	FLAVR	36.3 {4}	Vimeo-90K septuplet		-	-
10	DBVI	36.17dB ²³	October 2022 ²³		-	-
11	EDC	36.14dB ²⁰	February 2022 ²⁴		-	-

Appendix 1: Runtime

Model Links: Venue Repository	Runtime(s) (×2) A100 1280×768 Table 7 BiM-VFI
BiM-VFI	0.151
EMA-VFI	0.104
GIMM-VFI-R	0.494
UPR-Net LARGE	0.053

Appendix 3: Metrics selection for the rankings

Currently, the most commonly used metrics in the existing works on video frame interpolation and video deblurring are: PSNR, SSIM and LPIPS. Exactly in that order.

The main purpose of creating my rankings is to look for the best perceptually-oriented model for practical applications - hence the primary metric in my rankings will be the most common perceptual image quality metric in scientific papers: LPIPS.

At the time of writing these words, in October 2023, in relation to VFI, I have only found another perceptual image quality metric - DISTS in one paper: and also in one paper I found a bespoke VFI metric - FloLPIPS [arXiv]. Unfortunately, both of these papers omit to evaluate the best performing models based on the LPIPS metric. If, in the future, some researcher will evaluate LPIPS top-performing models using alternative, better perceptual metrics, I would of course be happy to add rankings based on those metrics.

I would like to use only one metric - LPIPS. Unfortunately still many of the best VFI and video deblurring methods are only evaluated using PSNR or PSNR and SSIM. For this reason, I will additionally present rankings based on PSNR, which will show the models that can, after perceptually-oriented training, be the best for practical applications, as well as providing a source of knowledge for building even better practical models in the future.

I have decided to completely abandon rankings based on the SSIM metric. Below are the main reasons for this decision, ranked from the most important to the less important.

The main reason is the following quote, which I found in a paper by researchers at Adobe Research: ¹⁴. In the quote they refer to a paper by researchers at NVIDIA: [arXiv].

We limit the evaluation herein to the PSNR metric since SSIM [57] is subject to unexpected and unintuitive results [39].
The second reason is, more and more papers are appearing where PSNR scores are given, but without SSIM: ²¹ and A model from such a paper appearing only in the PSNR-based ranking and at the same time not appearing in the SSIM-based ranking may give the misleading impression that the SSIM score is so poor that it does not exceed the ranking eligibility threshold, while there is simply no SSIM score in a paper.
The third reason is, that often the SSIM scores of individual models are very close to each other or identical. This is the case in the SNU-FILM Easy test, as shown in Table 3: [CVPR 2023], where as many as 6 models achieve the same score of 0.991 and as many as 5 models achieve the same score of 0.990. In the same test, PSNR makes it easier to determine the order of the ranking, with the same number of significant digits.
The fourth reason is that PSNR-based rankings are only ancillary when a model does not have an LPIPS score. For this reason, SSIM rankings do not add value to my repository and only reduce its readability.
The fifth reason is that I want to encourage researchers who want to use only two metrics in their paper to use LPIPS and PSNR instead of PSNR and SSIM.
The sixth reason is that the time saved by dropping the SSIM-based rankings will allow me to add new rankings based on other test data, which will be more useful and valuable.

Appendix 4: List of all research papers from the above rankings

Method	Abbr.	Paper	Official repository
BiM-VFI	-	BiM-VFI: Bidirectional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions
BiT	-	Blur Interpolation Transformer for Real-World Motion from Blur
CBBD	-	Frame Interpolation with Consecutive Brownian Bridge Diffusion
CtxSyn	-	Context-aware Synthesis for Video Frame Interpolation	-
DeMFI	-	DeMFI: Deep Joint Deblurring and Multi-Frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting
DvP	-	Dual-view Pyramid Network for Video Frame Interpolation	-
EDEN	-	EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
EDSC	-	Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution
EMA-VFI	-	Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation
FILM	-	FILM: Frame Interpolation for Large Motion
GIMM-VFI	-	Generalizable Implicit Motion Modeling for Video Frame Interpolation
HFD	-	Hierarchical Flow Diffusion for Efficient Frame Interpolation	-
InterpAny-Clearer	IA-Clearer	Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation
M2M	-	Many-to-many Splatting for Efficient Video Frame Interpolation
MoMo	-	Disentangled Motion Modeling for Video Frame Interpolation
PerVFI	-	Perception-Oriented Video Frame Interpolation via Asymmetric Blending
PRF	-	Video Frame Interpolation and Enhancement via Pyramid Recurrent Framework
SepConv	-	Video Frame Interpolation via Adaptive Separable Convolution
UPR-Net	-	A Unified Pyramid Recurrent Network for Video Frame Interpolation
UGFI	-	Frame Interpolation Transformer and Uncertainty Guidance	-
XVFI	-	XVFI: eXtreme Video Frame Interpolation

Method	Paper	Venue
ABME
AdaFNIO
AMT
CDFI
DBVI
DQBC
DRVI
EAFI	Error-Aware Spatial Ensembles for Video Frame Interpolation
EBME
EDC	Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN
EDENVFI
FGDCN
FLAVR	FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
HiFI	High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion
HRFFM	Video Frame Interpolation with Region-Distinguishable Priors from SAM
H-VFI
IFRNet	IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
IQ-VFI	IQ-VFI: Implicit Quadratic Motion Estimation for Video Frame Interpolation
JNMR
LADDER	LADDER: An Efficient Framework for Video Frame Interpolation
MA-GCSPA	Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation
NCM
ProBoost-Net	Progressive Motion Boosting for Video Frame Interpolation
RIFE	Real-Time Intermediate Flow Estimation for Video Frame Interpolation
RN-VFI	Range-nullspace Video Frame Interpolation with Focalized Motion Estimation
SoftSplat	Softmax Splatting for Video Frame Interpolation
SSR	Video Frame Interpolation with Many-to-many Splatting and Spatial Selective Refinement
ST-MFNet
Swin-VFI	Video Frame Interpolation for Polarization via Swin-Transformer
TDPNet	Textural Detail Preservation Network for Video Frame Interpolation
TTVFI
VFIformer	Video Frame Interpolation with Transformer
VFIFT	Video Frame Interpolation with Flow Transformer
VFIMamba	VFIMamba: Video Frame Interpolation with State Space Models
VFIT	Video Frame Interpolation Transformer
VRT	VRT: A Video Restoration Transformer

AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling [TIP 2022] [arXiv] ↩
CDFI: Compression-Driven Network Design for Frame Interpolation [CVPR 2021] [arXiv] ↩ ↩²
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution [TPAMI 2021] [arXiv] ↩
DRVI: Dual Refinement for Video Interpolation [Access 2021] ↩ ↩²
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation [CVPR 2023] [arXiv] ↩ ↩²
Video Frame Interpolation with Densely Queried Bilateral Correlation [IJCAI 2023] [arXiv] ↩ ↩²
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation [TIP 2023] [arXiv] ↩ ↩²
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation [CVPR 2023] [arXiv] ↩ ↩²
AdaFNIO: Adaptive Fourier Neural Interpolation Operator for video frame interpolation [arXiv] ↩ ↩²
Flow Guidance Deformable Compensation Network for Video Frame Interpolation [TMM 2023] [arXiv] ↩ ↩²
A Unified Pyramid Recurrent Network for Video Frame Interpolation [CVPR 2023] [arXiv] ↩ ↩²
Error-Aware Spatial Ensembles for Video Frame Interpolation [arXiv] ↩ ↩²
H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions [arXiv] ↩ ↩²
Revisiting Adaptive Convolutions for Video Frame Interpolation [WACV 2021] [arXiv] ↩ ↩²
Softmax Splatting for Video Frame Interpolation [CVPR 2020] [arXiv] ↩
Neighbor Correspondence Matching for Flow-based Video Frame Synthesis [MM 2022] [arXiv] ↩ ↩²
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation [CVPR 2022] [arXiv] ↩ ↩²
Enhanced Bi-directional Motion Estimation for Video Frame Interpolation [WACV 2023] [arXiv] ↩ ↩²
Asymmetric Bilateral Motion Estimation for Video Frame Interpolation [ICCV 2021] [arXiv] ↩ ↩²
JNMR: Joint Non-linear Motion Regression for Video Frame Interpolation [TIP 2023] [arXiv] ↩ ↩² ↩³
Efficient Convolution and Transformer-Based Network for Video Frame Interpolation [ICIP 2023] [arXiv] ↩ ↩² ↩³ ↩⁴
ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation [CVPR 2022] [arXiv] ↩
Deep Bayesian Video Frame Interpolation [ECCV 2022] ↩ ↩²
Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN [ICIP 2022] [arXiv] ↩

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
MoSt-DSA.png		MoSt-DSA.png
README.md		README.md
cat.png		cat.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Video Frame Interpolation Rankings
and Video Deblurring Rankings

List of Rankings

Joint Video Deblurring and Frame Interpolation Rankings

Video Deblurring Rankings