Implementation of Geometric Distance Based Feature Selection

It is a code that uses geometry measers to evaluate Feature subsets and to select optimal Feature subsets based on them.
Feature subset's evaluation implements the 'Feature selection based on geometric distance for high-dimensional data.' introduced in the paper.
This is a hybrid feature selection that combines filter and wrapper.
Therefore, even if this technique is not applied as it is, it would be a good reference for those who want to implement the hybrid method.
It can be used through 'pip install gdbfs'.
Please refer to the Repository 'https://github.com/seo-young-kim/GDBFS_deploy' for details.
This page provides a brief description of the techniques.

Reference 'Lee, J. H., & Oh, S. Y. (2016). Feature selection based on geometric distance for high-dimensional data. Electronics Letters, 52(6), 473-475.'.

This technique uses the distance between classification classes. So it can only be applied to classification problems.

Distance measurement method of Feature subset

The measurement for Feature subsets is multiplied by two geometric distances.

1. Inter-class distance

First, the distance between classes from the corresponding Feature subspace is achieved.
When referenced in the figure1, the further the distance between the center of the class and the smaller the intracranial partiality, the easier it will be classified.
Therefore, (the distance between the center of the class) - (class Internal Variance) is used as measurement.

2. Eveness of Inter-class distances

The more equal the distance between classes in the Feature subspace, the easier it can be expected to be classified. Therefore, it is used as a second measurement.

Figure 2. Comparision of the inter-class distance of the class distributions and the distance evenness from : Feature selection based on geometric distance for high-dimensional data. Electronics Letters, 52(6), 473-475.

Implementation with Sequential Forward Selection

Through Sequential Forward selection, we can selects feature subsets with maximum gdbfs value.

Figure 3. Sequential forward selection from : https://www.cc.gatech.edu/~bboots3/CS4641-Fall2018/Lecture16/16_FeatureSelection.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
Example1.ipynb		Example1.ipynb
README.md		README.md
geometricDistanceBasedFeatureSelection.py		geometricDistanceBasedFeatureSelection.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Implementation of Geometric Distance Based Feature Selection

Distance measurement method of Feature subset

1. Inter-class distance

2. Eveness of Inter-class distances

Implementation with Sequential Forward Selection

About

Uh oh!

Releases

Packages

Languages

seo-young-kim/GeometricDistanceBasedFeatureSelection

Folders and files

Latest commit

History

Repository files navigation

Implementation of Geometric Distance Based Feature Selection

Distance measurement method of Feature subset

1. Inter-class distance

2. Eveness of Inter-class distances

Implementation with Sequential Forward Selection

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages