joseTamezPena
diff --git a/‎README.md‎
Lines changed: 6 additions & 6 deletions b/‎README.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎RMD/COVID_19_TrainTest.Rmd‎
Lines changed: 11 additions & 11 deletions b/‎RMD/COVID_19_TrainTest.Rmd‎
Lines changed: 11 additions & 11 deletions
diff --git a/‎RMD/FDeA_ML_testing_ARCENE.Rmd‎
Lines changed: 4 additions & 4 deletions b/‎RMD/FDeA_ML_testing_ARCENE.Rmd‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎RMD/FDeA_ML_testing_DARWIN.Rmd‎
Lines changed: 11 additions & 11 deletions b/‎RMD/FDeA_ML_testing_DARWIN.Rmd‎
Lines changed: 11 additions & 11 deletions
diff --git a/‎RMD/FDeA_ML_testing_sonar.Rmd‎
Lines changed: 27 additions & 27 deletions b/‎RMD/FDeA_ML_testing_sonar.Rmd‎
Lines changed: 27 additions & 27 deletions
@@ -1,12 +1,12 @@
-# Heuristic Multidimensional Correlation Analysis: Goal-Driven Spatial Transformation Matrices
+# Iterative Decorrelation Analysis (IDeA) and the Unit of Measurement Preserving Spatial Transformation Matrices (UPSTM)
 
 ![](images/paste-706F2F78.png)
 
-Fig. 1. The weights ($w_j^i$) of the GDSTM matrix (**W**) are estimated by the HMCA algorithm.
+Fig. 1. The weights ($w_j^i$) of the UPSTM matrix (**W**) are estimated by the IDeA algorithm.
 
 \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_
 
-Many multidimensional/multimodality data sets contain continuous features that are co-linear, correlated or have some association between them. The goal of spatial transformations is to find a set of [latent variables](https://en.wikipedia.org/wiki/Latent_and_observable_variables) with minimum data correlation; hence downstream data analysis be simplified. Common data transformation matrices include statistically driven approaches such as [principal component analysis](https://en.wikipedia.org/wiki/Principal_component_analysis) (PCA), [explanatory factor analysis](https://en.wikipedia.org/wiki/Exploratory_factor_analysis) (EFA), and [canonical-correlation analysis](https://en.wikipedia.org/wiki/Canonical_correlation) (CCA). An heuristic alternative for these two statistical approaches is the heuristic-multidimensional correlation analysis (HMCA). The main advantage of the heuristic approach is that it is driven by specific requirements for the output generated. The specific requirements are:
+Many multidimensional/multimodality data sets contain continuous features that are co-linear, correlated or have some association between them. The goal of spatial transformations is to find a set of [latent variables](https://en.wikipedia.org/wiki/Latent_and_observable_variables) with minimum data correlation; hence downstream data analysis be simplified. Common data transformation matrices include statistically driven approaches such as [principal component analysis](https://en.wikipedia.org/wiki/Principal_component_analysis) (PCA), [explanatory factor analysis](https://en.wikipedia.org/wiki/Exploratory_factor_analysis) (EFA), and [canonical-correlation analysis](https://en.wikipedia.org/wiki/Canonical_correlation) (CCA). An algoritm alternative for these two statistical approaches is the Iterative Decorrelation Analysis (HMCA). The main advantage of the iterative approach is that it is driven by specific output requirements. The specific requirements are:
 
 1.  All output variables $Q=(q_1,...q_n)$ have a parent input variable $X=(x_1,...x_n)$ (See Fig 1.)
 
@@ -48,7 +48,7 @@ library("FRESA.CAD")
 data('iris')
 
 ## HMCA Decorrelation at 0.25 threshold, pearson and fast estimation 
-irisDecor <- GDSTMDecorrelation(iris,thr=0.25)
+irisDecor <- IDeA(iris,thr=0.25)
 
 ### Print the latent variables
 print(getLatentCoefficients(irisDecor))
@@ -77,9 +77,9 @@ This repository show some examples of the **FRESA.CAD::GDSTMDecorrelation(), FRE
 -   **irisexample.R** showcase the effect of the HMCA algorithm on the iris data set.
 
     -   Here an example of the output
-    -   ![](images/paste-8B4C5746.png)
+    -   ![](images/paste-AB4FBF9C.png)
 
--   ![](images/paste-AF234B49.png)
+-   ![](images/paste-913BE963.png)
 
 -   **ParkisonAnalysis_TrainTest.Rmd** is a demo shows the use of GDSTM and BSWiMS to gain insight of the features associated with a relevant outcome. Highlight process and functions that will aid authors to discern and statistically describe the relevant features associated with an specific outcome.
 
 
@@ -21,17 +21,17 @@ knitr::opts_chunk$set(collapse = TRUE, warning = FALSE, message = FALSE,comment
 
 ```
 
-# Effect of GDSTM-Based Decorrelation on Feature Discovery
+# Effect of UPSTM-Based Decorrelation on Feature Discovery
 
-Here I showcase of to use BSWiMS feature selection/modeling function coupled with Goal Driven Sparse Transformation Matrix (GDSTM) as a pre-processing step to decorrelate highly correlated features. The aim(s) are:
+Here I showcase of to use BSWiMS feature selection/modeling function coupled with Goal Driven Sparse Transformation Matrix (UPSTM) as a pre-processing step to decorrelate highly correlated features. The aim(s) are:
 
 1.  To improve model performance by uncovering the hidden information between correlated features.
 
 2.  To simplify the interpretation of the machine learning models.
 
 This demo will use:
 
--   *FRESA.CAD::GDSTMDecorrelation()*. For Decorrelation of Multidimensional data sets
+-   *FRESA.CAD::IDeA()*. For Decorrelation of Multidimensional data sets
 
     -   *FRESA.CAD::getDerivedCoefficients()*. For the extraction of the model of the newly discovered of decorrelated features.
 
@@ -131,15 +131,15 @@ pander::pander(table(testSet$PCR_result))
 
 #### Decorrelation: Training and Testing Sets Creation
 
-I compute a decorrelated version of the training and testing sets using the *GDSTMDecorrelation()* function of FRESA.CAD. The first decorrelation will be driven by features associated with the outcome. The second decorrelation will find the GDSTM without the outcome restriction.
+I compute a decorrelated version of the training and testing sets using the *IDeA()* function of FRESA.CAD. The first decorrelation will be driven by features associated with the outcome. The second decorrelation will find the UPSTM without the outcome restriction.
 
 ```{r results = "asis", warning = FALSE, dpi=600, fig.height= 6.0, fig.width= 8.0}
-## The GDSTM transformation driven by the Outcome
-deTrain <- GDSTMDecorrelation(trainSet,Outcome="PCR_result",thr=0.8,verbose = TRUE)
+## The UPSTM transformation driven by the Outcome
+deTrain <- IDeA(trainSet,Outcome="PCR_result",thr=0.8,verbose = TRUE)
 deTest <- predictDecorrelate(deTrain,testSet)
 
-## The GDSTM transformation without outcome
-deTrainU <- GDSTMDecorrelation(trainSet,thr=0.8,verbose = TRUE)
+## The UPSTM transformation without outcome
+deTrainU <- IDeA(trainSet,thr=0.8,verbose = TRUE)
 deTestU <- predictDecorrelate(deTrainU,testSet)
 
 ```
@@ -155,7 +155,7 @@ gplots::heatmap.2(abs(cormat),
                   scale = "none",
                   mar = c(10,10),
                   col=rev(heat.colors(5)),
-                  main = "Test Set Correlation after GDSTM",
+                  main = "Test Set Correlation after UPSTM",
                   cexRow = 0.35,
                   cexCol = 0.35,
                   key.title=NA,
@@ -198,7 +198,7 @@ cvBSWiMSDeCor <- randomCV(COVID_19_MS,
                 DECOR.control=list(Outcome="PCR_result",thr=0.8)
 )
 
-bpDecor <- predictionStats_binary(cvBSWiMSDeCor$medianTest,"BSWiMS Outcome-Driven GDSTM",cex=0.60)
+bpDecor <- predictionStats_binary(cvBSWiMSDeCor$medianTest,"BSWiMS Outcome-Driven UPSTM",cex=0.60)
 pander::pander(bpDecor$CM.analysis$tab)
 pander::pander(bpDecor$accc)
 pander::pander(bpDecor$aucs)
@@ -228,7 +228,7 @@ cvBSWiMSDeCorU <- randomCV(COVID_19_MS,
                 DECOR.control=list(thr=0.8)
 )
 
-bpDecorU <- predictionStats_binary(cvBSWiMSDeCorU$medianTest,"BSWiMS Data Driven GDSTM",cex=0.60)
+bpDecorU <- predictionStats_binary(cvBSWiMSDeCorU$medianTest,"BSWiMS Data Driven UPSTM",cex=0.60)
 pander::pander(bpDecorU$CM.analysis$tab)
 pander::pander(bpDecorU$accc)
 pander::pander(bpDecorU$aucs)
 
@@ -1,13 +1,13 @@
 ---
-title: 'Filtered Fit: FDeA and the GDSTM'
+title: 'Filtered Fit: FDeA and the UPSTM'
 output:
   html_document:
     df_print: paged
 ---
 
-## Filtered ML fit and the GDSTM with FRESA.CAD
+## Filtered ML fit and the UPSTM with FRESA.CAD
 
-Here we make use of the **FRESA.CAD::filteredfit()** function to train ML models with and without GDSTM on the ARCENE data set.
+Here we make use of the **FRESA.CAD::filteredfit()** function to train ML models with and without UPSTM on the ARCENE data set.
 
 > Isabelle Guyon, Steve R. Gunn, Asa Ben-Hur, Gideon Dror, 2004. Result analysis of the NIPS 2003 feature selection challenge. In: NIPS. [$$Web Link$$](http://books.nips.cc/papers/files/nips17/NIPS2004_0194.pdf). *from: <https://archive.ics.uci.edu/ml/datasets/Arcene>*
 >
@@ -138,7 +138,7 @@ pander::pander(psRaw$aucs)
 
 psDecor <- predictionStats_binary(cbind(datasetframe_test$Labels,
                                         predict(mLASSODecor,datasetframe_test)),
-                                "LASSO after GDSTM",cex=0.75)
+                                "LASSO after UPSTM",cex=0.75)
 pander::pander(psDecor$aucs)
 
 
 
@@ -1,5 +1,5 @@
 ---
-title: "FDeA and the GDSTM: DARWIN Data Set"
+title: "FDeA and the UPSTM: DARWIN Data Set"
 output:
   html_document:
     df_print: paged
@@ -14,17 +14,17 @@ knitr::opts_chunk$set(collapse = TRUE, warning = FALSE, message = FALSE,comment
 
 ```
 
-# Effect of GDSTM-Based Decorrelation on Feature Discovery: The DARWIN Evaluation
+# Effect of UPSTM-Based Decorrelation on Feature Discovery: The DARWIN Evaluation
 
-Here I showcase of to use BSWiMS feature selection/modeling function coupled with Goal Driven Sparse Transformation Matrix (GDSTM) as a pre-processing step to decorrelate highly correlated features. The aim(s) are:
+Here I showcase of to use BSWiMS feature selection/modeling function coupled with Goal Driven Sparse Transformation Matrix (UPSTM) as a pre-processing step to decorrelate highly correlated features. The aim(s) are:
 
 1.  To improve model performance by uncovering the hidden information between correlated features.
 
 2.  To simplify the interpretation of the machine learning models.
 
 This demo will use:
 
--   FRESA.CAD::GDSTMDecorrelation(). For Decorrelation of Multidimensional data sets
+-   FRESA.CAD::IDeA(). For Decorrelation of Multidimensional data sets
 
 -   FRESA.CAD::getDerivedCoefficients(). For the extraction of the model of the newly discovered of decorrelated features.
 
@@ -141,15 +141,15 @@ pander::pander(table(testSet$class))
 
 #### Decorrelation: Training and Testing Sets Creation
 
-I compute a decorrelated version of the training and testing sets using the *GDSTMDecorrelation()* function of FRESA.CAD. The first decorrelation will be driven by features associated with the outcome. The second decorrelation will find the GDSTM without the outcome restriction.
+I compute a decorrelated version of the training and testing sets using the *IDeA()* function of FRESA.CAD. The first decorrelation will be driven by features associated with the outcome. The second decorrelation will find the UPSTM without the outcome restriction.
 
 ```{r results = "asis", warning = FALSE, dpi=600, fig.height= 6.0, fig.width= 8.0}
-## The GDSTM transformation driven by the Outcome
-deTrain <- GDSTMDecorrelation(trainSet,Outcome="class",thr=0.8,verbose = TRUE,skipRelaxed=FALSE)
+## The UPSTM transformation driven by the Outcome
+deTrain <- IDeA(trainSet,Outcome="class",thr=0.8,verbose = TRUE,skipRelaxed=FALSE)
 deTest <- predictDecorrelate(deTrain,testSet)
 
-## The GDSTM transformation without outcome
-deTrainU <- GDSTMDecorrelation(trainSet,thr=0.8,verbose = TRUE,skipRelaxed=FALSE)
+## The UPSTM transformation without outcome
+deTrainU <- IDeA(trainSet,thr=0.8,verbose = TRUE,skipRelaxed=FALSE)
 deTestU <- predictDecorrelate(deTrainU,testSet)
 
 ```
@@ -166,7 +166,7 @@ gplots::heatmap.2(abs(cormat),
                   scale = "none",
                   mar = c(10,10),
                   col=rev(heat.colors(5)),
-                  main = "Test Set Correlation after GDSTM",
+                  main = "Test Set Correlation after UPSTM",
                   cexRow = 0.45,
                   cexCol = 0.45,
                   key.title=NA,
@@ -209,7 +209,7 @@ cvBSWiMSDeCor <- randomCV(DARWIN,
                 DECOR.control=list(Outcome="class",thr=0.8,skipRelaxed=FALSE)
 )
 
-bpDecor <- predictionStats_binary(cvBSWiMSDeCor$medianTest,"Outcome-Driven GDSTM",cex=0.60)
+bpDecor <- predictionStats_binary(cvBSWiMSDeCor$medianTest,"Outcome-Driven UPSTM",cex=0.60)
 pander::pander(bpDecor$CM.analysis$tab)
 pander::pander(bpDecor$accc)
 pander::pander(bpDecor$aucs)
 
@@ -1,15 +1,15 @@
 ---
-title: "FDeA and the GDSTM: Sonar Tests"
+title: "FDeA and the UPSTM: Sonar Tests"
 output:
   html_document:
     df_print: paged
 editor_options: 
   chunk_output_type: console
 ---
 
-## Filtered ML fit and the GDSTM with FRESA.CAD
+## Filtered ML fit and the UPSTM with FRESA.CAD
 
-Here we make use of the **FRESA.CAD::filteredfit()** function to train ML models with and without GDSTM.
+Here we make use of the **FRESA.CAD::filteredfit()** function to train ML models with and without UPSTM.
 
 Naive-Bayes (NB) and LASSO models are used in this demo.
 
@@ -214,13 +214,13 @@ pander::pander(psPCA$aucs)
 AllRocAUC <- rbind(AllRocAUC,psPCA$aucs)
 
 psDecor <- predictionStats_binary(cbind(classoutcomes,prDecor),
-                                "NB GDSTM",cex=0.75)
+                                "NB UPSTM",cex=0.75)
 pander::pander(psDecor$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor$aucs);
 
 
 psDecor2 <- predictionStats_binary(cbind(classoutcomes,prDecor2),
-                                "NB GDSTM Spearman",cex=0.75)
+                                "NB UPSTM Spearman",cex=0.75)
 pander::pander(psDecor2$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor2$aucs);
 
@@ -239,14 +239,14 @@ AllRocAUC <- rbind(AllRocAUC,psPCA$aucs)
 
 psDecor <- predictionStats_binary(cbind(classoutcomes,
                                         predict(mLASSODecor,datasetframe_test)),
-                                "LASSO GDSTM",cex=0.75)
+                                "LASSO UPSTM",cex=0.75)
 pander::pander(psDecor$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor$aucs);
 
 
 psDecor2 <- predictionStats_binary(cbind(classoutcomes,
                                          predict(mLASSODecor2,datasetframe_test)),
-                                "LASSO GDSTM Spearman",cex=0.75)
+                                "LASSO UPSTM Spearman",cex=0.75)
 pander::pander(psDecor2$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor2$aucs);
 
@@ -256,8 +256,8 @@ AllRocAUC <- rbind(AllRocAUC,psDecor2$aucs);
 
 ```{r results = "asis", warning = FALSE, dpi=600, fig.height= 6.0, fig.width= 8.0}
 
-rownames(AllRocAUC) <- c("NB:Raw","NB:PCA","NB:GDSTM_P","NB:GDSTM_S",
-                         "LASSO:Raw","LASSO:PCA","LASSO:GDSTM_P","LASSO:GDSTM_S")
+rownames(AllRocAUC) <- c("NB:Raw","NB:PCA","NB:UPSTM_P","NB:UPSTM_S",
+                         "LASSO:Raw","LASSO:PCA","LASSO:UPSTM_P","LASSO:UPSTM_S")
 pander::pander(AllRocAUC)
 bpROCAUC <- barPlotCiError(as.matrix(AllRocAUC),
                           metricname = "ROCAUC",
@@ -273,33 +273,33 @@ bpROCAUC <- barPlotCiError(as.matrix(AllRocAUC),
 
 ```
 
-## Visualization of GDSTM
+## Visualization of UPSTM
 
-The GDSTM is stored in the filteredFit() object. Hence, we can analyze and display the matrix.
+The UPSTM is stored in the filteredFit() object. Hence, we can analyze and display the matrix.
 
 ```{r results = "asis", warning = FALSE, dpi=600, fig.height= 6.0, fig.width= 8.0}
 
-gplots::heatmap.2(mNBDecor$GDSTM,
+gplots::heatmap.2(mNBDecor$UPSTM,
                   trace = "none",
                   mar = c(10,10),
                   col=rev(heat.colors(7)),
-                  main = paste("GDSTM Matrix (Pearson, LM):",studyName),
+                  main = paste("UPSTM Matrix (Pearson, LM):",studyName),
                   cexRow = 0.7,
                   cexCol = 0.7,
                   key.title=NA,
                   key.xlab="beta",
-                  xlab="GDSTM Feature", ylab="Input Feature")
+                  xlab="UPSTM Feature", ylab="Input Feature")
 
-gplots::heatmap.2(mNBDecor2$GDSTM,
+gplots::heatmap.2(mNBDecor2$UPSTM,
                   trace = "none",
                   mar = c(10,10),
                   col=rev(heat.colors(7)),
-                  main = paste("GDSTM Matrix (Spearman, RLM):",studyName),
+                  main = paste("UPSTM Matrix (Spearman, RLM):",studyName),
                   cexRow = 0.7,
                   cexCol = 0.7,
                   key.title=NA,
                   key.xlab="beta",
-                  xlab="GDSTM Feature", ylab="Input Feature")
+                  xlab="UPSTM Feature", ylab="Input Feature")
 ```
 
 ## Repeated Holdout Cross-Validation
@@ -378,8 +378,8 @@ The Aggregated Test Results
 par(mfrow=c(2,2))
 bpraw <- predictionStats_binary(cvNBRaw$testPredictions,"NB RAW",cex=0.70)
 bpPCA <- predictionStats_binary(cvNBPCA$testPredictions,"NB PCA",cex=0.70)
-bpdecor <- predictionStats_binary(cvNBDecor$testPredictions,"NB GDSTM",cex=0.70)
-bpdecorC <- predictionStats_binary(cvNBDecorC$testPredictions,"NB GDSTM Outcome Driven",cex=0.70)
+bpdecor <- predictionStats_binary(cvNBDecor$testPredictions,"NB UPSTM",cex=0.70)
+bpdecorC <- predictionStats_binary(cvNBDecorC$testPredictions,"NB UPSTM Outcome Driven",cex=0.70)
 
 pander::pander(bpraw$aucs)
 pander::pander(bpPCA$aucs)
@@ -590,18 +590,18 @@ pander::pander(psPCA$aucs)
 AllRocAUC <- rbind(AllRocAUC,psPCA$aucs)
 
 psDecor <- predictionStats_binary(cbind(classoutcomes,prDecor),
-                                "NB GDSTM",cex=0.75)
+                                "NB UPSTM",cex=0.75)
 pander::pander(psDecor$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor$aucs);
 
 
 psDecor2 <- predictionStats_binary(cbind(classoutcomes,prDecor2),
-                                "NB GDSTM Spearman",cex=0.75)
+                                "NB UPSTM Spearman",cex=0.75)
 pander::pander(psDecor2$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor2$aucs);
 
 psDecorD <- predictionStats_binary(cbind(classoutcomes,prDecorD),
-                                "NB GDSTMD Spearman",cex=0.75)
+                                "NB UPSTMD Spearman",cex=0.75)
 pander::pander(psDecorD$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecorD$aucs);
 
@@ -620,19 +620,19 @@ AllRocAUC <- rbind(AllRocAUC,psPCA$aucs)
 
 psDecor <- predictionStats_binary(cbind(classoutcomes,
                                         predict(mLASSODecor,datasetframe_test)),
-                                "LASSO GDSTM",cex=0.75)
+                                "LASSO UPSTM",cex=0.75)
 pander::pander(psDecor$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor$aucs);
 
 psDecorD <- predictionStats_binary(cbind(classoutcomes,
                                         predict(mLASSODecorD,datasetframe_test)),
-                                "LASSO GDSTMD",cex=0.75)
+                                "LASSO UPSTMD",cex=0.75)
 pander::pander(psDecorD$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecorD$aucs);
 
 psDecor2 <- predictionStats_binary(cbind(classoutcomes,
                                          predict(mLASSODecor2,datasetframe_test)),
-                                "LASSO GDSTM Spearman",cex=0.75)
+                                "LASSO UPSTM Spearman",cex=0.75)
 pander::pander(psDecor2$aucs)
 AllRocAUC <- rbind(AllRocAUC,psDecor2$aucs);
 
@@ -642,8 +642,8 @@ AllRocAUC <- rbind(AllRocAUC,psDecor2$aucs);
 
 ```{r results = "asis", warning = FALSE, dpi=600, fig.height= 6.0, fig.width= 8.0}
 
-rownames(AllRocAUC) <- c("NB:Raw","NB:PCA","NB:GDSTM_P","NB:GDSTMD_P","NB:GDSTM_S",
-                         "LASSO:Raw","LASSO:PCA","LASSO:GDSTM_P","LASSO:GDSTMD_P","LASSO:GDSTM_S")
+rownames(AllRocAUC) <- c("NB:Raw","NB:PCA","NB:UPSTM_P","NB:UPSTMD_P","NB:UPSTM_S",
+                         "LASSO:Raw","LASSO:PCA","LASSO:UPSTM_P","LASSO:UPSTMD_P","LASSO:UPSTM_S")
 pander::pander(AllRocAUC)
 bpROCAUC <- barPlotCiError(as.matrix(AllRocAUC),
                           metricname = "ROCAUC",