Statistics-L12023P1STAGE3-tests-for-ordinal-and-nominal-data-python-and-R

STAGE 3 Statistical tests

PROJECT DESCRIPTION This worksheet combines statistical tests and effect size functionalities (inbuilt or personal) to demo results next to each other for better inferencing.

Usually tests are run solo, and one might forget other important statements to add to interpret tests results better. This project is an attempt to combine such insight into one place to deliver wholesome inference on data

STAGE 1 - CHECKED - Descriptive data exploration (basically descriptive stats and general over view. First glace into the ether, trying to make sense) STAGE 2 - CHECKED - Comprehensive data exploration, nailing relationships, trends and correlations where some sense emerges STAGE 3 - this is where we are

The data set that has both ordinal and nominal DVs and ordinal and nominal IVs. Doing this tasks felt like code waltzes through the statistical realm after 3 shots of Absinthe. Most brain wracking went into test selection and running others for basic comparison.

METADATA EXPLANATION

RAW DATA - 20 Ordinal DVs, 36 Nominal DVs, 4 Ordinal IVs, 1 Nominal IV
DATA - ran a data wrangling python script to generate 1. Ordinal.xlsx, with combination of each Ordinal DV to each IV (each sheet with one combination) 2. Nominal.xlsx, with combination of each Nominal DV to each IV (each sheet with one combination) 3. Added ranking columns whilst generating the work-ready data This helped split the scripts vastly related to tests for either Ordinal or Nominal data This helped reduce data cleaning and wrangling code for further processes
SampleData - Sample datasheet shared. Column A in SampleData is Independent variable and Column B is dependent variable with responses. The Headers are decoded, that is why they have values like 7-40-1 etc.
Ratings in SampleData are conversions from text based input to numbered inputs

TEST SELECTION

Goodness of fit test - With any survey on a sample, must check if sample aptly represents the population. This can be carried out using

'Goodness of fit' tests like Chi2 or
Plotting sample distribution vs Population (if sample size is too small). Some plots attached as examples.

IV to DV | Group size

Ordinal to Ordinal | > 2, > 2 -> Linear to Linear (Ordinal Chi2 test), Kendalltau, Jonckheere–Terpstra, Cuzick tests

Ordinal to Nominal | > 2, = 2 -> Cochran–Armitage test for trend

Nominal to Ordinal | > 2, > 2 -> Kruskal Wallis (works on Ordinal IV too , as treats it as Nominal anyway)

Nominal to Nominal | > 2, = 2 -> Chi2

EFFECT SIZE

Ordinal - Ordinal -> Kendall tau -> tau [-1 to 1] (b and c) or Kruskal's gamma

Ordinal - Nominal -> Kruskal Wallis -> epsilon-squared [0 to 1]

Ordinal - Nominal -> Freeman’s theta [0 to 1] or epsilon-squared [0 to 1] or eta-squared (more biased) [0 to 1]

Nominal - Nominal -> Chi2 -> Cramver V [0 to 1] or phi

CODE AND INSTALLATION DESCRIPTION

Python - for quicker statistical tests and data wrangling
R - for more advanced statistical tests not available in python. Somethings are better left to R
Installation - I worked on VS Code IDE, Python 3.10.10, RStudio, R 4.3, Jupyter Notebook
Installation - Refer requirement files to install dependencies

CODE RUN

Sample data provided, ready for the script

COMMUNITY CONTRIBUTION

Thank you for considering contributing to this statistics project! I appreciate your interest in helping to improve and grow.

Ways to Contribute There are several ways you can contribute to the project:

Bug Reports: If you come across any bugs, errors, or unexpected behavior, please report them by opening an issue. Include as much detail as possible, such as steps to reproduce the issue and any error messages or relevant information.

Feature Requests: If you have ideas for new features or enhancements, we welcome feature requests. Open an issue and describe the feature or improvement you would like to see. Explain why it would be valuable and how it aligns with the project's scope.

Documentation: Will endevour to add one in near future. If you have any ideas or areas that need improvement, please contribute by adding the documentation. You can submit a pull request with your proposed changes.

Code Contributions: If you're interested in improving the project's codebase, you can contribute by fixing bugs, optimizing performance, or implementing new features. Fork the repository, make your changes in a new branch, and submit a pull request with a clear description of the changes and their purpose.

GETTING STARTED To get started with contributing, follow these steps:

Fork the repository to your own GitHub account.
Clone the forked repository to your local machine.
Create a new branch for your contributions.
Make the necessary changes and additions.
Test your changes thoroughly.
Commit your changes with descriptive commit messages.
Push the changes to your forked repository.
Submit a pull request to the main repository.

Truely appreciate your time and effort in contributing to the project. The team will review your contributions as soon as possible and provide feedback or merge them into the main project.

Once again, thank you for your contributions and for helping us make this statistics project even better!

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
CodeJsonVar		CodeJsonVar
CodeOrdinalTest.R		CodeOrdinalTest.R
CodeStatisticalTest.ipynb		CodeStatisticalTest.ipynb
Plot GoF age subgroups in each gender.png		Plot GoF age subgroups in each gender.png
Plot GoF age.png		Plot GoF age.png
Plot GoF gender.png		Plot GoF gender.png
README.md		README.md
SampleData.xlsx		SampleData.xlsx
SampleDataNominal.xlsx		SampleDataNominal.xlsx
SampleDataOrdinal.xlsx		SampleDataOrdinal.xlsx
SampleDataPopulationRepresentativeness.xlsx		SampleDataPopulationRepresentativeness.xlsx
SampleResults.xlsx		SampleResults.xlsx
ordinalTest.R		ordinalTest.R
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Statistics-L12023P1STAGE3-tests-for-ordinal-and-nominal-data-python-and-R

About

Uh oh!

Releases

Packages

Uh oh!

Languages

RDB-bit/Statistics-L12023P1STAGE3-tests-for-ordinal-and-nominal-data-python-and-R-

Folders and files

Latest commit

History

Repository files navigation

Statistics-L12023P1STAGE3-tests-for-ordinal-and-nominal-data-python-and-R

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages