Skip to content

๐Ÿง  Case study on data preprocessing and behavioral analysis of TechnoMagicLand visitors. Includes clustering, correlation, and visualization in R, with focus on identifying repeat visitors and improving engagement strategies.

License

Notifications You must be signed in to change notification settings

SergeyFilipov/tml-visitor-behavior-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

8 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿง  tml-visitor-behavior-analysis

A case study in data preprocessing and behavioral analysis of real visitor interactions from TechnoMagicLand (TML).
The project leverages clustering, correlation, and statistical visualization in R to uncover key usage patterns and optimize user engagement.


๐Ÿงพ Project Scope

This project follows a full CRISP-DM cycle for behavior-driven analytics based on museum interaction logs.

๐Ÿงช Key Objectives

  • Clean and prepare real-world visitor data
  • Identify high-value vs casual users
  • Understand exhibit engagement patterns
  • Cluster users based on time, visits, and interaction profiles
  • Visualize behavioral and statistical relationships

๐Ÿงฎ Code execution is available in the R script:
โžก๏ธ all-tables-preprocessed_final_1.R


๐Ÿงฐ Tools & Technologies

  • Language: R
  • Libraries: dplyr, ggplot2, tidyr, cluster, factoextra, corrplot
  • Clustering: k-means with elbow method
  • Visualization: Histograms, scatter plots, boxplots, heatmaps
  • Data source: Interaction logs from TechnoMagicLand

๐Ÿ“Š Visual Outputs

The analysis includes a variety of visualizations available in the repository under the PNG files section. These include:

  • Boxplots showing the distribution of total points and interaction durations
  • Histograms for visitor scores, durations, and behavioral segments
  • Bar charts ranking exhibits by average time, total interactions, and difficulty
  • Correlation heatmaps for both exhibit co-occurrence and user metrics
  • K-Means clustering outputs with elbow method validation and scatter plots
  • Visitor segmentation visuals (e.g. top users by points, visits, or consistency)

All visualizations can be found in the /figures or root folder as .png files with descriptive filenames.


๐Ÿ” Key Findings

  • ๐Ÿงโ€โ™‚๏ธ Most visitors only visit once, but some accumulate many points
  • ๐Ÿง  High repeat users tend to interact with more exhibits on average
  • ๐ŸŽฏ Certain exhibits have consistently high interaction time but low scores, indicating high difficulty or engagement
  • ๐Ÿ“‰ Many visitors drop off after only 1โ€“2 interactions
  • ๐Ÿค– Clustering identified meaningful segments (e.g. speedrunners vs explorers)

๐Ÿ“ Repository Structure

๐Ÿ“ฆ tml-visitor-behavior-analysis โ”œโ”€โ”€ ๐Ÿ“Š PNG visualizations โ”œโ”€โ”€ ๐Ÿ“„ all-tables-preprocessed_final_1.R โ”œโ”€โ”€ ๐Ÿ“˜ README.md โ”œโ”€โ”€ ๐Ÿ“‘ LICENSE (MIT) โ”œโ”€โ”€ ๐Ÿ“• Report PDF (Technical Summary)


๐Ÿง  Topics

r, clustering, data-preprocessing, data-visualization,
visitor-behavior, user-segmentation, education-project, technomagicland


๐Ÿ“œ License

This project is licensed under the MIT License โ€“ see LICENSE for details.


๐Ÿ™Œ Credits

Conducted as part of a data science and education analytics project based on real museum interaction data.
Developed by Sergey Filipov

About

๐Ÿง  Case study on data preprocessing and behavioral analysis of TechnoMagicLand visitors. Includes clustering, correlation, and visualization in R, with focus on identifying repeat visitors and improving engagement strategies.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages