A case study in data preprocessing and behavioral analysis of real visitor interactions from TechnoMagicLand (TML).
The project leverages clustering, correlation, and statistical visualization in R to uncover key usage patterns and optimize user engagement.
This project follows a full CRISP-DM cycle for behavior-driven analytics based on museum interaction logs.
- Clean and prepare real-world visitor data
- Identify high-value vs casual users
- Understand exhibit engagement patterns
- Cluster users based on time, visits, and interaction profiles
- Visualize behavioral and statistical relationships
๐งฎ Code execution is available in the R script:
โก๏ธ all-tables-preprocessed_final_1.R
- Language:
R
- Libraries:
dplyr
,ggplot2
,tidyr
,cluster
,factoextra
,corrplot
- Clustering:
k-means
with elbow method - Visualization: Histograms, scatter plots, boxplots, heatmaps
- Data source: Interaction logs from TechnoMagicLand
The analysis includes a variety of visualizations available in the repository under the PNG files section. These include:
- Boxplots showing the distribution of total points and interaction durations
- Histograms for visitor scores, durations, and behavioral segments
- Bar charts ranking exhibits by average time, total interactions, and difficulty
- Correlation heatmaps for both exhibit co-occurrence and user metrics
- K-Means clustering outputs with elbow method validation and scatter plots
- Visitor segmentation visuals (e.g. top users by points, visits, or consistency)
All visualizations can be found in the /figures
or root folder as .png
files with descriptive filenames.
- ๐งโโ๏ธ Most visitors only visit once, but some accumulate many points
- ๐ง High repeat users tend to interact with more exhibits on average
- ๐ฏ Certain exhibits have consistently high interaction time but low scores, indicating high difficulty or engagement
- ๐ Many visitors drop off after only 1โ2 interactions
- ๐ค Clustering identified meaningful segments (e.g. speedrunners vs explorers)
๐ฆ tml-visitor-behavior-analysis โโโ ๐ PNG visualizations โโโ ๐ all-tables-preprocessed_final_1.R โโโ ๐ README.md โโโ ๐ LICENSE (MIT) โโโ ๐ Report PDF (Technical Summary)
r
, clustering
, data-preprocessing
, data-visualization
,
visitor-behavior
, user-segmentation
, education-project
, technomagicland
This project is licensed under the MIT License โ see LICENSE for details.
Conducted as part of a data science and education analytics project based on real museum interaction data.
Developed by Sergey Filipov