Seoul National University Big Data Institute 4th Industrial Revolution AcademyBusiness Analytics Course Portfolio (2017.06 ~ 2018.02)
Introduction to the Education Program See more
- Acquiring insurance domain knowledge through interviews with key sales channel managers (FC, GA, TA)
- Variable creation from settlement management data for constructing analysis tables
- Selecting significant variables (indicators) through regression analysis and performing clustering based on them (using R)
- Sharing analysis results with operational practitioners and proposing management approaches tailored to each cluster's characteristics
- Preprocessing to transform given bill proposal data (individual sponsorship, co-sponsorship, co-sponsor names, party affiliation of each member (anonymous)) into "distance data" between members
- Clustering using k-means and hierarchical clustering algorithms (unsupervised learning)
- Adjusting and refining parameters based on comparison with the 2016 political situation
- Crawling blog posts within Naver Open API service
- Text data preprocessing using R packages (tm, KoNLP)
- Identifying correlations among key keywords within the snack products using Graphical Lasso algorithm
* RFM exercise (Performance Comparison of Machine Learning Techniques, Variable Selection)
- Analyzing customer purchase data including Recency, Frequency, and Monetary variables / Predicting the likelihood of inducing purchases upon issuing coupons