Emoji semantic change over time

Simple dashboard made in Dash.

Code and data for https://emoji-semantic-change.herokuapp.com/

Paper available at https://arxiv.org/abs/2105.00846

`change.csv`

Monthly semantic change score for emoji. The anchor point for calculating semantic change is usually 01/01/2012 (the first data point for such emoji will be 2012-02-01). For newer emoji, or those with missing data, the anchor point is the date of the blank line preceding the first non-blank line. For example, the anchor point for 🌼 is 2013-05-01.

The cluster assigned to each emoji (if there is one - not all emoji were clustered due to lack of data) is also given.

Semantic scores are provided in two formats: smoothed and z-normed. Smoothed scores are plotted in the dashboard, but z-normed scores are better for things like clustering. See paper for more details.

`year_list.csv`

For each emoji, for each year, shows the top 15 most similar words by cosine similarity.

This data is generated from Twitter content, so be prepared for terms you might find offensive!

`month_list.csv`

For each emoji, for each month, shows the top 15 most similar words by cosine similarity.

Monthly data is calculated using aggregate data across all years. So the top 15 most similar words for 🌀 in January is based on lemmatised word frequencies for January in 2012, 2013 and so on until 2018.

This data is generated from Twitter content, so be prepared for terms you might find offensive!

`nn_sim.csv`

For each emoji, in a given month, shows the mean cosine similarity to that emoji's 25 nearest neighbours, along with upper/lower bounds on 95% confidence intervals. (This data is not used by the dashboard and didn't make it into the paper!)

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
assets		assets
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
app.py		app.py
change.csv		change.csv
month_list.csv		month_list.csv
nn_sim.csv		nn_sim.csv
requirements.txt		requirements.txt
year_list.csv		year_list.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Emoji semantic change over time

`change.csv`

`year_list.csv`

`month_list.csv`

`nn_sim.csv`

About

Uh oh!

Releases

Packages

Languages

alexanderrobertson/emoji-semantic-change

Folders and files

Latest commit

History

Repository files navigation

Emoji semantic change over time

change.csv

year_list.csv

month_list.csv

nn_sim.csv

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`change.csv`

`year_list.csv`

`month_list.csv`

`nn_sim.csv`

Packages