This project explores which kinds of Hacker News submissions spark the most conversation and when authors should publish questions to maximize engagement. Using a historical snapshot of posts from the popular tech forum, we compare comment activity on Ask HN (questions) versus Show HN (project showcases) and identify the peak posting hours for lively discussion.
hacker-news-engagement/
├── data/ # Raw dataset (CSV)
│ └── hacker_news.csv
├── notebook/ # Jupyter Notebook with full analysis
│ └── hacker-news-posts.ipynb
├── .gitignore # Lists ignored files for version control
├── pyproject.toml # Dependency and project metadata
├── .python-version # Defines Python version for virtual environments
├── uv.lock # Lockfile for reproducible installs
└── README.md # Project overview and setup instructions
The analysis relies on hacker_news.csv
, a subset of the Hacker News Posts dataset originally published on Kaggle. The CSV contains the submission title, author, Unix timestamp, number of comments, and other metadata for tens of thousands of posts between 2015 – 2017.
Note: To keep the repository lightweight, only the filtered CSV used in this notebook is tracked. For a full refresh you can download the latest dataset from Kaggle and place it in the
data/
folder with the same filename.
Launch the notebook directly in your browser via Binder (no installation required):
Follow these steps to reproduce the analysis locally.
git clone https://github.com/xjwllmws/hacker-news-engagement.git
cd hacker-news-engagement
uv venv # create a virtual environment with uv
uv pip install -r requirements.txt # or use pyproject.toml directly
jupyter notebook notebook/hacker-news-posts.ipynb
This repository is provided for educational purposes. The Hacker News dataset is redistributed under its original public‑domain terms; all analysis code and notebook content are released under the MIT License.