Add additional popular repositories to the corpus test, such as `rails` for better coverage of syntax and ability to catch regressions.