Without going through all your commits, I wouldn't have known how messy the CMUDict dataset is. Thanks for cleaning it up, and releasing your updates!