Skip to content
View soodoku's full-sized avatar

Block or report soodoku

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
soodoku/README.md

Website Blog Stats

Metascience  Metascience: Tools to flag retracted articles in citations, research on how often retracted articles or articles with major errors are approvingly cited, tools for counting how often software is used based on replication files

Data for South Asia  Data, Research, and Tools Focused on South Asia: Parsed Electoral Rolls (~1B people), Local Election datasets with reservation status, Land records data, Ration data, Hindi-English transliteration tools, Effect of quotas in local elections, MNREGA

Online Safety  Online Safety: Domain content classification tools, Generative password models using real-world data, Research on data breach patterns (including politicians) and privacy

Geosensing  Geographically Distributed Data Collection: Tools for randomly sampling locations on streets, Heuristic route planning, Google Street View for assessing the quality of public infrastructure and demographics

Names  Names: ML tools for name standardization and cleaning, Name parsing algorithms, Ethnicity inference from names using voter registration data using deep learning

Pinned Loading

  1. appeler/ethnicolr Public

    Predict Race and Ethnicity Based on the Sequence of Characters in a Name

    Jupyter Notebook 242 65

  2. gojiplus/tuber Public

    🍠 Access YouTube from R

    R 185 54

  3. in-rolls/indicate Public

    transliterate hindi to english

    Jupyter Notebook 14 2

  4. recite/autosum Public

    Summarize Publications Automatically

    Python 37 10

  5. themains/password Public

    A password generator using an encoder-decoder model trained on ~881M passwords

    Jupyter Notebook 41 1

  6. notnews/archive_news_cc Public

    Closed Caption Transcripts of News Videos from archive.org 2014--2023

    HTML 47 4

Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.