Skip to content

Suggestion: augment pre-training with lichess open database #5

@linux-leo

Description

@linux-leo

See: https://database.lichess.org/#standard_games

Maybe use every nth game from the year 2013 before lichess grew in size, so the dataset covers a more or less equal amount of games per month while still covering a large time span, and to reduce the amount of games that need to be processed.

PS: I'm happy to provide some compute for this project with my google colab pro+ Subscription :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions