Skip to content

Inquiry About Issues and Pull Requests Data in The Stack V2 Dataset #13

@xiuxiu

Description

@xiuxiu

Hello,

I hope this message finds you well. I noticed that the recently released The Stack V2 dataset does not include issues and pull request data. I am interested in understanding whether there are any plans to incorporate this information in future releases.

Having access to issues and pull requests would significantly enhance the dataset's utility for research and analysis. Any updates or insights you could provide would be greatly appreciated.

Thank you for your work on this project!

NOTE: repo_licenses_s3 and commit_paris_files_s3 will be released later and we reccomend compilin your own sets for up to date information, those data sets are compiled in other parts of SC2 data pipeline. opt_outs_dataset_name will not be release as it is confidential data, so it is needed to compile such data for your project. Please ask on BigCode comunty genral forums on Slack for more details.

Best regards,
xiuxiu

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions