-
-
Notifications
You must be signed in to change notification settings - Fork 15
Fetch data for 24 days to stay within quota #49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@hugovk Does Would it be possible to enable a GitHub cron job to generate every month automatically? on:
schedule:
# * is a special character in YAML so you have to quote this string
- cron: '0 0 1 * *' |
It should account for the actually fetched values, because it's the result of the query sent. You can see a snippet in #42 which does more or less the same thing.
Yes, but there's already a cron running on Digital Ocean (see the README for details) that's meant to fetch the data each month automatically. Unfortunately the free quota is becoming too little and I need to adjust the amount fetched. About the 2025.06 data: that also used up too much quota and so didn't complete. I'd meant to merge #50 before the 1st June, but I was travelling. I'll have to do a manual run instead. |
Done re: #50 (comment) -> https://github.com/hugovk/top-pypi-packages/releases/tag/2025.06 |
Thank you! Maybe time to start thinking if the magic number 15000 should be reduced to be able to show a full month. |
5k or 8k or 15k or 600k doesn't make a difference! |
There are so many "big data" tools, it's sad that for counting PyPI downloads no other solution has been chosen but Big Query... now that Microsoft is helping CPython development, why don't they store the download data / metadata? So many packages are abandoned or incompatible to Python 3.13 or missing binary wheels for some platforms or architectures, identifying those problems would be more important than counting downloads and should be provided for free to the community. |
Is the compatibility with 3.13 so bad? https://pyreadiness.org/3.13/ shows 55% of the top 360 packages have declared compatibility by adding the 3.13 Trove classifier, but many more are nevertheless compatible but either don't use classifiers or haven't added/released yet. Are there any in particular you're missing? |
Dry runs: