CI: Make pep8 validation work with multiple files simultaneously #57914

dontgoto · 2024-03-19T18:26:07Z

The validation is bottlenecked due to the repeated sys calls, subprocess spawning, and pep8 calls that only validate a single file each. Running the validation on a list of files is approximately as fast as running it on a single file. This PR makes the first step, enabling the pep8 validation of a list of files, but for now still running it only on a single file. It will be followed by a few more PRs. The end result will be a complete refactoring of validate_docstrings.py, that will then take about 10 seconds to validate all docstrings.

All code checks passed.
Added type annotations to new arguments/methods/functions.

@datapythonista

The validation is bottlenecked due to the repeated sys calls, subprocess spawning, and pep8 calls that only validate a single file each. Running the validation on a list of files is approximately as fast as running it on a single file.

datapythonista · 2024-03-19T21:58:35Z

I'm travelling and I could only revew with the phone, but looks good.

Quick question, would it make sense to use ruff instead of flake8 for this?

dontgoto · 2024-03-19T22:28:18Z

Thanks for the quick review!

Quick question, would it make sense to use ruff instead of flake8 for this?

I think it would be slightly faster, but someone would have to configure ruff to behave the same way as we intend flake8 to behave here. On my machine, the whole code_check.sh docstrings validation finishes in about 10 seconds, 7s of those are separate calls to numpydoc.validate (I put in a PR there that brings this down to 5.5s), and the remaining 3s are split between our validate.pyand pep8.

So I think even with ruff we would at best get to 5.5s instead of 8.5s if we keep numpydoc.validate. Might still be worth an issue to discuss the (future) role of ruff here and in other places.

datapythonista · 2024-03-19T22:36:44Z

Yep, trying to improve 10s doesn't make sense. I know we are using ruff in the CI nowz bit I didn't check the exact state. The question was more about consistency, I didn't check if we already replaced flake8 with ruff.

dontgoto · 2024-03-19T23:05:16Z

I see ruff being used as part of the pre-commit hooks. And there it let a docstring error that I introduced in a non-replaced docstring through. It's also replaced flake8 for the docstring formatting, but back when I introduced my error, only the code_check.sh showed up as failed.

github-actions · 2024-04-25T00:05:30Z

This pull request is stale because it has been open for thirty days with no activity. Please update and respond to this comment if you're still interested in working on this.

mroeschke · 2024-05-08T16:55:46Z

Thanks for the PR but appears there not much appetite for this change so closing

dontgoto added 2 commits March 19, 2024 19:20

Make pep8 validation work with multiple files

495e46d

The validation is bottlenecked due to the repeated sys calls, subprocess spawning, and pep8 calls that only validate a single file each. Running the validation on a list of files is approximately as fast as running it on a single file.

Shorten function name

574de8a

datapythonista added Docs CI Continuous Integration labels Mar 19, 2024

dontgoto added 3 commits March 20, 2024 08:40

Merge branch 'main' into allow_pep8_validation_of_multiple_files

39b9e2e

Merge branch 'main' into allow_pep8_validation_of_multiple_files

e626ae1

Merge branch 'main' into allow_pep8_validation_of_multiple_files

2d3004a

github-actions bot added the Stale label Apr 25, 2024

mroeschke closed this May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CI: Make pep8 validation work with multiple files simultaneously #57914

CI: Make pep8 validation work with multiple files simultaneously #57914

Uh oh!

dontgoto commented Mar 19, 2024 •

edited

Loading

Uh oh!

datapythonista commented Mar 19, 2024

Uh oh!

dontgoto commented Mar 19, 2024

Uh oh!

datapythonista commented Mar 19, 2024

Uh oh!

dontgoto commented Mar 19, 2024

Uh oh!

github-actions bot commented Apr 25, 2024

Uh oh!

mroeschke commented May 8, 2024

Uh oh!

Uh oh!

Uh oh!

CI: Make pep8 validation work with multiple files simultaneously #57914

CI: Make pep8 validation work with multiple files simultaneously #57914

Uh oh!

Conversation

dontgoto commented Mar 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datapythonista commented Mar 19, 2024

Uh oh!

dontgoto commented Mar 19, 2024

Uh oh!

datapythonista commented Mar 19, 2024

Uh oh!

dontgoto commented Mar 19, 2024

Uh oh!

github-actions bot commented Apr 25, 2024

Uh oh!

mroeschke commented May 8, 2024

Uh oh!

Uh oh!

dontgoto commented Mar 19, 2024 •

edited

Loading