Add caches to hot functions in `utils.py` that are called from multiple locations #79

AlexWaygood · 2023-10-11T08:53:24Z

This gives around a ~~16%~~ 24% speedup when running sphinx-lint on 7 ~large .rst files in CPython.

Benchmark script, that needs to be run from a directory that has a clone of CPython in it

import sys
from sphinxlint.__main__ import main


files = [f"cpython/Doc/library/{module}.rst" for module in ("os", "typing", "sqlite3", "stdtypes", "argparse", "enum")]
files.append("cpython/Doc/reference/datamodel.rst")
args = ["foo"] + files

def test():
    main(args)

If the benchmark script is saved as benchmark.py, run the benchmark script using python -m timeit -s "from benchmark import test" "test()".

Part of #76

AlexWaygood · 2023-10-11T10:54:47Z

I temporarily converted this back to a draft, as I got worried that my benchmark script was unreliable, since timeit doesn't clear caches in between runs of the benchmark. I wrote a more reliable benchmark script, however, and it still shows a clear speedup.

New benchmark script

import statistics
import subprocess
import sys


files = [f"cpython/Doc/library/{module}.rst" for module in ("os", "typing", "sqlite3", "stdtypes", "argparse", "enum")]
files.append("cpython/Doc/reference/datamodel.rst")
argv = ["foo"] + files

script = f"""\
import time
from sphinxlint.__main__ import main
t0 = time.perf_counter()
main({argv})
t1 = time.perf_counter() - t0
raise SystemExit(t1)"""


command = [sys.executable, "-c", "; ".join(script.splitlines())]
timings = [
    float(subprocess.run(command, capture_output=True, text=True).stderr.strip())
    for _ in range(5)
]
print(statistics.mean(timings))

On main:

1.9073304200079293

With this PR branch:

1.685813060007058

hugovk · 2023-10-11T14:07:44Z

A 6% improvement on an 8-core macOS M2 (the basic benchmark script limits the number of files so multiprocessing isn't used):

main: 0.8339760334005405
PR: 0.7873101998004131

A 17% improvement when adjusting the benchmark script to run on all 293 Doc/library/*.rst (uses multiprocessing):

main: 1.5562639416013553
PR: 1.2917625166002835

hugovk

Nice improvement! Almost too easy!

AlexWaygood added 3 commits October 11, 2023 10:32

Add caches to several functions in utils.py

a7ae4ea

A more complex cache for hide_non_rst_blocks()

7d63ef3

Also do paragraphs()

aad06d0

AlexWaygood marked this pull request as draft October 11, 2023 09:18

AlexWaygood marked this pull request as ready for review October 11, 2023 10:52

hugovk approved these changes Oct 11, 2023

View reviewed changes

hugovk merged commit dc0514d into sphinx-contrib:main Oct 11, 2023

AlexWaygood deleted the caches branch October 11, 2023 14:15

hugovk mentioned this pull request Oct 11, 2023

Use pre-compiled regular expressions #77

Merged

This was referenced Oct 11, 2023

Micro-optimise check_missing_space_after_role() #80

Merged

Add a cache to rst.inline_markup_gen() #81

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add caches to hot functions in `utils.py` that are called from multiple locations #79

Add caches to hot functions in `utils.py` that are called from multiple locations #79

AlexWaygood commented Oct 11, 2023 •

edited

Loading

AlexWaygood commented Oct 11, 2023

hugovk commented Oct 11, 2023

hugovk left a comment

Add caches to hot functions in utils.py that are called from multiple locations #79

Add caches to hot functions in utils.py that are called from multiple locations #79

Conversation

AlexWaygood commented Oct 11, 2023 • edited Loading

AlexWaygood commented Oct 11, 2023

hugovk commented Oct 11, 2023

hugovk left a comment

Choose a reason for hiding this comment

Add caches to hot functions in `utils.py` that are called from multiple locations #79

Add caches to hot functions in `utils.py` that are called from multiple locations #79

AlexWaygood commented Oct 11, 2023 •

edited

Loading