Add totals report #18

meshy · 2022-11-29T17:45:41Z

This adds:

~~New isort and flake8 configs.~~
~~Very basic arg parsing. (More on this below.)~~
~~A help command.~~
A totals command. This produces a basic summary of a previously created JSON report file.

This lacks (todo):

Docstrings
A command to print the summary of a report file over time.
isort in pre-commit. (see Expanded linting (isort and flake8 configs) #24)

Question: Is it time to break this into modules?
Question: Would it be better to use Typer (or similar) instead?

Tenzer

Broadly speaking, this looks good.

I'd suggest using argparse for the argument parsing. It's part of the standard library and would handle any edge cases for us.

I think we should rename the errors subcommand to parse as it's more descriptive of what it does.
At the same time we could require the parse subcommand was provided and bump the version to 0.2.0, to avoid having to maintain the legacy way without any subcommand provided - especially considering how few people presumably are using this for now.

meshy · 2023-01-30T23:59:19Z

Thanks :)

argparse is a good idea. I've now written a commit that uses it, and it's definitely nicer.

I'm not able to push it right now because I no longer have permissions on this repo. Are you happy granting me permissions on this repo again? If not that's fine. Let me know, and I'll fork and create a new PR.

No need to rush that decision. There's still plenty more for me to do on this before I'm happy with it.

Tenzer · 2023-01-31T09:08:19Z

I have invited you as a collaborator on the repo.

meshy · 2023-01-31T09:29:52Z

Thanks!

At the same time, I've renamed the `errors` subcommand to `parse`, and dropped the `help` command in favour of the `--help` flag that `argparse` automatically generates.

Tenzer · 2023-02-18T16:34:13Z

I guess this PR can be closed now?

meshy · 2023-02-18T16:39:26Z

I'm going to re-work this. I'd still like a command that can:

Print the number of errors in the report, and the number of files with errors.
Print the history of that same data (as json or csv) over time, so that graphs can easily be produced.

Tenzer · 2023-03-05T21:41:40Z

I had some free time and thought I would see if I could help this along, and came up with this:

diff --git a/mypy_json_report.py b/mypy_json_report.py
index 05992c3..149ecd4 100644
--- a/mypy_json_report.py
+++ b/mypy_json_report.py
@@ -13,11 +13,13 @@
 # limitations under the License.
 
 import argparse
+import csv
 import enum
 import itertools
 import json
 import operator
 import pathlib
+import subprocess
 import sys
 import textwrap
 from collections import Counter, defaultdict
@@ -90,6 +92,26 @@ def main() -> None:
 
     parse_parser.set_defaults(func=_parse_command)
 
+    report_parser = subparsers.add_parser(
+        "report", help="Generate a report from the ratchet file."
+    )
+    report_parser.add_argument("ratchet_file", help="Path to the ratchet file.")
+    report_parser.add_argument(
+        "-g",
+        "--git",
+        action="store_true",
+        help="Generate historic report by using the history from Git.",
+    )
+    report_parser.add_argument(
+        "-o",
+        "--output-file",
+        type=argparse.FileType("w"),
+        default=sys.stdout,
+        help="The file to write the report to. If omitted, the report will be written to STDOUT. Only used in historic mode.",
+    )
+
+    report_parser.set_defaults(func=_report_command)
+
     parsed = parser.parse_args()
     parsed.func(parsed)
 
@@ -295,5 +317,65 @@ class ChangeTracker:
         return None
 
 
+def _count_errors(errors: dict[str, dict[str, int]]) -> dict[str, int]:
+    total_errors = sum(sum(file_errors.values()) for file_errors in errors.values())
+    files_with_errors = len(errors.keys())
+
+    return {
+        "total_errors": total_errors,
+        "files_with_errors": files_with_errors,
+    }
+
+
+def _report_command(args: argparse.Namespace) -> None:
+    if not args.git:
+        with open(args.ratchet_file) as file_pointer:
+            errors = json.load(file_pointer)
+
+        print(
+            "Total errors: {total_errors}\nFiles with errors: {files_with_errors}".format(
+                **_count_errors(errors)
+            )
+        )
+        return None
+
+    result = subprocess.run(
+        [
+            "git",
+            "log",
+            r"--format=%H|%ad",
+            "--date=format:%F %T",
+            "--reverse",
+            "--",
+            args.ratchet_file,
+        ],
+        check=True,
+        capture_output=True,
+    )
+    writer = csv.DictWriter(
+        args.output_file,
+        fieldnames=["commit_hash", "timestamp", "total_errors", "files_with_errors"],
+    )
+    writer.writeheader()
+
+    for line in filter(None, result.stdout.decode().split("\n")):
+        commit_hash, timestamp = line.split("|")
+        result = subprocess.run(
+            ["git", "show", f"{commit_hash}:./{args.ratchet_file}"],
+            check=True,
+            capture_output=True,
+        )
+        errors = json.loads(result.stdout.decode())
+
+        try:
+            errors_count = _count_errors(errors)
+        except AttributeError:
+            continue
+
+        writer.writerow(
+            {"commit_hash": commit_hash, "timestamp": timestamp} | errors_count
+        )
+
+
 if __name__ == "__main__":
     main()

It's pretty rough and is bound to need changes to work with Python 3.7, but it might serve as some inspiration.

I opted for shelling out to call Git as that would mean we don't need any extra dependencies, but thinking about it, it probably would be nicer to have a Git Python library to handle the Git interactions and specify that as an optional dependency.

meshy · 2023-03-07T09:10:56Z

Thanks for that! I think shelling out to git is a reasonable approach for now.

meshy added 11 commits November 29, 2022 17:39

Add flake8 config

7394cac

Add isort config

9c4c7dd

Break calculation into separate statement

3f1eac6

Isolate calculation from serialization

29c61e3

Rename test

2a9a692

Split logic from entrypoint

c0e8d3e

Add basic support for subcommands

627ecd3

Allow calling with mypy-json-report errors

55d92f1

Add help command

e2d4021

Print only the command when bad arguments passed

ab7b417

Add totals command to quantify errors in a file

3142897

Tenzer reviewed Nov 30, 2022

View reviewed changes

Use argparse for parsing arguments

daf7260

At the same time, I've renamed the `errors` subcommand to `parse`, and dropped the `help` command in favour of the `--help` flag that `argparse` automatically generates.

This was referenced Jan 31, 2023

Expanded linting (isort and flake8 configs) #24

Merged

Move parsing into parse subcommand (introduce argparse) #25

Merged

This was referenced Jan 3, 2024

Add flag for printing reports in colour #89

Merged

Refactor into submodules #90

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add totals report #18

Add totals report #18

meshy commented Nov 29, 2022 •

edited

Loading

Tenzer left a comment

meshy commented Jan 30, 2023

Tenzer commented Jan 31, 2023

meshy commented Jan 31, 2023

Tenzer commented Feb 18, 2023

meshy commented Feb 18, 2023

Tenzer commented Mar 5, 2023

meshy commented Mar 7, 2023

Add totals report #18

Are you sure you want to change the base?

Add totals report #18

Conversation

meshy commented Nov 29, 2022 • edited Loading

Tenzer left a comment

Choose a reason for hiding this comment

meshy commented Jan 30, 2023

Tenzer commented Jan 31, 2023

meshy commented Jan 31, 2023

Tenzer commented Feb 18, 2023

meshy commented Feb 18, 2023

Tenzer commented Mar 5, 2023

meshy commented Mar 7, 2023

meshy commented Nov 29, 2022 •

edited

Loading