Enable Dialyzer and ETC cross checks #429

erszcz · 2022-07-13T14:51:52Z

This adds a few changes to facilitate comparisons between existing Erlang type checkers:

a new Make rule to enable running Dialyzer on Gradualizer tests
renames test modules to avoid name clashes - when running Dialyzer on multiple modules, their names have to be unique; it was not the case across should_pass / should_fail / ...
ignores *.erltypes files generated by ETC

All in all, there are no functional changes to the type checker itself.

Intermediate comparison results are available at https://gist.github.com/erszcz/4d43a77464c87a514e71eecf2811af63.

zuiderkwast

Nice! What's the results? Any dialyzer errors found?

zuiderkwast · 2022-07-13T16:19:07Z

Makefile

+.PHONY: dialyze-tests
+dialyze-tests: app $(DIALYZER_PLT)
+	dialyzer $(DIALYZER_OPTS) $(test_data_erls)
+


Can we run this in CI to maintain expected results? I.e. to prevent name clashes from being introduced again.

Are there any dialyzer errors in the tests?

Hmm, we probably could, but the errors do not overlap completely, so we would need to come up with some should-pass/should-fail/known-problems mapping mechanism. I don't think we can run it just to detect name clashes, we would also have to check the error code and determine which is returned why.

Below is an interesting case:

Gradualizer/test/should_pass/andalso_any.erl

Lines 5 to 10 in 51a8029

-spec f1() -> boolean().

f1() ->

true andalso g1().

-spec g1() -> any().

g1() -> 3.

Dialyzer reports:

andalso_any.erl:5:2: Invalid type specification for function andalso_any:f1/0. The success typing is () -> 3

Whereas Gradualizer is fine with it, because any() is compatible with every other type. AFAIU a full-blown gradual typing system according to Siek and Taha would inject a dynamic check in such a place.

Dialyzer infers the type of an integer literal, but it couldn't do so if the value was not a literal.
Gradualizer just believes the spec, even with --infer, and checks against it, so it assumes everything is fine.

I imagine there are more cases like this, but haven't analysed all of them yet. The current summary is available at https://gist.github.com/erszcz/4d43a77464c87a514e71eecf2811af63#file-check-2022-07-13_170833-tsv.

Interesting. Yes, dializer does a full inference, while gradualizer is silent on purpose in this case.

Regarding Siek & Taha: Since Erlang/BEAM is a dynamic language, there are already runtime checks for everything anyway. The runtime catches it and raises an exception such as badarg. The paper speaks about the case where a dynamic type is added to a statically language without runtime type checks. This is my interpretation anyway.

Could we check for unique module filenames in another way then? Just some simple shell script?

Since Erlang/BEAM is a dynamic language, there are already runtime checks for everything anyway.

I agree. However, I think the mentioned example shows unsoundness of the type system. Let's consider this (and the following) paragraph from https://papl.cs.brown.edu/2014/safety-soundness.html#%28part._type-soundness%29:

The central result we wish to have for a given type-system is called soundness. It says this. Suppose we are given an expression (or program) e. We type-check it and conclude that its type is t. When we run e, let us say we obtain the value v. Then v will also have type t.

In our case the type checker will pretend that f1() -> boolean(), but in fact it turns out to be f1() -> integer() upon evaluation.

Could we check for unique module filenames in another way then? Just some simple shell script?

I think we can use some find | sort | uniq magic to do it. I'll come up with something tomorrow.

Are you saying Gradualizer (or any sound gradual type system) should insert an assertion to make sure g1() in your example above returns a boolean? This is to ensure we get a runtime type error rather than a value of the wrong type...? So f1 above is translated into something like this:

-spec f1() -> boolean(). f1() -> true andalso (begin G1 = g1(), ?assert(is_boolean(G1)), G1 end).

This is interesting. I didn't think about it in this way before.

I think this is explained in a simple way in the introduction of Max S. New, Dustin Jamner, and Amal Ahmed. 2020. Graduality and Parametricity: Together Again for the First
Time. Proc. ACM Program. Lang. 4, POPL, Article 46 (January 2020), 32 pages. https://doi.org/10.1145/3371114

Are you saying Gradualizer (or any sound gradual type system) should insert an assertion to make sure g1() in your example above returns a boolean?

I think so. The fragment you quote is aligned with what I remembered from the paper 👍

-spec f1() -> boolean(). f1() -> true andalso (begin G1 = g1(), ?assert(is_boolean(G1)), G1 end).

So f1 above is translated into something like this [...]

Yes, more or less. Their paper didn't cover Erlang, obviously, and the semantics of the language start to matter at this point. I'd type the Erlang andalso as andalso(boolean(), any()) -> any() based on the below:

1> 3 andalso 5. ** exception error: bad argument: 3 2> false andalso 5. false 3> true andalso 5. 5

So I think the inserted assertion, due to the spec we declare for f1, should enclose the application of andalso:

-spec f1() -> boolean(). f1() -> R = true andalso g1(), ?assert(is_boolean(R)), R.

However, I don't think we can consider this a goal for Gradualizer given it's not part of a compiler. Perhaps one day it could rewrite .beam files... but that's a far fetched goal. I think there are more tangible and important milestones before that.

@josefs What do you think about our conclusions above? The example functions f1 and g1...

However, I don't think we can consider this a goal for Gradualizer given it's not part of a compiler. Perhaps one day it could rewrite .beam files... but that's a far fetched goal.

True, but without this I don't believe we have soundness and the types can't really be trusted. I always regarded gradualizer an experiment. The next experiment could be "Gradual Erlang", a language which looks like Erlang and compiles to Erlang with assertions inserted....

Interesting comment that A andalso B actually is syntactic sugar for case A of true -> B ... - erlang/otp#5456 (comment).

As you noted @zuiderkwast, inserting checks when the typechecker coerces to/from any() is referred to as "sound gradual typing" in the literature. My intention with Gradualizer was to refrain from doing it, simply because I thought that it would be too much work. Adding coercions has the advantage that runtime type errors would be reported earlier and in places that are often easier to understand.

In some implementations is required to insert these coercions because not doing so would lead to a crash/segfault/undefined behaviour. But since we're relying on executing things on top of BEAM we don't have such problems.

There are quite a few papers on measuring the cost of these coercions and there are example programs for which they are quite expensive. Those examples typically involve having any() embedded in some structure so that the coercions have to traverse the structure.

I don't have anything in principle against adding these kinds of coercions. But from an efficiency point of view it might be preferrable to have this not be the default behaviour and have a flag that enables it.

erszcz · 2022-07-14T11:53:55Z

Ok, we have a check for name clashes in place. Unless you have some comments, @zuiderkwast, I'd be happy to merge this.

zuiderkwast

Yes, looks good!

erszcz · 2022-07-14T16:04:03Z

FYI, https://gist.github.com/erszcz/4d43a77464c87a514e71eecf2811af63#some-examples has summary of the discrepancies between Dialyzer and Gradualizer in should_pass tests. Two of them are already fixed: one here, one in #430. One or two of them need creating bug tickets, which I'll do at some point.

I'm keen on learning about the differences in should_fail tests, though. Dialyzer seems to be way more permissive.

erszcz added 2 commits July 12, 2022 17:34

Add dialyze-tests make rule

f4839d9

Ignore *.erltypes

ac1cd06

erszcz requested a review from zuiderkwast July 13, 2022 14:51

erszcz changed the title ~~Dialyzer and ETC cross checks~~ Enable Dialyzer and ETC cross checks Jul 13, 2022

erszcz force-pushed the dialyzer-cross-check branch from 1d159ea to 21451ed Compare July 13, 2022 14:57

erszcz added 2 commits July 13, 2022 16:59

Avoid module name clashes in tests

03cb63e

Add a remark about some refinements in Dialyzer

748cbf8

erszcz force-pushed the dialyzer-cross-check branch from 21451ed to 748cbf8 Compare July 13, 2022 14:59

zuiderkwast reviewed Jul 13, 2022

View reviewed changes

Add a script to check for name clashes in test module file names

82e3ea9

erszcz force-pushed the dialyzer-cross-check branch from 1572dbb to 82e3ea9 Compare July 14, 2022 11:51

zuiderkwast approved these changes Jul 14, 2022

View reviewed changes

Export f/1 and nospec_update_bug/1 to fix Dialyzer warnings

20fd644

erszcz merged commit 8d7fbf4 into josefs:master Jul 14, 2022

erszcz deleted the dialyzer-cross-check branch July 14, 2022 15:28

This was referenced Jul 25, 2022

Export function in tests to fix a Dialyzer error #431

Merged

Map type inference not working when map creation expression is not an argument #432

Closed

erszcz mentioned this pull request Aug 30, 2022

Warn on every cast from any() to a well known type? #446

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Dialyzer and ETC cross checks #429

Enable Dialyzer and ETC cross checks #429

erszcz commented Jul 13, 2022 •

edited

Loading

zuiderkwast left a comment

zuiderkwast Jul 13, 2022

erszcz Jul 13, 2022 •

edited

Loading

zuiderkwast Jul 13, 2022 •

edited

Loading

erszcz Jul 13, 2022

zuiderkwast Jul 14, 2022 •

edited

Loading

zuiderkwast Jul 14, 2022

erszcz Jul 14, 2022

zuiderkwast Jul 14, 2022

erszcz Aug 11, 2022 •

edited

Loading

josefs Aug 30, 2022

erszcz commented Jul 14, 2022

zuiderkwast left a comment

erszcz commented Jul 14, 2022 •

edited

Loading

	-spec f1() -> boolean().
	f1() ->
	true andalso g1().

	-spec g1() -> any().
	g1() -> 3.

Enable Dialyzer and ETC cross checks #429

Enable Dialyzer and ETC cross checks #429

Conversation

erszcz commented Jul 13, 2022 • edited Loading

zuiderkwast left a comment

Choose a reason for hiding this comment

zuiderkwast Jul 13, 2022

Choose a reason for hiding this comment

erszcz Jul 13, 2022 • edited Loading

Choose a reason for hiding this comment

zuiderkwast Jul 13, 2022 • edited Loading

Choose a reason for hiding this comment

erszcz Jul 13, 2022

Choose a reason for hiding this comment

zuiderkwast Jul 14, 2022 • edited Loading

Choose a reason for hiding this comment

zuiderkwast Jul 14, 2022

Choose a reason for hiding this comment

erszcz Jul 14, 2022

Choose a reason for hiding this comment

zuiderkwast Jul 14, 2022

Choose a reason for hiding this comment

erszcz Aug 11, 2022 • edited Loading

Choose a reason for hiding this comment

josefs Aug 30, 2022

Choose a reason for hiding this comment

erszcz commented Jul 14, 2022

zuiderkwast left a comment

Choose a reason for hiding this comment

erszcz commented Jul 14, 2022 • edited Loading

erszcz commented Jul 13, 2022 •

edited

Loading

erszcz Jul 13, 2022 •

edited

Loading

zuiderkwast Jul 13, 2022 •

edited

Loading

zuiderkwast Jul 14, 2022 •

edited

Loading

erszcz Aug 11, 2022 •

edited

Loading

erszcz commented Jul 14, 2022 •

edited

Loading