Convention for test data which uses APIs defined by Python stubs #862

vlasovskikh · 2017-01-26T15:55:18Z

This PR addresses #754 about testing typeshed stubs from the static analysis perspective.

Currently we check only for syntax errors in stub files. The idea is to add test data for static analyzers similar to how it's done in DefinitelyTyped for TypeScript.

vlasovskikh · 2017-01-26T16:06:45Z

The Travis tests are almost OK except pytype which apparently doesn't yet support function comments for type hints added to PEP 484 last year. @matthiaskramm I was unable to find an issue about it at the pytype issue tracker. Could you take a look at this problem?

matthiaskramm · 2017-01-26T16:45:08Z

@vlasovskikh: pytype supports this now. We can export a new open-source release.

Out of curiosity, why do you need support for this syntax? And why in Python 3.6? What happens if these comments are just ignored?

vlasovskikh · 2017-01-26T16:49:46Z

@matthiaskramm I used the comment-based syntax in a 3.6 test accidentally, by analogy to my other 2and3 test data. I use type hints for functions in test data since mypy doesn't check functions with no type hints by default. If the mypy guys have no objections, we can add this option for running mypy for typeshed tests.

gvanrossum · 2017-01-30T04:52:47Z

I'm still not excited about this. The tests just require you to say everything twice -- there's nothing that verifies that the tests actually match the implementation, so the problem remains essentially the same: if the author of a set of stubs misreads the docs, the stubs will be wrong, because the tests will be based on their mis-reading of the docs.

JukkaL · 2017-01-30T10:33:30Z

I think that these might only be worthwhile if we'd make it possible to run the tests using, say, pytest. This way we would be able to verify that both the tests and the stubs conform to the implementation. The stubs could still be too general or narrow, but at least they can't be totally inaccurate. It would also be helpful to have tests that are expected to not pass type checking, to verify that bad code can be diagnosed correctly.

Again, having tests for stubs wouldn't be required or maybe not even generally recommended, but they might he helpful in some cases, and they could make reviewing changes to stubs easier, as we wouldn't need to always manually verify that types are correct (or blindly trust the contributor). If something looks fishy in a PR, we could always ask the contributor to write some tests.

Here's a hypothetical test case for ord:

from typeshed_util import assert_type
...
    def test_ord_str(self):
        n = ord('A')
        assert_type(n, int)

When this test case is run using pytest, we test that ord accepts a str argument at runtime and may return an int (assert_type would perform an isinstance check at runtime).

When we type check the test case using mypy (with --check-untyped-defs so we don't need an annotation) mypy will verify that the stub allows calling ord with a str argument, and we'd teach mypy to process assert_type in a special way so that it would generate an error on that line unless the static type of n is int.

ord is perhaps too trivial to warrant a test case, but for more complex things like dict.get and requests.get these could be useful.

We could also have tests that verify that invalid arguments are rejected. For example:

from typeshed_util import expect_type_error
...
    def test_ord_invalid_arg(self):
        with expect_type_error():
            ord(1)

When the test case is run using pytest, we'd fail the case case unless the call to ord raises a suitable exception (such as TypeError).

When we'd type check the test case using mypy, mypy would recognize with expect_type_error() and generate an error if the body wouldn't generate an error without the with statement.

vlasovskikh · 2017-01-30T14:41:31Z

@JukkaL I like your idea and I'll experiment with it. It looks viable for at stdlib stubs at least. With many incoming pull requests to the repository I feel like having working code examples for suggested stubs is a good idea.

As for third-party stubs, it may require installing unspecified versions of dependencies (including incompatible with one other).

JukkaL · 2017-01-30T16:21:11Z

Starting with stdlib sounds reasonable.

We may be able to use pip freeze and pinning package versions to get repeatable dependencies, at the cost of making it a little harder to update and add dependencies. Using unspecified package versions would be asking for trouble, in my opinion.

Another option would be use to multiple virtualenvs, e.g. one per third-party package. We might have to do this anyway, since we could have two third-party packages with conflicting dependencies.

vlasovskikh · 2017-02-05T20:31:47Z

@gvanrossum @JukkaL @matthiaskramm I've sent another PR #917 that proposes both static and run-time tests. Closing this one.

vlasovskikh added 2 commits January 26, 2017 18:51

Convention for test data which uses APIs defined by Python stubs

4e8d414

Currently we check only for syntax errors in stub files. The idea is to add test data for static analyzers similar to how it's done in DefinitelyTyped for TypeScript.

Methods of typing.NamedTuple

43317c7

vlasovskikh mentioned this pull request Jan 26, 2017

Better testing of stubs #754

Closed

Temporarily disabled checking test_data/ due to a bug in pytype

601da3e

vlasovskikh mentioned this pull request Jan 26, 2017

Added object.__sizeof__ #863

Closed

vlasovskikh mentioned this pull request Feb 5, 2017

Static and run-time tests for stubs #917

Closed

vlasovskikh closed this Feb 5, 2017

vlasovskikh deleted the test-data branch February 19, 2017 15:50

toolness mentioned this pull request May 31, 2017

Add PyJWT type annotations #1281

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convention for test data which uses APIs defined by Python stubs #862

Convention for test data which uses APIs defined by Python stubs #862

vlasovskikh commented Jan 26, 2017

vlasovskikh commented Jan 26, 2017

matthiaskramm commented Jan 26, 2017

vlasovskikh commented Jan 26, 2017

gvanrossum commented Jan 30, 2017

JukkaL commented Jan 30, 2017

vlasovskikh commented Jan 30, 2017

JukkaL commented Jan 30, 2017

vlasovskikh commented Feb 5, 2017

Convention for test data which uses APIs defined by Python stubs #862

Convention for test data which uses APIs defined by Python stubs #862

Conversation

vlasovskikh commented Jan 26, 2017

vlasovskikh commented Jan 26, 2017

matthiaskramm commented Jan 26, 2017

vlasovskikh commented Jan 26, 2017

gvanrossum commented Jan 30, 2017

JukkaL commented Jan 30, 2017

vlasovskikh commented Jan 30, 2017

JukkaL commented Jan 30, 2017

vlasovskikh commented Feb 5, 2017