1302800 - Verify taskgraph implementations against documentation

Reporter

Description

•

8 years ago

The docs in taskcluster/docs contain a few lists of things -- attributes, parameters, kinds, caches, run-using implementations.

It would be great if something, either as part of the `mach taskgraph python-tests` or as part of the decision task, would ensure that those stay in sync with the code.

Some simple regexps over the documentation files would do the trick -- I don't think that trying to parse the files with docutils is a good use of time.

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Updated

•

8 years ago

Assignee: nobody → hammad13060

Comment hidden (mozreview-request)

Review commit: https://reviewboard.mozilla.org/r/95702/diff/#index_header
See other reviews: https://reviewboard.mozilla.org/r/95702/

HAMMAD AKHTAR

Assignee

Comment 2

•

8 years ago

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

I am trying to develop generic method for doc verification against implementation. 
Please provide your feedback.
P.S patch is incomplete right now

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/1-2/

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 4

•

8 years ago

mozreview-review

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

https://reviewboard.mozilla.org/r/95702/#review96026

::: taskcluster/taskgraph/generator.py:240
(Diff revision 2)
> +    @VerifyDoc.verify_doc_file("documentation and implementation for 'kinds' do not match", os.getcwd() + "/taskcluster/docs/")
> +    def verify_kinds(self, kinds):
> +        return {
> +            "kinds.rst": [kind for kind in kinds]
> +        }

I don't see a great advantage to using a decorator here.  Why not just

    def verify_kinds(self, kinds):
        verify_docs(
            filename='kinds.rst',
            identifiers=kinds.keys(),
            appearing_as='heading')

::: taskcluster/taskgraph/generator.py:256
(Diff revision 2)
> +            for attribute in task.attributes:
> +                attribute_set.add(attribute)

`attribute_set.update(task.attributes)` will be faster

::: taskcluster/taskgraph/generator.py:265
(Diff revision 2)
> +        for kind_label, kind in kinds.iteritems():
> +            if "jobs" in kind.config:
> +                for job_label, job in kind.config["jobs"].iteritems():
> +                    if "run" in job and "using" in job["run"]:
> +                        key = job["run"]["using"]
> +                        job_set.add(key)

This is reaching a little too deeply into the kind definitions.  In particular, they might have `jobs-from` instead of `jobs`.  Or some kinds might use completely different config.

However, `taskgraph.transforms.job.registry` has exactly the list of of run-using's that you need.

::: taskcluster/taskgraph/util/verifydoc.py:6
(Diff revision 2)
> +import re
> +import os
> +
> +class VerifyDoc(object):
> +
> +	@classmethod

A class with a single class method is just about always better implemented as a simple function.

Attachment #8814487 - Flags: review?(dustin)

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 5

•

8 years ago

I really like this comprehensive approach -- keep it up!

HAMMAD AKHTAR

Assignee

Comment 6

•

8 years ago

Hey Dustin thanks for the feedback :)
Can you elaborate on `taskgraph.transforms.job.registry`. Are you referring to registry dictionary in `transforms/__init__.py` ?

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 7

•

8 years ago

That's the one!

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/2-3/

HAMMAD AKHTAR

Assignee

Comment 9

•

8 years ago

Hey Dustin, can you shed some light on caches ?

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/3-4/

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/4-5/

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/5-6/

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/6-7/

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 14

•

8 years ago

mozreview-review

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

https://reviewboard.mozilla.org/r/95702/#review96888

I love the design!  This is great!

Just a few (ok, a lot) Python style points.  Some of these are probably applicable to any kind of Python, while others are specific to how we write Python in the Gecko source.

::: taskcluster/taskgraph/util/verifydoc.py:1
(Diff revision 7)
> +import re

This will need a copyright header like the other Python files (just copy/paste it)

Also, please use four-space indents.

::: taskcluster/taskgraph/util/verifydoc.py:4
(Diff revision 7)
> +import re
> +import os
> +
> +base_path = os.getcwd() + "/taskcluster/docs/"

best to use `os.path.join` here

::: taskcluster/taskgraph/util/verifydoc.py:8
(Diff revision 7)
> +
> +base_path = os.getcwd() + "/taskcluster/docs/"
> +
> +
> +def verify_docs(filename, identifiers, appearing_as):
> +	fileObject = open(base_path + filename)

In in-tree Python, we prefer `with open`:

    with open(os.path.join(base_path, filename)) as f:
        doctext = f.read()

::: taskcluster/taskgraph/util/verifydoc.py:10
(Diff revision 7)
> +	if appearing_as == "inline-literal":
> +		expression_list = ["``" + identifier + "``" for identifier in identifiers]
> +	elif appearing_as == "heading":
> +		expression_list = [identifier + "\n[-+\n*]+|[.+\n*]+" for identifier in identifiers]

Nicely done!

Please add an `else` branch here that raises an exception, to avoid typos like `"haeding"`.

::: taskcluster/taskgraph/util/verifydoc.py:15
(Diff revision 7)
> +	if appearing_as == "inline-literal":
> +		expression_list = ["``" + identifier + "``" for identifier in identifiers]
> +	elif appearing_as == "heading":
> +		expression_list = [identifier + "\n[-+\n*]+|[.+\n*]+" for identifier in identifiers]
> +
> +	for i in range(0, len(expression_list)):

`for expression, identifier in zip(expression_list, identifiers)`

Attachment #8814487 - Flags: review?(dustin)

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/7-8/

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/8-9/

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/9-10/

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/10-11/

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 19

•

8 years ago

mozreview-review

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

https://reviewboard.mozilla.org/r/95702/#review97232

At a glance, this is looking better, but I see more review requests coming .. let me know when you're ready for me to do the final review!

Attachment #8814487 - Flags: review?(dustin)

HAMMAD AKHTAR

Assignee

Comment 20

•

8 years ago

(In reply to Dustin J. Mitchell [:dustin] from comment #19)
> Comment on attachment 8814487 [details]
> Bug 1302800 - Verify taskgraph implementations against documentation;
> 
> https://reviewboard.mozilla.org/r/95702/#review97232
> 
> At a glance, this is looking better, but I see more review requests coming
> .. let me know when you're ready for me to do the final review!

I am ready for final review :)

Comment hidden (mozreview-request)

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

Review request updated; see interdiff: https://reviewboard.mozilla.org/r/95702/diff/11-12/

HAMMAD AKHTAR

Assignee

Comment 22

•

8 years ago

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

I am ready for final review :)

Dustin J. Mitchell [:dustin] (he/him)

Reporter

Comment 23

•

8 years ago

mozreview-review

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

https://reviewboard.mozilla.org/r/95702/#review97638

::: taskcluster/docs/transforms.rst:154
(Diff revision 12)
> +  ``hazard``
> +  ``mach``
> +  ``mozharness``
> +  ``run-task``
> +  ``spidermonkey`` or ``spidermonkey-package`` or ``spidermonkey-mozjs-crate``
> +  ``toolchain-script``

I suspect this will render as a paragraph in the final output?  We'll probably want to make it a list.

Attachment #8814487 - Flags: review?(dustin) → review+

Pulsebot

Comment 24

•

8 years ago

Pushed by dmitchell@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/632a2fddbc63
Verify taskgraph implementations against documentation; r=dustin

Phil Ringnalda (:philor)

Comment 25

•

8 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/632a2fddbc63

Status: NEW → RESOLVED

Closed: 8 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla53

Justin Wood (:Callek)

Comment 26

•

8 years ago

Soooo, how do I run this test locally?

At first glance it looks to be part of the decision task alone, but if thats true, I think we should back out because a test that can break the entire build that can only be run in production (including try) isn't ideal for local iteration. (While the rest of the mach taskgraph stuff can indeed be run locally)

If it should be running as part of `./mach taskgraph full` it doesn't seem to be failing for me locally when I'm missing docs for some of the kinds on the `date` branch after merging this in...

Flags: needinfo?(hammad13060)

Justin Wood (:Callek)

Comment 27

•

8 years ago

mozreview-review

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

https://reviewboard.mozilla.org/r/95702/#review97700

::: taskcluster/taskgraph/util/verifydoc.py:9
(Diff revision 12)
> +# file, You can obtain one at http://mozilla.org/MPL/2.0/.
> +
> +import re
> +import os
> +
> +base_path = os.path.join(os.getcwd(), "taskcluster/docs/")

Shouldn't this be using __file__ in some way, rather than relying on a known cwd that just-happens-to-match the topsrcdir

Justin Wood (:Callek)

Comment 28

•

8 years ago

mozreview-review

Comment on attachment 8814487 [details]
Bug 1302800 - Verify taskgraph implementations against documentation;

https://reviewboard.mozilla.org/r/95702/#review97704

::: taskcluster/taskgraph/util/verifydoc.py:18
(Diff revision 12)
> +    with open(os.path.join(base_path, filename)) as fileObject:
> +        doctext = "".join(fileObject.readlines())
> +        if appearing_as == "inline-literal":
> +            expression_list = ["``" + identifier + "``" for identifier in identifiers]
> +        elif appearing_as == "heading":
> +            expression_list = [identifier + "\n[-+\n*]+|[.+\n*]+" for identifier in identifiers]

Ah-Ha, found my issue with some debug logging...

 0:00.20 Testing match for expression u'balrog-l10n(?:\\n[-+\\n*]+|[\\.+\\n*]+)' using identifier u'balrog-l10n'
Traceback (most recent call last):
  File "/home/callek/mozilla/hg/mozilla-central/taskcluster/mach_commands.py", line 231, in show_taskgraph

That was after modifying this expression, the | here was effectively causing it to match either 'balrog-l10n\n[-+\n*]+' OR '[.+\n*]+'  I'll get a followup bug on file and a patch, unless you beat me to it.

Justin Wood (:Callek)

Updated

•

8 years ago

Blocks: 1322193

Justin Wood (:Callek)

Comment 29

•

8 years ago

Filed Bug 1322193  for the followup work, and assigned to Hammad for now.

Flags: needinfo?(hammad13060)

BMO Automation

Updated

•

6 years ago

Product: TaskCluster → Firefox Build System