Closed
Bug 1300345
Opened 9 years ago
Closed 8 years ago
Intermittent-infra BadZipfile: File is not a zip file
Categories
(Release Engineering :: Applications: MozharnessCore, defect)
Release Engineering
Applications: MozharnessCore
Tracking
(Not tracked)
RESOLVED
WORKSFORME
People
(Reporter: intermittent-bug-filer, Unassigned)
References
Details
(Keywords: intermittent-failure, Whiteboard: [stockwell infra])
Attachments
(1 obsolete file)
Updated•9 years ago
|
Component: General → Mozharness
Product: Taskcluster → Release Engineering
QA Contact: jlund
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 3•9 years ago
|
||
I think this is just the new signature for bug 1300048 with the patch on bug 1272083 landed.
Depends on: 1272083
Comment 4•9 years ago
|
||
Just offhand, I'd say the frequency of this is somewhere around 20 times the frequency of that.
| Comment hidden (Intermittent Failures Robot) |
Comment 6•9 years ago
|
||
(In reply to Phil Ringnalda (:philor) from comment #4)
> Just offhand, I'd say the frequency of this is somewhere around 20 times the
> frequency of that.
Sure, because downloading the build and tests is performed via the new method now. That's why the higher frequency of this bug compared to bug 1300048 now.
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Updated•9 years ago
|
Summary: Intermittent BadZipfile: File is not a zip file → Intermittent-infra BadZipfile: File is not a zip file
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 16•9 years ago
|
||
I don't know how OF counted 67.
Yesterday there were 22 instances:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1300413&startday=2016-09-01&endday=2016-09-15&tree=all
Comment 17•9 years ago
|
||
This link instead:
https://brasstacks.mozilla.com/orangefactor/?display=Bug&bugid=1300345&startday=2016-09-01&endday=2016-09-15&tree=all
I see 52 for yesterday.
I expect today to be less than 10 and tomorrow close to 0.
| Comment hidden (Intermittent Failures Robot) |
Comment 19•9 years ago
|
||
This got fixed by bug 1300812.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 22•9 years ago
|
||
Well... the spike from last week is from someone pushing to inbound code a couple of weeks old.
See comments on bug 1305501 for details.
Comment 23•9 years ago
|
||
It seems that a lot of code was retriggered or backfilled and starred which meant making this spike again.
3:49 PM <armenzg> now I'm even more confused
3:49 PM this log
3:49 PM https://treeherder.mozilla.org/logviewer.html#?repo=mozilla-inbound&job_id=36399206#L612
3:49 PM says September 23rd
3:50 PM which takes me to inbound
3:50 PM https://treeherder.mozilla.org/#/jobs?repo=mozilla-inbound&revision=72c8ea085d7227204666b528d001f34c1cf4b113
3:50 PM now that says Sep. 13th
3:50 PM <philor> talk to jmaher and his heavy retrigger finger
3:51 PM he wants to know when his Linux talos broke, so he ran around fifty billion retriggers, and various pushes to try
3:52 PM <armenzg> philor: I think I should make pulse_actions takes Paypal payments after a certain threshold
philor> armenzg: yes! though charge RyanVM for that one, it's mostly chrome that he starred so he probably also retriggered
3:53 PM since nobody else would have seen it that much later
3:54 PM <armenzg> Armen Zambrano Gasparnian I wonder if we need to modify OF
3:54 PM hrmm
3:54 PM this is tough
3:54 PM <philor> ITYM "rewrite", that being what we do instead of fixing
3:55 PM <armenzg> Armen Zambrano Gasparnian philor: we should track this bug on the rewrite
3:55 PM it seems that we would want to ignore retriggers
3:55 PM wlach: do you have a document to track requirements for the OF rewrite?
3:55 PM maybe we should start one
3:56 PM for whoever ends up working on it
3:56 PM philor: but who starred the retries from old pushes?
3:56 PM <armenzg> Armen Zambrano Gasparnian or did auto-starring do that?
3:56 PM <philor> armenzg: same person who triggered them, no doubt
3:56 PM no, there is no such thing as auto-starring
3:57 PM <philor> there is only autoclassify, which does a thing distinct from starring
3:57 PM <armenzg> Armen Zambrano Gasparnian philor: but that would have meant that somebody backfilled over few weeks of pushes and starred those oranges?
3:57 PM I'm scared of the thought of taking such painful path
3:58 PM <philor> armenzg: yes, that is far far far less surprising than they they would retrigger the piss out of things and then leave them unstarred
3:58 PM <ekyle> armenzg: https://docs.google.com/document/d/1LB4Ppj55rw9IFC-ggQr6Vy0yx8Gq-c6rcD7aNt_0Ib4/edit
3:58 PM <philor> well, jmaher may well have, because he believes in "green or it was my failure", but whatever RyanVM was looking for, he would have been looking for a particular failure, so he would have wanted the things which were not it to go away
3:59 PM <armenzg> Armen Zambrano Gasparnian jmaher|afk: RyanVM|mtg did you guys retrigger a lot on the 22nd and starred the failures?
| Comment hidden (mozreview-request) |
Updated•9 years ago
|
Attachment #8795777 -
Attachment is obsolete: true
Attachment #8795777 -
Flags: review?(dustin)
Updated•9 years ago
|
Whiteboard: [stockwell infra]
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 34•8 years ago
|
||
This is not fixed and happens quite often recently.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
| Comment hidden (Intermittent Failures Robot) |
| Comment hidden (Intermittent Failures Robot) |
Comment 37•8 years ago
|
||
Hi! There is a pretty large increase in no. of failures for this bug in the last couple of hours. This is from central but encountered them also on autoland.
https://treeherder.mozilla.org/#/jobs?repo=mozilla-central&filter-resultStatus=testfailed&filter-resultStatus=busted&filter-resultStatus=exception&fromchange=cadd4b3c2bcb0ea8a0a82f37df184224ac16efc7&tochange=980d4dc65afca8ded0b22b6d95568e1a2f301346&selectedJob=171967746
Not sure what sheriffs need to do in these cases so I chose to comment on the bug and let people know about this issue. On the retriggers the jobs were green.
https://treeherder.mozilla.org/#/jobs?repo=autoland&revision=ab684911d52067d31f43986511da8d0c97fd6484&filter-resultStatus=testfailed&filter-resultStatus=busted&filter-resultStatus=exception&filter-resultStatus=success&filter-searchStr=xpc&selectedJob=171955698
Flags: needinfo?(jmaher)
Flags: needinfo?(catlee)
| Comment hidden (Intermittent Failures Robot) |
Comment 39•8 years ago
|
||
Well, there is something clearly broken in the content delivery on the TaskCluster (AWS) side:
> [task 2018-04-04T22:40:17.473Z] 22:40:17 INFO - Downloading packages: [u'target.common.tests.zip', u'target.mochitest.tests.zip'] for test suite categories: ['mochitest']
> [task 2018-04-04T22:40:17.474Z] 22:40:17 INFO - Downloading and extracting to /builds/worker/workspace/build/tests these dirs * from https://queue.taskcluster.net/v1/task/VdeVvSVLQQep9yV4J5x3Lw/artifacts/public/build/target.common.tests.zip
> [task 2018-04-04T22:40:17.475Z] 22:40:17 INFO - retry: Calling fetch_url_into_memory with args: (), kwargs: {'url': u'https://queue.taskcluster.net/v1/task/VdeVvSVLQQep9yV4J5x3Lw/artifacts/public/build/target.common.tests.zip'}, attempt #1
> [task 2018-04-04T22:40:17.475Z] 22:40:17 INFO - Fetch https://queue.taskcluster.net/v1/task/VdeVvSVLQQep9yV4J5x3Lw/artifacts/public/build/target.common.tests.zip into memory
> [task 2018-04-04T22:40:18.986Z] compiz (core) - Warn: Attempted to restack relative to 0x1400006 which is not a child of the root window or a window compiz owns
> [task 2018-04-04T22:40:22.233Z] 22:40:22 INFO - Content-Length response header: 282
> [task 2018-04-04T22:40:22.233Z] 22:40:22 INFO - Bytes received: 282
The file target.common.tests.zip cannot be 282 bytes in size!
John, who from the Taskcluster team could have a look at this? Do we have some AWS logs which might show what was going wrong at this time?
Flags: needinfo?(jmaher)
Flags: needinfo?(jhford)
Flags: needinfo?(catlee)
Comment 40•8 years ago
|
||
Sounds like bug 1364463 again.
| Comment hidden (Intermittent Failures Robot) |
Comment 42•8 years ago
|
||
Until we've moved to the new Artifact API, we cannot really do anything here other than delete bad objects as they are found. We'll be able to move things to the new API as soon as a few Queue PRs land. I will communicate broadly when this happens.
Flags: needinfo?(jhford)
| Comment hidden (Intermittent Failures Robot) |
Updated•8 years ago
|
Status: REOPENED → RESOLVED
Closed: 9 years ago → 8 years ago
Resolution: --- → WORKSFORME
You need to log in
before you can comment on or make changes to this bug.
Description
•