<a class="header-button" href="https://bugzilla.mozilla.org/home" title="Go to home page"> Bugzilla

Smokey Ardisson (offline for a while; not following bugs - do not email)

Assignee

Comment 1

•

18 years ago

Attached patch wip patch for windows/linux (obsolete) — Details — Splinter Review

WIP, works fine on Windows, didn't test Linux yet. I'm not quite sure how to do Mac, since we persist that value via interface builder.

Mike Schroepfer

Comment 2

•

18 years ago

We need to resolve throttling on server or client one way or another. I'm a fan of the server-side throttling - but then again I'm not doing any of the work :-).

Flags: blocking1.9? → blocking1.9+

Priority: -- → P2

Comment 3

•

18 years ago

(In reply to comment #2) > I'm fan of the server-side throttling By that you surely don't mean "don't accept (100 - X)% of reports sent in," right? If a user submits a crash report and then goes to file a bug (or comment on an existing bug) and then finds the crash he thought he reported was not reported, it's going to be 1) confusing and 2) dataloss of that crash data. 1) is bad because we want the reporter to be considered reliable; 2) is bad because we never know what user is going to have that key piece of data, and might cause Bugzilla churn closing bugs filed in anticipation of crash reports that don't exist. If that's not what you meant, than just ignore that paragraph ;) (In reply to comment #1) > I'm not quite sure how to do Mac, since we persist that value via interface builder. I'm not sure I follow; does that checkbox state not persist on other platforms?

Mike Schroepfer

Comment 4

•

18 years ago

(In reply to comment #3) > (In reply to comment #2) > > I'm fan of the server-side throttling > > By that you surely don't mean "don't accept (100 - X)% of reports sent in," > right? No - I mean store 100% of reports and process a subset. Storage and acceptance is cheap. Processing is expensive. > If a user submits a crash report and then goes to file a bug (or comment on an > existing bug) and then finds the crash he thought he reported was not reported, > it's going to be 1) confusing and 2) dataloss of that crash data. 1) is bad > because we want the reporter to be considered reliable; 2) is bad because we > never know what user is going to have that key piece of data, and might cause > Bugzilla churn closing bugs filed in anticipation of crash reports that don't > exist. I suggested that if you ever requested a particular crash id and it wasn't processed it would get added to the queue and processed. This was deemed as "hard". Again just my recommendations :-)

Smokey Ardisson (offline for a while; not following bugs - do not email)

Assignee

Comment 5

•

18 years ago

(In reply to comment #3) > I'm not sure I follow; does that checkbox state not persist on other platforms? Yes, but we handle it manually in the code. On Mac, it's just bound to "Shared Defaults.values.submitReport" from IB. Is there a way to detect when that value is unset and set it appropriately? Schrep: I think that server-side throttling would be nicer, but I do think we'll run into storage and management issues. I think this patch will be pretty low-impact, and allow people to easily opt-in, so that testers/developers shouldn't have to worry about having their reports sent. My only concern is that it might make it *too* easy to opt-in, so we'll still have to be prepared for a pretty high volume of reports.

Comment 6

•

18 years ago

(In reply to comment #5) > (In reply to comment #3) > > I'm not sure I follow; does that checkbox state not persist on other platforms? > > Yes, but we handle it manually in the code. On Mac, it's just bound to "Shared > Defaults.values.submitReport" from IB. Is there a way to detect when that value > is unset and set it appropriately? Presumably there's a nice way to do that from code, but you'd have to consult a real Cocoa-head. All IB really should be doing there is writing the state to the user defaults (and reading the defaults when launching), so, at worst case, you could defaults read org.mozilla.crashreporter submitReport and then defaults write org.mozilla.crashreporter submitReport [-bool|-int] the other value to manipulate your X% default (not really sure how that's being done, so my comment could be completely off in left field).

Assignee

Comment 7

•

18 years ago

Yeah, I realized that I could just do it manually the other day. I'll whip up a comprehensive patch in a bit.

Assignee

Comment 8

•

17 years ago

Attached patch complete patch [checked in] — Details — Splinter Review

Ok, this works on Mac as well.

Attachment #311590 - Attachment is obsolete: true

Attachment #314124 - Flags: review?(benjamin)

Updated

•

17 years ago

Attachment #314124 - Flags: review?(benjamin) → review+

Assignee

Comment 9

•

17 years ago

Comment on attachment 314124 [details] [diff] [review] complete patch [checked in] Checked this in. I guess we should use this option on Windows trunk, just to make sure it doesn't have any ill effects. I'll attach a patch for that in a sec.

Attachment #314124 - Attachment description: complete patch → complete patch [checked in]

Assignee

Comment 10

•

17 years ago

Attached patch set enable percent to 25% on fx-win32-tbox — Details — Splinter Review

This will only enable the "submit" checkbox by default 25% of the time on Windows builds. This only takes effect if there's no existing value, so existing nightly testers will not be affected, only new users. Note that you can still check the box at any time if it's unchecked to submit a report. (Or uncheck it if it's checked, of course.)

Attachment #314177 - Flags: review?(robert)

Robert Helmer [:rhelmer]

Comment 11

•

17 years ago

Comment on attachment 314177 [details] [diff] [review] set enable percent to 25% on fx-win32-tbox Should this be on for release builds too or just nightlies?

Attachment #314177 - Flags: review?(robert) → review+

Comment 12

•

17 years ago

For releases. Actually more important for releases than for nightlies.

Jesse Ruderman

Comment 13

•

17 years ago

>This will only enable the "submit" checkbox by default 25% of the time on >Windows builds. This only takes effect if there's no existing value, so >existing nightly testers will not be affected, only new users. Does this design mean it will be hard to change the percentage between, say, Firefox 3 and Firefox 3.0.1?

Comment 14

•

17 years ago

just to be clear, we want to try and get 100% of the crashes for all nightlies, betas, and even release candidates that will get downloaded less than a million or so times. when we get bits that we know will be pushed as "final" and could eventually be downloaded by tens of millons of users we want to flip some bit that allows us to throttle back to 10 or 25%. Using a build config flag seems and requiring a rebuild sounds like more work/risk than we might want to apply to bits that are in transition from an RC to Final.

Comment 15

•

17 years ago

RCs are RCs: we don't change the bits from RC to final. We discussed being able to set the percentage from a text file over IRC. That solution is not optimal because it re-uses a l10n file for other content. It is more practical to compile in the random percentage at this point. I don't think we need 100% of crashes for RCs in any case... the statistical sample sizes are already huge, so we don't gain much statistical knowledge with more users; and since any users can check the box to send a particular report, to help QA particular crashes, I don't think we're going to miss a lot.

Comment 16

•

17 years ago

Jesse, changing between 3 and 3.0.1 would involve a mozconfig change, but isn't "hard" or risky.

Assignee

Comment 17

•

17 years ago

I think what Jesse means is that because this is "sticky", once a user has crashed the value is saved, so changing the percentage won't change any existing users. We could tweak this slightly so that we only persist actual user-selected values, and let the random value always be random, or persist the random value separately from the user-selected value, but I don't know how useful that would be.

Assignee

Comment 18

•

17 years ago

Also, did we actually have a way to change the Talkback percentage via updates, or was it solely through the installer?

Comment 19

•

17 years ago

> I don't think we need 100% of crashes for RCs in any case... the statistical sample sizes are already huge, so we don't gain much statistical knowledge with more users; we are respinning 2.0.0.13 now because a top crash regression went undetected in release candidate and it turned out to be serious enough to require reaction. You're probably right that 3.0RC1,2,3... will have several million users, pretty good sample size, and long enough bake time to detect a possible topcrash ship blocker. The question is if 3.0.0.1-13 will have all those things working for them. Historically it has been tough to build in enough bake time and big enough sample size to spot top crash and other problems that create the need for respins of maintenance releases. > Also, did we actually have a way to change the Talkback percentage via updates, or was it solely through the installer? Talkback provided a protocol where the client would ask for instructions from the server when it started the submission process. One of the instructions the server could respond with is "shut yourself down and don't send reports for this release in the future", so in effect we could throttle post install and turn off clients where the data was no longer valuable.

Comment 20

•

17 years ago

> topcrash ship blocker. The question is if 3.0.0.1-13 will have all those > things working for them. Historically it has been tough to build in enough > bake time and big enough sample size to spot top crash and other problems that > create the need for respins of maintenance releases. How do you propose to fix that? RCs are bit-identical to the actual release (if we don't find blockers, we release those precise bits). > the server could respond with is "shut yourself down and don't send reports for > this release in the future", so in effect we could throttle post install and We have the same functionality in breakpad, see bug 412788

Comment 21

•

17 years ago

> How do you propose to fix that? RCs are bit-identical to the actual release (if we don't find blockers, we release those precise bits). here is the idea we have kicked around before. use the installer package naming convention that we have been using for the last couple of years.. have the installer run some code like if the name of the installer package matches "beta" "alpha" "pre" then no throttling else throttle

Comment 22

•

17 years ago

RC builds do not contain beta/alpha/pre in the filename: http://ftp.mozilla.org/pub/mozilla.org/firefox/nightly/2.0.0.14-candidates/rc1/

Comment 23

•

17 years ago

but they could (and should?) to avoid confusion on what they are, and make this work to our advantage for gathering the maximum amount of crash data.

Comment 24

•

17 years ago

Well, it's somewhat irrelevant to breakpad since we don't choose the random percentage at install-time, but rather at crash-time.

Michael Morgan [:morgamic]

Assignee

Comment 25

•

17 years ago

I checked the second patch in on trunk and the 'release' branch, so it should be set for RC1. We can back it out on trunk after it bakes without problems.

Status: NEW → RESOLVED

Closed: 17 years ago

Resolution: --- → FIXED

Comment 26

•

17 years ago

How did we arrive at 25%?

Michael Morgan [:morgamic]

Assignee

Comment 27

•

17 years ago

I don't recall. Anyway, I wonder if this is somewhat ineffective, given how easy it is to check the box. If we're still having scaling problems, we could do something more effective.

Comment 28

•

17 years ago

Well, we are getting 200K+ reports a day when we expected 60K a day, so something is messed up. Seems like 25% is arbitrarily picked anyway, so I was wondering if there's a statistical reason for that number. We should pick whatever is the minimum to have statistically significant samples. Have we discussed what that % would be?