Closed Bug 1815253 Opened 2 years ago Closed 1 year ago

Instrument database loading cases

Categories

(Data Platform and Tools :: Glean: SDK, task, P2)

Product:

Component:

Type:

task

Priority:

P2

Severity:

--

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: chutten, Assigned: perry.mcmanis)

References

(Blocks 1 open bug)

Details

Attachments

(2 files)

[mozilla/glean] Bug 1815253 - Record what caused an RKV db load failure (#2677) 1 year ago BMO Github Automation 42 bytes, text/x-github-pull-request		Details \| Review
Data Review request 1 year ago Perry McManis [:perry.mcmanis] 2.97 KB, text/plain	travis_ : data-review+	Details

Chris H-C :chutten

Reporter

Description

•

2 years ago

rkv likely tells us the difference between "Tried to open the db and there wasn't one" and "Tried to open the db and it was broken" cases that result in Glean performing first run actions (generating client_id, resetting seq and first_run_hour, etc).

We should consider instrumenting these cases of first run so we can help diagnose what proportions of first runs aren't actually "first"

Travis Long [:travis_]

Updated

•

2 years ago

Assignee: nobody → pmcmanis

Priority: -- → P3

Chris H-C :chutten

Reporter

Comment 1

•

2 years ago

(( Could've sworn I wrote a comment about this already... ah well ))

We're looking to instrument three cases:

db isn't present (Glean starts afresh)
db is present, and is bad (Glean starts afresh)
db is present, and is good (Glean uses existing data)

I looked at this a couple weeks ago and rkv_new is the right place to look for Case 2, but for differentiating Cases 1 and 3 you'll need to use something like Path::exists.

Plus, there's the wrinkle that this is happening while opening the db. Glean doesn't exist, meaning you can't directly instrument this using e.g. set_sync. If you can get to the dispatcher, you might be able to dispatch something for after init's done... but otherwise, your data can't be added to a Glean that doesn't fully exist yet.

Perry McManis [:perry.mcmanis]

Assignee

Comment 2

•

2 years ago

Moving to p1 for myself

Flags: needinfo?(rpierzina)

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

2 years ago

Priority: P3 → P1

Perry McManis [:perry.mcmanis]

Assignee

Comment 3

•

2 years ago

Implementation is in progress. RKV does give us a handy error that makes this a bit simpler.

Adding complexity is the need to persist the error (since well, there's no RKV storage if RKV got messed up) and getting that into a metric. Will update as I make progress.

Raphael Aurich [:raphael] UTC+01:00

Comment 4

•

2 years ago

Thank you, Perry!

Flags: needinfo?(rpierzina)

Raphael Aurich [:raphael] UTC+01:00

Updated

•

2 years ago

Blocks: 1816530

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

2 years ago

No longer blocks: 1816530

Priority: P1 → P2

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

2 years ago

Blocks: 1816530

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

2 years ago

Depends on: 1820792

Alessio Placitelli [:Dexter]

Comment 5

•

2 years ago

Perry is this work complete?

Flags: needinfo?(pmcmanis)

Perry McManis [:perry.mcmanis]

Assignee

Comment 6

•

2 years ago

•

No, not the instrumentation.

I landed the code to correctly handle the error and make the behavior itself match what we described it as doing (and desired it to do). However, after discussing with Travis, it became apparent that actually plumbing this all the way up to being sent in a new error metric was a pretty meaty task and we decided to split the work off into this ticket for dealing with later.

Flags: needinfo?(pmcmanis) → needinfo?(alessio.placitelli)

Alessio Placitelli [:Dexter]

Comment 7

•

2 years ago

(In reply to Perry McManis [:perry.mcmanis] from comment #6)

No, not the instrumentation.

Thanks, please untake it if you're no longer planning on working on it :-) Consider bringing this up in the next SDK meeting for re-triage?

Flags: needinfo?(alessio.placitelli)

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

2 years ago

Assignee: pmcmanis → nobody

Travis Long [:travis_]

Updated

•

2 years ago

Priority: P2 → --

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

2 years ago

Assignee: nobody → pmcmanis

Priority: -- → P2

Perry McManis [:perry.mcmanis]

Assignee

Comment 8

•

2 years ago

Update: after discussing we have decided this is worth doing.

I will take it back and get it completed with help from Jan-Erik.

Travis Long [:travis_]

Comment 9

•

1 year ago

This might be worth adding any errors around trying to clear the database or write to it (or at least ensuring we have adequate instrumentation around these things already)

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

1 year ago

Assignee: pmcmanis → nobody

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

1 year ago

Priority: P2 → --

Travis Long [:travis_]

Updated

•

1 year ago

Priority: -- → P2

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

1 year ago

Assignee: nobody → pmcmanis

BMO Github Automation

Comment 10

•

1 year ago

Attached file [mozilla/glean] Bug 1815253 - Record what caused an RKV db load failure (#2677) — Details

Perry McManis [:perry.mcmanis]

Assignee

Comment 11

•

1 year ago

Attached file Data Review request — Details

Attachment #9368167 - Flags: data-review?(tlong)

Travis Long [:travis_]

Comment 12

•

1 year ago

Comment on attachment 9368167 [details]
Data Review request

Data Review

Is there or will there be documentation that describes the schema for the ultimate data set in a public, complete, and accurate way?

Yes, through the metrics.yaml file and the Glean Dictionary.

Is there a control mechanism that allows the user to turn the data collection on and off?

Yes, through the data preferences in the application settings.

If the request is for permanent data collection, is there someone who will monitor the data over time?

Permanent collection to be monitored over time by pmcmanis@mozilla.com and glean-team@mozilla.com

Using the category system of data types on the Mozilla wiki, what collection type of data do the requested measurements fall under?

Category 1, Technical data

Is the data collection request for default-on or default-off?

Default-on

Does the instrumentation include the addition of any new identifiers (whether anonymous or otherwise; e.g., username, random IDs, etc. See the appendix for more details)?

No

Is the data collection covered by the existing Firefox privacy notice?

Yes

Does the data collection use a third-party collection tool?

No

Result

data-review+

Attachment #9368167 - Flags: data-review?(tlong) → data-review+

Perry McManis [:perry.mcmanis]

Assignee

Comment 13

•

1 year ago

A small update, we will be changing the name/description:

Name: rkv_load_error
Description: If there was an error loading the RKV database, record it.

All other aspects of this collection are identical.

Perry McManis [:perry.mcmanis]

Assignee

Updated

•

1 year ago

Status: NEW → RESOLVED

Closed: 1 year ago

Resolution: --- → FIXED

You need to log in before you can comment on or make changes to this bug.