Closed Bug 1885529 Opened 1 year ago Closed 1 year ago

Investigate increase in DNS queueing time with native HTTPS records

Categories

(Core :: Networking: DNS, defect, P2)

Unspecified
Windows 11
defect

Tracking

()

VERIFIED FIXED
126 Branch
Tracking Status
firefox-esr115 --- unaffected
firefox123 --- unaffected
firefox124 --- unaffected
firefox125 --- disabled
firefox126 --- fixed

People

(Reporter: valentin, Assigned: valentin)

References

(Blocks 1 open bug, Regression)

Details

(Keywords: regression, Whiteboard: [necko-triaged][necko-priority-queue])

Attachments

(4 files)

https://glam.telemetry.mozilla.org/firefox/probe/dns_native_queuing/explore?currentPage=1&os=Windows&process=parent&ref=20240308044528&timeHorizon=QUARTER&visiblePercentiles=%5B99.9%2C99%2C95%5D

This seems to only affect windows. It's unclear whether some HTTPS requests are hanging forever, or there's a bug that makes records wait in the queue without being resolved.

Set release status flags based on info from the regressing bug 1874464

Set release status flags based on info from the regressing bug 1874464

Attached file 1885529-data-review.md
Attachment #9394438 - Flags: data-review?(chutten)
Attachment #9394438 - Flags: data-review?(cboozarjomehri)

Comment on attachment 9394438 [details]
1885529-data-review.md

(I'll let Cameron take this one)

Attachment #9394438 - Flags: data-review?(chutten)

The issue here is only obvious on Windows, when network.dns.get-ttl=true
and network.dns.native_https_query=true.

GetTTL will cause another resolution to happen with a low priority,
but these don't get picked up unless
mActiveAnyThreadCount < MaxResolverThreadsAnyPriority()

This causes the DNS_NATIVE_QUEUING probe to increase since these
queries don't get picked up immediately.

Also decrementing mActiveAnyThreadCount for ByType records makes
sure it always has the correct value.

When calling MoveToAnotherPendig the record gets removed from the
pending queue, and InsertRecord is called. As InsertRecord again
increments mPendingCount, we need to decrement it when removing from
a pending queue to ensure it's accurate.

Depends on D206498

Pushed by valentin.gosu@gmail.com: https://hg.mozilla.org/integration/autoland/rev/2b8d47203a6b Also update nsHostResolver::mActiveAnyThreadCount in CompleteLookupByTypeLocked when resolving via native DNS r=necko-reviewers,jesup https://hg.mozilla.org/integration/autoland/rev/e633b6ff34e8 Make sure HostRecordQueue::mPendingCount is accurate r=necko-reviewers,jesup
Status: NEW → RESOLVED
Closed: 1 year ago
Resolution: --- → FIXED
Target Milestone: --- → 126 Branch

glam is broken, but I ran this in SQL https://sql.telemetry.mozilla.org/queries/98978/source#244273 and it shows us back to normal.

Status: RESOLVED → VERIFIED

Comment on attachment 9394438 [details]
1885529-data-review.md

Is there or will there be documentation that describes the schema for the ultimate data set available publicly, complete and accurate?

Yes.

Is there a control mechanism that allows the user to turn the data collection on and off?

Yes. This collection can be controlled through the product's preferences.

If the request is for permanent data collection, is there someone who will monitor the data over time?

Yes, vgosu@mozilla.com and necko@mozilla.com are responsible.

Using the category system of data types on the Mozilla wiki, what collection type of data do the requested measurements fall under?

Category 1, Technical.

Is the data collection request for default-on or default-off?

Default on for all channels.

Does the instrumentation include the addition of any new identifiers?

No.

Is the data collection covered by the existing Firefox privacy notice?

Yes.

Does the data collection use a third-party collection tool?

No.

Result: datareview+

Attachment #9394438 - Flags: data-review?(cboozarjomehri) → data-review+
Pushed by valentin.gosu@gmail.com: https://hg.mozilla.org/integration/autoland/rev/0ce4fc8cc12d Measure time spent in native HTTPS call r=necko-reviewers,jesup
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: