Closed Bug 1874471 Opened 8 months ago Closed 7 months ago

Possible race condition when demuxing video packets of new transceiver

Tracking

()

Status:

RESOLVED FIXED

Milestone:

125 Branch

Tracking Flags:

Tracking

Status

firefox125

---

fixed

People

(Reporter: deadbeef, Assigned: bwc)

Details

Attachments

(3 files)

mozlog_demux_error.txt 8 months ago Taylor Brandstetter 6.05 KB, text/plain		Details
Bug 1874471: Only use receive payload types in MediaPipelineFilter. r?mjf 7 months ago Byron Campen [:bwc] 48 bytes, text/x-phabricator-request		Details \| Review
Bug 1874471: Disable payload-type matching for filters that know an ssrc. r?mjf 7 months ago Byron Campen [:bwc] 48 bytes, text/x-phabricator-request		Details \| Review

Taylor Brandstetter

Reporter

Description

•

8 months ago

Attached file mozlog_demux_error.txt — Details

Steps to reproduce:

Set up a RTCPeerConnection with a sendonly video transceiver and perform an offer/answer exchange.
Apply a remote description adding a new recvonly video m= section containing an a=ssrc line (MID and RID header extensions not in use, demuxing by SSRC only). In the attached log this is SSRC 2804095052.
At the same time, or sooner, packets from this SSRC are arriving from the remote endpoint.

Actual results:

There appears to be a race condition when setting up the receive stream. From the attached log:

VideoConduit 7f1d6cd24900 (presumably representing the recvonly transceiver) sets up the receive stream as expected for SSRC 2804095052
VideoConduit 7f1d7b51fc00 (presumably the sendonly transceiver) processes a packet from SSRC 2804095052, and due to code that handles the "unknown ssrc (and ssrc-not-signaled case)", adopts this as its new remote SSRC
This unsets 2804095052 from VideoConduit 7f1d6cd24900, which generates a new remote SSRC.
VideoConduit 7f1d6cd24900 now fails to demux packets from 2804095052.

My guess is that, while there's only one video m= section, all packets are routed to it in order to handle unknown/unsignaled SSRCs. This presumably changes when the new remote description is applied containing a new video m= section, but packets may still be queued up in the first VideoConduit, which when processed unset the SSRC of the VideoConduit which is actually meant to be handling that SSRC.

We had the same exact problem in Chrome, which was quite a pain to work around: https://source.chromium.org/chromium/_/webrtc/src.git/+/15e078c574981597c5d6ecc13476f54e667dc568

It looks like it can be avoided by making sure there's never exactly one video m= section, but adding a superfluous m= section seems to be causing other issues which I'm still looking into.

Expected results:

Demuxing by SSRC works as expected.

Taylor Brandstetter

Reporter

Comment 1

•

8 months ago

Just wanted to clarify, in #2, the m= section is sendonly in the remote description, recvonly in the local description.

Michael Froman [:mjf]

Comment 2

•

8 months ago

Byron, any thoughts here?

Flags: needinfo?(docfaraday)

Byron Campen [:bwc]

Assignee

Comment 3

•

7 months ago

... so this is a scenario that uses bundle, which requires the use of SDP mid, but does not use RTP mid?

At the risk of being labelled a smart-aleck:

Patient: Doctor, it hurts when I do this!
Doctor: Then don't do that.

More seriously, I think that we must be seeing early arrival of that new RTP stream. That causes the filter for the first conduit to see a unique payload type, and learn the SSRC. Then, when we negotiate, the filter for the second conduit also will let that SSRC through. Lastly, because we're doing bundle without the MID RTP extension, we disable the ssrc switchover logic, because if we don't we'll just get random assignment with unsignaled SSRCs (this is what prevents us getting into an infinite back-and-forth ssrc switching).

I suppose we could add a "filter out everything because you're not negotiated to receive" bit on the filters.

Flags: needinfo?(docfaraday)

Taylor Brandstetter

Reporter

Comment 4

•

7 months ago

No, I agree, we should definitely be using the RTP MID extension. Unfortunately, given how things currently work, that's easier said than done...

Byron Campen [:bwc]

Assignee

Comment 5

•

7 months ago

Ok, I think I'm going to do a couple of things:

Disregard the negotiated payload types of non-receiving m-sections for the purposes of unique payload type filtering.
If a filter knows an ssrc, either from negotiation or learning it from RTP, we disable payload type matching for that filter. This will avoid incorrectly learning a new m-section's ssrc if the RTP arrives early.

Byron Campen [:bwc]

Assignee

Updated

•

7 months ago

Assignee: nobody → docfaraday

Byron Campen [:bwc]

Assignee

Comment 6

•

7 months ago

https://treeherder.mozilla.org/jobs?repo=try&revision=9dd96fa754f6e060515f4aa10c71b37e375a50b7
https://treeherder.mozilla.org/jobs?repo=try&revision=a6cdfecb18a45545e06710ae2ff4311b6181429f
https://treeherder.mozilla.org/jobs?repo=try&revision=18fa9bdcf9e6702b5fa11b3fbf8ede9772b9d21d

Byron Campen [:bwc]

Assignee

Comment 7

•

7 months ago

Attached file Bug 1874471: Only use receive payload types in MediaPipelineFilter. r?mjf — Details

Byron Campen [:bwc]

Assignee

Comment 8

•

7 months ago

Attached file Bug 1874471: Disable payload-type matching for filters that know an ssrc. r?mjf — Details

Depends on D201766

Byron Campen [:bwc]

Assignee

Comment 9

•

7 months ago

Try looks good.

Could you grab a binary from one of those try pushes and see if it addresses the issue?

Flags: needinfo?(deadbeef)

Taylor Brandstetter

Reporter

Comment 10

•

7 months ago

I'm not sure what I'm missing, but I'm having trouble reproducing the issue in an isolated way. I have a certain integration test which reproduces it (at least 10% of the time), but am not able to run that on a custom browser... Would it be ok for me to verify when the next release comes around?

Flags: needinfo?(deadbeef)

Byron Campen [:bwc]

Assignee

Comment 11

•

7 months ago

I'm a little bit leary of landing a non-verified behavioral change the day before a soft freeze, so maybe this will go into the next nightly. If that helps you substantially, I can try an early uplift.

Pulsebot

Comment 12

•

7 months ago

Pushed by bcampen@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/d26e5bcff843 Only use receive payload types in MediaPipelineFilter. r=mjf https://hg.mozilla.org/integration/autoland/rev/a7847a74255d Disable payload-type matching for filters that know an ssrc. r=mjf

Cosmin Sabou [:CosminS]

Comment 13

•

7 months ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/d26e5bcff843
https://hg.mozilla.org/mozilla-central/rev/a7847a74255d

Status: UNCONFIRMED → RESOLVED

Closed: 7 months ago

status-firefox125: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 125 Branch

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Possible race condition when demuxing video packets of new transceiver

Categories

(Core :: WebRTC: Signaling, defect)

Tracking

()

People

(Reporter: deadbeef, Assigned: bwc)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(3 files)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Updated

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Attachment

General

Description

File Name

Content Type