Closed Bug 927579 Opened 12 years ago Closed 12 years ago

crash [@ mlp_process] [@ tansig_approx]

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla28

Tracking Flags:

Tracking

Status

firefox27

---

wontfix

firefox28

---

fixed

firefox-esr24

---

unaffected

b2g-v1.1hd

---

unaffected

b2g-v1.2

---

wontfix

b2g-v1.3

---

fixed

b2g-v1.3T

---

fixed

b2g-v1.4

---

unaffected

People

(Reporter: jruderman, Assigned: padenot)

References

Details

(Keywords: crash, sec-low, testcase, Whiteboard: [adv-main28+])

Attachments

(3 files)

testcase (crashes Firefox) 12 years ago Jesse Ruderman 458 bytes, text/html		Details
stack (gdb) 12 years ago Jesse Ruderman 2.79 KB, text/plain		Details
stack (ASan) 12 years ago Jesse Ruderman 14.87 KB, text/plain		Details

Jesse Ruderman

Reporter

Description

•

12 years ago

Attached file testcase (crashes Firefox) — Details

No description provided.

Jesse Ruderman

Reporter

Updated

•

12 years ago

Group: core-security

Jesse Ruderman

Reporter

Comment 1

•

12 years ago

Attached file stack (gdb) — Details

Jesse Ruderman

Reporter

Comment 2

•

12 years ago

Attached file stack (ASan) — Details

Karl Tomlinson (:karlt)

Comment 3

•

12 years ago

Passing this to MediaRecorder, libopus. Please return to Web Audio if there is a problem with the output from the MediaStreamDestinationNode.

Component: Web Audio → Video/Audio

Maire Reavy [:mreavy]

Comment 4

•

12 years ago

Rob -- As Karl says, this looks to be a bug in MediaRecorder (not Web Audio). Since this is a sec bug, can you have a look and either take this bug yourself or find the right owner? (Feel free to pass it back if the root cause problem is with Web Audio.) Thanks.

Assignee: nobody → roc

Summary: WebAudio crash [@ mlp_process] [@ tansig_approx] → crash [@ mlp_process] [@ tansig_approx]

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 5

•

12 years ago

Linux platform can reproduce this one. Same crash point.

Assignee: roc → rlin

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 6

•

12 years ago

I checked the AudioChunk and found if the ac.createDelay with non-zero parameter, The NotifyQueuedTrackChanges in encoder would get a chunk with Duration = 2. createDelay(0) doesn't have this. Is this behavior normal?

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 7

•

12 years ago

Hi Karlt, I try to check the AudioChunk from media stream and found all chunk are with the duration = 2 and mChannelData size = 2 on 44100 hz, is it normal? If yes, we may have somebody to fix crash problem in libopus.

Flags: needinfo?(karlt)

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 8

•

12 years ago

Hi Timothy, Could you help to check this libopus crash? #0 0x00007ffff1f36aab in tansig_approx (x=-nan(0x400000)) at /media/randy-lin/aa890c59-befb-4bc6-9138-def8ebeea61c/gecko_c2/media/libopus/src/mlp.c:81 #1 0x00007ffff1f36c0f in mlp_process (m=0x7ffff5244610 <net>, in=0x7fffbb39dd60, out=0x7fffbb39dc50) at /media/randy-lin/aa890c59-befb-4bc6-9138-def8ebeea61c/gecko_c2/media/libopus/src/mlp.c:100 #2 0x00007ffff1f35bd2 in tonality_analysis (tonal=0x7fffb74d6ff8, info_out=0x0, celt_mode=0x7ffff5244460 <mode48000_960_120>, x=0x7fffbb39dfc0, len=480, offset=480, C=2, lsb_depth=24, downmix=0x7ffff1f1ce81 <downmix_float>)

Flags: needinfo?(tterribe)

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 9

•

12 years ago

test page: https://rawgithub.com/randylin/mediaRecorder/master/crash_opus.html

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 10

•

12 years ago

backout this Bug 922247 - When encoding to Opus, resample the input to 48kHz if its sample rate is not suitable (edit) and it it works well....

Flags: needinfo?(karlt)

Randell Jesup [:jesup] (needinfo me)

Comment 11

•

12 years ago

Paul, can you look at this given the patch for bug 922247 is implicated? Thanks

Flags: needinfo?(paul)

Paul Adenot (:padenot)

Assignee

Comment 12

•

12 years ago

I am looking at it as we speak.

Assignee: rlin → paul

Flags: needinfo?(paul)

Jean-Marc Valin

Comment 13

•

12 years ago

The problem seems to be that NaNs were being fed to the opus encoder, eventually causing a crash in tansig_approx(). The bug is now fixed in git. See: http://git.xiph.org/?p=opus.git;a=commitdiff;h=d6b5679 The crash was due to an out-of-bound read, so it's unlikely to be exploitable for anything worse than a crash. Also, it's probably a good idea to understand (and ideally fix) why the encoder was being fed NaNs in the first place.

Timothy B. Terriberry (:derf)

Comment 14

•

12 years ago

(In reply to Jean-Marc Valin (:jmspeex) from comment #13) > The problem seems to be that NaNs were being fed to the opus encoder, > eventually causing a crash in tansig_approx(). The bug is now fixed in git. > See: http://git.xiph.org/?p=opus.git;a=commitdiff;h=d6b5679 Paul, do you want to make a patch back-porting this fix and assign me as the reviewer? > The crash was due to an out-of-bound read, so it's unlikely to be > exploitable for anything worse than a crash. Also, it's probably a good idea > to understand (and ideally fix) why the encoder was being fed NaNs in the > first place. I assume Paul's still looking at that, but yeah, libopus shouldn't crash if it's fed NaNs, either.

Flags: needinfo?(tterribe)

Karl Tomlinson (:karlt)

Comment 15

•

12 years ago

(In reply to Randy Lin [:rlin] from comment #7) > I try to check the AudioChunk from media stream and found all chunk are with > the duration = 2 and mChannelData size = 2 on 44100 hz, is it normal? If > yes, we may have somebody to fix crash problem in libopus. If an AudioChunk directly from the DelayNodeEngine has duration = 2, then there is a bug there, but I assume this duration is from the resampler in the encoder. Other streams may have duration = 2, I assume.

Paul Adenot (:padenot)

Assignee

Comment 16

•

12 years ago

(In reply to Timothy B. Terriberry (:derf) from comment #14) > (In reply to Jean-Marc Valin (:jmspeex) from comment #13) > > The problem seems to be that NaNs were being fed to the opus encoder, > > eventually causing a crash in tansig_approx(). The bug is now fixed in git. > > See: http://git.xiph.org/?p=opus.git;a=commitdiff;h=d6b5679 > > Paul, do you want to make a patch back-porting this fix and assign me as the > reviewer? Yes, I'll do that. > > The crash was due to an out-of-bound read, so it's unlikely to be > > exploitable for anything worse than a crash. Also, it's probably a good idea > > to understand (and ideally fix) why the encoder was being fed NaNs in the > > first place. > > I assume Paul's still looking at that, but yeah, libopus shouldn't crash if > it's fed NaNs, either. Of course, there is clearly a bug on the gecko size, here.

Shelly Lin [:shelly]

Comment 17

•

12 years ago

(In reply to Karl Tomlinson (:karlt) from comment #15) > (In reply to Randy Lin [:rlin] from comment #7) > > I try to check the AudioChunk from media stream and found all chunk are with > > the duration = 2 and mChannelData size = 2 on 44100 hz, is it normal? If > > yes, we may have somebody to fix crash problem in libopus. > > If an AudioChunk directly from the DelayNodeEngine has duration = 2, then > there is a bug there, but I assume this duration is from the resampler in > the encoder. Other streams may have duration = 2, I assume. AudioChunk with duration = 2 is observed at the callback which is directly called from MediaStreamGraph, Opus encoder queues up chunks and feed them to libopus at once, so duration with 2 should not be a problem theoretically, I'm guessing libopus doesn't like the data being fed to it.

Paul Adenot (:padenot)

Assignee

Comment 18

•

12 years ago

(gdb) p mSourceSegment->mChunks.Length() $37 = 22 (gdb) p iter $31 = { mSegment = @0x7fffbf486bb0, mIndex = 5 } (gdb) p *(float*)((SharedBuffer*)mSourceSegment->mChunks[5].mBuffer)->Data()@128 $33 = {0 <repeats 127 times>, -nan(0x400000)} (gdb) p *(float*)((SharedBuffer*)mSourceSegment->mChunks[6].mBuffer)->Data()@128 $38 = {-nan(0x400000) <repeats 128 times>} (gdb) p *(float*)((SharedBuffer*)mSourceSegment->mChunks[7].mBuffer)->Data()@128 $39 = {-nan(0x400000) <repeats 128 times>} and so on until mChunks[21]. It seems like we are getting NaNs from the graph somehow.

Andrew McCreight [:mccr8]

Comment 19

•

12 years ago

Setting sec-low as it sounds like this may just be a harmless crash.

Keywords: sec-low

Paul Adenot (:padenot)

Assignee

Comment 20

•

12 years ago

Some news: this seems completely unrelated to the MediaRecorder, and pretty silent, in fact. It seems related to cycles and DelayNode, but I can't repro if I use the revision of the initial cycles handling. I've started bisecting, I'll have more info tomorrow. Basically, we read some trash memory somewhere (I suspect the DelayNode, but I'm not sure), put it into an AudioChunk, and it walks its way into the graph and goes eventually to the MediaRecorder, where it crashes. When the Opus patch lands, this will be non-security sensitive, I think. Worst case, we output some trash audio.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 21

•

12 years ago

Paul told me he found the underlying bug here. But I guess he didn't get around to attaching a patch.

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 22

•

12 years ago

Found similar crash when on run this bug's testcase. Bug 935774 - "Assertion failure: meta" in mozilla::MediaEncoder::GetEncodedData try: https://tbpl.mozilla.org/?tree=Try&rev=13cbbbc439a4 ==> https://tbpl.mozilla.org/php/getParsedLog.php?id=30500102&tree=Try 16:10:29 INFO - 0 XUL!mlp_process [mlp.c:13cbbbc439a4 : 81 + 0xa] 16:10:29 INFO - rbx = 0x00000001042b7978 r12 = 0x0000000115d4fdb0 16:10:29 INFO - r13 = 0x00000001042b7900 r14 = 0x0000000000000019 16:10:29 INFO - r15 = 0x0000000000000000 rip = 0x000000010336b12c 16:10:29 INFO - rsp = 0x0000000115d4d5a0 rbp = 0x0000000115d4d780 16:10:29 INFO - Found by: given as instruction pointer in context 16:10:29 INFO - 1 XUL!tonality_analysis [analysis.c:13cbbbc439a4 : 480 + 0x4] 16:10:29 INFO - rbx = 0x0000000106fdb3f8 r12 = 0x0000000000000012 16:10:29 INFO - r13 = 0x0000000115d4df10 r14 = 0x0000000000000014 16:10:29 INFO - r15 = 0x0000000000000000 rip = 0x000000010336a8be 16:10:29 INFO - rsp = 0x0000000115d4d790 rbp = 0x0000000115d4ff10 16:10:29 INFO - Found by: call frame info 16:10:29 INFO - 2 XUL!run_analysis [analysis.c:13cbbbc439a4 : 627 + 0x3c]

Jean-Marc Valin

Comment 23

•

12 years ago

This was fixed in Opus, but it seems like the fix hasn't gone into FF yet: http://git.xiph.org/?p=opus.git;a=commitdiff;h=d6b5679

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 24

•

12 years ago

When can we merge this patch into FF?

Paul Adenot (:padenot)

Assignee

Comment 25

•

12 years ago

iirc, this was a problem related to cycles and DelayNode, and we were grabbing the wrong input chunks at a stream level. I'll look into it again, but when I told roc I found the cause, I was wrong, the actual cause was deeper.

Paul Adenot (:padenot)

Assignee

Comment 26

•

12 years ago

Also, this seem to happen because the graph in the testcase does not contain a node capable of producing audio.

Timothy B. Terriberry (:derf)

Comment 27

•

12 years ago

(In reply to Paul Adenot (:padenot) from comment #26) > Also, this seem to happen because the graph in the testcase does not contain > a node capable of producing audio. Can you still post the backport patch from comment 14?

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Comment 28

•

12 years ago

I just try to include the commnet 14 for the Bug 935774 - Assertion failure: meta in mozilla::MediaEncoder::GetEncodedData It can avoid the test fail, result: https://tbpl.mozilla.org/?tree=Try&rev=3d6a880c5b47 on the test_mediarecorder_record_crash_audiocontext.html test case.

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Updated

•

12 years ago

Blocks: 935774

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Updated

•

12 years ago

No longer blocks: 935774

Randy Lin [:rlin] - Be Mozillian on 2015/01/26

Updated

•

12 years ago

Blocks: 940933

Christoph Diehl [:posidron]

Updated

•

12 years ago

Comment 29

•

12 years ago

Is this fixed by bug 944538 which also fixed bug 945618?

Paul Adenot (:padenot)

Assignee

Comment 30

•

12 years ago

Yep.

Ralph Giles (:rillian)

Comment 31

•

12 years ago

So can we close this? If not what needs to be done to resolve it. Things unclear to me: Tim asked about backporting the tansig fix to ff27, but bug 945618 etc. mark that version as wontfix. Is there a bug for not feeding NaNs to the encoder in the first place?

Paul Adenot (:padenot)

Assignee

Comment 32

•

12 years ago

Now there is: bug 962548.

Status: NEW → RESOLVED

Closed: 12 years ago

Resolution: --- → FIXED

Ryan VanderMeulen [:RyanVM]

Updated

•

12 years ago

Depends on: 944538

Target Milestone: --- → mozilla28

u279076

Updated

•

12 years ago

status-firefox28: --- → fixed

Flags: in-testsuite?

Al Billings [:abillings - ex-MoCo]

Comment 33

•

12 years ago

What versions of Firefox were affected by this?

Flags: needinfo?(paul)

Paul Adenot (:padenot)

Assignee

Comment 34

•

12 years ago

This started to happen on 26, I think, since bug 842243 landed.

Flags: needinfo?(paul)

Al Billings [:abillings - ex-MoCo]

Updated

•

12 years ago

status-firefox27: --- → wontfix

Al Billings [:abillings - ex-MoCo]

Updated

•

12 years ago

status-b2g-v1.1hd: --- → unaffected

status-b2g-v1.2: --- → ?

status-b2g-v1.3: --- → fixed

status-b2g-v1.4: --- → unaffected

Al Billings [:abillings - ex-MoCo]

Updated

•

12 years ago

status-b2g-v1.2: ? → affected

status-firefox-esr24: --- → unaffected

Ryan VanderMeulen [:RyanVM]

Comment 35

•

12 years ago

B2G RelMan isn't approving sec-lows for v1.1/v1.2 uplift.

status-b2g-v1.2: affected → wontfix

Al Billings [:abillings - ex-MoCo]

Updated

•

12 years ago

Whiteboard: [adv-main28+]

Ryan VanderMeulen [:RyanVM]

Updated

•

12 years ago

status-b2g-v1.3T: --- → fixed

Daniel Veditz [:dveditz]

Updated

•

11 years ago

Group: core-security

You need to log in before you can comment on or make changes to this bug.