Closed Bug 1328642 Opened 8 years ago Closed 8 years ago

Crash in FileClose

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla53

Tracking Flags:

Tracking

Status

firefox-esr45

51+

fixed

firefox50

---

wontfix

firefox51

---

fixed

firefox52

---

fixed

firefox53

---

fixed

People

(Reporter: marco, Assigned: bkelly)

References

Details

(Keywords: crash, csectype-uaf, sec-high, Whiteboard: [adv-main51+][adv-esr45.7+])

Crash Data

Attachments

(1 file, 1 obsolete file)

patch 8 years ago Ben Kelly [:bkelly, not reviewing] 5.42 KB, patch	asuth : review+	Details \| Diff \| Splinter Review
Protect multi-thread stream access with a mutex. r=asuth 8 years ago Ben Kelly [:bkelly, not reviewing] 5.47 KB, patch	bkelly : review+ lizzard : approval-mozilla-aurora+ lizzard : approval-mozilla-beta+ lizzard : approval-mozilla-release+ lizzard : approval-mozilla-esr45+ abillings : sec-approval+	Details \| Diff \| Splinter Review

Marco Castelluccio [:marco]

Reporter

Description

•

8 years ago

This bug was filed from the Socorro interface and is report bp-13855a66-0e50-4177-83a0-2fdf32170104. ============================================================= (40.43% in signature vs 00.65% overall) address = 0xffffffffe5e5e5e5

Nathan Froyd [:froydnj]

Comment 1

•

8 years ago

These sorts of crashes should be filed as security bugs.

Group: core-security

Marco Castelluccio [:marco]

Reporter

Comment 2

•

8 years ago

I added the csectype-uaf keyword but forgot to mark the bug as core-security :(

Andrew McCreight [:mccr8]

Comment 3

•

8 years ago

Ben, can you look at these crashes and see if there's something we can do? Thanks.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 4

•

8 years ago

No crash reports in FF53 yet. Some small hope its already been fixed. So far nothing is jumping out at me unfortunately. I might need to push some diagnostic asserts.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 5

•

8 years ago

One weird thing is that the reports started Dec 28, 2016, but across a variety of releases. Zero reports before Dec 28, but now we are getting them on FF47 to FF52.

Ben Kelly [:bkelly, not reviewing]

Assignee

Updated

•

8 years ago

Assignee: nobody → bkelly

Status: NEW → ASSIGNED

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 6

•

8 years ago

Also, some of the crashes don't go through dom/cache. It appears to be an issue with nsFileInputStream closing files on windows: https://crash-stats.mozilla.com/report/index/91f698ca-c487-4573-bc49-bc5522161229

Marco Castelluccio [:marco]

Reporter

Comment 7

•

8 years ago

(In reply to Ben Kelly [:bkelly] from comment #5) > One weird thing is that the reports started Dec 28, 2016, but across a > variety of releases. Zero reports before Dec 28, but now we are getting > them on FF47 to FF52. Which reports started on Dec 28? There are reports earlier than Dec 28 if you restrict the search to product=Firefox and address contains 'e5': https://crash-stats.mozilla.com/signature/?product=Firefox&address=~e5&signature=FileClose&date=%3E%3D2016-10-04T19%3A29%3A00.000Z&date=%3C2017-01-04T19%3A29%3A00.000Z#graphs Did you restrict the search with other parameters?

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 8

•

8 years ago

I think we might need to push some diagnostic assertions to try to figure out whats going on.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 9

•

8 years ago

(In reply to Marco Castelluccio [:marco] from comment #7) > (In reply to Ben Kelly [:bkelly] from comment #5) > > One weird thing is that the reports started Dec 28, 2016, but across a > > variety of releases. Zero reports before Dec 28, but now we are getting > > them on FF47 to FF52. > > Which reports started on Dec 28? I just clicked on "more reports" link from the crash report in comment 0. Sorry for my confusion.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 10

•

8 years ago

BTW, my theory is that the CacheStreamControlChild is being UAF'd in some way. Bug 1328686 will add some assertions that will try to show if this is the case. Not linking the bug due to security issues, though.

Daniel Veditz [:dveditz]

Updated

•

8 years ago

Keywords: sec-high

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 11

•

8 years ago

I push some diagnostic asserts to m-i in bug 1328686. I'll uplift them to aurora next week.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 12

•

8 years ago

These crashes are possibly related: https://crash-stats.mozilla.com/signature/?product=Firefox&signature=mozilla%3A%3Adom%3A%3Acache%3A%3AStreamControl%3A%3ACloseAllReadStreams&date=%3E%3D2017-01-02T15%3A46%3A00.000Z&date=%3C2017-01-09T15%3A46%3A00.000Z&_columns=date&_columns=product&_columns=version&_columns=build_id&_columns=platform&_columns=reason&_columns=address&_sort=-date&page=1#reports

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 13

•

8 years ago

I was looking through FileClose crash comments and noticed this one referenced weather.com: https://crash-stats.mozilla.com/report/index/7563555a-a6fd-48a9-b15c-101b82160819 Bug 1329989 is a UAF bug that is triggered by bughunter on weather.com. Perhaps the issue over there is stomping on memory leading to this bug.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 14

•

8 years ago

Another weather.com crash: https://crash-stats.mozilla.com/report/index/b2dad455-3672-4609-b026-58be12160816

Andrew McCreight [:mccr8]

Updated

•

8 years ago

Updated

•

8 years ago

Group: core-security → dom-core-security

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 16

•

8 years ago

Unfortunately this is still happening in FF51.0b14 which has the fix for bug 1329989: https://crash-stats.mozilla.com/report/index/568dc20c-bee3-4680-a80b-d20ea2170116

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 17

•

8 years ago

I have a new theory: 1) The Cache API ReadStream is basically a wrapper around nsFileInputStream 2) There are places we read and close the ReadStream from an IO thread like StreamTransportService, etc. 3) Cache API also tries to close the nsFileInputStream on its owning thread under some circumstances 4) nsFileInputStream does not provide any cross-thread protection for its underlying mFD So if we have a Cache ReadStream being read in a worker thread, we can get a situation where the stream is closed on the IO thread right when the worker thread is terminated and also tries to close the stream. This could result in one of the two closes operating on garbage data.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 18

•

8 years ago

In particular, we have a narrow window where nsFileStreamBase's mFD is non-nullptr, but has already have PR_FreeFileDesc() called on it. This internally free's its memory. I'm going to see if I can add some locking to nsFileStream. The tricky part is to avoid blocking the main thread.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 19

•

8 years ago

Attached patch patch (obsolete) — Details — Splinter Review

In the end I decided it made more sense to put the locking in the ReadStream. This Cache API class is purposely designed to be accessed on an IO thread while simultaneously being closed on the owning thread. Therefore the locking belongs in this object. (It also make this patch less risky than changing all file streams.) I decided to make the mutex clearly scoped to only the mStream and mSnappyStream objects. I did not try to pull in any of the other ReadStream state into the mutex. It seemed safer to leave the current runnables and atomic<> variables used to handle that state. Also, I wanted to minimize the chance for a recursive lock or other unexpected contention. Overall this lock should have very little perf impact. Typically there should only be one thread accessing the stream. We only need to potentially block in the case where shutdown or a worker thread is forcing the stream to close. https://treeherder.mozilla.org/#/jobs?repo=try&revision=cd58c73fc83cff3d7fb02c67df7f4c57ece56f13

Attachment #8827654 - Flags: review?(bugmail)

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 20

•

8 years ago

Oh, I also fixed a stupid bug where mHasEverBeenRead was not be initialized.

Andrew Sutherland [:asuth] (he/him)

Comment 21

•

8 years ago

Comment on attachment 8827654 [details] [diff] [review] patch Review of attachment 8827654 [details] [diff] [review]: ----------------------------------------------------------------- I also verified that this covers all uses of mStream/mSnappyStream in the file. (Noting that of course NoteClosedRunnable and ForgetRunnable have their own logically separate mStream which refers to a ReadStream::Inner.) ::: dom/cache/ReadStream.cpp @@ +285,5 @@ > // stream ops can happen on any thread > + nsresult rv = NS_OK; > + { > + MutexAutoLock lock(mMutex); > + rv = mSnappyStream->Close(); Affirming that I see the change from mStream to mSnappyStream and that this makes sense and is a semantic improvement.

Attachment #8827654 - Flags: review?(bugmail) → review+

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 22

•

8 years ago

Attached patch Protect multi-thread stream access with a mutex. r=asuth — Details — Splinter Review

Updated with commit message.

Attachment #8827654 - Attachment is obsolete: true

Attachment #8827689 - Flags: review+

Ben Kelly [:bkelly, not reviewing]

Assignee

Updated

•

8 years ago

Attachment #8827689 - Attachment is patch: true

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 23

•

8 years ago

Comment on attachment 8827689 [details] [diff] [review] Protect multi-thread stream access with a mutex. r=asuth [Security approval request comment] How easily could an exploit be constructed based on the patch? The patch is clearly about protecting shared data from multi-threaded access. With digging you could probably figure out mFD is a prime target here. That being said, you also would need to figure out how to force two threads to access the stream simultaneously from content script. You can get the ReadStream::Close() called on the owning thread by creating it in a worker thread and calling worker.terminate(). That is not too hard. The harder part is getting the other thread to access it simultaneously. You can try using Response.text(), although that does not call Close() automatically so may not trigger there. You could try using ServiceWorker respondWith(), but that would be harder since you cannot then use worker.terminate(). Its probably possible, but it would take some effort and would still be quite racy. Do comments in the patch, the check-in comment, or tests included in the patch paint a bulls-eye on the security problem? The comments and commit message mention protecting against cross-thread access. That pretty clear from the code, though, since its adding a Mutex. Which older supported branches are affected by this flaw? All branches with Cache API enabled. This includes 45ESR even though service workers themselves are disabled there. It may be harder to trigger without service workers to use with it, though. If not all supported branches, which bug introduced the flaw? All branches basically. Do you have backports for the affected branches? If not, how different, hard to create, and risky will they be? This code has not changed recently and should be relatively easy to backport. How likely is this patch to cause regressions; how much testing does it need? I think the risk is low. I scoped the mutex locks to be as small as possible. This code also has a good test suite and it passed a try build. Also note that we won't really know if this fixes the problem until we get the patch into a high volume branch.

Attachment #8827689 - Flags: sec-approval?

Attachment #8827689 - Flags: approval-mozilla-release?

Attachment #8827689 - Flags: approval-mozilla-esr45?

Attachment #8827689 - Flags: approval-mozilla-beta?

Attachment #8827689 - Flags: approval-mozilla-aurora?

Ben Kelly [:bkelly, not reviewing]

Assignee

Updated

•

8 years ago

status-firefox50: --- → wontfix

status-firefox51: --- → affected

status-firefox52: --- → affected

status-firefox53: --- → affected

status-firefox-esr45: --- → affected

Al Billings [:abillings - ex-MoCo]

Updated

•

8 years ago

Attachment #8827689 - Flags: sec-approval? → sec-approval+

Liz Henry (:lizzard) (relman/hg->git project)

Comment 24

•

8 years ago

Comment on attachment 8827689 [details] [diff] [review] Protect multi-thread stream access with a mutex. r=asuth Another fix for service workers crash, let's take it.

Attachment #8827689 - Flags: approval-mozilla-release?

Attachment #8827689 - Flags: approval-mozilla-release+

Attachment #8827689 - Flags: approval-mozilla-esr45?

Attachment #8827689 - Flags: approval-mozilla-esr45+

Attachment #8827689 - Flags: approval-mozilla-beta?

Attachment #8827689 - Flags: approval-mozilla-beta+

Attachment #8827689 - Flags: approval-mozilla-aurora?

Attachment #8827689 - Flags: approval-mozilla-aurora+

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 25

•

8 years ago

I'll land this in the next hour or two. Sorry I didn't see the flag approvals last night.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 26

•

8 years ago

I had to rebase around my MOZ_DIAGNOSTIC_ASSERT changes to ReadStream. I'm doing a local build and test before landing.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 27

•

8 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/c6e104aef85f2ee6831ce1e6b38c4127ef08d125 https://hg.mozilla.org/releases/mozilla-aurora/rev/8a2f1d1241fa944e682e20650584921e1ca90e29 https://hg.mozilla.org/releases/mozilla-release/rev/4025860dec4be8fa9490e9351fadd9119c7c1262 Note, I landed directly on release since beta was closed. Also, I don't have a checkout of esr. If someone could merge there I would appreciate it.

status-firefox51: affected → fixed

status-firefox52: affected → fixed

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 28

•

8 years ago

Apparently there are conflicts on ESR also. I am checking it out to do the landing there as well.

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 29

•

8 years ago

https://hg.mozilla.org/releases/mozilla-esr45/rev/237c1ca6e035e9c60d8f4585550749efad1a8538

status-firefox-esr45: affected → fixed

Wes Kocher (:KWierso) (Not reading bugmail; email directly if needed)

Comment 30

•

8 years ago

https://hg.mozilla.org/mozilla-central/rev/c6e104aef85f

Status: ASSIGNED → RESOLVED

Closed: 8 years ago

status-firefox53: affected → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla53

Al Billings [:abillings - ex-MoCo]

Updated

•

8 years ago

tracking-firefox-esr45: --- → 51+

Whiteboard: [adv-main51+][adv-esr45.7+]

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 31

•

8 years ago

We still need to watch crash stats to verify this fixed it.

Daniel Veditz [:dveditz]

Updated

•

8 years ago

Group: dom-core-security → core-security-release

Ben Kelly [:bkelly, not reviewing]

Assignee

Comment 32

•

8 years ago

We've started to get crash reports for the new RC, but I don't see any instances of this crash. So far so good.

Daniel Veditz [:dveditz]

Updated

•

8 years ago

Group: core-security-release

You need to log in before you can comment on or make changes to this bug.