Closed Bug 1044193 Opened 11 years ago Closed 11 years ago

crash in gray_rgb_convert mostly on Android ARM

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla34

Tracking Flags:

Tracking

Status

firefox31

---

wontfix

firefox32

fixed

firefox33

fixed

firefox34

---

fixed

b2g-v2.0

---

fixed

b2g-v2.1

---

fixed

fennec

---

People

(Reporter: kbrosnan, Assigned: mwu)

References

(
URL
)

Details

(Keywords: crash, regression, reproducible)

Crash Data

Attachments

(2 files, 1 obsolete file)

assert log 11 years ago Kelsey Gilbert [:jgilbert] 10.05 KB, text/plain		Details
Always close FDs on heap allocation path 11 years ago Michael Wu [:mwu] 908 bytes, patch		Details \| Diff \| Splinter Review
Always cleanup on heap allocation path 11 years ago Michael Wu [:mwu] 1.06 KB, patch	glandium : review+ Sylvestre : approval-mozilla-aurora+ lmandel : approval-mozilla-beta+	Details \| Diff \| Splinter Review

Kevin Brosnan [Ex-Mozilla]

Reporter

Description

•

11 years ago

This bug was filed from the Socorro interface and is report bp-aa2a8597-c144-4aaf-869e-097d52140711. ============================================================= I am able to reproduce this crash by opening the following page http://www.retronaut.com/2013/06/runnymede-eggs-for-graf-zeppelin/?utm_content=buffer614db&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer on a Nexus 4. https://crash-stats.mozilla.com/report/list?signature=gray_rgb_convert&range_value=7&range_unit=days 0 libxul.so gray_rgb_convert media/libjpeg/jdcolext.c 1 libxul.so sep_upsample media/libjpeg/jdsample.c 2 libxul.so process_data_simple_main media/libjpeg/jdmainct.c 3 libxul.so jpeg_read_scanlines media/libjpeg/jdapistd.c 4 libxul.so mozilla::image::nsJPEGDecoder::OutputScanlines(bool*) image/decoders/nsJPEGDecoder.cpp 5 libxul.so mozilla::image::nsJPEGDecoder::WriteInternal(char const*, unsigned int, mozilla::image::DecodeStrategy) image/decoders/nsJPEGDecoder.cpp 6 libxul.so mozilla::image::Decoder::Write(char const*, unsigned int, mozilla::image::DecodeStrategy) image/src/Decoder.cpp 7 libxul.so mozilla::image::RasterImage::WriteToDecoder(char const*, unsigned int, mozilla::image::DecodeStrategy) image/src/RasterImage.cpp 8 libxul.so mozilla::image::RasterImage::DecodePool::DecodeSomeOfImage(mozilla::image::RasterImage*, mozilla::image::DecodeStrategy, mozilla::image::RasterImage::DecodePool::DecodeType, unsigned int) image/src/RasterImage.cpp 9 libxul.so mozilla::image::RasterImage::DecodePool::DecodeJob::Run() image/src/RasterImage.cpp 10 libxul.so nsThreadPool::Run() xpcom/threads/nsThreadPool.cpp 11 libxul.so nsThread::ProcessNextEvent(bool, bool*) xpcom/threads/nsThread.cpp 12 libxul.so NS_ProcessNextEvent(nsIThread*, bool) xpcom/glue/nsThreadUtils.cpp 13 libxul.so mozilla::ipc::MessagePumpForNonMainThreads::Run(base::MessagePump::Delegate*) ipc/glue/MessagePump.cpp 14 libxul.so MessageLoop::RunInternal() ipc/chromium/src/base/message_loop.cc 15 libxul.so MessageLoop::Run() ipc/chromium/src/base/message_loop.cc 16 libxul.so nsThread::ThreadFunc(void*) xpcom/threads/nsThread.cpp 17 libnss3.so _pt_root nsprpub/pr/src/pthreads/ptthread.c

Brad Lassey [:blassey] (use needinfo?)

Comment 1

•

11 years ago

Milan, reproducible crash needs an assignee

tracking-fennec: ? → +

Flags: needinfo?(milan)

Milan Sreckovic [:milan] (needinfo for best results)

Comment 2

•

11 years ago

libjpeg was updated in 31, but this crash predates that. I imagine it'd be somewhere in our code anyway.

Kelsey Gilbert [:jgilbert]

Comment 3

•

11 years ago

I can also repro this.

URL: http://www.retronaut.com/2013/06/runn...

Kelsey Gilbert [:jgilbert]

Comment 4

•

11 years ago

Also repros on my Nexus 10.

Kelsey Gilbert [:jgilbert]

Comment 5

•

11 years ago

Attached file assert log — Details

Kelsey Gilbert [:jgilbert]

Comment 6

•

11 years ago

I get an assert different from the crash location, at least on my Nexus 10: 0x8298d1f4 in mozilla::image::CreateLockedSurface (vbuf=0x945f62e0, size=..., format=mozilla::gfx::B8G8R8A8) at /media/jgilbert/fast-ext4/dev/mozilla/central/image/src/imgFrame.cpp:55 GDB log is attached above.

Kelsey Gilbert [:jgilbert]

Comment 7

•

11 years ago

I don't have knowledge of imgFrame. Maybe :seth will know if this is bad?

Flags: needinfo?(seth)

Seth Fowler [:seth] [:s2h]

Comment 8

•

11 years ago

Michael, looks like Jeff is seeing an assert in code related to volatile buffers in imgFrame. (See comment 5) Could you take a look?

Flags: needinfo?(seth) → needinfo?(mwu)

Milan Sreckovic [:milan] (needinfo for best results)

Comment 9

•

11 years ago

Are these the volatile buffers for decoded images? If that's the case, Android is the only platform where we currently use that, having run into some regressions on other platforms (see bug 1046277 for a history of bugs that led us there), and it may be worth disabling this on Android as well, and fixing it together on all platforms? Jeff, since you can reproduce this, could you try making imgFrame::SetDiscardable() a no-op on Android as well, and see if that stops the crash?

Flags: needinfo?(milan) → needinfo?(jgilbert)

Michael Wu [:mwu]

Assignee

Comment 10

•

11 years ago

I think I know what's going on. This looks like a problem with the ashmem implementation of VolatileBuffer - it doesn't fall back on normal allocations properly when the kernel refuses to mmap a huge ashmem buffer.

Flags: needinfo?(mwu)

Michael Wu [:mwu]

Assignee

Comment 11

•

11 years ago

Guess that makes sense - we're trying to allocate 250+MB for an image here..

Milan Sreckovic [:milan] (needinfo for best results)

Comment 12

•

11 years ago

(In reply to Michael Wu [:mwu] from comment #10) > I think I know what's going on. This looks like a problem with the ashmem > implementation of VolatileBuffer - it doesn't fall back on normal > allocations properly when the kernel refuses to mmap a huge ashmem buffer. Nice catch.

Assignee: nobody → mwu

Flags: needinfo?(jgilbert)

Michael Wu [:mwu]

Assignee

Comment 13

•

11 years ago

Attached patch Always close FDs on heap allocation path (obsolete) — Details — Splinter Review

Going to send to try and then test on device.

James Willcox (:snorp) (jwillcox@mozilla.com) (he/him)

Comment 14

•

11 years ago

(In reply to Michael Wu [:mwu] from comment #13) > Created attachment 8466476 [details] [diff] [review] > Always close FDs on heap allocation path > > Going to send to try and then test on device. Nice catch

Michael Wu [:mwu]

Assignee

Comment 15

•

11 years ago

Attached patch Always cleanup on heap allocation path — Details — Splinter Review

I actually managed to load parts of the page with this. It doesn't always work, but it doesn't crash 100% now. This page requires about a gigabyte or more of memory to load all images successfully and I'm not sure jemalloc/Android is able to give us that on these 2GB ram devices.

Attachment #8466476 - Attachment is obsolete: true

Michael Wu [:mwu]

Assignee

Updated

•

11 years ago

Attachment #8467501 - Flags: review?(mh+mozilla)

Mike Hommey [:glandium]

Updated

•

11 years ago

Attachment #8467501 - Flags: review?(mh+mozilla) → review+

Michael Wu [:mwu]

Assignee

Comment 16

•

11 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/c607db1a32d0

Nigel Babu [:nigelb]

Comment 17

•

11 years ago

Does this failure look related to the push? https://tbpl.mozilla.org/php/getParsedLog.php?id=45225017&tree=Mozilla-Inbound

Michael Wu [:mwu]

Assignee

Comment 18

•

11 years ago

(In reply to Nigel Babu [:nigelb] from comment #17) > Does this failure look related to the push? > https://tbpl.mozilla.org/php/getParsedLog.php?id=45225017&tree=Mozilla- > Inbound Don't see a way this would've caused that.

Ed Morley [:emorley]

Comment 19

•

11 years ago

https://hg.mozilla.org/mozilla-central/rev/c607db1a32d0

Status: NEW → RESOLVED

Closed: 11 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla34

Michael Wu [:mwu]

Assignee

Comment 20

•

11 years ago

Comment on attachment 8467501 [details] [diff] [review] Always cleanup on heap allocation path Approval Request Comment [Feature/regressing bug #]: Regression from bug 962670. [User impact if declined]: Pages with large images are more likely to crash. [Describe test coverage new/current, TBPL]: Tests cover basic functionality of this code, but don't cover this low memory fallback path. [Risks and why]: Minimal - fixes a fallback path used in low memory conditions. [String/UUID change made/needed]: None

Attachment #8467501 - Flags: approval-mozilla-aurora?

Kevin Brosnan [Ex-Mozilla]

Reporter

Updated

•

11 years ago

Blocks: 962670

status-firefox31: --- → affected

status-firefox32: --- → affected

status-firefox33: --- → affected

status-firefox34: --- → fixed

tracking-firefox32: --- → ?

tracking-firefox33: --- → ?

Keywords: regression

Michael Wu [:mwu]

Assignee

Updated

•

11 years ago

status-b2g-v2.0: --- → affected

status-b2g-v2.1: --- → fixed

Sylvestre Ledru [:Sylvestre]

Updated

•

11 years ago

status-firefox31: affected → wontfix

tracking-firefox32: ? → +

tracking-firefox33: ? → +

Sylvestre Ledru [:Sylvestre]

Comment 21

•

11 years ago

Comment on attachment 8467501 [details] [diff] [review] Always cleanup on heap allocation path Michael, 32 seems affected too. Why didn't you requested an uplift for beta too?

Attachment #8467501 - Flags: approval-mozilla-aurora? → approval-mozilla-aurora+

Flags: needinfo?(mwu)

Michael Wu [:mwu]

Assignee

Comment 22

•

11 years ago

Comment on attachment 8467501 [details] [diff] [review] Always cleanup on heap allocation path Sure. This should also be good for beta.

Attachment #8467501 - Flags: approval-mozilla-beta?

Flags: needinfo?(mwu)

Ryan VanderMeulen [:RyanVM]

Comment 23

•

11 years ago

https://hg.mozilla.org/releases/mozilla-aurora/rev/2e81f30a3b26

status-firefox33: affected → fixed

Brad Lassey [:blassey] (use needinfo?)

Updated

•

11 years ago

Blocks: 1036219

Lawrence Mandel [:lmandel] (use needinfo)

Comment 24

•

11 years ago

Comment on attachment 8467501 [details] [diff] [review] Always cleanup on heap allocation path Beta+

Attachment #8467501 - Flags: approval-mozilla-beta? → approval-mozilla-beta+

Michael Wu [:mwu]

Assignee

Comment 25

•

11 years ago

[Blocking Requested - why for this release]: This is a general stability fix that's going into Firefox 32 that should also help with B2G.

blocking-b2g: --- → 2.0?

Ryan VanderMeulen [:RyanVM]

Comment 26

•

11 years ago

https://hg.mozilla.org/releases/mozilla-beta/rev/e51295fe2c54

status-firefox32: affected → fixed

Ryan VanderMeulen [:RyanVM]

Comment 27

•

11 years ago

https://hg.mozilla.org/releases/mozilla-b2g32_v2_0/rev/e51295fe2c54

status-b2g-v2.0: affected → fixed

Michael Wu [:mwu]

Assignee

Updated

•

11 years ago

blocking-b2g: 2.0? → ---

Liz Henry (:lizzard) (relman/hg->git project)

Updated

•

11 years ago

Flags: qe-verify+

Adina Petridean QA [:apetridean]

Updated

•

3 years ago

Flags: qe-verify+

You need to log in before you can comment on or make changes to this bug.