Closed Bug 1025913 Opened 11 years ago Closed 11 years ago

CacheIOThread hogging cpu

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla33

People

(Reporter: jrmuizel, Assigned: mayhemer)

References

Details

Attachments

(1 file, 1 obsolete file)

v1 11 years ago Honza Bambas (:mayhemer) 12.47 KB, patch	michal : review+	Details \| Diff \| Splinter Review
v1.1 11 years ago Honza Bambas (:mayhemer) 14.36 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review

Jeff Muizelaar [:jrmuizel]

Reporter

Description

•

11 years ago

Version: 32.0a1 (2014-06-05) Spins without ceasing with all of the time spent in PurgeByFrecency

Honza Bambas (:mayhemer)

Assignee

Comment 1

•

11 years ago

How often can you reproduce? Is it debuggable?

Honza Bambas (:mayhemer)

Assignee

Updated

•

11 years ago

Comment 3

•

11 years ago

See bug 1028415

Honza Bambas (:mayhemer)

Assignee

Comment 4

•

11 years ago

(In reply to Yuan Pengfei from comment #3) > See bug 1028415 I don't think it's related (these are actually separate bugs). The thing is that Jeff claims this loops at CacheStorageService::MemoryPool::PurgeByFrecency and not CacheFileIOManager::OverLimitEvictionInternal - those are totally unrelated. I'm concerned about but 1027028 that might indicate we do something wrong with removing entries from the pool (the frecency and exp time memory array actually). If there is something fishy, we may get to a situation the purging code loops. I've put this to the tree with a knowledge this could happen (Murphy's law). And here it is! There are two solutions: - try to find the true cause (there were a bug like this once, but it's well understood and fixed a very long time ago) - avoid the loop even there is a problem in the surrounding logic by some additional checks

Honza Bambas (:mayhemer)

Assignee

Comment 5

•

11 years ago

Oh, I think I got it. Hmm... it may happen that we open an existing (warmed) memory-only entry as a disk entry. That will switch the mUseDisk flag and hence the entry will not be able to remove itself from the correct pool... This needs some thinking first.

Honza Bambas (:mayhemer)

Assignee

Comment 7

•

11 years ago

Attached patch v1 (obsolete) — Details — Splinter Review

- constify the mUseDisk flag in CacheEntry (fix for this bug) - when there is a warmed disk entry for the context/url and we are opening it again as memory only, the warmed entry is doomed (replaced by a new memory-only one) - we also check for a disk file (when there were no warmed disk entry) when opening a new memory-only entry and doom the file ; it's then consistent with the case when there already has been a warmed entry https://tbpl.mozilla.org/?tree=Try&rev=705c869dcf88

Assignee: nobody → honzab.moz

Status: NEW → ASSIGNED

Attachment #8444137 - Flags: review?(michal.novotny)

Honza Bambas (:mayhemer)

Assignee

Updated

•

11 years ago

Blocks: 986179

Honza Bambas (:mayhemer)

Assignee

Comment 8

•

11 years ago

better try: https://tbpl.mozilla.org/?tree=Try&rev=6dbe2e07da2c

Honza Bambas (:mayhemer)

Assignee

Comment 9

•

11 years ago

(In reply to Honza Bambas (:mayhemer) from comment #8) > better try: > https://tbpl.mozilla.org/?tree=Try&rev=6dbe2e07da2c Bug 1005696 ?

Honza Bambas (:mayhemer)

Assignee

Updated

•

11 years ago

Blocks: 1029213

Honza Bambas (:mayhemer)

Assignee

Comment 10

•

11 years ago

(In reply to Honza Bambas (:mayhemer) from comment #9) > (In reply to Honza Bambas (:mayhemer) from comment #8) > > better try: > > https://tbpl.mozilla.org/?tree=Try&rev=6dbe2e07da2c > > Bug 1005696 ? Apparently: https://tbpl.mozilla.org/?tree=Try&rev=0b6907de0eed

Michal Novotny [:michal]

Comment 11

•

11 years ago

Comment on attachment 8444137 [details] [diff] [review] v1 Review of attachment 8444137 [details] [diff] [review]: ----------------------------------------------------------------- ::: netwerk/cache2/CacheEntry.cpp @@ +334,5 @@ > + // 1. When this is a disk entry and not told to truncate, check there is a disk file. > + // If not, set the 'truncate' flag to true so that this entry will open instantly > + // as a new one. > + // 2. When this is a memory-only entry, check there is a disk file. > + // If there is or could be, doom that file. I think this should be documented in nsICacheStorageService.idl. It isn't obvious that storing an entry to memoryCacheStorage could remove entries from diskCacheStorage. ::: netwerk/test/unit/test_cache2-07a-open-memory.js @@ +17,5 @@ > + asyncOpenCacheEntry("http://disk-first/", "disk", Ci.nsICacheStorage.OPEN_NORMALLY, null, > + // Must wait for write, since opening the entry as memory-only before the disk one > + // is written would cause NS_ERROR_NOT_AVAILABLE from openOutputStream when writing > + // this disk entry. > + new OpenCallback(NEW|WAITFORWRITE, "m2m", "m2d", function(entryD1) { Just a nit. The way you choose the metadata and data content is a bit chaotic. If the first letter 'm' in "m1m" and "m1d" should mean the first letter from "mem-first" then this metadata and data should be "d1m" and "d2d".

Attachment #8444137 - Flags: review?(michal.novotny) → review+

Honza Bambas (:mayhemer)

Assignee

Comment 12

•

11 years ago

Attached patch v1.1 — Details — Splinter Review

https://hg.mozilla.org/integration/mozilla-inbound/rev/b502c2fd110d

Attachment #8444137 - Attachment is obsolete: true

Attachment #8445926 - Flags: review+

Wes Kocher (:KWierso) (Not reading bugmail; email directly if needed)

Comment 13

•

11 years ago

https://hg.mozilla.org/mozilla-central/rev/b502c2fd110d

Status: ASSIGNED → RESOLVED

Closed: 11 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla33

Gustavo Homem

Comment 14

•

11 years ago

I'm still seeing this on 32.0.3 PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 11162 gustavo 20 0 1626m 711m 44m R 93.1 35.3 3924:18 Cache2 I/O Is this bug really fixed?

Honza Bambas (:mayhemer)

Assignee

Comment 15

•

11 years ago

(In reply to Gustavo Homem from comment #14) > I'm still seeing this on 32.0.3 > > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > > 11162 gustavo 20 0 1626m 711m 44m R 93.1 35.3 3924:18 Cache2 I/O > > Is this bug really fixed? I think there is another one. See bug 1064091. Are you able to provide some additional info that could help diagnose?

Gustavo Homem

Comment 16

•

11 years ago

I've looked at bug 1064091 but here I never got to the point of a hang. It's just the FF gets slower and slower until is unusable, keeps on using CPU even though I'm not doing anything while Facebook (the main CPU killer within Firefox) isn't open. It gets to a point where playing a low resolution (ex: 240p) youtube video becomes impossible. If I restart the browser than all is well again. This easily reproducible on a system that is tight on resources (an old 3 core AMD machine). If I run top and press shitf+H I see a thread called "Cache2 I/O" permanently using 100% CPU.

Paul Templeton

Comment 17

•

11 years ago

I made a comment on bug Bug 1064091 which could be related.

sam113101

Comment 18

•

11 years ago

Today I was using firefox and it started using 100% CPU. I launched htop and saw that cache2 was the culprit. It did not crash and did not significantly slow down. I'm using the official firefox 32 on fedora 20. I should also say that I tried to investigate this bug further, but quickly realized that the firefox profiler wasn't showing the usage for cache2.

Gustavo Homem

Comment 19

•

11 years ago

Currently I can browse for like one hour until the Cache threads begins hogging. @Paul Templeton: I added a related comment as well. Maybe these two bugs should me merged.

Paul Templeton

Comment 20

•

11 years ago

Just an update - Version 33.0 has fixed our problems - typical CPU <0.2% to 5% under load. I don't know what was changed since the last two versions but this one is stable in RDP

Gustavo Homem

Comment 21

•

11 years ago

After I upgraded to version 33 I'm no longer seeing the problem.

Ulrich Windl

Comment 22

•

10 years ago

I have the problem with 44.0 on Linux 64 bit ("Cache2 I/O" takes all the power of on CPU core (98.34%), and Firefox takes another 26.82%. All while being idle for minutes!). See my comment in bug 1085172.

Honza Bambas (:mayhemer)

Assignee

Comment 23

•

10 years ago

(In reply to Ulrich Windl from comment #22) > I have the problem with 44.0 on Linux 64 bit ("Cache2 I/O" takes all the > power of on CPU core (98.34%), and Firefox takes another 26.82%. All while > being idle for minutes!). See my comment in bug 1085172. Please open a new bug and provide us with an HTTP log when you are able to reproduce the problem: https://developer.mozilla.org/en-US/docs/Mozilla/Debugging/HTTP_logging Just please change NSPR_LOG_MODULES=timestamp,cache2:5 Thanks.

Flags: needinfo?(Ulrich.Windl)

Honza Bambas (:mayhemer)

Assignee

Updated

•

9 years ago

Flags: needinfo?(Ulrich.Windl)

You need to log in before you can comment on or make changes to this bug.