Bugzilla

Reporter

Comment 1

•

15 years ago

Sorry, I should have said, doing a "search everywhere" locks up the mail client completely.  CPU drops and eventually returns so it is probably busy doing a lookup against my gloda db (global-messages-db.sqlite is 2.4GB), but there is no "progress" and it says "application not responding" for minutes at a time, probably not the intended behavior.  I'm not sure if this is a consequence of the 100%CPU issue, or just the size of the DB.

My system is a 2.53GHz Core2Duo running Windows XPSP3.  HD hardly sees any activity (some, but not solid) and total commit is under physical amount.

Assignee

Comment 2

•

15 years ago

I've noticed as well that gloda seems to be a little bit more in the way than it was after rkent's checkin, and the changes to reduce the amount of nsMsgHdr js garbage created. I notice it especially when I start up in the morning, and a 100 or more new messages are added to my inbox.

I don't know about the hang on exit - I've exited a lot of times when gloda is busy, and haven't had it hang. If you want to e-mail me the password to the zip file, I can look at it.

Kent James (:rkent)

Comment 3

•

15 years ago

(In reply to comment #2)
> I've noticed as well that gloda seems to be a little bit more in the way than
> it was after rkent's checkin

Are you saying that the checkin helped, and it since has gotten worse, or that it got worse when the checkin was done?

If the latter, which checkin do you mean?

Ludovic Hirlimann [:Usul]

Assignee

Comment 4

•

15 years ago

the former - initially, things were pretty good, but when I got back from my vacation this Monday, it seemed a bit worse.

Updated

•

15 years ago

Keywords: perf

Comment 5

•

15 years ago

so the real the issue/root cause is redownloaded/resynced messages?

Reporter

Comment 6

•

15 years ago

I don't know if I can answer that.  From a user perspective, background indexing should be just that, background and unnoticed.  I think that the redownloaded/resynced folders caused a surge of emails to be indexed and it caused the client to become unresponsive (and now that indexing is completed, the client seems to be behaving fairly normally), so if it is determined to be acceptable or at least unavoidable that the indexing causes noticeable/significant slowdown then I suppose you are correct.  

However, if it is intended to be nearly transparent to normal use, then I would say the root issue is that the code meant to do indexing in the background needs more work so that even when large mail stores are "discovered" by the indexing process.

However, the one issue that is definitely a problem is the 100% CPU when the client is taken offline in the middle of indexing.  Maybe some indexing subroutine is being hung/orphaned when going offline rather than properly closed.

I've had a few updates since this occurred, I'll see if I can take the client offline in the middle of the next indexing and see if the behavior occurs on smaller indexing jobs, or is some artifact of the larger indexing jobs.

Comment 7

•

15 years ago

My question in comment 5 was asked because "Yes" might make this a dupe of another bug. Versus indexing overhead. And "Search Everywhere" slowness is noted in another bug.

OTOH, 700+meg may be a better symptom of what's causing the slowness. What the size of your largest .msf files that would be getting updated?

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 8

•

15 years ago

In order of descending sizes 13MB, 9MB, 7.5MB, etc.

Updated

•

15 years ago

Flags: blocking-thunderbird3? → blocking-thunderbird3+

Comment 9

•

15 years ago

need strategy to narrow this.

For Michael...
question #1 is whether this is only during search. 
if not, we can remove that from the summary
question #2 is your machine specs - memory, cpu, and do you see this with no other apps running

rkent, bienevenu - a thought, might it be nice if we could turn off indexing of new messages, without turning off gloda, so that we can eliminate indexing as a cause for issues such as this?  Or, can we approximate that by not syncing new messages?

Summary: Thunderbird 3pre is consuming 100%CPU (doing Gloda search?) making mail nearly unusable → Thunderbird 3pre high memory and consuming 100%CPU (doing Gloda search?) making mail nearly unusable

David Ascher (:davida)

Comment 10

•

15 years ago

Going offline turns off indexing (as a proxy for being on battery power).

Kent James (:rkent)

Comment 11

•

15 years ago

It would also be fairly easy to add something, say a hidden preference, that would "pause" gloda indexing.

Simon Paquet [:sipaq]

Updated

•

15 years ago

Whiteboard: [no l10n impact]

Reporter

Comment 12

•

15 years ago

So I turned off gloda for testing another bug (bug 506809), and I see that 100% CPU continues with it turned off.  This issue is not during a search (searching with standard to/cc/subject filter doesn't seem to have an effect).  

Currently on Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1.4pre) Gecko/20090929 Shredder/3.0pre.  System is Windows XP (32bit) SP3, 2GB RAM, 2.53GHz Intel core2duo (Dell E6400), standard HDD.

I've just stopped and restarted Shredder, and it isn't doing it now, but was earlier today.  While in the 100% CPU condition, activity manager is blank, and using sysinternal filemon, NOTHING is being written to disk.  This activity was tested after a file->exit (all windows close but process is still running).  Currently waiting for activities that might kick off the run away process.

Summary: Thunderbird 3pre high memory and consuming 100%CPU (doing Gloda search?) making mail nearly unusable → Thunderbird 3pre high memory and consuming 100%CPU making mail nearly unusable

Ludovic Hirlimann [:Usul]

Reporter

Comment 13

•

15 years ago

I've found out how to recreate this.  CPU goes to 100% every time I select my drafts folder (saving a file to drafts does not do this), nothing appears in error console or activity console when I do this.  I'm out of time now, but I will startup up a IMAP:5 log tomorrow.  Is there any other logging I should turn on to help with this?

David Ascher (:davida)

Comment 14

•

15 years ago

Oh, cool.  Yes, you can turn on gloda logging.  In the Config Editor (Advanced prefs) turn on: mailnews.database.global.logging.console and mailnews.database.global.logging.dump

(.dump will go to stderr, so that works if you know how to run thunderbird from a shell/console)

Comment 15

•

15 years ago

Might be a good idea to imap:5,timestamp so we'll see if there's a timeout or something.

Reporter

Comment 16

•

15 years ago

Attached file IMAP:5, timestamp log of opening thunderbird, selecting drafts, then quitting. — Details

OK, attached is my IMAP:5, timestamp log, it is a password-protected .zip, please email me for the password, Wayne already has it though.  

I did set the mailnews.database.global.dump to true (.console was already true), and did a launch of tbird with >stderr.log, but it just created a 0 length file.

I launched the mail app with the drafts folder set as my last opened folder (so it would enter the error state as quickly as possible, I then did a file->exit at 15.45.30, waited a minute plus, and killed the process at 15.46.50.

Reporter

Comment 17

•

15 years ago

Further troubleshooting results:
On the positive side: reindexing "drafts" fixes the 100%CPU problem.
On the negative side: I have no good way of inducing this behavior, so I don't know if this was an offline copy issue, replay IMAP instruction issue, etc.

Comment 18

•

15 years ago

did you keep the bad .msf ?

Assignee

Comment 19

•

15 years ago

From the log, it looks like we're trying to set a flag on an invalid msg uid in the drafts folder (looks like a small negative number, probably having to do with an offline operation) - the imap protocol code seems to bail out before we talk to the server, but then subsequent attempts to do things with the drafts folder, like update its message counts via STATUS fail, because we think we're still trying to do something to the folder. I'm not sure why this would lead to 100% cpu, but I can try to simulate this in the debugger.

Reporter

Comment 20

•

15 years ago

I did find a copy of my drafts offline copy and .msf from yesterday, I closed
thunderbird, and replaced those copies into the IMAP folder, restarted client,
and still not seeing 100% CPU issue.  I would assume this is NOT a offline copy
issue (?), but could this be a folder replay issue?  If so, where are the
replay instructions so that I can restore those to see if they initiate the
issue? I have a copy of my entire profile directory from before reindexing.

Assignee

Comment 21

•

15 years ago

did you select the drafts folder when you did the experiment?

Whenever you select a folder, we try to playback offline operations - it looks like the .msf file has operations to mark a couple messages read, 0xFFFFFF7E and 0xFFFFFF80, which obviously don't exist in your drafts folder. 

We have a folder flag which tells us if the folder has any offline events to play back. This is stored in the .msf file, but also panacea.dat - you may need to restore panacea.dat from your copy of your profile, along with the .msf file.

Assignee

Comment 22

•

15 years ago

Both those logs show the same behavior with the drafts folder.

Assignee

Comment 23

•

15 years ago

I suspect the thread trying to store the sent flag on 0xFFFFFF7E is in an infinite loop, or trying to do a very large string operation (e.g., a length has gone negative). I'll try to reproduce it in the debugger.

Reporter

Comment 24

•

15 years ago

Attached file Panacea.dat, drafts, and drafts.msf causing issue. — Details

Bingo, readding back panacea and the drafts and drafts.msf caused the bug to reappear.  Attaching all three files to this bug, using same password as other attachments.

Assignee

Comment 25

•

15 years ago

thx, I've almost got this fixed, actually.

Assignee

Updated

•

15 years ago

Assignee: nobody → bienvenu

Status: NEW → ASSIGNED

Assignee

Comment 26

•

15 years ago

Attached patch proposed fix — Details — Splinter Review

Never minding how we ended up with those UIDs, since we should be handling UID's > 0x7FFFFFFF anyway, this fixes the problem for me.

Asking Neil for review since he might have some better ideas about how to convince all the relevant code that we want unsigned int and the string equivalents. nsTString.AppendInt(PRUint32, radix) is charmingly broken, from what I can see:

      inline void AppendInt( PRUint32 aInteger, PRInt32 aRadix = kRadix10 )
        {
          AppendInt(PRInt32(aInteger), aRadix);
        }

I'm going to try to make an xpcshell test case for these UID's > 0x7FFFFFFF - I think the fake server lets me assign UID's, so if I can just get JS to do the right thing, I should be able to do so.

Attachment #404123 - Flags: superreview?(neil)

Attachment #404123 - Flags: review?(neil)

Assignee

Updated

•

15 years ago

Target Milestone: --- → Thunderbird 3.0rc1

Assignee

Comment 27

•

15 years ago

Since these UID's are hard to recreate in the wild, I tweaked nsImapService as follows and used the debugger and a breakpoint to change the PC in order to recreate the issue and test the fix:

+++ b/mailnews/imap/src/nsImapService.cpp
@@ -1727,17 +1727,20 @@ nsresult nsImapService::DiddleFlags(nsIE
       urlSpec.Append('>');
       urlSpec.Append(messageIdsAreUID ? uidString : sequenceString);
       urlSpec.Append(">");
       urlSpec.Append(hierarchyDelimiter);
       nsCString folderName;
       GetFolderName(aImapMailFolder, folderName);
       urlSpec.Append(folderName);
       urlSpec.Append(">");
-      urlSpec.Append(messageIdentifierList);
+      if (PR_TRUE)
+        urlSpec.Append(messageIdentifierList);
+      else
+        urlSpec.Append("4294967166:4294967168");

Whiteboard: [no l10n impact] → [no l10n impact][has patch for review]

Assignee

Comment 28

•

15 years ago

this probably accounts for some percentage of the hangs on shutdown, though I doubt it's a very high percentage.

neil@parkwaycc.co.uk

Comment 29

•

15 years ago

Comment on attachment 404123 [details] [diff] [review]
proposed fix

Worse, external API only has one AppendInt which takes an int(!)

>-    // we don't need to null terminate currentKeyToken because atoi 
>+    // we don't need to null terminate currentKeyToken because strtoul 
>     // stops at non-numeric chars.
>-    curToken = atoi(currentKeyToken); 
>+    curToken = strtoul(currentKeyToken, nsnull, 10); 
Nit: might as well trim those trailing spaces ;-)

Attachment #404123 - Flags: superreview?(neil)

Attachment #404123 - Flags: superreview+

Attachment #404123 - Flags: review?(neil)

Attachment #404123 - Flags: review+

neil@parkwaycc.co.uk

Comment 30

•

15 years ago

The solution that works with the external API is to snprintf into a local temporary character array and append that to the string.

Assignee

Comment 31

•

15 years ago

fix checked in. I'll file a bug on the string classes just for fun...

Status: ASSIGNED → RESOLVED

Closed: 15 years ago

Resolution: --- → FIXED

Whiteboard: [no l10n impact][has patch for review] → [no l10n impact]

Assignee

Comment 32

•

15 years ago

bug 520366 filed for the string library issue.

Comment 33

•

15 years ago

Michael Baffoni, can  you confirm problem gone using RC1?

Severity: major → critical

Keywords: hang

Summary: Thunderbird 3pre high memory and consuming 100%CPU making mail nearly unusable → Thunderbird 3pre high memory and consuming 100% CPU making mail nearly unusable, need to handle UID's > 0x7FFFFFFF