Closed Bug 403603 Opened 17 years ago Closed 14 years ago

Over One minute IMAP online copy or move retried forever and mail FLOODING!

Tracking

(blocking-thunderbird3.1 rc1+, thunderbird3.1 rc1-fixed)

Status:

RESOLVED FIXED

Tracking Flags:

Tracking

Status

blocking-thunderbird3.1

---

rc1+

thunderbird3.1

---

rc1-fixed

People

(Reporter: lenin, Assigned: Bienvenu)

References

Details

(Keywords: qawanted, Whiteboard: [ready to land])

Attachments

(3 files, 1 obsolete file)

OK! 17 years ago Reinin Oyama 2.33 KB, patch	Bienvenu : review-	Details \| Diff \| Splinter Review
IMAP4 Server log showing the bug in effect 16 years ago Paul Smith 4.04 KB, text/plain		Details
wip 14 years ago David :Bienvenu 33.89 KB, patch		Details \| Diff \| Splinter Review
proposed fix 14 years ago David :Bienvenu 34.04 KB, patch	standard8 : review+ standard8 : superreview+	Details \| Diff \| Splinter Review

Reinin Oyama

Reporter

Description

•

17 years ago

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.0; ja; rv:1.8.1.9) Gecko/20071025 Firefox/2.0.0.9
Build Identifier: HEAD

If IMAP online Copy or Move timeouted, thunderbird retries it forever.
And mail FLOODING!
And IMAP server is saturated.

Reproducible: Always

Steps to Reproduce:
1.
2.
3.
Actual Results:  
I made a patch!

Index: mailnews/imap/src/nsImapProtocol.cpp
===================================================================
RCS file: /cvsroot/mozilla/mailnews/imap/src/nsImapProtocol.cpp,v
retrieving revision 1.671
diff -u -8 -p -r1.671 nsImapProtocol.cpp
--- mailnews/imap/src/nsImapProtocol.cpp	9 Nov 2007 15:07:23 -0000	1.671
+++ mailnews/imap/src/nsImapProtocol.cpp	13 Nov 2007 11:04:01 -0000
@@ -311,17 +311,17 @@ static PRBool gFetchByChunks = PR_TRUE;
 static PRInt32 gMaxChunkSize = 40960;
 static PRBool gInitialized = PR_FALSE;
 static PRBool gHideUnusedNamespaces = PR_TRUE;
 static PRBool gHideOtherUsersFromList = PR_FALSE;
 static PRBool gUseEnvelopeCmd = PR_FALSE;
 static PRBool gUseLiteralPlus = PR_TRUE;
 static PRBool gExpungeAfterDelete = PR_FALSE;
 static PRBool gCheckDeletedBeforeExpunge = PR_FALSE; //bug 235004
-static PRInt32 gResponseTimeout = 60;
+static PRInt32 gResponseTimeout = 8400;
 static nsCStringArray gCustomDBHeaders;
 
 nsresult nsImapProtocol::GlobalInitialization()
 {
     gInitialized = PR_TRUE;
     nsresult rv;
     nsCOMPtr<nsIPrefBranch> prefBranch(do_GetService(NS_PREFSERVICE_CONTRACTID, &rv));
     NS_ENSURE_SUCCESS(rv, rv);
@@ -1596,18 +1596,23 @@ PRBool nsImapProtocol::RetryUrl()
   nsCOMPtr <nsIImapMockChannel> saveMockChannel;
   m_runningUrl->GetMockChannel(getter_AddRefs(saveMockChannel));
   ReleaseUrlState(PR_TRUE);
   nsresult rv;
   nsCOMPtr<nsIImapIncomingServer> imapServer  = do_QueryReferent(m_server, &rv);
   kungFuGripImapUrl->SetMockChannel(saveMockChannel);
   if (NS_SUCCEEDED(rv))
     imapServer->RemoveConnection(this);
-  if (m_imapServerSink)
-    m_imapServerSink->RetryUrl(kungFuGripImapUrl);
+  if (m_imapServerSink){
+    nsImapAction imapaction;
+    kungFuGripImapUrl->GetImapAction(&imapaction);
+    if(imapaction != nsIImapUrl::nsImapOnlineCopy && imapaction != nsIImapUrl::nsImapOnlineMove){
+        m_imapServerSink->RetryUrl(kungFuGripImapUrl);
+    }
+  }
   return (m_imapServerSink != nsnull); // we're running a url (the same url)
 }
 
 // ignoreBadAndNOResponses --> don't throw a error dialog if this command results in a NO or Bad response
 // from the server..in other words the command is "exploratory" and we don't really care if it succeeds or fails.
 void nsImapProtocol::ParseIMAPandCheckForNewMail(const char* commandString, PRBool aIgnoreBadAndNOResponses)
 {
     if (commandString)

Reinin Oyama

Reporter

Updated

•

17 years ago

Version: unspecified → Trunk

Magnus Melin [:mkmelin]

Comment 1

•

17 years ago

Please add the patch as an attachment (use the "Add an attachment" link). Then ask review by setting the r? flag of the patch to the email address of a suitable reviewer like David.

Assignee: nobody → bienvenu

Component: General → Networking: IMAP

Product: Thunderbird → Core

QA Contact: general → networking.imap

Reinin Oyama

Reporter

Comment 2

•

17 years ago

Attached patch OK! — Details — Splinter Review

David :Bienvenu

Assignee

Comment 3

•

17 years ago

Comment on attachment 294922 [details] [diff] [review]
OK!

We don't want to not retry online move/copies at all - we simply want to retry them once, which is what I thought the code did...

Attachment #294922 - Flags: review-

Dario Corti

Comment 4

•

17 years ago

I confirm the issue, if tried to copy a 180mb folder in another and I had to kill TB when the new folder was over 1gb..... (around 750 messages copied in 4k+ messages, over and over).

Paul Smith

Comment 5

•

16 years ago

Attached file IMAP4 Server log showing the bug in effect — Details

Paul Smith

Comment 6

•

16 years ago

We get this all the time as well. It's totally reproducible for us.

I've uploaded a server log showing the bug take effect.

It's only an extract as the same thing repeated itself over and over. It did it 20 times before I killed the server.

Description of log:
Session 0x28 started a copy at 9:48:54
At 9:49.55, Thunderbird started a new session (0x2a), got the message UIDs/flags, and started another copy at 9:49:57

At 9:50:43, session 0x28's copy finished

At 9:51:02, Thunderbird started a new session (0x2b), got the message details and started another copy at 9:51:03

At 9:52:07, Thunderbird started a new session (0x2d), got the message details and started another copy at 9:52:09

At 9:52:21, session 0x2a's copy finished


My thoughts:
- retrying a command like 'copy' is potentially not a good idea anyway. Even if you just retry it once, you end up with two copies of the messages if they both succeed. Commands like store/fetch etc are OK to retry as they have no side effects.

- retrying the command after it has received a successful response back from a previous attempt of the command is a bad idea.

- maybe the timeout should be dynamic - it will probably take longer to copy 1000 messages than it takes to copy 1 message. 

- maybe Thunderbird should break down copies into smaller 'chunks' as an option

Aaron Wormus

Comment 7

•

16 years ago

This bug is effecting me as well - each time I try to copy several email messages from one folder to another the emails are duplicated MANY times.

Wayne Mery (:wsmwk)

Comment 8

•

16 years ago

Reinin, do you also see this using trunk?  And if so, is there a way forward?

(bug 391853, bug 410692 are possible duplicates)

Nobody; OK to take it and work on it

Updated

•

16 years ago

Product: Core → MailNews Core

Michael Baffoni

Comment 9

•

15 years ago

I agree, bug 410692 appears to be a dupe of this one, please dupe it.

Please note my comments in 410692; with current TB3 capability of storing commands on a per folder basis, we are getting into a never-ending loop with the move/copy commands that timeout because they are never removed from the execute queue.

Status: UNCONFIRMED → NEW

Ever confirmed: true

Flags: blocking-thunderbird3?

Wayne Mery (:wsmwk)

Comment 10

•

15 years ago

bienvenu has targeted bug 410692 for 3.0b3 so not duping. Let's see if that fixes this one

Depends on: 410692

Mark Banner (:standard8)

Comment 11

•

15 years ago

(In reply to comment #10)
> bienvenu has targeted bug 410692 for 3.0b3 so not duping. Let's see if that
> fixes this one

If anyone can still reproduce on latest trunk (ftp://ftp.mozilla.org/pub/thunderbird/nightly/) please let us know asap.

Marking as not blocking as I believe bug 410692 and others I've seen have covered this already. Please re-request blocking if that's not the case (and provide steps to repeat).

Flags: blocking-thunderbird3? → blocking-thunderbird3-

Keywords: qawanted

Michael Baffoni

Comment 12

•

15 years ago

Well, it is different now.  My understanding of the fix is that it increased the timeout greatly (even dynamically ?) so that copies of emails (etc.) should no longer hit the timeout, and therefore shouldn't hit failure until the operation actually finished.

The timeout does seem longer, and now one copy does not seem to create many times the number of messages in a single copy/move operation.  However, I'm still seeing the long moves as a failure, so it is like it is still hitting the 60 second timeout, but it isn't immediately retrying the failed command.  With storage of the command in the IMAP folder, if I leave the folder, and come back to it, it will retry the operation.

So although now it is not _flooding_, I'm still seeing multiple copies created from a single move/copy operation (with revisits of the source folder).

Is this behavior sufficiently different to be it's own bug and "fix" this one, or do you want to keep this here?

Pehr Söderman

Comment 13

•

15 years ago

This bug is not dead yet. I just ran into it with 3.0b3. The worst part is that the tool that could previous clean up after these errors (Remove Duplicate Messages (alternative)) seems to trigger the bug also if the destination folder is in IMAP and the original number of duplicates large enough.

I am not sure what is the best way to handle this case, but my feeling is that to offer a list of pending IMAP commands (like for example mail.app does) and allow the user to cancel commands or let them run instead would be better. The current behavior is... Not acceptable. Reposting potentially dangerous commands are not a good idea.

Now, time to remove 1860000 messages from my mail server :(

Paul Smith

Comment 14

•

15 years ago

My view is that Thunderbird should simply not retry commands (especially those such as COPY, SEARCH, APPEND). By all means, tell the user that the action is taking a long time, but an automatic silent retry is inappropriate.

The only reason a retry WOULD be appropriate is if the IMAP protocol allowed servers to ignore commands - which it doesn't. So, if Thunderbird sends a command, then either the server will act on it, report an error, or there's a bug with the server. In the first case, a retry is harmful, in the second case, Thunderbird will (or at least should) tell the user the error, and in the third case, it should just tell the user that nothing is happening, and let the user decide what to do (which may involve them contacting the server vendor for help).

I'm not sure what the rationale behind the automatic retry is, but I can't think of a good enough reason for it.

Pehr Söderman

Updated

•

15 years ago

Flags: blocking-thunderbird3.1?

Dan Mosedale (:dmosedale, :dmose)

Comment 15

•

14 years ago

It's not clear to me from the comments that this is still happening on 3.0.x (though it might well be!).  If people here are still seeing it, please comment here in detail and renominate.

blocking-thunderbird3.1: --- → -

Flags: blocking-thunderbird3.1?

Pehr Söderman

Comment 16

•

14 years ago

Just tested. This bug is certainly alive and ready to destroy your IMAP folders in 3.0.1.

Pehr Söderman

Updated

•

14 years ago

Flags: blocking-thunderbird3.1?

Dan Mosedale (:dmosedale, :dmose)

Updated

•

14 years ago

blocking-thunderbird3.1: - → rc1+

Flags: blocking-thunderbird3.1?

David :Bienvenu

Assignee

Comment 17

•

14 years ago

Pehr, what steps are you using to reproduce the problem? Move/copy of a large message between imap folders on the same server? Or a lot of large messages? Are you getting a timeout error, or a server error?

Pehr Söderman

Comment 18

•

14 years ago

Running 3.0.3 now, against hMailServer 5.2.1-B361.

To reproduce the bug follow the following steps: 

1: Create an IMAP folder with too many messages to copy before the timeout (10 000 is more than enough at my system).
2: Create a new IMAP folder, using the same IMAP account.
3: Choose all the messages, move them to the new folder.
4: Thunderbird will be sluggish but work.
5: Wait...

Result:
In the background thunderbird will post new copy operations regularly, creating multiple copies of the original 10 0000 messages in the destination folder, but never removing them from the original folder.

Pehr Söderman

Comment 19

•

14 years ago

If I read the patch in Bug 409259 (which supposedly fixes the related bugs) correctly it increases the timeout to #messages/40 seconds for large amounts of messages (over 2400). It's not impossible that my IMAP server can't mannage an average 40/second as many of the mails in my archive are rather large. And once the bug triggers the first time the load on the server makes it certain the bug will trigger again.

Chris

Comment 20

•

14 years ago

This bug still exists in 3.0.4

When moving approx. 1000 messages from one folder to another, the copy operations are being continuously retried and the destination folder is filled with duplicates. Currently up to 19000.

Messages are of a variety of sizes 2k-1M, depending on attachment size.

This is a common operation for many users, as they may archive their incoming messages (as I do) to monthly subfolders.

David :Bienvenu

Assignee

Comment 21

•

14 years ago

I've hacked our fake imap server to simulate timeouts, so I can probably use that to work on the imap code not to do retries in this case, but I also have to worry about the offline imap playback code, to make the sure the offline playback operation is removed.

Unfortunately, our fakeserver code doesn't allow multiple connections or cleanup particular well after itself, which makes it a bit tricky to use for testing this.

David :Bienvenu

Assignee

Comment 22

•

14 years ago

Attached patch wip (obsolete) — Details — Splinter Review

This WIP does several things:

Makes nsImapProtocol::m_connectionStatus an nsresult, and propagates it back to the url listener OnStopRunningUrl call. This allows us to detect timeouts.

Makes imap not internally retry move/copies.

Propagates NO/BAD server responses back to OnStopRunningUrl via a new error code, NS_MSG_ERROR_IMAP_COMMAND_FAILED. In general, retrying commands that failed on the server is not useful (there are exceptions, e.g., temporary bad state on the server, but we can't know something is temporary.)

If an offline move/copy fails because of a timeout or a command error, remove the offline operation, so that we won't keep retrying it.

Extends the fake imap server to optionally timeout imap copy commands.

Adds an xpcshell imap move/copy test that times out.

Makes pseudo-offline playback requests hold a reference to the msg window. This allows the new imap move/copy timeout xpcshell test to pass in a msg window w/o having it get garbage collected out from under us (xpconnect's wrapper object was not getting add-reffed, so it got gc'd before the timeout fired).

I need to test this some more, and actually make the test fail. This is a little tricky because fake server doesn't allow multiple connections.

David :Bienvenu

Assignee

Updated

•

14 years ago

Whiteboard: [patch in progress]

David :Bienvenu

Assignee

Comment 23

•

14 years ago

Attached patch proposed fix — Details — Splinter Review

See prev comment for description of what this patch does. Most of the patch is simply making connectionStatus an nsresult, and propagating it around.

Attachment #441633 - Attachment is obsolete: true

Attachment #441819 - Flags: superreview?(bugzilla)

Attachment #441819 - Flags: review?(bugzilla)

David :Bienvenu

Assignee

Updated

•

14 years ago

Whiteboard: [patch in progress] → [has patch for review]

Mark Banner (:standard8)

Updated

•

14 years ago

Whiteboard: [has patch for review] → [has patch for review][eta for review Mon]

Dan Mosedale (:dmosedale, :dmose)

Updated

•

14 years ago

Whiteboard: [has patch for review][eta for review Mon] → [has patch; needs review Standard8]

Mark Banner (:standard8)

Updated

•

14 years ago

Attachment #441819 - Flags: superreview?(bugzilla)

Attachment #441819 - Flags: superreview+

Attachment #441819 - Flags: review?(bugzilla)

Attachment #441819 - Flags: review+

Mark Banner (:standard8)

Updated

•

14 years ago

Whiteboard: [has patch; needs review Standard8] → [ready to land]

David :Bienvenu

Assignee

Comment 24

•

14 years ago

fixed for trunk and 3.1 on 1.9.2 branch

Status: NEW → RESOLVED

Closed: 14 years ago

status-thunderbird3.1: --- → rc1-fixed

Flags: in-testsuite+

Resolution: --- → FIXED

You need to log in before you can comment on or make changes to this bug.