IRC Chat stuck on "Connecting..." after a network outage. Should show timeout
Categories
(Chat Core :: General, defect)
Tracking
(thunderbird_esr91+ fixed, thunderbird92 affected)
People
(Reporter: diomede979, Assigned: diomede979, Mentored, NeedInfo)
Details
(Whiteboard: [support])
Attachments
(1 file, 2 obsolete files)
1.91 KB,
patch
|
clokep
:
review+
wsmwk
:
approval-comm-esr91+
|
Details | Diff | Splinter Review |
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/88.0.4324.104 Safari/537.36
Steps to reproduce:
connect to an IRC server, disconnect Internet connection from the OS (for example by disabling the connection) until the IRC client detects disconnection and then reconnect.
Actual results:
the IRC connections stuck in "Disconnecting..." state. Then everytime I do need to force the reconnection manually.
Expected results:
as in the older versions, after a disconnection a countdown starts until the reconnection.
Updated•4 years ago
|
Comment 1•4 years ago
|
||
Could not find a dupe or reproduce using 78.7.1 on Fedora 33 Workstation.
Connected to Chat.
Turned off my wireless connection.
Chat disconnected as expected.
Turned my wireless connection on.
Chat reconnected.
I should reconnect while the application is still showing disconnecting?
If it isn't completely disconnected why should it reconnect?
Assignee | ||
Comment 2•4 years ago
|
||
(In reply to WaltS48 [:walts48] from comment #1)
Could not find a dupe or reproduce using 78.7.1 on Fedora 33 Workstation.
Connected to Chat.
Turned off my wireless connection.
Chat disconnected as expected.
Turned my wireless connection on.
Chat reconnected.I should reconnect while the application is still showing disconnecting?
If it isn't completely disconnected why should it reconnect?
In my case, che application stucks in "Disconnecting..." state, even if network come back again.
In older versions, a countdown starts and when fire the connection starts again.
Comment 3•4 years ago
|
||
I'm connecting to chat.freenode.net using IRC.
I tested with 68.11.0 on Linux and did see that countdown and reconnected when I either removed my wireless dongle and reinserted it or just disconnected my user.
Testing with 78.7.1 on Windows 10 and disconnecting my wireless account.
When I reconnect the Wi-Fi account, I get an "Error: Peer reports it experienced an internal error" message in my Instant messaging status dialog and the account does not reconnect. I need to click the "Connect" button.
In the Error Console I see:
Bad certificate or SSL connection for (username)@chat.freenode.net:
Peer reports it experienced an internal error. irc.jsm:921
I removed my chat name from that error.
Do you see something similar?
Comment hidden (typo) |
Assignee | ||
Comment 5•4 years ago
|
||
(In reply to WaltS48 [:walts48] from comment #3)
I'm connecting to chat.freenode.net using IRC.
I tested with 68.11.0 on Linux and did see that countdown and reconnected when I either removed my wireless dongle and reinserted it or just disconnected my user.
Testing with 78.7.1 on Windows 10 and disconnecting my wireless account.
When I reconnect the Wi-Fi account, I get an "Error: Peer reports it experienced an internal error" message in my Instant messaging status dialog and the account does not reconnect. I need to click the "Connect" button.
In the Error Console I see:
Bad certificate or SSL connection for (username)@chat.freenode.net:
Peer reports it experienced an internal error. irc.jsm:921I removed my chat name from that error.
Do you see something similar?
how to show status dialog? I don't have it.
Comment 6•4 years ago
|
||
In the Chat tab click the "Show Accounts" button.
Do you see the same message in the Error Console that I do about a bad certificate?
Assignee | ||
Comment 7•4 years ago
|
||
(In reply to WaltS48 [:walts48] from comment #6)
In the Chat tab click the "Show Accounts" button.
Do you see the same message in the Error Console that I do about a bad certificate?
No. But i guess it's just a problem related to Freenode service. Try with some other IRC server.
Comment 8•4 years ago
|
||
What one are you having a problem with?
Assignee | ||
Comment 9•4 years ago
|
||
(In reply to WaltS48 [:walts48] from comment #8)
What one are you having a problem with?
I am having problems with all of them, for example: irc.azzurra.org
Comment 10•4 years ago
•
|
||
Well, I've already wasted enough of my time on this and have documented my results.
For review:
- I didn't find a duplicate.
- I have no problem reconnecting using Thunderbird release and beta on Linux.
- I need to click the "Connect" button in the Instant messaging status dialog using those Thunderbird versions on Windows 10.
- I never get stuck in "Disconnecting..." when manually forcing a disconnection.
I guess I'll have to wait for an outage and hope I'm using chat when it occurs.
Do you often have outages in your location?
Does it happen in safe mode, a test profile, with any AV applications disabled?
Assignee | ||
Comment 11•4 years ago
|
||
(In reply to WaltS48 [:walts48] from comment #10)
Well, I've already wasted enough of my time on this and have documented my results.
For review:
- I didn't find a duplicate.
- I have no problem reconnecting using Thunderbird release and beta on Linux.
- I need to click the "Connect" button in the Instant messaging status dialog using those Thunderbird versions on Windows 10.
- I never get stuck in "Disconnecting..." when manually forcing a disconnection.
I guess I'll have to wait for an outage and hope I'm using chat when it occurs.
Do you often have outages in your location?
Does it happen in safe mode, a test profile, with any AV applications disabled?
there is my mistake, I get stuck on "Connecting...", and not "Disconnecting...". In the past, everytime a network outage occurs, a red countdown starts. Now I do not see that countdown and the status stucks in "Connecting...", then I do need to click on "Disconnect" and then "Connect". In this way it comes back connected. It happens with all of my configured server.
Updated•4 years ago
|
Comment 12•4 years ago
|
||
Could not reproduce with chat.freenode.net in my testing of the 78.8.0 release candidate on Windows 10 or Linux.
Assignee | ||
Comment 13•4 years ago
|
||
(In reply to WaltS48 [:walts48] from comment #12)
Could not reproduce with chat.freenode.net in my testing of the 78.8.0 release candidate on Windows 10 or Linux.
I have the problem on 78.7.1 release.
Comment 14•4 years ago
|
||
Please reread comment #1, comment #3 and comment #10.
You have the problem.
I can't find any other reports.
You have a support issue not a bug IMHO.
It would be nice if you answered questions asked.
How often do you experience a network outage?
Comment 15•4 years ago
|
||
It could be interesting to take a look at the protocols logs (via "Show Accounts" on the "Chat" tab, then right click on the account and click "Copy Debug Log"). It is possible the server is sending odd data we're not expecting.
Note that this log could include "sensitive" data (it will contain at least the domain you're trying to connect to).
Assignee | ||
Comment 16•4 years ago
|
||
(In reply to Patrick Cloke [:clokep] from comment #15)
It could be interesting to take a look at the protocols logs (via "Show Accounts" on the "Chat" tab, then right click on the account and click "Copy Debug Log"). It is possible the server is sending odd data we're not expecting.
Note that this log could include "sensitive" data (it will contain at least the domain you're trying to connect to).
Connection reset.
[2/24/2021, 8:36:56 PM] DEBUG (@ prpl-irc: disconnect resource:///modules/socket.jsm:216)
Disconnect
[2/24/2021, 8:36:57 PM] DEBUG (@ prpl-irc: connect resource:///modules/socket.jsm:171)
Connecting to: apple.bnc4free.com:1339
[2/24/2021, 8:36:57 PM] DEBUG (@ prpl-irc: onTransportStatus resource:///modules/socket.jsm:560)
onTransportStatus(STATUS_RESOLVING)
[2/24/2021, 8:36:57 PM] DEBUG (@ prpl-irc: onTransportStatus resource:///modules/socket.jsm:560)
onTransportStatus(STATUS_RESOLVED)
[2/24/2021, 8:36:57 PM] DEBUG (@ prpl-irc: onStartRequest resource:///modules/socket.jsm:500)
onStartRequest
[2/24/2021, 8:36:57 PM] DEBUG (@ prpl-irc: onStopRequest resource:///modules/socket.jsm:509)
onStopRequest (2152398878)
Assignee | ||
Comment 17•4 years ago
|
||
After any further investigation, I discovered that it is a behavior caused by a network adapter named "vEthernet (Default Switch)" component of Hyper-V on Windows 10.
Looks like that Thunderbird, when disconnecting from internet, recognizes this adapter like a working internet connection and stuck on it trying to wait connection. This network adapter can't be disabled and must be removed from Windows component.
On the following link there are instruction on how to disable it.
(https://superuser.com/questions/1282014/how-to-remove-all-the-vethernet-default-switch-once-and-for-all).
I don't know if it is still a Thunderbird bug, in the sense that Thunderbird should be able to recognize a working internet connection before trying to connect trough it. But for sure now we know what is causing the issue and how to workaround that.
Comment 18•4 years ago
|
||
Thanks for clarifying. Let's close then.
Assignee | ||
Comment 19•4 years ago
|
||
I'm sorry, but this bug han't been resolved. I think that "Connecting..." state should have a timeout. Right now it keeps this state
Assignee | ||
Updated•4 years ago
|
Updated•4 years ago
|
Assignee | ||
Comment 20•4 years ago
|
||
I investigated about the reason of the issue.
this is the DEBUG log when you try to connect to a server when you are offline:
(Thunderbird 78.12.0 (20210712120515), Gecko 78.12.0 (20210712120515) on Windows NT 10.0; Win64; x64)
[8/7/2021, 5:48:10 PM] DEBUG (@ prpl-irc: connect resource:///modules/socket.jsm:171)
Connecting to: **********
[8/7/2021, 5:48:10 PM] DEBUG (@ prpl-irc: onTransportStatus resource:///modules/socket.jsm:560)
onTransportStatus(STATUS_RESOLVING)
[8/7/2021, 5:48:10 PM] DEBUG (@ prpl-irc: onTransportStatus resource:///modules/socket.jsm:560)
onTransportStatus(STATUS_RESOLVED)
[8/7/2021, 5:48:10 PM] DEBUG (@ prpl-irc: onStartRequest resource:///modules/socket.jsm:500)
onStartRequest
[8/7/2021, 5:48:10 PM] DEBUG (@ prpl-irc: onStopRequest resource:///modules/socket.jsm:509)
onStopRequest (2152398878)
according to https://searchfox.org/comm-central/source/suite/chatzilla/js/lib/connection-xpcom.js#9
2152398878 code number which is in posted DEBUG log corresponds to "NS_ERROR_UNKNOWN_HOST = NS_ERROR_MODULE_NETWORK + 30"
The problem of the is here:
https://searchfox.org/comm-central/source/chat/modules/socket.jsm#460
is needed to manage the "NS_ERROR_UNKNOWN_HOST" state which occurs when a connection tries to connect when offline.
I suggest to start a timeout retry or disconnect it when this status event happens.
In this way we will not see che connection stuck on "connecting..." state even when the machine goes online.
thank you.
Updated•4 years ago
|
Assignee | ||
Comment 21•4 years ago
|
||
Comment 22•4 years ago
|
||
Comment on attachment 9235847 [details] [diff] [review]
1690415-add-unknown-host.patch
I added Patrick as reviewer to let it go further.
Comment 23•4 years ago
|
||
diomede979 (aka :wallbroken), congratulations and thank you for offering your first patch after our mentoring sessions!
The patch looks formally correct at first glance, review will check if it does the right thing.
Comment 24•4 years ago
|
||
Comment 25•4 years ago
|
||
Comment 26•4 years ago
|
||
Updated the patch.
Comment 27•4 years ago
|
||
Comment 28•4 years ago
|
||
Now with correct check.
Comment 29•4 years ago
|
||
Updated•4 years ago
|
Updated•4 years ago
|
Comment 30•4 years ago
|
||
Pushed by mkmelin@iki.fi:
https://hg.mozilla.org/comm-central/rev/2675ff5a0d6b
Handle NS_ERROR_UNKNOWN_HOST error during connection by retrying. r=clokep
Updated•4 years ago
|
Comment 31•4 years ago
|
||
Time for uplift requests? NI'ing the reviewer since the assignee might not be familiar with the process.
Updated•4 years ago
|
Updated•4 years ago
|
Updated•3 years ago
|
Comment 32•3 years ago
|
||
Comment on attachment 9236257 [details] [diff] [review]
1690415-add-unknown-host.patch
[Approval Request Comment]
Regression caused by (bug #): N/A
User impact if declined: In some situations chat accounts might not reconnected properly.
Testing completed (on c-c, etc.): This has been on Daily for a few weeks and is in TB 93.0b1.
Risk to taking this patch (and alternatives if risky): I don't see how this can make the situation work -- it attempts to reconnect in more situations.
Comment 33•3 years ago
|
||
Comment on attachment 9236257 [details] [diff] [review]
1690415-add-unknown-host.patch
[Triage Comment]
Approved for esr91
Comment 34•3 years ago
|
||
I don't see how this can make the situation work
I assume you mean "worse"
Comment 35•3 years ago
|
||
(In reply to Wayne Mery (:wsmwk) from comment #34)
I don't see how this can make the situation work
I assume you mean "worse"
Yes, sorry about that!
Comment 36•3 years ago
|
||
bugherder uplift |
Thunderbird 91.1.1
https://hg.mozilla.org/releases/comm-esr91/rev/90901e4e57c5
Comment 37•3 years ago
|
||
diomede979, please test 91.1.1 is now available.
Description
•