Closed
Bug 303633
Opened 20 years ago
Closed 20 years ago
Profile manager reports all profiles as locked
Categories
(Core Graveyard :: Profile: BackEnd, defect)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: MatsPalmgren_bugz, Assigned: MatsPalmgren_bugz)
References
Details
(Keywords: fixed1.8, regression)
Attachments
(2 files, 1 obsolete file)
73.35 KB,
text/plain
|
Details | |
2.25 KB,
patch
|
brendan
:
review+
roc
:
superreview+
asa
:
approval1.8b4+
|
Details | Diff | Splinter Review |
STEPS TO REPRODUCE
1. have your home directory NFS-mounted
2. start SeaMonkey (Linux/GTK2/i386)
ACTUAL RESULTS
Profile manager says all profiles are locked, even creating a new one and
choosing it is reported as locked.
If I start it using "HOME=/tmp seamonkey" then it works, so I guess this
has something to do with my HOME being NFS-mounted...
The regression occured 2005-08-02-03 -- 2005-08-03-04
Looking at the checkins I'm guessing it was caused by bug 151188
![]() |
||
Comment 2•20 years ago
|
||
This was not intended. It sounds as though file locking over NFS is still
busted for some scenarios. Mats, can you strace and post a log of system calls?
/be
Assignee | ||
Comment 3•20 years ago
|
||
My home directory is /home/mats
Assignee | ||
Comment 4•20 years ago
|
||
This includes sub-processes as well... I had to delete a large chunk at the
beginning and end to make it fit within the maximum attachment size 300kb.
Hopefully I kept the bit you were interested in.
Assignee | ||
Updated•20 years ago
|
Attachment #191797 -
Attachment is obsolete: true
Assignee | ||
Comment 5•20 years ago
|
||
With a debug build I get:
fcntl(F_SETLK) failed. errno = 13
###!!! ASSERTION: Could not get profile directory lock.: 'Error', file
nsProfile.cpp, line 1175
![]() |
||
Comment 6•20 years ago
|
||
(In reply to comment #5)
> With a debug build I get:
>
> fcntl(F_SETLK) failed. errno = 13
> ###!!! ASSERTION: Could not get profile directory lock.: 'Error', file
> nsProfile.cpp, line 1175
Great, EACCES ("Permission denied").
I ported Sun's original vnode-based NFS to SGI Irix almost 20 years ago. The
design decision to keep NFS stateless, which was in the end hopeless, meant that
the "lock manager" woud have to solve the obvious hard problem, later (unobvious
hard problems NFS ended up having to solve involved avoiding replay of mutating
operations due to its crappy UDP-based error control, via an "idempotency cache"
indexed by RPC transaction id).
Being stateless at one layer pushes problems up the stack; it can be a good
trade-off for the whole system, or it can be a shortcut for implementors working
at only the lower layer. With NFS I think the latter was more the case. The
gain of a simpler (nominally; Sun RPC in full is not simple) protocol was offset
by the complexity of recovering Unix-ish statefull semantics, which costs were
borne by many others, not by the NFS designers.
When lockd first appeared several years later, it was a bug-ridden hunk of junk.
The situation did not improve while I was paying attention, wherefore the whole
pid-signed symlink profile locking scheme that I created first in 1995 for
Netscape 2.
Anyway, is locking over NFS still just unreliable? Or is it possible that your
home directory is exported with some option preventing you from locking your own
files? If you can write to files in your NFS-mounted profile directory, you
ought to be able to lock them. Perhaps the problem is that you get EACCES
always for any attempt to lock a symlink?
We should be able to work around this, whatever it is, but it would be best to
understand the problem.
/be
Assignee | ||
Comment 7•20 years ago
|
||
(In reply to comment #6)
> When lockd first appeared several years later, it was a bug-ridden hunk of junk.
After some testing, it appears it still is.
My NFS server is a FreeBSD 5.3/i386.
My NFS client is a Linux 2.6/i386 (SuSE 9.3).
I can create files but not lock them with fcntl().
It turns out 'lockd' does not come for free when you enable the NFS
server and export filesystems on FreeBSD, you have to explicitly say,
(in /etc/rc.conf):
rpc_lockd_enable="YES"
rpc_statd_enable="YES"
in addition to:
nfs_server_flags="-u -t -n 2 -h 192.168.1.1"
nfs_server_enable="YES"
which I already had.
So after adding that and rebooting both the client and server my test
program reports that fcntl(F_SETLK) succeeds.
The problem is that now my KDE desktop (on the client) is not very happy,
I can login but no new windows appears and it eventually just freezes.
I have a second NFS server that runs Linux 2.4/i386 and
if I set my HOME to a directory there I get:
fcntl(F_SETLK) failed. errno = 37
(ENOLCK 37 /* No record locks available */)
This system seems to have started 'lockd' automatically so I don't know
why it gives an error, but since the error isn't EAGAIN or EACCES
SeaMonkey falls back to using "LockWithSymlink()" which works.
Assignee | ||
Comment 8•20 years ago
|
||
So after disabling my flakey 'lockd' again this is a workaround that works
for me. It does a "fcntl(F_GETLK)" before doing the real lock operation
and if it fails we return NS_ERROR_FAILURE instead.
Normally "fcntl(F_GETLK)" should succeed both when the file is locked or not,
so a failure would indicate that fcntl() isn't working as expected for this
file. The Linux man-page fcntl(3) says:
F_GETLK
On input to this call, lock describes a lock we would like to place
on the file. If the lock could be placed, fcntl() does not actually
place it, but returns F_UNLCK in the l_type field of lock and leaves
the other fields of the structure unchanged. If one or more incom
patible locks would prevent this lock being placed, then fcntl()
returns details about one of these locks in the l_type, l_whence,
l_start, and l_len fields of lock and sets l_pid to be the PID of
the process holding that lock.
F_SETLK
Acquire a lock (when l_type is F_RDLCK or F_WRLCK) or release a lock
(when l_type is F_UNLCK) on the bytes specified by the l_whence,
l_start, and l_len fields of lock. If a conflicting lock is held by
another process, this call returns -1 and sets errno to EACCES or
EAGAIN.
I think it resolves the ambiguity of EACCES for F_SETLK, what do you think?
Assignee | ||
Comment 9•20 years ago
|
||
Comment on attachment 191833 [details] [diff] [review]
fix?
The second hunk fixes an unrelated compilation warning.
![]() |
||
Comment 10•20 years ago
|
||
Comment on attachment 191833 [details] [diff] [review]
fix?
Thanks, this looks good to me. I can't believe lockd hasn't been fixed in all
these years and Unix/Linux rewrites.
/be
Attachment #191833 -
Flags: superreview?(roc)
Attachment #191833 -
Flags: review+
Attachment #191833 -
Flags: approval1.8b4?
![]() |
||
Updated•20 years ago
|
Flags: blocking1.8b4+
Comment on attachment 191833 [details] [diff] [review]
fix?
looks good. thanks for handling this
Attachment #191833 -
Flags: superreview?(roc) → superreview+
Updated•20 years ago
|
Attachment #191833 -
Flags: approval1.8b4? → approval1.8b4+
Assignee | ||
Updated•20 years ago
|
Assignee: roc → mats.palmgren
Assignee | ||
Comment 12•20 years ago
|
||
Checked in to trunk at 2005-08-08 19:40 PDT.
-> FIXED
Status: NEW → RESOLVED
Closed: 20 years ago
Resolution: --- → FIXED
Updated•9 years ago
|
Product: Core → Core Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•