506693 - SELinux is preventing JIT from changing memory segment access

Reporter

Description

•

15 years ago

Jan Lieskovsky from Red Hat Security Response Team reported the following to the Mozilla Security via mail:

while using Firefox-3.5 version if Fedora 11
(firefox-3.5.1-1.fc11), we noticed the following
security problem, related with the new Firefox's 3.5
JavaScript Just-In-Time (JIT) compiler (SELinux setroubleshoot message):

The problem:
===========

<cite>
Summary:

SELinux is preventing firefox from changing a writable memory segment
executable.

Detailed Description:

[SELinux is in permissive mode, the operation would have been denied but was
permitted due to permissive mode.]

The firefox application attempted to change the access protection of memory
(e.g., allocated using malloc). This is a potential security problem.
Applications should not be doing this. Applications are sometimes coded
incorrectly and request this permission. The SELinux Memory Protection Tests
(http://people.redhat.com/drepper/selinux-mem.html) web page explains how to
remove this requirement. If firefox does not work and you need it to work, you
can configure SELinux temporarily to allow this access until the application is
fixed. Please file a bug report
(http://bugzilla.redhat.com/bugzilla/enter_bug.cgi) against this package.

Allowing Access:

If you trust firefox to run correctly, you can change the context of the
executable to execmem_exec_t. "chcon -t execmem_exec_t
'/usr/lib/firefox-3.5/firefox'". You must also change the default file context
files on the system in order to preserve them even on a full relabel. "semanage
fcontext -a -t execmem_exec_t '/usr/lib/firefox-3.5/firefox'"

Fix Command:

chcon -t execmem_exec_t '/usr/lib/firefox-3.5/firefox'
</cite>

Analysis and argumentation:
===========================

After further investigation / review it implied, this is a potential security
issue related with the way, the new Just-In-Time compiler is designed.
Here is the argumentation from Ulrich Drepper what's related to this issue:

<snip>
It's simply not acceptable that the program which is most susceptible to the
problems the Internet exposes a machine is the one which doesn't follow the
rules for secure programming.  If the new JIT requires writable and executable
memory it either has to be rewritten (have two mappings for the same data: one
writable, one executable) or it has to be disabled. 
</snip>

Conclusion:
===========

This results to the following proposed solution -- if newest Mozilla's Firefox 3.5
JavaScript JIT compiler needs to map relevant memory area for both, writing and
execution, it should do that in two separate requests. Because as Ulrich
already pointed out, the current implementation is prone to security attacks,
as demonstrated in the wild by:

    https://bugzilla.mozilla.org/show_bug.cgi?id=503286
    (CVE-2009-2477 CVE-2009-2478 CVE-2009-2479)

Separation of 'write' and 'exec' memory map requests into two separate requests
could prevent occurrance of such exploits / flaws in the future.

  Ulrich also recommends / provides sample code, how this problem can be
  solved in a secure way:
    http://people.redhat.com/drepper/selinux-mem.html

Could you have a further look into this issue and fix / reimplement relevant
parts of Mozilla Firefox's 3.5 JIT compiler to address this deficiency?

Daniel Veditz [:dveditz]

Reporter

Updated

•

15 years ago

Whiteboard: [sg:investigate]

Benjamin Smedberg

Comment 1

•

15 years ago

The basic SELinux problem is that you can't map writable and executable at the same time, correct?

I don't see any reason this needs to be private: it's well known.

Matěj Cepl

Comment 2

•

15 years ago

Could I ask for adding stransky@redhat.com to the CC list so he can see this?

Reed Loden [:reed]

Comment 3

•

15 years ago

(In reply to comment #2)
> Could I ask for adding stransky@redhat.com to the CC list so he can see this?

Done.

Daniel Veditz [:dveditz]

Reporter

Updated

•

15 years ago

blocking1.9.1: ? → ---

status1.9.1: --- → ?

Christopher Aillon (sabbatical, not receiving bugmail)

Comment 4

•

15 years ago

I'll note that I agree this should not be private.

Also, this bug will block Fedora 12 which will deny execmem by default, in SELinux enforcing mode.  We had to revert that feature temporarily to be able to ship our alpha, but the intention is to ship it for final.

Mike Shaver (:shaver -- probably not reading bugmail closely)

Comment 5

•

15 years ago

Anyone there who can help with a patch to the JIT to get the better behaviour?

Andreas Gal :gal

Updated

•

15 years ago

Assignee: general → gal

Andreas Gal :gal

Comment 6

•

15 years ago

I think graydon's merge work will help this, but I assume we have to fix this for 3.5.x so we are going to have to patch the old fragmento. I have a patch for r/o code fragments somewhere in bugzilla.

Andreas Gal :gal

Updated

•

15 years ago

Depends on: 473872

Graydon Hoare :graydon

Comment 7

•

15 years ago

Yeah. I'm hesitant to accept Ulrich's reasoning on this. The theory is that someone who wants to attack us might be able to pull off said attack by finding a bug and exploiting it to write to executable memory (this much I agree with) but is then *not* going to be able to pull off the exact same attack when it's done via "dual mapping", writing their attack into the W mapping and then convincing the program to run it via the X mapping.

The argument rests on the belief that "twice random = more random". It's a nice thought, but it's also a lot of work to implement, and I don't see anything other than wishful thinking motivating it. Just this notion that "since it's more complicated, it'll be secure". If that were true, we could just throw in a few more randomized hashtables in the middle of our page-mapping data structures and declare ourselves secure.

Andreas Gal :gal

Comment 8

•

15 years ago

I think we have a unique opportunity with traces that we can actually implement a read-only code cache, since traces are specialized at recording time and don't require on-the-fly patching like PICs do. The previous patch was pretty expensive because we used to have 4k pages and we had to set and revoke the execute bit for each page individually. With the new code allocator that should be less of an issue. What I still don't know how to solve is how to address concurrent code cache access. As long we keep one code cache per thread though, that should be fine.

Andreas Gal :gal

Comment 9

•

15 years ago

Dan, I would like to unhide the bug to get more feedback. I am pretty sure its widely known that our code cache is write and executable, and I don't think this bug contains information that warrants to hide it.

Jeff Walden [:Waldo]

Comment 10

•

15 years ago

For what it's worth, I agree completely with comment 7.  Executing dynamically generated code is what can get you into trouble; you still have that with the proposed workaround.

I also agree completely with comment 9.  Are there any other serious JIT engines which actually do the two-views-of-one-page hackaround?  I'm not aware of any, which makes me think anyone malicious is going to expect precisely what we do without having looked, and it's not exactly hard to look and verify that assumption in any case.

Benjamin Smedberg

Updated

•

15 years ago

Group: core-security

Ulrich Drepper

Comment 11

•

15 years ago

Let me say one thing first:

getting writable and executable memory in any form is nowadys a privilege, not a right.

I think even today's rules as implemented in SELinux are to loose and should be tightened even further.

In this light those who insist on having writable memory have the obligation to do everything possible to justify the trust. Reading Graydon's comments does exactly the opposite.

Of course using the double-mapping scheme doesn't solve the problem 100%. Only disallowing writable&executable memory completely does it. If this is what you prefer, fine, it's easy enough to implement in the OS.

But this is not in the best interest of everybody and therefore we have to mitigate the risk with all means. The double mapping does help, as simple math tells you. If an attacker can guess the address range for a mapping with, let's say, 5% (very high) then guessing both addresses at the same time is possible in only 0.25% of the time. That's a significant reduction.

If only on address is needed an attacker can write position-independent code which just has to be injected. On x86-64 we have an appropriate addressing mode (relative to the %rip). Using this makes it far too easy to write exploits. If the writable region is elsewhere this becomes much harder.

It is no extravagant request/demand to fix this. It is just common sense. Firefox is *by far* the most problematic application, security-wise. 5 out of 7 critical advisories for RHEL5.3 were for firefox (see Mark Cox's blog). *Anything* that can help here, reducing the severity level or whatever, must be done.

The changes also shouldn't be that problematic, unless the code is truly chaotic:

- change buffer management to keep two buffers aligned in size
- wrap all places where code/data is injected
- do some pointer arithmetic before the actual poking

If there are ways in which we (= Red Hat) can help let any of us know. We'll see what can be done.

Andreas Gal :gal

Comment 12

•

15 years ago

Personally I am shocked security folks are floating the double mapped memory hack. It seems like false security through obscurity to me, with very little randomization, even if we accept your math above. In my little academic ivory tower, executable and writable bits shouldn't co-exist, and the kernel should enforce this. I would prefer implementing this strategy in ff, but unfortunately it seems to have a pretty significant performance cost (ballpark 5% last time I measured).

The root cause for this overhead is the fact that dynamically typed programming languages like JS require frequent code cache updates, either because code is rewritten on the fly (PICs, V8 says hello) or in case of TraceMonkey a lot of specialized read-only code fragments are assembled in very quick succession, interleaved with actual code execution. Flipping the read/write/exec bits every time we update the code cache seems impractical. The syscall takes a small eternity, not to mention the havoc that causes at the hardware level (TLBs and what not).

An additional complication I ran into is concurrent code generation. I am playing around with a background compiler thread, and revoking the execute bit becomes extremely tricky without tripping the foreground thread that runs code from the same code cache.

Both frequent code cache updates and concurrent updates would be overhead free with the proposed double mapped page approach, so I am willing to go that route as long there is buy-in from our security folks that this actually a valid fix for the writable code cache issue.

As for an actual implementation, its not as an easy 1. 2. 3. as in the above post. Our code cache is chunked, and native code fragments can span multiple regions in multiple chunks. Figuring out for each region what the local mapping to the sibling page(s) is, is not going to be trivial, especially if we want the offset to differ for each chunk (which we want for randomization, but also because address space fragmentation will likely not allow to always get additional pages with a constant offset).

Ed, what is your take on this?

Ulrich Drepper

Comment 13

•

15 years ago

(In reply to comment #12)
> The root cause for this overhead is the fact that dynamically typed programming
> languages like JS require frequent code cache updates, either because code is
> rewritten on the fly (PICs, V8 says hello) or in case of TraceMonkey a lot of
> specialized read-only code fragments are assembled in very quick succession,
> interleaved with actual code execution. Flipping the read/write/exec bits every
> time we update the code cache seems impractical.

Nobody suggests this.  With two mappings you can always have a writable segment.  It seems you haven't even tried to understand what I described in the page linked in the original report.

Mike Shaver (:shaver -- probably not reading bugmail closely)

Comment 14

•

15 years ago

(In reply to comment #13)
> Nobody suggests this.  With two mappings you can always have a writable
> segment.  It seems you haven't even tried to understand what I described in the
> page linked in the original report.

*Andreas* suggests this, since as you say the current different-mapping limitation is a mitigation, not a solution to the problem.  It would actually solve the problem if we didn't have any memory that was concurrently mapped executable and writable (via one or multiple address regions, the attacker probably doesn't care), but unfortunately the performance characteristics of making such changes frequently aren't really viable.

It seems you haven't tried to understand the comment to which you replied?

Thanks for your offer of help -- I think that Graydon and Andreas can point you to the appropriate code, and I know they'd be willing to review a patch.

Jan Lieskovsky

Comment 15

•

15 years ago

For case this would be helpful for you (to identify particular code parts).
This was reported in relevant Fedora bug BZ#509945:

<cite>
This is where xulrunner traces back when executable/writable mappings are
disallowed:

Program received signal SIGSEGV, Segmentation fault.
nanojit::LirBufWriter::ins0 (op=<value optimized out>, this=<value optimized
out>, this=<value optimized out>, 
    op=<value optimized out>) at nanojit/LIR.cpp:315
315                     l->initOpcode(op);
Missing separate debuginfos, use: debuginfo-install firefox-3.5-2.fc12.i586
(gdb) bt
#0  0x00a3890d in nanojit::LirBufWriter::ins0 (op=<value optimized out>,
this=<value optimized out>, 
    this=<value optimized out>, op=<value optimized out>) from
/usr/lib/xulrunner-1.9.1/libmozjs.so
#1  0x00a0202f in RegExpNativeCompiler::compile (this=0xbfffa26c,
cx=0xb1ba2c00) at jsregexp.cpp:2411
#2  0x009f94aa in CompileRegExpToNative (re=<value optimized out>, cx=<value
optimized out>, fragment=<value optimized out>)
    at jsregexp.cpp:2475
#3  GetNativeRegExp (re=<value optimized out>, cx=<value optimized out>,
fragment=<value optimized out>)
    at jsregexp.cpp:2510
#4  MatchRegExp (re=<value optimized out>, cx=<value optimized out>,
fragment=<value optimized out>) at jsregexp.cpp:3922
#5  js_ExecuteRegExp (re=<value optimized out>, cx=<value optimized out>,
fragment=<value optimized out>)
    at jsregexp.cpp:4090
#6  0x009fff92 in regexp_exec_sub (cx=0xb1ba2c00, obj=<value optimized out>,
argc=1, argv=0xb15cd0c0, test=1, 
    rval=0xb15cd0b8) at jsregexp.cpp:4889
#7  0x00a00062 in regexp_test (cx=0xb1ba2c00, argc=1, vp=0xb15cd0b8) at
jsregexp.cpp:4911
#8  0x009c8d99 in js_Interpret (cx=0xb1ba2c00) at jsinterp.cpp:5147

Just debuginfo-install xulrunner and you'll see several instances of such
mappings:

/usr/src/debug/xulrunner-1.9.1/mozilla-1.9.1/js/src/nanojit/Assembler.cpp
/usr/src/debug/xulrunner-1.9.1/mozilla-1.9.1/js/src/nanojit/Assembler.h
/usr/src/debug/xulrunner-1.9.1/mozilla-1.9.1/js/src/nanojit/Nativei386.cpp
/usr/src/debug/xulrunner-1.9.1/mozilla-1.9.1/js/src/nanojit/avmplus.h

And a rather funny comment as well:

    #elif defined AVMPLUS_UNIX
            /**
             * Don't use normal heap with mprotect+PROT_EXEC for executable
code.
             * SELinux and friends don't allow this.
             */
            return mmap(NULL,
                        pages * kNativePageSize,
                        PROT_READ | PROT_WRITE | PROT_EXEC,
                        MAP_PRIVATE | MAP_ANON,
                        -1,
                        0);
    #else  

</cite>

Cc-ed also original Fedora reporter Lubomir Rintel to this bug.

Graydon Hoare :graydon

Comment 16

•

15 years ago

Thanks, we know where the regions are coming from, it's all quite centralized. 

I wish I could say Ulrich's response "shocks" me, but it's what I knew we'd get here. I still don't think this will make us one iota safer: heapspray though the writable mapping (of the sort we just were exploited by) will still work, and %rip-relative addressing is a red herring (you can build PIC code if you can find a single GOT-derived address in a register or on the stack). But that's beside the point.

The plain fact is that one can't win an argument like this with Ulrich, so we really only have two options: do what he says, or close the bug and/or maintain a perpetual disagreement with him. I don't have the energy for the latter. It'll be less work to just accept his preference and implement it. Let's put this on the "post merge" list of feature work on the code allocator and be done with it.

WIP 15 years ago Martin Stránský [:stransky] (ni? me) 23.49 KB, patch		Details \| Diff \| Splinter Review
WIP v2 (for trunk) 15 years ago Martin Stránský [:stransky] (ni? me) 24.05 KB, patch		Details \| Diff \| Splinter Review
v3 15 years ago Martin Stránský [:stransky] (ni? me) 33.76 KB, patch		Details \| Diff \| Splinter Review
v4 15 years ago Martin Stránský [:stransky] (ni? me) 29.25 KB, patch		Details \| Diff \| Splinter Review
v5 15 years ago Martin Stránský [:stransky] (ni? me) 44.80 KB, patch	edwsmith : review-	Details \| Diff \| Splinter Review
separated patch for secure memory allocation 15 years ago Martin Stránský [:stransky] (ni? me) 15.83 KB, patch		Details \| Diff \| Splinter Review
write/exec code with requested changes 15 years ago Martin Stránský [:stransky] (ni? me) 39.37 KB, patch	edwsmith : review+ gal : review+	Details \| Diff \| Splinter Review
exec patch v2 14 years ago Martin Stránský [:stransky] (ni? me) 30.80 KB, patch		Details \| Diff \| Splinter Review
exec patch v3 14 years ago Martin Stránský [:stransky] (ni? me) 31.62 KB, patch		Details \| Diff \| Splinter Review
exec patch v.4 - i386 & x86_64 14 years ago Martin Stránský [:stransky] (ni? me) 33.40 KB, patch		Details \| Diff \| Splinter Review
linux allocation v2 14 years ago Martin Stránský [:stransky] (ni? me) 15.62 KB, patch	n.nethercote : review+	Details \| Diff \| Splinter Review
exec patch v5 14 years ago Martin Stránský [:stransky] (ni? me) 52.30 KB, patch		Details \| Diff \| Splinter Review
memmap v3 14 years ago Martin Stránský [:stransky] (ni? me) 14.57 KB, patch		Details \| Diff \| Splinter Review
exec v6 14 years ago Martin Stránský [:stransky] (ni? me) 62.18 KB, patch		Details \| Diff \| Splinter Review
memmap v4 (added mntent.h to system headers) 14 years ago Martin Stránský [:stransky] (ni? me) 14.88 KB, patch		Details \| Diff \| Splinter Review
exec v7 (merged with TraceMonkey) 14 years ago Martin Stránský [:stransky] (ni? me) 62.24 KB, patch		Details \| Diff \| Splinter Review
a different approach, v1, broken 14 years ago Nicholas Nethercote [inactive] 100.90 KB, patch		Details \| Diff \| Splinter Review
a different approach, v2, works 14 years ago Nicholas Nethercote [inactive] 177.25 KB, patch		Details \| Diff \| Splinter Review
a different approach, v3, works 14 years ago Nicholas Nethercote [inactive] 177.32 KB, patch		Details \| Diff \| Splinter Review
v3 synced with latest tracemonky trunk 14 years ago Martin Stránský [:stransky] (ni? me) 225.96 KB, patch		Details \| Diff \| Splinter Review
JM WIP/i386 14 years ago Martin Stránský [:stransky] (ni? me) 13.60 KB, patch		Details \| Diff \| Splinter Review
JM WIP v.2 14 years ago Martin Stránský [:stransky] (ni? me) 21.28 KB, patch		Details \| Diff \| Splinter Review
nanojit & methodjit patch 14 years ago Martin Stránský [:stransky] (ni? me) 265.76 KB, patch		Details \| Diff \| Splinter Review