转自：https://yq.aliyun.com/articles/1723

摘要： 作者：王智通 CC_STACKPROTECT补丁是Tejun Heo在09年给主线kernel提交的一个用来防止内核堆栈溢出的补丁。默认的config是将这个选项关闭的，可以在编译内核的时候，修改.config文件为CONFIG_CC_STACKPROTECTOR=y来启...

作者：王智通

CC_STACKPROTECT补丁是Tejun Heo在09年给主线kernel提交的一个用来防止内核堆栈溢出的补丁。默认的config是将这个选项关闭的，可以在编译内核的时候，修改.config文件为CONFIG_CC_STACKPROTECTOR=y来启用。未来飞天内核可以将这个选项开启来防止利用内核stack溢出的0day攻击。这个补丁的防溢出原理是：在进程启动的时候，在每个buffer的后面放置一个预先设置好的stack canary，你可以把它理解成一个哨兵，当buffer发生缓冲区溢出的时候，肯定会破坏stack canary的值，当stack canary的值被破坏的时候，内核就会直接当机。那么是怎么判断stack canary被覆盖了呢？其实这个事情是gcc来做的，内核在编译的时候给gcc加了个-fstack-protector参数，我们先来研究下这个参数是做什么用的。

先写个简单的有溢出的程序：

[wzt@localhost csaw]$ cat test.c

#include <stdio.h>

#include <stdlib.h>

void test(void)

{

        char buff[64];

        memset(buff, 0x41, 128);     //向64大小的buffer拷贝128字节， 肯定会发生缓冲区溢出。

}

int main(void)

{

        test();

        return 0;

}

[wzt@localhost csaw]$ gcc -o test test.c

[wzt@localhost csaw]$ ./test

段错误

反汇编看看：

[wzt@localhost csaw]$ objdump -d test > hex

08048384 <test>:

 8048384:       55                      push   %ebp

 8048385:       89 e5                   mov    %esp,%ebp

 8048387:       83 ec 58                sub    $0x58,%esp

 804838a:       c7 44 24 08 80 00 00    movl   $0x80,0x8(%esp)

 8048391:       00

 8048392:       c7 44 24 04 41 00 00    movl   $0x41,0x4(%esp)

 8048399:       00

 804839a:       8d 45 c0                lea    0xffffffc0(%ebp),%eax

 804839d:       89 04 24                mov    %eax,(%esp)

 80483a0:       e8 e3 fe ff ff          call   8048288 <memset@plt>

 80483a5:       c9                      leave

 80483a6:       c3                      ret

没什么特别的，我们在加上-fstack-protector参数看看：

[wzt@localhost csaw]$ gcc -o test test.c -fstack-protector

[wzt@localhost csaw]$ ./test

*** stack smashing detected ***: ./test terminated

已放弃

这次程序打印了一条堆栈被溢出的信息，然后就自动退出了。

在反汇编看下：

[wzt@localhost csaw]$ objdump -d test > hex1

080483d4 <test>:

 80483d4:       55                      push   %ebp

 80483d5:       89 e5                   mov    %esp,%ebp

 80483d7:       83 ec 68                sub    $0x68,%esp

 80483da:       65 a1 14 00 00 00       mov    %gs:0x14,%eax

 80483e0:       89 45 fc                mov    %eax,0xfffffffc(%ebp)

 80483e3:       31 c0                   xor    %eax,%eax

 80483e5:       c7 44 24 08 80 00 00    movl   $0x80,0x8(%esp)

 80483ec:       00

 80483ed:       c7 44 24 04 41 00 00    movl   $0x41,0x4(%esp)

 80483f4:       00

 80483f5:       8d 45 bc                lea    0xffffffbc(%ebp),%eax

 80483f8:       89 04 24                mov    %eax,(%esp)

 80483fb:       e8 cc fe ff ff          call   80482cc <memset@plt>

 8048400:       8b 45 fc                mov    0xfffffffc(%ebp),%eax

 8048403:       65 33 05 14 00 00 00    xor    %gs:0x14,%eax

 804840a:       74 05                   je     8048411 <test+0x3d>

 804840c:       e8 db fe ff ff          call   80482ec <__stack_chk_fail@plt>

 8048411:       c9                      leave

 8048412:       c3                      ret

使用-fstack-protector参数后， gcc在函数的开头放置了几条汇编代码：

!sh

 80483d7:       83 ec 68                sub    $0x68,%esp

 80483da:       65 a1 14 00 00 00       mov    %gs:0x14,%eax

 80483e0:       89 45 fc                mov    %eax,0xfffffffc(%ebp)

将代码段gs偏移0×14内存处的值赋值给了ebp-4，也就是第一个变量值的后面。

在call完memeset后，有如下汇编代码：

!sh

 80483fb:       e8 cc fe ff ff          call   80482cc <memset@plt>

 8048400:       8b 45 fc                mov    0xfffffffc(%ebp),%eax

 8048403:       65 33 05 14 00 00 00    xor    %gs:0x14,%eax

 804840a:       74 05                   je     8048411 <test+0x3d>

 804840c:       e8 db fe ff ff          call   80482ec <__stack_chk_fail@plt>

在memset后，gcc要检查这个操作是否发生了堆栈溢出, 将保存在ebp-4的这个值与原来的值对比一下，如果不相同，说明堆栈发生了溢出，那么就会执行stack_chk_fail这个函数，这个函数是glibc实现的，打印出上面看到的信息，然后进程退出。

从这个例子中我们可以看出gcc使用了-fstack-protector参数后，会自动检查堆栈是否发生了溢出，但是有一个前提就是内核要给每个进程提前设置好一个检测值放置在%gs:0×14位置处，这个值称之为stack canary。所以我们可以看到防止堆栈溢出是由内核和gcc共同来完成的。

gcc的任务就是放置几条汇编代码，然后和%gs:0×14位置处的值进行对比即可。主要任务还是内核如何来设置stack canary，也是CC_STACKPROTECTOR补丁要实现的目的，下面我们仔细来看下这个补丁是如何实现的。

既然gcc硬性规定了stack canary必须在%gs的某个偏移位置处，那么内核也必须按着这个规定来设置。

对于32位和64位内核， gs寄存器有着不同的功能。

64位内核gcc要求stack canary是放置在gs段的40偏移处，并且gs寄存器在每cpu变量中是共享的，每cpu变量irq_stack_union的结构如下：

arch/x86/include/asm/processor.h

union irq_stack_union {

        char irq_stack[IRQ_STACK_SIZE];

        /*

         * GCC hardcodes the stack canary as %gs:40.  Since the

         * irq_stack is the object at %gs:0, we reserve the bottom

         * 48 bytes of the irq stack for the canary.

         */

        struct {

                char gs_base[40];

                unsigned long stack_canary;

        };

};

DECLARE_PER_CPU_FIRST(union irq_stack_union, irq_stack_union);

gs_base只是一个40字节的站位空间， stack_canary就紧挨其后。并且在应用程序进出内核的时候，内核会使用swapgs指令自动更换gs寄存器的内容。

32位下就稍微有点复杂了。由于某些处理器在加载不同的段寄存器时很慢，所以内核使用fs段寄存器替换了gs寄存器。但是gcc在使用-fstack-protector的时候，还要用到gs段寄存器，所以内核还要管理gs寄存器，我们要把CONFIG_X86_32_LAZY_GS选项关闭， gs也只在进程切换的时候才改变。 32位用每cpu变量stack_canary保存stack canary。

struct stack_canary {

        char __pad[20];         /* canary at %gs:20 */

        unsigned long canary;

};

DECLARE_PER_CPU_ALIGNED(struct stack_canary, stack_canary);

内核是处于保护模式的，因此gs寄存器就变成了保护模式下的段选子，在GDT表中也要有相应的设置：

diff --git a/arch/x86/include/asm/segment.h b/arch/x86/include/asm/segment.h

index 1dc1b51..14e0ed8 100644 (file)

--- a/arch/x86/include/asm/segment.h

+++ b/arch/x86/include/asm/segment.h

@@ -61,7 +61,7 @@

  *

  *  26 - ESPFIX small SS

  *  27 - per-cpu                       [ offset to per-cpu data area ]

- *  28 - unused

+ *  28 - stack_canary-20               [ for stack protector ]

  *  29 - unused

  *  30 - unused

  *  31 - TSS for double fault handler

@@ -95,6 +95,13 @@

 #define __KERNEL_PERCPU 0

 #endif

+#define GDT_ENTRY_STACK_CANARY         (GDT_ENTRY_KERNEL_BASE + 16)

+#ifdef CONFIG_CC_STACKPROTECTOR

+#define __KERNEL_STACK_CANARY          (GDT_ENTRY_STACK_CANARY * 8)

+#else

+#define __KERNEL_STACK_CANARY          0

+#endif

+

 #define GDT_ENTRY_DOUBLEFAULT_TSS      31

GDT表中的第28个表项用来定为stack canary所在的段。

#define GDT_STACK_CANARY_INIT                                           \

        [GDT_ENTRY_STACK_CANARY] = GDT_ENTRY_INIT(0x4090, 0, 0x18),

GDT_STACK_CANARY_INIT在刚进入保护模式的时候被调用，这个段描述符项被设置为基地址为0，段大小设为24，因为只在基地址为0，偏移为0×14处放置一个4bytes的stack canary，所以24字节正好。不理解的同学可以看看intel保护模式的手册，对着段描述符结构一个个看就行了。

在进入保护模式后， start_kernel()会调用boot_init_stack_canary()来初始话一个stack canary。

/*

 * Initialize the stackprotector canary value.

 *

 * NOTE: this must only be called from functions that never return,

 * and it must always be inlined.

 */

static __always_inline void boot_init_stack_canary(void)

{

        u64 canary;

        u64 tsc;

#ifdef CONFIG_X86_64

        BUILD_BUG_ON(offsetof(union irq_stack_union, stack_canary) != 40);

#endif

        /*

         * We both use the random pool and the current TSC as a source

         * of randomness. The TSC only matters for very early init,

         * there it already has some randomness on most systems. Later

         * on during the bootup the random pool has true entropy too.

         */

        get_random_bytes(&canary, sizeof(canary));

        tsc = __native_read_tsc();

        canary += tsc + (tsc << 32UL);

        current->stack_canary = canary;

#ifdef CONFIG_X86_64

        percpu_write(irq_stack_union.stack_canary, canary);

#else

        percpu_write(stack_canary.canary, canary);

#endif

}

随机出了一个值赋值给每cpu变量， 32位是stack_canary, 64位是irq_stack_union。

内核在进一步初始化cpu的时候，会调用setup_stack_canary_segment()来设置每个cpu的GDT的stack canary描述符项：

start_kernel()->setup_per_cpu_areas()->setup_stack_canary_segment：

static inline void setup_stack_canary_segment(int cpu)

{

#ifdef CONFIG_X86_32

        unsigned long canary = (unsigned long)&per_cpu(stack_canary, cpu);

        struct desc_struct *gdt_table = get_cpu_gdt_table(cpu);

        struct desc_struct desc;

        desc = gdt_table[GDT_ENTRY_STACK_CANARY];

        set_desc_base(&desc, canary);

        write_gdt_entry(gdt_table, GDT_ENTRY_STACK_CANARY, &desc, DESCTYPE_S);

#endif

}

在内核刚进入保护模式的时候, stack canary描述符的基地址被初始化为0，现在在cpu初始化的时候要重新设置为每cpu变量stack_canary的地址，而不是变量保存的值。通过这些设置当内核代码在访问%gs:0×14的时候，就会访问stack canry保存的值。注意：setup_stack_canary_segment是针对32位内核做设置，因为64位内核中的irq_stack_union是每cpu共享的，不用针对每个cpu单独设置。然后就可以调用switch_to_new_gdt(cpu);来加载GDT表和加载gs寄存器。

经过上述初始化过程，在内核代码里访问%gs:0×14就可以定位stack canary的值了，那么每个进程的stack canary是什么时候设置的呢？

在内核启动一个进程的时候，会把gs寄存器的值设为KERNEL_STACK_CANARY

--- a/arch/x86/kernel/process_32.c

+++ b/arch/x86/kernel/process_32.c

@@ -212,6 +212,7 @@ int kernel_thread(int (*fn)(void *), void *arg, unsigned long flags)

        regs.ds = __USER_DS;

        regs.es = __USER_DS;

        regs.fs = __KERNEL_PERCPU;

+       regs.gs = __KERNEL_STACK_CANARY;

        regs.orig_ax = -1;

        regs.ip = (unsigned long) kernel_thread_helper;

        regs.cs = __KERNEL_CS | get_kernel_rpl();

内核在fork一个进程的时候，有如下操作：

static struct task_struct *dup_task_struct(struct task_struct *orig)

{

#ifdef CONFIG_CC_STACKPROTECTOR

        tsk->stack_canary = get_random_int();

#endif

}

随机初始化了一个stack_canary保存在task_struct结构中的stack_canary变量中。当进程在切换的时候，通过switch宏把新进程的stack canary保存在每cpu变量stack_canary中，当前进程的stack_canary也保存在一个每cpu变量中，完成stack canary的切换。

diff --git a/arch/x86/include/asm/system.h b/arch/x86/include/asm/system.h

index 79b98e5..2692ee8 100644 (file)

--- a/arch/x86/include/asm/system.h

+++ b/arch/x86/include/asm/system.h

@@ -23,6 +23,22 @@ struct task_struct *__switch_to(struct task_struct *prev,

 #ifdef CONFIG_X86_32

+#ifdef CONFIG_CC_STACKPROTECTOR

+#define __switch_canary                                                        \

+       "movl "__percpu_arg([current_task])",%%ebx\n\t"                 \

+       "movl %P[task_canary](%%ebx),%%ebx\n\t"                         \

+       "movl %%ebx,"__percpu_arg([stack_canary])"\n\t"

+#define __switch_canary_oparam                                         \

+       , [stack_canary] "=m" (per_cpu_var(stack_canary))

+#define __switch_canary_iparam                                         \

+       , [current_task] "m" (per_cpu_var(current_task))                \

+       , [task_canary] "i" (offsetof(struct task_struct, stack_canary))

+#else  /* CC_STACKPROTECTOR */

+#define __switch_canary

+#define __switch_canary_oparam

+#define __switch_canary_iparam

+#endif /* CC_STACKPROTECTOR */

+

 /*

  * Saving eflags is important. It switches not only IOPL between tasks,

  * it also protects other tasks from NT leaking through sysenter etc.

@@ -46,6 +62,7 @@ do {                                                                  \

                     "pushl %[next_ip]\n\t"     /* restore EIP   */     \

                     "jmp __switch_to\n"        /* regparm call  */     \

                     "1:\t"                                             \

+                    __switch_canary                                    \

                     "popl %%ebp\n\t"           /* restore EBP   */     \

                     "popfl\n"                  /* restore flags */     \

                                                                        \

@@ -58,6 +75,8 @@ do {                                                                  \

                       "=b" (ebx), "=c" (ecx), "=d" (edx),              \

                       "=S" (esi), "=D" (edi)                           \

                                                                        \

+                      __switch_canary_oparam                           \

+                                                                       \

                       /* input parameters: */                          \

                     : [next_sp]  "m" (next->thread.sp),                \

                       [next_ip]  "m" (next->thread.ip),                \

@@ -66,6 +85,8 @@ do {                                                                  \

                       [prev]     "a" (prev),                           \

                       [next]     "d" (next)                            \

                                                                        \

+                      __switch_canary_iparam                           \

+                                                                       \

                     : /* reloaded segment registers */                 \

                        "memory");                                      \

 } while (0)

前面讲过当gcc检测到堆栈溢出的时候，会调用glibc的stack_chk_fail函数，但是当内核堆栈发生溢出的时候，

不能调用glibc的函数，所以内核自己实现了一个stack_chk_fail函数：

kernel/panic.c

#ifdef CONFIG_CC_STACKPROTECTOR

/*

 * Called when gcc's -fstack-protector feature is used, and

 * gcc detects corruption of the on-stack canary value

 */

void __stack_chk_fail(void)

{

        panic("stack-protector: Kernel stack is corrupted in: %p\n",

                __builtin_return_address(0));

}

EXPORT_SYMBOL(__stack_chk_fail);

#endif

当内核堆栈发生溢出的时候，就会执行stack_chk_fail函数，内核当机。

这就是这个补丁的原理，不懂的同学请参考： http://git.kernel.org/?p=linux/kernel/git/next/linux-next.git;a=commitdiff;h=60a5317ff0f42dd313094b88f809f63041568b08

CC_STACKPROTECTOR防内核堆栈溢出补丁分析【转】的更多相关文章

web攻击之八：溢出攻击（nginx服务器防sql注入/溢出攻击/spam及禁User-agents）
一.什么是溢出攻击首先, 溢出,通俗的讲就是意外数据的重新写入,就像装满了水的水桶,继续装水就会溢出,而溢出攻击就是,攻击者可以控制溢出的代码,如果程序的对象是内核级别的,如dll.sys文件等,就 ...
【RTOS】FreeRTOS中的任务堆栈溢出检测机制
目录前言任务堆栈堆栈溢出任务堆栈溢出检测机制 API 两种堆栈溢出检测方式堆栈溢出钩子函数内核何时检测任务堆栈溢出任务堆栈溢出检测存在的局限性前言注意:本笔记发布时可能忘记补充查看d ...
STM32堆栈溢出
在使用STM32读取SD Card的文件时,总是会卡死在读函数那里 res = f_read(&fsrc, gbuffer, sizeof(gbuffer)-1, &br); 而且出现 ...
linux信号机制－用户堆栈和内核堆栈的变化【转】
转自:http://itindex.net/detail/16418-linux-%E4%BF%A1%E5%8F%B7-%E5%A0%86%E6%A0%88 此文只简单分析发送信号给用户程序后,用户堆 ...
VS2013 堆栈溢出调查（0xC00000FD: Stack overflow）
在调试一个代码时,执行过程中会出现如下错误(0xC00000FD: Stack overflow). 很明显是堆栈溢出了. 网上很多方法,都是通过修改设置工程配置,把堆栈调大一些,如下图. 但是堆栈到 ...
Linux内核堆栈使用方法进程0和进程1【转】
转自:http://blog.csdn.net/yihaolovem/article/details/37119971 目录(?)[-] 8 Linux 系统中堆栈的使用方法 81 初始化阶段 82 ...
JPA 一对多双向映射结果对象相互迭代造成堆栈溢出问题方法
问题: JPA 在双向映射时,会相互包含对方的实例,相互引用,造成递归迭代,堆栈溢出(java.lang.StackOverflowError). 分析: 在后端向前端传递的时候会将数据序列化,转为j ...
Linux内核 fork 源码分析
内核版本:linux-4.4.18 源码位置:这里 fork相关的代码最终执行的函数为_do_fork(),下面按照顺序分析下_do_fork(): 首先判断是否需要trace(跟踪)这个进程,这一步 ...
arcgis engine 中出现的内存堆栈溢出问题。
两种解决方案: 1.循环加载mxd文档的时候出现的堆栈溢出,解决办法是每次循环结束时清空FeatureLayer,感觉并不好,但是确实可以实现功能. 2.循环调取featureclass的search ...

随机推荐

DS实验题融合软泥怪-1
题目思路很容易想到第一种做法,用Huffman算法,从森林中找出两个值最小的节点,合并再加入森林,在这个过程中不断记录. 但是每一次需要sort一遍,将最小的两个值节点置于头两个节点,最坏情况下复 ...
PHP 实现数学问题：组合
需求: 有一个数组 ['a', 'b', 'c', 'cd'] 需要从数组中取出任意 m 个不重复的元素,列出所有的可能性(也不需要出现重复的组合例如['a', 'b' ,'c'] 和 ['a', ' ...
【java基础学习】反射
1. 什么是反射 Class.Method.Field.Constructor,它们是反射对象.它们是类.方法.成员变量.构造器,在内存中的形式. 也就是万物皆对象!类是类型.方法是类型.成员变量是类 ...
pyspider 安装时 Could not run curl-config
sudo aptitude install libcurl4-openssl-dev
C++ 支持两种索引的排行榜模板
version 1: #ifndef RANK_TMPL_H_ #define RANK_TMPL_H_ #include <stdio.h> #include <stdint.h& ...
hadoop（2014/0619）
map-reduce :分解任务和合并任务的能力 hdfs: namenode and datanode namenode放置元数据 datanoe放置数据
C# 接口笔记
/* 1. 实现多态的两种方式. * 使用虚方法实现多态. * 使用抽象方法实现多态. * ...
关于CLR、CIL、CTS、CLS、CLI、BCL和FCL
如果要想深入学习.NET平台,那么标题中的这些关键字对你来说并不陌生,这些名词构成了.NET庞大的生态系统,为了宏观认识.NET平台,学些.NET架构体系,针对一些常用常用名词的理解是很有必要的,未必 ...
自定义NSLog
我们在调试程序的时候,往往需要输出一些日志信息,用到NSLog函数,当我们准备发布程序,需要注释掉NSLog代码,这个时候往往会定义一个宏,在调试的时候,会输出日志,在Release正式版本的时候,会 ...
批量修改一张表格的多个sheet名
例如这样的函数: Sub aab() For x = 4 To 5 Sheets(x).Name = "10" & Format(x - 3, "00" ...

CC_STACKPROTECTOR防内核堆栈溢出补丁分析【转】

作者：王智通

CC_STACKPROTECTOR防内核堆栈溢出补丁分析【转】的更多相关文章

随机推荐

热门专题