Proj EULibHarn Paper Reading: IntelliGen: Automatic Driver Synthesis for FuzzTesting

Abstract

本文: IntelliGen
目标: 生成fuzz driver
步骤：

确定entry function, 评价权重
通过层次参数替换(hierarchical parameter replacement)和类型推断生成driver
实验效果:
实验对象: Android Open-Source Project; Google's fuzzer-test-suite; 工业上的其他合作方
竞争软件: FUDGE, FuzzGen
效果:
average 1.08×-2.03× more basic blocks and 1.36×-2.06× more paths
找到10多个bugs

1. Intro

P4: 本文认为生成Driver的两个挑战1. 找到有高价值（code coverage + memcpy)的入口函数 2. 如何生成

Automatic fuzz driver generation
IntelliGen:

FUDGE: 会生成大量的candidate drivers，让用户去自行挑选（但是这里的叙述，包括related work中对signature的描述，还有直接给用户等很令人怀疑）
FuzzGen: 需要高质量的test cases

API Usage Mining

MAPO: 被调用的频率
GrouMiner: 在整个proj的图上找usage pattern
APIExample: 找到具有相似使用例子的函数

Unit-test generation

Kampmann等: 从系统测试执行中抽取带参数的unit tests
Testful: 生成Java类或者方法的test cases
Pacheco等: 借助feedback
GRT: 京动态分析
GenRed: 生成并缩减object-oriented test cases

3. Design

A. Entry Function Locator
文中提到一个好的driver会绕过许多检查直达深层逻辑?
这里强调的三种func分别是:

指针解引用
对内存的分配释放拷贝
如果调用了其他函数，+= func2.priority

B. Fuzz Driver Synthesizer

需要能将fuzz engine生成的输入正确地转化为参数
亮点：
挑战：对于指针类型的参数，常常解引用前都不知道具体期待的类型
本文：到第一次使用才给指针类型赋值(lazy store)

记录全部memory range分配情况
遇到store，将对应的memory range设置为assigned
遇到load，检查是否已经assigned，如果没有，就构建一个目标类型的结构体放到对应的memory range里面

4. Implementation

LLVM, LLVMFuzzerTestOneInput()，生成IR等级的driver+librarycode->fuzzer

Choosing effective entry functions: 用户要自己找适合的entry function
Avoiding redundant memory assignments：只lazy store哪些没有被设定为occupied的区域
Synthesizing complex arguments for the entry function
Assigning appropriate values for arguments: IntelliGen 扫描函数的 IR 以搜索比较指令
Filtering out useless drivers：自动执行下，如果crash就丢掉。此外，为了避免memory leak，IntelliGen还把全部的内存分配释放都hook

5. Evaluation

数据集:

AOSP中的6个libs
fuzzer-test-suite中带有手写driver的9个
3个来自企业合作方的real world projs

指标: basic block coverage, path coverage
通过llvm-cov收集

硬件环境，每个driver运行10次，每次4线程跑6个小时，表中为平均值。
对象：IntelliGen，FuzzGen

疑问: We do not list the bugs found in these libraries since these do not have a standard bug list.

6. Case Study on Real Projects

7. Lesson Learned

Writing a fuzz driver manually seriously hinders the efficiency of fuzz testing.
The quality of fuzz drivers will drastically impact the performance of fuzz testing
The performance of driver synthesis can be improved with more domain knowledge.
The criteria for identifying effective entry functions is largely undetermine

posted @ 2021-12-26 04:01 雪溯阅读(299) 评论(0) 编辑收藏举报

刷新页面返回顶部

雪溯

总之心情不好的话大概就会来这边做两道OJ，此处顺便储存部分笔记

Proj EULibHarn Paper Reading: IntelliGen: Automatic Driver Synthesis for FuzzTesting

Abstract

1. Intro

3. Design

4. Implementation

5. Evaluation

6. Case Study on Real Projects

7. Lesson Learned

公告

雪溯

总之心情不好的话大概就会来这边做两道OJ，此处顺便储存部分笔记

Proj EULibHarn Paper Reading: IntelliGen: Automatic Driver Synthesis for FuzzTesting

Abstract

1. Intro

2. Related Work

3. Design

4. Implementation

5. Evaluation

6. Case Study on Real Projects

7. Lesson Learned

公告