third_party/afl/src/docs/status_screen.txt - Issue 2075883002: Add American Fuzzy Lop (afl) to third_party/afl/

Side by Side Diff: third_party/afl/src/docs/status_screen.txt

Issue 2075883002: Add American Fuzzy Lop (afl) to third_party/afl/ (Closed) Base URL: https://chromium.googlesource.com/chromium/src.git@master

Patch Set: Fix nits Created 4 years, 6 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View unified diff | Download patch

OLD	NEW
(Empty)
	1 ===============================

	2 Understanding the status screen

	3 ===============================

	4

	5 This document provides an overview of the status screen - plus tips for

	6 troubleshooting any warnings and red text shown in the UI. See README for

	7 the general instruction manual.

	8

	9 0) A note about colors

	10 ----------------------

	11

	12 The status screen and error messages use colors to keep things readable and

	13 attract your attention to the most important details. For example, red almost

	14 always means "consult this doc" :-)

	15

	16 Unfortunately, the UI will render correctly only if your terminal is using

	17 traditional un*x palette (white text on black background) or something close

	18 to that.

	19

	20 If you are using inverse video, you may want to change your settings, say:

	21

	22 - For GNOME Terminal, go to Edit > Profile preferences, select the "colors"

	23 tab, and from the list of built-in schemes, choose "white on black".

	24

	25 - For the MacOS X Terminal app, open a new window using the "Pro" scheme via

	26 the Shell > New Window menu (or make "Pro" your default).

	27

	28 Alternatively, if you really like your current colors, you can edit config.h

	29 to comment out USE_COLORS, then do 'make clean all'.

	30

	31 I'm not aware of any other simple way to make this work without causing

	32 other side effects - sorry about that.

	33

	34 With that out of the way, let's talk about what's actually on the screen...

	35

	36 1) Process timing

	37 -----------------

	38

	39 +----------------------------------------------------+

	40 \| run time : 0 days, 8 hrs, 32 min, 43 sec \|

	41 \| last new path : 0 days, 0 hrs, 6 min, 40 sec \|

	42 \| last uniq crash : none seen yet \|

	43 \| last uniq hang : 0 days, 1 hrs, 24 min, 32 sec \|

	44 +----------------------------------------------------+

	45

	46 This section is fairly self-explanatory: it tells you how long the fuzzer has

	47 been running and how much time has elapsed since its most recent finds. This is

	48 broken down into "paths" (a shorthand for test cases that trigger new execution

	49 patterns), crashes, and hangs.

	50

	51 When it comes to timing: there is no hard rule, but most fuzzing jobs should be

	52 expected to run for days or weeks; in fact, for a moderately complex project, th e

	53 first pass will probably take a day or so. Every now and then, some jobs

	54 will be allowed to run for months.

	55

	56 There's one important thing to watch out for: if the tool is not finding new

	57 paths within several minutes of starting, you're probably not invoking the

	58 target binary correctly and it never gets to parse the input files we're

	59 throwing at it; another possible explanations are that the default memory limit

	60 (-m) is too restrictive, and the program exits after failing to allocate a

	61 buffer very early on; or that the input files are patently invalid and always

	62 fail a basic header check.

	63

	64 If there are no new paths showing up for a while, you will eventually see a big

	65 red warning in this section, too :-)

	66

	67 2) Overall results

	68 ------------------

	69

	70 +-----------------------+

	71 \| cycles done : 0 \|

	72 \| total paths : 2095 \|

	73 \| uniq crashes : 0 \|

	74 \| uniq hangs : 19 \|

	75 +-----------------------+

	76

	77 The first field in this section gives you the count of queue passes done so far

	78 - that is, the number of times the fuzzer went over all the interesting test

	79 cases discovered so far, fuzzed them, and looped back to the very beginning.

	80 Every fuzzing session should be allowed to complete at least one cycle; and

	81 ideally, should run much longer than that.

	82

	83 As noted earlier, the first pass can take a day or longer, so sit back and

	84 relax. If you want to get broader but more shallow coverage right away, try

	85 the -d option - it gives you a more familiar experience by skipping the

	86 deterministic fuzzing steps. It is, however, inferior to the standard mode in

	87 a couple of subtle ways.

	88

	89 To help make the call on when to hit Ctrl-C, the cycle counter is color-coded.

	90 It is shown in magenta during the first pass, progresses to yellow if new finds

	91 are still being made in subsequent rounds, then blue when that ends - and

	92 finally, turns green after the fuzzer hasn't been seeing any action for a

	93 longer while.

	94

	95 The remaining fields in this part of the screen should be pretty obvious:

	96 there's the number of test cases ("paths") discovered so far, and the number of

	97 unique faults. The test cases, crashes, and hangs can be explored in real-time

	98 by browsing the output directory, as discussed in the README.

	99

	100 3) Cycle progress

	101 -----------------

	102

	103 +-------------------------------------+

	104 \| now processing : 1296 (61.86%) \|

	105 \| paths timed out : 0 (0.00%) \|

	106 +-------------------------------------+

	107

	108 This box tells you how far along the fuzzer is with the current queue cycle: it

	109 shows the ID of the test case it is currently working on, plus the number of

	110 inputs it decided to ditch because they were persistently timing out.

	111

	112 The "*" suffix sometimes shown in the first line means that the currently

	113 processed path is not "favored" (a property discussed later on, in section 6).

	114

	115 If you feel that the fuzzer is progressing too slowly, see the note about the

	116 -d option in section 2 of this doc.

	117

	118 4) Map coverage

	119 ---------------

	120

	121 +--------------------------------------+

	122 \| map density : 4763 (29.07%) \|

	123 \| count coverage : 4.03 bits/tuple \|

	124 +--------------------------------------+

	125

	126 The section provides some trivia about the coverage observed by the

	127 instrumentation embedded in the target binary.

	128

	129 The first line in the box tells you how many branch tuples we have already

	130 hit, in proportion to how much the bitmap can hold. Be wary of extremes:

	131

	132 - Absolute numbers below 200 or so suggest one of three things: that the

	133 program is extremely simple; that it is not instrumented properly (e.g.,

	134 due to being linked against a non-instrumented copy of the target

	135 library); or that it is bailing out prematurely on your input test cases.

	136 The fuzzer will try to mark this in pink, just to make you aware.

	137

	138 - Percentages over 70% may very rarely happen with very complex programs

	139 that make heavy use of template-generated code.

	140

	141 Because high bitmap density makes it harder for the fuzzer to reliably

	142 discern new program states, I recommend recompiling the binary with

	143 AFL_INST_RATIO=10 or so and trying again (see env_variables.txt).

	144

	145 The fuzzer will flag high percentages in red. Chances are, you will never

	146 see that unless you're fuzzing extremely hairy software (say, v8, perl,

	147 ffmpeg).

	148

	149 The other line deals with the variability in tuple hit counts seen in the

	150 binary. In essence, if every taken branch is always taken a fixed number of

	151 times for all the inputs we have tried, this will read "1.00". As we manage

	152 to trigger other hit counts for every branch, the needle will start to move

	153 toward "8.00" (every bit in the 8-bit map hit), but will probably never

	154 reach that extreme.

	155

	156 Together, the values can be useful for comparing the coverage of several

	157 different fuzzing jobs that rely on the same instrumented binary.

	158

	159 5) Stage progress

	160 -----------------

	161

	162 +-------------------------------------+

	163 \| now trying : interest 32/8 \|

	164 \| stage execs : 3996/34.4k (11.62%) \|

	165 \| total execs : 27.4M \|

	166 \| exec speed : 891.7/sec \|

	167 +-------------------------------------+

	168

	169 This part gives you an in-depth peek at what the fuzzer is actually doing right

	170 now. It tells you about the current stage, which can be any of:

	171

	172 - calibration - a pre-fuzzing stage where the execution path is examined

	173 to detect anomalies, establish baseline execution speed, and so on. Executed

	174 very briefly whenever a new find is being made.

	175

	176 - trim L/S - another pre-fuzzing stage where the test case is trimmed to the

	177 shortest form that still produces the same execution path. The length (L)

	178 and stepover (S) are chosen in general relationship to file size.

	179

	180 - bitflip L/S - deterministic bit flips. There are L bits toggled at any given

	181 time, walking the input file with S-bit increments. The current L/S variants

	182 are: 1/1, 2/1, 4/1, 8/8, 16/8, 32/8.

	183

	184 - arith L/8 - deterministic arithmetics. The fuzzer tries to subtract or add

	185 small integers to 8-, 16-, and 32-bit values. The stepover is always 8 bits.

	186

	187 - interest L/8 - deterministic value overwrite. The fuzzer has a list of known

	188 "interesting" 8-, 16-, and 32-bit values to try. The stepover is 8 bits.

	189

	190 - extras - deterministic injection of dictionary terms. This can be shown as

	191 "user" or "auto", depending on whether the fuzzer is using a user-supplied

	192 dictionary (-x) or an auto-created one. You will also see "over" or "insert" ,

	193 depending on whether the dictionary words overwrite existing data or are

	194 inserted by offsetting the remaining data to accommodate their length.

	195

	196 - havoc - a sort-of-fixed-length cycle with stacked random tweaks. The

	197 operations attempted during this stage include bit flips, overwrites with

	198 random and "interesting" integers, block deletion, block duplication, plus

	199 assorted dictionary-related operations (if a dictionary is supplied in the

	200 first place).

	201

	202 - splice - a last-resort strategy that kicks in after the first full queue

	203 cycle with no new paths. It is equivalent to 'havoc', except that it first

	204 splices together two random inputs from the queue at some arbitrarily

	205 selected midpoint.

	206

	207 - sync - a stage used only when -M or -S is set (see parallel_fuzzing.txt).

	208 No real fuzzing is involved, but the tool scans the output from other

	209 fuzzers and imports test cases as necessary. The first time this is done,

	210 it may take several minutes or so.

	211

	212 The remaining fields should be fairly self-evident: there's the exec count

	213 progress indicator for the current stage, a global exec counter, and a

	214 benchmark for the current program execution speed. This may fluctuate from

	215 one test case to another, but the benchmark should be ideally over 500 execs/sec

	216 most of the time - and if it stays below 100, the job will probably take very

	217 long.

	218

	219 The fuzzer will explicitly warn you about slow targets, too. If this happens,

	220 see the perf_tips.txt file included with the fuzzer for ideas on how to speed

	221 things up.

	222

	223 6) Findings in depth

	224 --------------------

	225

	226 +--------------------------------------+

	227 \| favored paths : 879 (41.96%) \|

	228 \| new edges on : 423 (20.19%) \|

	229 \| total crashes : 0 (0 unique) \|

	230 \| total hangs : 24 (19 unique) \|

	231 +--------------------------------------+

	232

	233 This gives you several metrics that are of interest mostly to complete nerds.

	234 The section includes the number of paths that the fuzzer likes the most based

	235 on a minimization algorithm baked into the code (these will get considerably

	236 more air time), and the number of test cases that actually resulted in better

	237 edge coverage (versus just pushing the branch hit counters up). There are also

	238 additional, more detailed counters for crashes and hangs.

	239

	240 7) Fuzzing strategy yields

	241 --------------------------

	242

	243 +-----------------------------------------------------+

	244 \| bit flips : 57/289k, 18/289k, 18/288k \|

	245 \| byte flips : 0/36.2k, 4/35.7k, 7/34.6k \|

	246 \| arithmetics : 53/2.54M, 0/537k, 0/55.2k \|

	247 \| known ints : 8/322k, 12/1.32M, 10/1.70M \|

	248 \| dictionary : 9/52k, 1/53k, 1/24k \|

	249 \| havoc : 1903/20.0M, 0/0 \|

	250 \| trim : 20.31%/9201, 17.05% \|

	251 +-----------------------------------------------------+

	252

	253 This is just another nerd-targeted section keeping track of how many paths we

	254 have netted, in proportion to the number of execs attempted, for each of the

	255 fuzzing strategies discussed earlier on. This serves to convincingly validate

	256 assumptions about the usefulness of the various approaches taken by afl-fuzz.

	257

	258 The trim strategy stats in this section are a bit different than the rest.

	259 The first number in this line shows the ratio of bytes removed from the input

	260 files; the second one corresponds to the number of execs needed to achieve this

	261 goal. Finally, the third number shows the proportion of bytes that, although

	262 not possible to remove, were deemed to have no effect and were excluded from

	263 some of the more expensive deterministic fuzzing steps.

	264

	265 8) Path geometry

	266 ----------------

	267

	268 +---------------------+

	269 \| levels : 5 \|

	270 \| pending : 1570 \|

	271 \| pend fav : 583 \|

	272 \| own finds : 0 \|

	273 \| imported : 0 \|

	274 \| variable : 0 \|

	275 +---------------------+

	276

	277 The first field in this section tracks the path depth reached through the

	278 guided fuzzing process. In essence: the initial test cases supplied by the

	279 user are considered "level 1". The test cases that can be derived from that

	280 through traditional fuzzing are considered "level 2"; the ones derived by

	281 using these as inputs to subsequent fuzzing rounds are "level 3"; and so forth.

	282 The maximum depth is therefore a rough proxy for how much value you're getting

	283 out of the instrumentation-guided approach taken by afl-fuzz.

	284

	285 The next field shows you the number of inputs that have not gone through any

	286 fuzzing yet. The same stat is also given for "favored" entries that the fuzzer

	287 really wants to get to in this queue cycle (the non-favored entries may have to

	288 wait a couple of cycles to get their chance).

	289

	290 Next, we have the number of new paths found during this fuzzing section and

	291 imported from other fuzzer instances when doing parallelized fuzzing; and the

	292 number of inputs that produce seemingly variable behavior in the tested binary.

	293

	294 That last bit is actually fairly interesting. There are four quasi-common

	295 explanations for variable behavior of the tested program:

	296

	297 - Use of uninitialized memory in conjunction with some intrinsic sources of

	298 entropy in the tested binary. This can be indicative of a security bug.

	299

	300 - Attempts to create files that were already created during previous runs, or

	301 otherwise interact with some form of persistent state. This is harmless,

	302 but you may want to instruct the targeted program to write to stdout or to

	303 /dev/null to avoid surprises (and disable the creation of temporary files

	304 and similar artifacts, if applicable).

	305

	306 - Hitting functionality that is actually designed to behave randomly. For

	307 example, when fuzzing sqlite, the fuzzer will dutifully detect variable

	308 behavior once the mutation engine generates something like:

	309

	310 select random();

	311

	312 - Multiple threads executing at once in semi-random order. This is usually

	313 just a nuisance, but if the number of variable paths is very high, try the

	314 following options:

	315

	316 - Use afl-clang-fast from llvm_mode/ - it uses a thread-local tracking

	317 model that is less prone to concurrency issues,

	318

	319 - See if the target can be compiled or run without threads. Common

	320 ./configure options include --without-threads, --disable-pthreads, or

	321 --disable-openmp.

	322

	323 - Replace pthreads with GNU Pth (https://www.gnu.org/software/pth/), which

	324 allows you to use a deterministic scheduler.

	325

	326 Less likely causes may include running out of disk space, SHM handles, or other

	327 globally limited resources.

	328

	329 The paths where variable behavior is detected are marked with a matching entry

	330 in the <out_dir>/queue/.state/variable_behavior/ directory, so you can look

	331 them up easily.

	332

	333 If you can't suppress variable behavior and don't want to see these warnings,

	334 simply set AFL_NO_VAR_CHECK=1 in the environment before running afl-fuzz. This

	335 will also dramatically speed up session resumption.

	336

	337 9) CPU load

	338 -----------

	339

	340 [cpu: 25%]

	341

	342 This tiny widget shows the apparent CPU utilization on the local system. It is

	343 calculated by taking the number of processes in the "runnable" state, and then

	344 comparing it to the number of logical cores on the system.

	345

	346 If the value is shown in green, you are using fewer CPU cores than available on

	347 your system and can probably parallelize to improve performance; for tips on

	348 how to do that, see parallel_fuzzing.txt.

	349

	350 If the value is shown in red, your CPU is possibly oversubscribed, and

	351 running additional fuzzers may not give you any benefits.

	352

	353 Of course, this benchmark is very simplistic; it tells you how many processes

	354 are ready to run, but not how resource-hungry they may be. It also doesn't

	355 distinguish between physical cores, logical cores, and virtualized CPUs; the

	356 performance characteristics of each of these will differ quite a bit.

	357

	358 If you want a more accurate measurement, you can run the afl-gotcpu utility

	359 from the command line.

	360

	361 10) Addendum: status and plot files

	362 -----------------------------------

	363

	364 For unattended operation, some of the key status screen information can be also

	365 found in a machine-readable format in the fuzzer_stats file in the output

	366 directory. This includes:

	367

	368 - start_time - unix time indicating the start time of afl-fuzz

	369 - last_update - unix time corresponding to the last update of this file

	370 - fuzzer_pid - PID of the fuzzer process

	371 - cycles_done - queue cycles completed so far

	372 - execs_done - number of execve() calls attempted

	373 - execs_per_sec - current number of execs per second

	374 - paths_total - total number of entries in the queue

	375 - paths_found - number of entries discovered through local fuzzing

	376 - paths_imported - number of entries imported from other instances

	377 - max_depth - number of levels in the generated data set

	378 - cur_path - currently processed entry number

	379 - pending_favs - number of favored entries still waiting to be fuzzed

	380 - pending_total - number of all entries waiting to be fuzzed

	381 - variable_paths - number of test cases showing variable behavior

	382 - unique_crashes - number of unique crashes recorded

	383 - unique_hangs - number of unique hangs encountered

	384

	385 Most of these map directly to the UI elements discussed earlier on.

	386

	387 On top of that, you can also find an entry called 'plot_data', containing a

	388 plottable history for most of these fields. If you have gnuplot installed, you

	389 can turn this into a nice progress report with the included 'afl-plot' tool.

OLD	NEW

« no previous file with comments | « third_party/afl/src/docs/sister_projects.txt ('k') | third_party/afl/src/docs/technical_details.txt » ('j') | no next file with comments »