Issue 2881583002: [fuzzer] Add input validation in the beginning of the parser fuzz target.

Issue 2881583002: [fuzzer] Add input validation in the beginning of the parser fuzz target. (Closed)

Created:
3 years, 7 months ago by mmoroz

Modified:
3 years, 7 months ago

Reviewers:
Martin Barbella, marja

CC:
v8-reviews_googlegroups.com

Target Ref:
refs/heads/master

Project:
v8

Visibility:
Public.

More Reviews

Description

[fuzzer] Add input validation in the beginning of the parser fuzz target. Non-printable characters do not make sense. Inputs with non balanced brackets are mostly useless as well. This validation function makes the fuzzer 15-20x faster. Also use -only_ascii=1 option of libFuzzer: https://codereview.chromium.org/2875933003 BUG=chromium:584819 Review-Url: https://codereview.chromium.org/2881583002 Cr-Commit-Position: refs/heads/master@{#45367} Committed: https://chromium.googlesource.com/v8/v8/+/96628339311653f8b05e7d9658af1ae681b26bea

Patch Set 1 #

Total comments: 4

Patch Set 2 : Fix coding style and apply git cl format #

Created: 3 years, 7 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+44 lines, -0 lines)			Patch
	M	test/fuzzer/parser.cc	View	1	1 chunk	+44 lines, -0 lines	0 comments	Download

Messages

Total messages: 17 (7 generated)

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages

mmoroz

On 2017/05/12 09:31:08, mmoroz wrote: > mailto:mmoroz@chromium.org changed reviewers: > + mailto:marja@chromium.org, mailto:mbarbella@chromium.org Please take ...

3 years, 7 months ago (2017-05-16 12:28:19 UTC) #3

marja

q: is the current parser fuzzer producing any interesting reports, ie is it ok to ...

3 years, 7 months ago (2017-05-16 12:30:12 UTC) #4

mmoroz

On 2017/05/16 12:30:12, marja wrote: > q: is the current parser fuzzer producing any interesting ...

3 years, 7 months ago (2017-05-16 13:09:21 UTC) #5

marja

On 2017/05/16 13:09:21, mmoroz wrote: > On 2017/05/16 12:30:12, marja wrote: > > q: is ...

3 years, 7 months ago (2017-05-16 13:12:12 UTC) #6

mmoroz

On 2017/05/16 13:12:12, marja wrote: > On 2017/05/16 13:09:21, mmoroz wrote: > > On 2017/05/16 ...

3 years, 7 months ago (2017-05-16 14:14:56 UTC) #7

mmoroz

On 2017/05/16 14:14:56, mmoroz wrote: > On 2017/05/16 13:12:12, marja wrote: > > On 2017/05/16 ...

3 years, 7 months ago (2017-05-17 08:25:41 UTC) #8

On 2017/05/16 14:14:56, mmoroz wrote:
> On 2017/05/16 13:12:12, marja wrote:
> > On 2017/05/16 13:09:21, mmoroz wrote:
> > > On 2017/05/16 12:30:12, marja wrote:
> > > > q: is the current parser fuzzer producing any interesting reports, ie is
> it
> > ok
> > > > to fully repurpose it?
> > > 
> > > I do not run it locally since the electricity outage last week. Looking at
> the
> > > input I've gotten before then, they seem to have potential. At least, they
> are
> > > better than binary trash that is being rejected by parser anyway.
> > > 
> > > I also think that it makes sense to use some -max_len limit. Maybe 48 or
64?
> > > I'll add it to
> > >
> >
>
https://cs.chromium.org/chromium/src/testing/libfuzzer/fuzzers/BUILD.gn?q=est...
> > 
> > So is the plan to fully discard the current parser fuzzer and make it
produce
> > different kind of examples?
> > 
> > 48 / 64 sound really short - they're probably good for the minimal examples
I
> > showed you, but there are other interesting cases which don't fit there.
> 
> I wouldn't say "fully discard", but would say "iteratively upgrade" the fuzzer
> to generate better testcases :)
> 
> Current upgrade resolves two issues:
> a) bad characters in the input
> b) non balanced braces
> 
> These are low hanging fruits, but anyway it seems to be a movement in
right17923
> direction. As a side effect, the fuzzer becomes much faster, as we reject bad
> inputs before starting all the V8 stuff.
> 
> Thanks for clarification on max_len, I won't change it then.

I also checked how the coverage changes if we land this.

Existing fuzzer has shown 16933 edges on a minimized corpus from the cloud

If I replace all bad characters with spaces (' '), that corpus gives 16407 edges

If I add new corpus files (generated during 2-3 days on my desktop), it becomes
17646

I think that we lose a bit of "trashy" coverage, that happens due to the parser
bailing out on invalid inputs, but we get a good increment of "good" coverage
that is produced by somewhat valid-looking inputs.

PTAL

marja

lgtm https://codereview.chromium.org/2881583002/diff/1/test/fuzzer/parser.cc File test/fuzzer/parser.cc (right): https://codereview.chromium.org/2881583002/diff/1/test/fuzzer/parser.cc#newcode26 test/fuzzer/parser.cc:26: if (!(std::isspace(ptr[i]) || std::isprint(ptr[i]))) Coding style nit: if ...

3 years, 7 months ago (2017-05-17 08:56:20 UTC) #9

mmoroz

Thanks Marja! https://codereview.chromium.org/2881583002/diff/1/test/fuzzer/parser.cc File test/fuzzer/parser.cc (right): https://codereview.chromium.org/2881583002/diff/1/test/fuzzer/parser.cc#newcode26 test/fuzzer/parser.cc:26: if (!(std::isspace(ptr[i]) || std::isprint(ptr[i]))) On 2017/05/17 08:56:20, ...

3 years, 7 months ago (2017-05-17 09:59:15 UTC) #11

commit-bot: I haz the power

CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2881583002/20001

3 years, 7 months ago (2017-05-17 09:59:48 UTC) #14

commit-bot: I haz the power

3 years, 7 months ago (2017-05-17 10:36:20 UTC) #17

Message was sent while issue was closed.

Committed patchset #2 (id:20001) as
https://chromium.googlesource.com/v8/v8/+/96628339311653f8b05e7d9658af1ae681b...

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages