Issue 1834303002: Rework the mime sniffer fuzzer.

Issue 1834303002: Rework the mime sniffer fuzzer. (Closed)

Created:
4 years, 8 months ago by mmenke

Modified:
4 years, 8 months ago

Reviewers:
eroman

CC:
chromium-reviews, cbentzel+watch_chromium.org, mmoroz

Base URL:
https://chromium.googlesource.com/chromium/src.git@master

Target Ref:
refs/pending/heads/master

Project:
chromium

Visibility:
Public.

More Reviews

Description

Rework the mime sniffer fuzzer. In particular, make it fuzz the URL and content-type header, and make it check the other top level mime sniffing function. Also, rename it so it's more clearly associated with mime_sniffer.h. BUG=598397 Committed: https://crrev.com/5552a6a020ac21565f4a92a36d545e8115c56132 Cr-Commit-Position: refs/heads/master@{#383597}

Patch Set 1 #

Patch Set 2 : Update comment #

Total comments: 12

Patch Set 3 : Remove ShouldSniffMimeType #

Patch Set 4 : Response to comments #

Patch Set 5 : Update comment #

Created: 4 years, 8 months ago

Download [raw] [tar.bz2]

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+63 lines, -29 lines)			Patch
M	net/BUILD.gn	View		2 chunks	+11 lines, -11 lines	0 comments	Download
A	net/base/mime_sniffer_fuzzer.cc	View	1 2 3 4	1 chunk	+52 lines, -0 lines	0 comments	Download
D	net/base/sniff_mime_type_fuzzer.cc	View		1 chunk	+0 lines, -18 lines	0 comments	Download

Messages

Total messages: 19 (5 generated)

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages

mmenke

Eric: Hrm...Didn't plan to send a bunch of reviews your way, just the way things ...

4 years, 8 months ago (2016-03-28 20:20:59 UTC) #2

mmenke

I'm not planning to do a ton of fuzzers all at once, thinking just 1-2 ...

4 years, 8 months ago (2016-03-28 20:24:18 UTC) #3

eroman

Given that I wrote the original test, sending me the reviews is totally reasonable :)

4 years, 8 months ago (2016-03-28 20:30:35 UTC) #4

mmenke

On 2016/03/28 20:30:35, eroman wrote: > Given that I wrote the original test, sending me ...

4 years, 8 months ago (2016-03-28 20:34:45 UTC) #5

eroman

> Ahh, didn't realize that. Had just assumed it was written by the cluster > ...

4 years, 8 months ago (2016-03-28 21:00:26 UTC) #7

mmenke

On 2016/03/28 21:00:26, eroman wrote: > > Ahh, didn't realize that. Had just assumed it ...

4 years, 8 months ago (2016-03-28 21:22:04 UTC) #8

On 2016/03/28 21:00:26, eroman wrote:
> > Ahh, didn't realize that.  Had just assumed it was written by the cluster >
> fuzz team...
> 
> Did you assume that because you thought it was a weak test written by someone
> without insight into how the network stack code works.... or because it was
> awesome and wow-ed you?

It was actually the file name mismatch...Which was awesome and wow-ed me?

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
File net/base/mime_sniffer_fuzzer.cc (right):

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
net/base/mime_sniffer_fuzzer.cc:17: // Finds the line break in |string_piece|,
removes every up to and including the
On 2016/03/28 21:00:25, eroman wrote:
> This comment is missing something.
> 
> removes everything, or removes "every one"

Oops.  Done.  Rewrote this method just a few times.  Too many chefs, and all
that...Even when I'm the only chef.  :)

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
net/base/mime_sniffer_fuzzer.cc:18: // line break from |string_piece|, and
returns all the
On 2016/03/28 21:00:26, eroman wrote:
> Almost like you ended your thought before -
> 
> 
> 
> then the pterodactyl safely landed optimus prime onto the millenium falcon.

That's just silly.  The pterodactyl was a Decepticon, and would never save
Optimus Prime!

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
net/base/mime_sniffer_fuzzer.cc:19: std::string
GetNextArgument(base::StringPiece* string_piece) {
On 2016/03/28 21:00:25, eroman wrote:
> not a fan of name "string_piece"
> 
> |input| would be more descriptive

Done.

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
net/base/mime_sniffer_fuzzer.cc:47: net::ShouldSniffMimeType(url,
mime_type_hint);
On 2016/03/28 21:00:26, eroman wrote:
> This function has a dependence on the URL scheme, however we are only passing
in
> https:// URLs here.
> 
> Would it be better to feed it a URL whose scheme can vary?

I've switched this to take entire URL.

I was taking the path just to reduce the GURL search space (No IDN, no file://
magic, for instance, which has some different rules), as I was horrified how
many of the generated test cases were just probing GURL's logic.  Even with just
the path, we get hundreds of them.  I don't think mime sniffing should ever
depend on scheme in the future - we've pushed back against adding any more
logic, and I think more dependencies on the URL is something we even more
strongly don't want, but that could change, I suppose.

I've also remove the call to ShouldSniffMimeType, and now just check the other
two.  Happy to add it back, it just doesn't seem to do anything exciting enough.

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
net/base/mime_sniffer_fuzzer.cc:50: net::SniffMimeType(input.data(),
input.length(), url, mime_type_hint,
On 2016/03/28 21:00:26, eroman wrote:
> I presume it may be the case that |!url.is_valid()|. Is this going to hit any
> DCHECKs in how |url| is consumed?

So I don't think that case will currently be hit, and the mime type logic
doesn't care, anyways.  Changing URL to be an arbitrary string would cause us to
hit that case, but it still seems to be fine.

mmenke

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_fuzzer.cc File net/base/mime_sniffer_fuzzer.cc (right): https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_fuzzer.cc#newcode47 net/base/mime_sniffer_fuzzer.cc:47: net::ShouldSniffMimeType(url, mime_type_hint); On 2016/03/28 21:22:04, mmenke wrote: > On ...

4 years, 8 months ago (2016-03-28 21:57:53 UTC) #9

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
File net/base/mime_sniffer_fuzzer.cc (right):

https://codereview.chromium.org/1834303002/diff/20001/net/base/mime_sniffer_f...
net/base/mime_sniffer_fuzzer.cc:47: net::ShouldSniffMimeType(url,
mime_type_hint);
On 2016/03/28 21:22:04, mmenke wrote:
> On 2016/03/28 21:00:26, eroman wrote:
> > This function has a dependence on the URL scheme, however we are only
passing
> in
> > https:// URLs here.
> > 
> > Would it be better to feed it a URL whose scheme can vary?
> 
> I've switched this to take entire URL.
> 
> I was taking the path just to reduce the GURL search space (No IDN, no file://
> magic, for instance, which has some different rules), as I was horrified how
> many of the generated test cases were just probing GURL's logic.  Even with
just
> the path, we get hundreds of them.  I don't think mime sniffing should ever
> depend on scheme in the future - we've pushed back against adding any more
> logic, and I think more dependencies on the URL is something we even more
> strongly don't want, but that could change, I suppose.
> 
> I've also remove the call to ShouldSniffMimeType, and now just check the other
> two.  Happy to add it back, it just doesn't seem to do anything exciting
enough.

And just to confirm, running it now, 90%+ of the things it saves are it
exploring the file:// and filesystem:// URL spaces, with only one line of data. 
We could force there to be at least two lfs, which would mean as it explores the
URL space, there's a higher chance of it exploring other arguments as well, not
sure it would get us much, though.

It has discovered that .crx is magic, and one of the mime-type hints, at least,
so it's not completely wasting its time.

I find it interesting to see what it comes up with.  Clearly, I need more
excitement in my life.  Maybe I'll take up watching youtube videos of people
bungee jumping.

eroman

> It was actually the file name mismatch. I named the file sniff_mime_type_fuzzer.cc because it ...

4 years, 8 months ago (2016-03-28 22:28:09 UTC) #10

mmenke

On 2016/03/28 22:28:09, eroman wrote: > > It was actually the file name mismatch. > ...

4 years, 8 months ago (2016-03-28 22:33:51 UTC) #11

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1834303002/80001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1834303002/80001

4 years, 8 months ago (2016-03-28 22:35:32 UTC) #13

eroman

I agree on both counts. This is kind of a crazy idea, but we could ...

4 years, 8 months ago (2016-03-28 22:40:03 UTC) #14

mmenke

On 2016/03/28 22:28:09, eroman wrote: > > It was actually the file name mismatch. > ...

4 years, 8 months ago (2016-03-28 22:47:41 UTC) #15

commit-bot: I haz the power

4 years, 8 months ago (2016-03-28 23:13:51 UTC) #19

Message was sent while issue was closed.

Patchset 5 (id:??) landed as
https://crrev.com/5552a6a020ac21565f4a92a36d545e8115c56132
Cr-Commit-Position: refs/heads/master@{#383597}

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages