Issue 2961373002: Improve Zip File Scanning on Mac

mortonm

Patchset #7 (id:120001) has been deleted

3 years, 5 months ago (2017-07-11 17:05:40 UTC) #1

mortonm

Description was changed from ========== trying stuff out BUG=600392 ========== to ========== This CL fixes ...

3 years, 5 months ago (2017-07-12 17:33:30 UTC) #2

mortonm

mortonm@google.com changed reviewers: + jialiul@chromium.org

3 years, 5 months ago (2017-07-12 17:33:30 UTC) #3

Jialiu Lin

Looking good :-) https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_browsing/zip_analyzer.cc#newcode70 chrome/common/safe_browsing/zip_analyzer.cc:70: LOG(ERROR) << "macho?: " << (bytes.compare("\xcf\xfa\xed\xfe") ...

3 years, 5 months ago (2017-07-12 17:57:11 UTC) #11

mortonm

https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_browsing/zip_analyzer.cc#newcode70 chrome/common/safe_browsing/zip_analyzer.cc:70: LOG(ERROR) << "macho?: " << (bytes.compare("\xcf\xfa\xed\xfe") == 0); On ...

3 years, 5 months ago (2017-07-12 18:15:06 UTC) #12

mortonm

Description was changed from ========== This CL fixes two aspects of broken ZIP processing on ...

3 years, 5 months ago (2017-07-12 18:16:33 UTC) #13

mortonm

On 2017/07/12 18:15:06, mortonm wrote: > https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_browsing/zip_analyzer.cc > File chrome/common/safe_browsing/zip_analyzer.cc (right): > > https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_browsing/zip_analyzer.cc#newcode70 > ...

3 years, 5 months ago (2017-07-12 18:55:51 UTC) #14

On 2017/07/12 18:15:06, mortonm wrote:
>
https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_bro...
> File chrome/common/safe_browsing/zip_analyzer.cc (right):
> 
>
https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_bro...
> chrome/common/safe_browsing/zip_analyzer.cc:70: LOG(ERROR) << "macho?: " <<
> (bytes.compare("\xcf\xfa\xed\xfe") == 0);
> On 2017/07/12 17:57:10, Jialiu Lin wrote:
> > You can convert this LOG(ERROR) into a DCHECK
> 
> Yeah, sorry I should probably just take this out. This is only one example of
a
> valid MachO magic value (see line 79). I just left it in for my own debugging
> purposes but it doesn't really belong.
> 
>
https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_bro...
> chrome/common/safe_browsing/zip_analyzer.cc:72: if (bytes.length() !=
> sizeof(uint32_t))
> On 2017/07/12 17:57:10, Jialiu Lin wrote:
> > Maybe this should be a DCHECK too. 
> 
> I changed it to < instead of !=, so the function can be called with longer
> strings and it will still check only the first 4 bytes. I wanted this check in
> the code so that the memcpy() doesn't access invalid memory if the string is
> less than 4 bytes long.
> 
>
https://codereview.chromium.org/2961373002/diff/260001/chrome/common/safe_bro...
> chrome/common/safe_browsing/zip_analyzer.cc:170: results->has_executable =
true;
> On 2017/07/12 17:57:10, Jialiu Lin wrote:
> > Do we need to set has_executable = true here? When we go into the .app
> directory
> > and encounter a Mach-O file, we can set this value then.
> 
> Yeah, I was debating this as well. I figured I'd leave it as is since this
> mirrors the pre-existing behavior of the code and I didn't want to break any
> tests. I'll just take it out and hopefully no tests start failing.

Wow, didn't realize that 'CIGAM' was 'MAGIC' backwards so ignore the explanation
for my first comment :) I guess there is only one magic value but those
constants check for different endianness and integer width of the architecture.

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 16:10:04 UTC) #15

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/420001

3 years, 5 months ago (2017-07-13 16:10:13 UTC) #16

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-13 16:28:13 UTC) #17

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: chromeos_amd64-generic_chromium_compile_only_ng on master.tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/chromeos_amd64-generic_chromium_compile_only_ng/builds/379238) linux_chromium_chromeos_ozone_rel_ng on ...

3 years, 5 months ago (2017-07-13 16:28:14 UTC) #18

mortonm

mortonm@google.com changed reviewers: + satorux@chromium.org

3 years, 5 months ago (2017-07-13 16:41:13 UTC) #19

mortonm

adding satorux@ as OWNER for: third_party/zlib/google/zip_reader.cc third_party/zlib/google/zip_reader.h

3 years, 5 months ago (2017-07-13 16:42:51 UTC) #25

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 16:49:43 UTC) #26

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/460001

3 years, 5 months ago (2017-07-13 16:49:50 UTC) #27

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 17:07:24 UTC) #28

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/480001

3 years, 5 months ago (2017-07-13 17:07:34 UTC) #29

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-13 17:53:15 UTC) #30

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: win_chromium_x64_rel_ng on master.tryserver.chromium.win (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.win/builders/win_chromium_x64_rel_ng/builds/469876)

3 years, 5 months ago (2017-07-13 17:53:16 UTC) #31

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 18:18:54 UTC) #32

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/490001

3 years, 5 months ago (2017-07-13 18:19:03 UTC) #33

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-13 19:01:01 UTC) #34

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: win_chromium_rel_ng on master.tryserver.chromium.win (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.win/builders/win_chromium_rel_ng/builds/488751)

3 years, 5 months ago (2017-07-13 19:01:02 UTC) #35

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 19:45:57 UTC) #36

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/510001

3 years, 5 months ago (2017-07-13 19:46:12 UTC) #37

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-13 20:34:15 UTC) #38

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_chromium_chromeos_ozone_rel_ng on master.tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/linux_chromium_chromeos_ozone_rel_ng/builds/428359)

3 years, 5 months ago (2017-07-13 20:34:16 UTC) #39

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 20:57:29 UTC) #40

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/530001

3 years, 5 months ago (2017-07-13 20:57:46 UTC) #41

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-13 22:05:01 UTC) #42

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: win_chromium_rel_ng on master.tryserver.chromium.win (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.win/builders/win_chromium_rel_ng/builds/488968)

3 years, 5 months ago (2017-07-13 22:05:02 UTC) #43

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-13 22:22:36 UTC) #44

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/550001

3 years, 5 months ago (2017-07-13 22:23:05 UTC) #45

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-14 00:27:40 UTC) #46

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: mac_chromium_rel_ng on master.tryserver.chromium.mac (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.mac/builders/mac_chromium_rel_ng/builds/499965)

3 years, 5 months ago (2017-07-14 00:27:41 UTC) #47

satorux1

satorux@chromium.org changed reviewers: + palmer@chromium.org

3 years, 5 months ago (2017-07-14 01:00:52 UTC) #48

satorux1

+palmer Does this scanning code run in the browser process or in a separate process? ...

3 years, 5 months ago (2017-07-14 01:00:53 UTC) #49

mortonm

All of the code in zip_analyzer.cc, including calling zip_reader functionality, runs in a sandboxed utility ...

3 years, 5 months ago (2017-07-14 17:08:45 UTC) #50

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-14 18:18:52 UTC) #51

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/590001

3 years, 5 months ago (2017-07-14 18:19:13 UTC) #52

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-14 19:10:46 UTC) #53

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: win_chromium_rel_ng on master.tryserver.chromium.win (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.win/builders/win_chromium_rel_ng/builds/489700)

3 years, 5 months ago (2017-07-14 19:10:48 UTC) #54

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-14 20:09:48 UTC) #55

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/610001

3 years, 5 months ago (2017-07-14 20:10:00 UTC) #56

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-14 21:15:18 UTC) #57

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/610001

3 years, 5 months ago (2017-07-14 21:15:28 UTC) #58

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-14 22:20:46 UTC) #59

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/630001

3 years, 5 months ago (2017-07-14 22:21:00 UTC) #60

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-14 22:47:51 UTC) #61

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/650001

3 years, 5 months ago (2017-07-14 22:48:09 UTC) #62

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-15 00:46:20 UTC) #63

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 5 months ago (2017-07-15 00:46:21 UTC) #64

palmer

https://codereview.chromium.org/2961373002/diff/650001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/650001/third_party/zlib/google/zip_reader.cc#newcode362 third_party/zlib/google/zip_reader.cc:362: total_num_bytes_read += base::checked_cast<size_t>(num_bytes_read); Should |total_num_bytes_read| be a |base::CheckedNumeric<size_t>| ?

3 years, 5 months ago (2017-07-17 18:44:49 UTC) #70

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-17 20:30:40 UTC) #71

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/670001

3 years, 5 months ago (2017-07-17 20:30:48 UTC) #72

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-17 20:40:49 UTC) #73

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/690001

3 years, 5 months ago (2017-07-17 20:41:07 UTC) #74

mortonm

https://codereview.chromium.org/2961373002/diff/650001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/650001/third_party/zlib/google/zip_reader.cc#newcode362 third_party/zlib/google/zip_reader.cc:362: total_num_bytes_read += base::checked_cast<size_t>(num_bytes_read); On 2017/07/17 18:44:49, palmer wrote: > ...

3 years, 5 months ago (2017-07-17 20:41:45 UTC) #75

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-17 22:38:19 UTC) #76

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_android_rel_ng on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/linux_android_rel_ng/builds/340144)

3 years, 5 months ago (2017-07-17 22:38:21 UTC) #77

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-17 23:11:15 UTC) #78

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/690001

3 years, 5 months ago (2017-07-17 23:11:25 UTC) #79

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-18 06:06:04 UTC) #80

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 5 months ago (2017-07-18 06:06:05 UTC) #81

satorux1

sorry for the belated response. https://codereview.chromium.org/2961373002/diff/690001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/690001/third_party/zlib/google/zip_reader.cc#newcode328 third_party/zlib/google/zip_reader.cc:328: bool ZipReader::ExtractPartOfCurrentEntry(WriterDelegate* delegate, This ...

3 years, 5 months ago (2017-07-18 07:46:38 UTC) #82

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-18 19:55:58 UTC) #83

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/710001

3 years, 5 months ago (2017-07-18 19:56:17 UTC) #84

mortonm

https://codereview.chromium.org/2961373002/diff/690001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/690001/third_party/zlib/google/zip_reader.cc#newcode328 third_party/zlib/google/zip_reader.cc:328: bool ZipReader::ExtractPartOfCurrentEntry(WriterDelegate* delegate, On 2017/07/18 07:46:37, satorux1 wrote: > ...

3 years, 5 months ago (2017-07-18 19:58:49 UTC) #85

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-18 21:12:20 UTC) #86

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: android_n5x_swarming_rel on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/android_n5x_swarming_rel/builds/223463)

3 years, 5 months ago (2017-07-18 21:12:22 UTC) #87

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-19 15:34:24 UTC) #88

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/710001

3 years, 5 months ago (2017-07-19 15:34:37 UTC) #89

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-19 16:36:52 UTC) #90

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 5 months ago (2017-07-19 16:36:54 UTC) #91

satorux1

sorry for the belated response again https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc#newcode143 chrome/common/safe_browsing/zip_analyzer.cc:143: reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false, Sorry ...

3 years, 5 months ago (2017-07-21 05:27:02 UTC) #92

mortonm

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc#newcode143 chrome/common/safe_browsing/zip_analyzer.cc:143: reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false, On 2017/07/21 05:27:02, satorux1 wrote: > Sorry ...

3 years, 5 months ago (2017-07-21 16:29:57 UTC) #93

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_bro...
File chrome/common/safe_browsing/zip_analyzer.cc (right):

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_bro...
chrome/common/safe_browsing/zip_analyzer.cc:143:
reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false,
On 2017/07/21 05:27:02, satorux1 wrote:
> Sorry for not taking a look at the caller earlier, but I'm trying to
understand
> why you needed to introduce this new function.
> 
> Seems to me that the following would be OK:
> 
>   reader.ExtractCurrentEntryToString(sizeof(uint32_t), &magic);
> 
> If the content is larger than 4GB, the function will stop reading at the
maximum
> size. It'll return false but |magic| should contain the 4GB of data.

In this case, we only want to read the first 4 bytes of the current entry to
determine whether it is a Mac executable (and thus has the MAGIC header for
MachO files). We don't want to do an in-memory copy of a large file into a
string just to be able to check the first 4 bytes. The existing functionality in
zip_reader only allows for reading an entire file while specifying the max
length to extract, but we can't just specify |4| here because for all files
larger than 4 bytes the extraction would simply fail.

In the current implementation, when extraction fails,
ExtractCurrentEntryToString() does not copy any data to the output string. Note
that this function returns without copying data to output:
https://cs.chromium.org/chromium/src/third_party/zlib/google/zip_reader.cc?rc...

Moreover, the change isn't as simple as just copying the data to the output even
when extraction fails, due to the way ExtractCurrentEntry() is written as well
as the WriteBytes function for StringWriterDelegate:
https://cs.chromium.org/chromium/src/third_party/zlib/google/zip_reader.cc?rc...

Unless i'm missing something, I think the new function is necessary for reading
the first 4 bytes of an entry

https://codereview.chromium.org/2961373002/diff/710001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader.h (right):

https://codereview.chromium.org/2961373002/diff/710001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.h:162: // Extracts entirety of the current
entry in chunks to |delegate|.
On 2017/07/21 05:27:02, satorux1 wrote:
> While you are at it, could you add documentation about the return value?

Done. Didn't add an explanation here, but there are a variety of cases that
could cause this function to return false, including not being able to read the
current entry, not being able to write to the delegate, etc

satorux1

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc#newcode143 chrome/common/safe_browsing/zip_analyzer.cc:143: reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false, On 2017/07/21 16:29:56, mortonm wrote: > On ...

3 years, 5 months ago (2017-07-21 23:00:09 UTC) #94

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_bro...
File chrome/common/safe_browsing/zip_analyzer.cc (right):

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_bro...
chrome/common/safe_browsing/zip_analyzer.cc:143:
reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false,
On 2017/07/21 16:29:56, mortonm wrote:
> On 2017/07/21 05:27:02, satorux1 wrote:
> > Sorry for not taking a look at the caller earlier, but I'm trying to
> understand
> > why you needed to introduce this new function.
> > 
> > Seems to me that the following would be OK:
> > 
> >   reader.ExtractCurrentEntryToString(sizeof(uint32_t), &magic);
> > 
> > If the content is larger than 4GB, the function will stop reading at the
> maximum
> > size. It'll return false but |magic| should contain the 4GB of data.
> 
> In this case, we only want to read the first 4 bytes of the current entry to
> determine whether it is a Mac executable (and thus has the MAGIC header for
> MachO files). We don't want to do an in-memory copy of a large file into a
> string just to be able to check the first 4 bytes. The existing functionality
in
> zip_reader only allows for reading an entire file while specifying the max
> length to extract, but we can't just specify |4| here because for all files
> larger than 4 bytes the extraction would simply fail.
> 
> In the current implementation, when extraction fails,
> ExtractCurrentEntryToString() does not copy any data to the output string.
Note
> that this function returns without copying data to output:
>
https://cs.chromium.org/chromium/src/third_party/zlib/google/zip_reader.cc?rc...
> 
> Moreover, the change isn't as simple as just copying the data to the output
even
> when extraction fails, due to the way ExtractCurrentEntry() is written as well
> as the WriteBytes function for StringWriterDelegate:
>
https://cs.chromium.org/chromium/src/third_party/zlib/google/zip_reader.cc?rc...
> 
> Unless i'm missing something, I think the new function is necessary for
reading
> the first 4 bytes of an entry

My bad. sizeof(uint32_t) is of course 4, not 4GB. :( Sorry for the confusion.

What I meant was that the following would work:

  reader.ExtractCurrentEntryToString(sizeof(sizeof(uint32_t), &magic);   //
sizeof(sizeof(uint32_t) == 4

But you are right that it doesn't work currently. I'd rather think that fixing
the behavior is better than introducing a new function that looks similar but
behaves slightly different.

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-24 17:45:47 UTC) #95

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/750001

3 years, 5 months ago (2017-07-24 17:45:51 UTC) #96

mortonm

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_browsing/zip_analyzer.cc#newcode143 chrome/common/safe_browsing/zip_analyzer.cc:143: reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false, On 2017/07/21 23:00:09, satorux1 wrote: > On ...

3 years, 5 months ago (2017-07-24 17:51:44 UTC) #97

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_bro...
File chrome/common/safe_browsing/zip_analyzer.cc (right):

https://codereview.chromium.org/2961373002/diff/710001/chrome/common/safe_bro...
chrome/common/safe_browsing/zip_analyzer.cc:143:
reader.ExtractFromBeginningOfCurrentEntryToString(sizeof(uint32_t), false,
On 2017/07/21 23:00:09, satorux1 wrote:
> On 2017/07/21 16:29:56, mortonm wrote:
> > On 2017/07/21 05:27:02, satorux1 wrote:
> > > Sorry for not taking a look at the caller earlier, but I'm trying to
> > understand
> > > why you needed to introduce this new function.
> > > 
> > > Seems to me that the following would be OK:
> > > 
> > >   reader.ExtractCurrentEntryToString(sizeof(uint32_t), &magic);
> > > 
> > > If the content is larger than 4GB, the function will stop reading at the
> > maximum
> > > size. It'll return false but |magic| should contain the 4GB of data.
> > 
> > In this case, we only want to read the first 4 bytes of the current entry to
> > determine whether it is a Mac executable (and thus has the MAGIC header for
> > MachO files). We don't want to do an in-memory copy of a large file into a
> > string just to be able to check the first 4 bytes. The existing
functionality
> in
> > zip_reader only allows for reading an entire file while specifying the max
> > length to extract, but we can't just specify |4| here because for all files
> > larger than 4 bytes the extraction would simply fail.
> > 
> > In the current implementation, when extraction fails,
> > ExtractCurrentEntryToString() does not copy any data to the output string.
> Note
> > that this function returns without copying data to output:
> >
>
https://cs.chromium.org/chromium/src/third_party/zlib/google/zip_reader.cc?rc...
> > 
> > Moreover, the change isn't as simple as just copying the data to the output
> even
> > when extraction fails, due to the way ExtractCurrentEntry() is written as
well
> > as the WriteBytes function for StringWriterDelegate:
> >
>
https://cs.chromium.org/chromium/src/third_party/zlib/google/zip_reader.cc?rc...
> > 
> > Unless i'm missing something, I think the new function is necessary for
> reading
> > the first 4 bytes of an entry
> 
> My bad. sizeof(uint32_t) is of course 4, not 4GB. :( Sorry for the confusion.
> 
> What I meant was that the following would work:
> 
>   reader.ExtractCurrentEntryToString(sizeof(sizeof(uint32_t), &magic);   //
> sizeof(sizeof(uint32_t) == 4
> 
> But you are right that it doesn't work currently. I'd rather think that fixing
> the behavior is better than introducing a new function that looks similar but
> behaves slightly different.

Done. Required adding an extra parameter to each of the existing functions, but
this seemed like the most straightforward way.

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-24 20:04:21 UTC) #98

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_android_rel_ng on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/linux_android_rel_ng/builds/345517)

3 years, 5 months ago (2017-07-24 20:04:22 UTC) #99

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-24 20:54:18 UTC) #100

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/770001

3 years, 5 months ago (2017-07-24 20:54:24 UTC) #101

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-24 22:50:58 UTC) #102

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_android_rel_ng on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/linux_android_rel_ng/builds/345813)

3 years, 5 months ago (2017-07-24 22:51:00 UTC) #103

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-24 23:17:37 UTC) #104

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/780001

3 years, 5 months ago (2017-07-24 23:17:48 UTC) #105

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-25 02:03:11 UTC) #106

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 5 months ago (2017-07-25 02:03:13 UTC) #107

satorux1

Thank you for revising the patch. I think it's getting close. I have a request ...

3 years, 5 months ago (2017-07-25 07:55:15 UTC) #108

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-25 17:44:48 UTC) #109

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/800001

3 years, 5 months ago (2017-07-25 17:45:01 UTC) #110

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-25 17:53:56 UTC) #111

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/820001

3 years, 5 months ago (2017-07-25 17:54:04 UTC) #112

mortonm

https://codereview.chromium.org/2961373002/diff/780001/third_party/zlib/google/zip_reader.h File third_party/zlib/google/zip_reader.h (right): https://codereview.chromium.org/2961373002/diff/780001/third_party/zlib/google/zip_reader.h#newcode220 third_party/zlib/google/zip_reader.h:220: // |max_read_bytes|. Otherwise if |read_entire_file| is false, |num_bytes| On ...

3 years, 5 months ago (2017-07-25 18:10:16 UTC) #113

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-25 22:03:10 UTC) #114

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: win_chromium_x64_rel_ng on master.tryserver.chromium.win (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.win/builders/win_chromium_x64_rel_ng/builds/478039)

3 years, 5 months ago (2017-07-25 22:03:11 UTC) #115

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 5 months ago (2017-07-25 22:59:55 UTC) #116

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/820001

3 years, 5 months ago (2017-07-25 23:00:14 UTC) #117

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 5 months ago (2017-07-26 01:43:02 UTC) #118

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 5 months ago (2017-07-26 01:43:04 UTC) #119

satorux1

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/google/zip_reader.cc#newcode302 third_party/zlib/google/zip_reader.cc:302: bool entire_file_extracted = true; start with false and set ...

3 years, 4 months ago (2017-07-27 07:05:32 UTC) #120

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader.cc (right):

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:302: bool entire_file_extracted = true;
start with false and set it to true only when the entire file is read?

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:335: if (total_num_bytes_read ==
num_bytes_to_extract) {
Special-casing this case seems tricky. Maybe we can get rid of this by something
like below?

while (true) {
  const int num_bytes_read =
    unzReadCurrentFile(zip_file_, buf.get(), internal::kZipBufSize);

  if (num_bytes == 0) {
    entire_file_extracted = true;
    break;
  } else if (num_bytes_read < 0) {
    // If num_bytes_read < 0, then it's a specific UNZ_* error code.
    break;
  } else if (num_bytes_read > 0) {
    // Some data is read.
    auto unread_bytes = base::CheckedNumeric<uint64_t>(num_bytes_to_extract) -
                        total_num_bytes_read;
    uint_64 num_bytes_to_write = std::min(unread_bytes.ValueOrDie(),
                                          num_bytes_read);
    if (!delegate->WriteBytes(buf.get(), num_bytes_to_write))
      break;
  }
}

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:448: bool
ZipReader::ExtractCurrentEntryToString(uint64_t num_bytes,
num_bytes -> max_read_bytes to be consistent with the header file

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:475: // There was an error in extracting
entry.
I was a bit confused at first. Maybe add something like below?

There was an error. If the current entry is smaller than max_read_bytes,
ExtractCurrentEntry() should read the whole content and return true.

satorux1

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/google/zip_reader.cc#newcode335 third_party/zlib/google/zip_reader.cc:335: if (total_num_bytes_read == num_bytes_to_extract) { On 2017/07/27 07:05:31, satorux1 ...

3 years, 4 months ago (2017-07-27 07:10:28 UTC) #121

mortonm

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/google/zip_reader.cc#newcode302 third_party/zlib/google/zip_reader.cc:302: bool entire_file_extracted = true; On 2017/07/27 07:05:31, satorux1 wrote: ...

3 years, 4 months ago (2017-07-27 18:32:01 UTC) #123

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader.cc (right):

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:302: bool entire_file_extracted = true;
On 2017/07/27 07:05:31, satorux1 wrote:
> start with false and set it to true only when the entire file is read?

Done.

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:335: if (total_num_bytes_read ==
num_bytes_to_extract) {
On 2017/07/27 07:10:28, satorux1 wrote:
> On 2017/07/27 07:05:31, satorux1 wrote:
> > Special-casing this case seems tricky. Maybe we can get rid of this by
> something
> > like below?
> > 
> > while (true) {
> >   const int num_bytes_read =
> >     unzReadCurrentFile(zip_file_, buf.get(), internal::kZipBufSize);
> > 
> >   if (num_bytes == 0) {
> >     entire_file_extracted = true;
> >     break;
> >   } else if (num_bytes_read < 0) {
> >     // If num_bytes_read < 0, then it's a specific UNZ_* error code.
> >     break;
> >   } else if (num_bytes_read > 0) {
> >     // Some data is read.
> >     auto unread_bytes = base::CheckedNumeric<uint64_t>(num_bytes_to_extract)
-
> >                         total_num_bytes_read;
> >     uint_64 num_bytes_to_write = std::min(unread_bytes.ValueOrDie(),
> >                                           num_bytes_read);
> 
> we'd also need
> 
>   if (num_bytes_to_write == 0) 
>     break;
> 
> >     if (!delegate->WriteBytes(buf.get(), num_bytes_to_write))
> >       break;
> >   }
> > }
> >  
> 

Done.

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:335: if (total_num_bytes_read ==
num_bytes_to_extract) {
On 2017/07/27 07:05:31, satorux1 wrote:
> Special-casing this case seems tricky. Maybe we can get rid of this by
something
> like below?
> 
> while (true) {
>   const int num_bytes_read =
>     unzReadCurrentFile(zip_file_, buf.get(), internal::kZipBufSize);
> 
>   if (num_bytes == 0) {
>     entire_file_extracted = true;
>     break;
>   } else if (num_bytes_read < 0) {
>     // If num_bytes_read < 0, then it's a specific UNZ_* error code.
>     break;
>   } else if (num_bytes_read > 0) {
>     // Some data is read.
>     auto unread_bytes = base::CheckedNumeric<uint64_t>(num_bytes_to_extract) -
>                         total_num_bytes_read;
>     uint_64 num_bytes_to_write = std::min(unread_bytes.ValueOrDie(),
>                                           num_bytes_read);
>     if (!delegate->WriteBytes(buf.get(), num_bytes_to_write))
>       break;
>   }
> }
>  

Done.

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:448: bool
ZipReader::ExtractCurrentEntryToString(uint64_t num_bytes,
On 2017/07/27 07:05:31, satorux1 wrote:
> num_bytes -> max_read_bytes to be consistent with the header file

Done.

https://codereview.chromium.org/2961373002/diff/820001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:475: // There was an error in extracting
entry.
On 2017/07/27 07:05:31, satorux1 wrote:
> I was a bit confused at first. Maybe add something like below?
> 
> There was an error. If the current entry is smaller than max_read_bytes,
> ExtractCurrentEntry() should read the whole content and return true.

Done. The revised comment now explains the situation more clearly.

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-07-27 18:32:05 UTC) #124

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/860001

3 years, 4 months ago (2017-07-27 18:32:20 UTC) #125

Jialiu Lin

Thanks satorux1@ for your thorough review! Really appreciate your effort. BTW, do we still need ...

3 years, 4 months ago (2017-07-27 18:34:41 UTC) #126

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-07-27 19:45:51 UTC) #128

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_chromium_rel_ng on master.tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/linux_chromium_rel_ng/builds/511428)

3 years, 4 months ago (2017-07-27 19:45:52 UTC) #129

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-07-27 21:24:01 UTC) #130

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/860001

3 years, 4 months ago (2017-07-27 21:24:10 UTC) #131

satorux1

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/google/zip_reader.cc#newcode329 third_party/zlib/google/zip_reader.cc:329: unzReadCurrentFile(zip_file_, buf.get(), 1) == 0) { I thought we ...

3 years, 4 months ago (2017-07-27 23:18:10 UTC) #132

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-07-27 23:57:04 UTC) #133

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 4 months ago (2017-07-27 23:57:06 UTC) #134

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-07-28 18:02:10 UTC) #135

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/900001

3 years, 4 months ago (2017-07-28 18:02:21 UTC) #136

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-07-28 21:33:59 UTC) #137

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: android_n5x_swarming_rel on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/android_n5x_swarming_rel/builds/231733)

3 years, 4 months ago (2017-07-28 21:34:00 UTC) #138

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-07-31 16:21:09 UTC) #139

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/900001

3 years, 4 months ago (2017-07-31 16:21:19 UTC) #140

mortonm

Hi satorux1@, thanks for the help on this CL! I am close to finishing my ...

3 years, 4 months ago (2017-07-31 18:13:04 UTC) #141

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-07-31 18:51:22 UTC) #142

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_chromium_rel_ng on master.tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/linux_chromium_rel_ng/builds/513436)

3 years, 4 months ago (2017-07-31 18:51:23 UTC) #143

satorux1

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/google/zip_reader.cc#newcode329 third_party/zlib/google/zip_reader.cc:329: unzReadCurrentFile(zip_file_, buf.get(), 1) == 0) { On 2017/07/31 18:13:04, ...

3 years, 4 months ago (2017-08-01 08:01:09 UTC) #144

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader.cc (right):

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:329: unzReadCurrentFile(zip_file_,
buf.get(), 1) == 0) {
On 2017/07/31 18:13:04, mortonm wrote:
> On 2017/07/27 23:18:10, satorux1 wrote:
> > I thought we could get rid of the call to unzReadCurrentFile() here, since
the
> > while loop will end at the first |if| conditional in the loop, if the entire
> > file is read:
> > 
> > 310   if (num_bytes_read == 0) {
> > 311     entire_file_extracted = true;
> > 312     break;
> > 313   }
> > 
> > Here, can we go with something like below?
> > 
> >   auto unread_bytes = ...;
> >   if (unread_bytes.ValueOrDie() == 0)
> >     break;
> > 
> >   uint64_t uint64_t num_bytes_to_write = ...;
> >   if (!delegate->WriteBytes(buf.get(), num_bytes_to_write)
> >     break;
> 
> Yeah, I think I originally wrote the function this way as well. The problem
with
> this approach is that it will cause the function to return false if the
> specified |num_bytes_to_extract| parameter equals the size of the entry being
> extracted. 

Ah you are right. Sorry for missing that point.

> The check on line 325 checks to see if the |num_bytes_to_extract| cap
> has been hit while performing extraction. If so, we usually want to return
> false, except when |num_bytes_to_extract| equals the size of the entry, in
which
> case we want to return true. However, in line with your comment, I've
re-written
> the code to avoid doing the second call to unzReadCurrentFile().

I think the new version with two additional booleans is more complicated than
the previous version. :)

I think your previous version could be a bit simplified if we keep track of the
remaining capacity of the delegate than the total bytes extracted:

  uint64_t remaining_capacity = num_bytes_to_extract;
  bool entire_file_extracted = false;

  while (remaining_capacity > 0) {
    const int num_bytes_read =
        unzReadCurrentFile(zip_file_, buf.get(), internal::kZipBufSize);

    if (num_bytes_read == 0) {
      entire_file_extracted = true;
      break;
    }
    if (num_bytes_read < 0) {
      // If num_bytes_read < 0, then it's a specific UNZ_* error code.          

      break;
    } else if (num_bytes_read > 0) {
      uint64_t num_bytes_to_write =
          std::min<uint64_t>(remaining_capacity,
                             base::checked_cast<uint64_t>(num_bytes_read));
      if (!delegate->WriteBytes(buf.get(), num_bytes_to_write))
         break;
      if (remaining_capacity == base::checked_cast<uint64_t>(num_bytes_read)) {
        // Ensures function returns true if the entire file has been read.
        entire_file_extracted =
            (unzReadCurrentFile(zip_file_, buf.get(), 1) == 0);
      }
      CHECK_GE(remaining_capacity, num_bytes_to_write);
      remaining_capacity -= num_bytes_to_write;
    }
  }

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader_unittest.cc (right):

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:580: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123456", i));
EXPECT_EQ(base::StringPiece("0123456", i).as__string(), contents) ? then you can
get rid of EXPECT_EQ(i, contents.size());

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:586: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123456", i));
ditto

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:605: EXPECT_EQ(0,
memcmp(contents.c_str(), "", 0));
EXPECT_EQ("", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:607: EXPECT_EQ(0,
memcmp(contents.c_str(), "", 0));
EXPECT_EQ("", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:612: EXPECT_EQ(0,
memcmp(contents.c_str(), "", 0));
EXPECT_EQ("", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:614: EXPECT_EQ(0,
memcmp(contents.c_str(), "0", 1));
EXPECT_EQ("0", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:616: EXPECT_EQ(0,
memcmp(contents.c_str(), "0", 1));
EXPECT_EQ("0", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:620:
EXPECT_TRUE(reader.ExtractCurrentEntryToString(0, &contents));
EXPECT_EQ("", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:623: EXPECT_EQ(0,
memcmp(contents.c_str(), "01", 2));
EXPECT_EQ("01", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:625: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123", 4));
EXPECT_EQ("0123", contents)?

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:627: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123", 0));
0 here does not look correct.

EXPECT_EQ("0123", contents)?

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-08-01 16:16:06 UTC) #145

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/920001

3 years, 4 months ago (2017-08-01 16:16:20 UTC) #146

mortonm

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/google/zip_reader.cc#newcode329 third_party/zlib/google/zip_reader.cc:329: unzReadCurrentFile(zip_file_, buf.get(), 1) == 0) { On 2017/08/01 08:01:08, ...

3 years, 4 months ago (2017-08-01 16:16:47 UTC) #147

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader.cc (right):

https://codereview.chromium.org/2961373002/diff/860001/third_party/zlib/googl...
third_party/zlib/google/zip_reader.cc:329: unzReadCurrentFile(zip_file_,
buf.get(), 1) == 0) {
On 2017/08/01 08:01:08, satorux1 wrote:
> On 2017/07/31 18:13:04, mortonm wrote:
> > On 2017/07/27 23:18:10, satorux1 wrote:
> > > I thought we could get rid of the call to unzReadCurrentFile() here, since
> the
> > > while loop will end at the first |if| conditional in the loop, if the
entire
> > > file is read:
> > > 
> > > 310   if (num_bytes_read == 0) {
> > > 311     entire_file_extracted = true;
> > > 312     break;
> > > 313   }
> > > 
> > > Here, can we go with something like below?
> > > 
> > >   auto unread_bytes = ...;
> > >   if (unread_bytes.ValueOrDie() == 0)
> > >     break;
> > > 
> > >   uint64_t uint64_t num_bytes_to_write = ...;
> > >   if (!delegate->WriteBytes(buf.get(), num_bytes_to_write)
> > >     break;
> > 
> > Yeah, I think I originally wrote the function this way as well. The problem
> with
> > this approach is that it will cause the function to return false if the
> > specified |num_bytes_to_extract| parameter equals the size of the entry
being
> > extracted. 
> 
> Ah you are right. Sorry for missing that point.
> 
> > The check on line 325 checks to see if the |num_bytes_to_extract| cap
> > has been hit while performing extraction. If so, we usually want to return
> > false, except when |num_bytes_to_extract| equals the size of the entry, in
> which
> > case we want to return true. However, in line with your comment, I've
> re-written
> > the code to avoid doing the second call to unzReadCurrentFile().
> 
> I think the new version with two additional booleans is more complicated than
> the previous version. :)
> 
> I think your previous version could be a bit simplified if we keep track of
the
> remaining capacity of the delegate than the total bytes extracted:
> 
>   uint64_t remaining_capacity = num_bytes_to_extract;
>   bool entire_file_extracted = false;
> 
>   while (remaining_capacity > 0) {
>     const int num_bytes_read =
>         unzReadCurrentFile(zip_file_, buf.get(), internal::kZipBufSize);
> 
>     if (num_bytes_read == 0) {
>       entire_file_extracted = true;
>       break;
>     }
>     if (num_bytes_read < 0) {
>       // If num_bytes_read < 0, then it's a specific UNZ_* error code.        
 
>            
>       break;
>     } else if (num_bytes_read > 0) {
>       uint64_t num_bytes_to_write =
>           std::min<uint64_t>(remaining_capacity,
>                              base::checked_cast<uint64_t>(num_bytes_read));
>       if (!delegate->WriteBytes(buf.get(), num_bytes_to_write))
>          break;
>       if (remaining_capacity == base::checked_cast<uint64_t>(num_bytes_read))
{
>         // Ensures function returns true if the entire file has been read.
>         entire_file_extracted =
>             (unzReadCurrentFile(zip_file_, buf.get(), 1) == 0);
>       }
>       CHECK_GE(remaining_capacity, num_bytes_to_write);
>       remaining_capacity -= num_bytes_to_write;
>     }
>   }
> 
> 
> 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
File third_party/zlib/google/zip_reader_unittest.cc (right):

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:580: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123456", i));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ(base::StringPiece("0123456", i).as__string(), contents) ? then you
can
> get rid of EXPECT_EQ(i, contents.size());

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:586: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123456", i));
On 2017/08/01 08:01:09, satorux1 wrote:
> ditto

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:605: EXPECT_EQ(0,
memcmp(contents.c_str(), "", 0));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:607: EXPECT_EQ(0,
memcmp(contents.c_str(), "", 0));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:612: EXPECT_EQ(0,
memcmp(contents.c_str(), "", 0));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:614: EXPECT_EQ(0,
memcmp(contents.c_str(), "0", 1));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("0", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:616: EXPECT_EQ(0,
memcmp(contents.c_str(), "0", 1));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("0", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:620:
EXPECT_TRUE(reader.ExtractCurrentEntryToString(0, &contents));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:623: EXPECT_EQ(0,
memcmp(contents.c_str(), "01", 2));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("01", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:625: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123", 4));
On 2017/08/01 08:01:09, satorux1 wrote:
> EXPECT_EQ("0123", contents)? 

Done.

https://codereview.chromium.org/2961373002/diff/900001/third_party/zlib/googl...
third_party/zlib/google/zip_reader_unittest.cc:627: EXPECT_EQ(0,
memcmp(contents.c_str(), "0123", 0));
On 2017/08/01 08:01:09, satorux1 wrote:
> 0 here does not look correct.
> 
> EXPECT_EQ("0123", contents)? 

The file is only 4 characters long. I am checking that when |max_read_bytes| is
larger than the file, the function still returns true and returns the entire
file in the string. I've added a comment noting this.

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-08-01 17:42:32 UTC) #148

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_android_rel_ng on master.tryserver.chromium.android (JOB_FAILED, https://build.chromium.org/p/tryserver.chromium.android/builders/linux_android_rel_ng/builds/352623)

3 years, 4 months ago (2017-08-01 17:42:34 UTC) #149

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-08-01 21:42:33 UTC) #150

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/920001

3 years, 4 months ago (2017-08-01 21:42:48 UTC) #151

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-08-02 01:08:44 UTC) #152

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 4 months ago (2017-08-02 01:08:45 UTC) #153

satorux1

zlib stuff lgtm with some requests: https://codereview.chromium.org/2961373002/diff/920001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/920001/third_party/zlib/google/zip_reader.cc#newcode315 third_party/zlib/google/zip_reader.cc:315: if (num_bytes_read < ...

3 years, 4 months ago (2017-08-03 08:36:48 UTC) #154

satorux1

https://codereview.chromium.org/2961373002/diff/920001/third_party/zlib/google/zip_reader_unittest.cc File third_party/zlib/google/zip_reader_unittest.cc (right): https://codereview.chromium.org/2961373002/diff/920001/third_party/zlib/google/zip_reader_unittest.cc#newcode743 third_party/zlib/google/zip_reader_unittest.cc:743: } On 2017/08/03 08:36:48, satorux1 wrote: > I's suggest ...

3 years, 4 months ago (2017-08-03 08:47:03 UTC) #155

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-08-03 15:00:16 UTC) #163

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/940001

3 years, 4 months ago (2017-08-03 15:00:27 UTC) #164

mortonm

Thanks! https://codereview.chromium.org/2961373002/diff/920001/third_party/zlib/google/zip_reader.cc File third_party/zlib/google/zip_reader.cc (right): https://codereview.chromium.org/2961373002/diff/920001/third_party/zlib/google/zip_reader.cc#newcode315 third_party/zlib/google/zip_reader.cc:315: if (num_bytes_read < 0) { On 2017/08/03 08:36:48, ...

3 years, 4 months ago (2017-08-03 15:01:23 UTC) #165

Robert Sesek

rsesek@chromium.org changed reviewers: + rsesek@chromium.org

3 years, 4 months ago (2017-08-03 15:43:33 UTC) #167

Robert Sesek

Note on the CL description: The "Subject" in Rietveld does not get included in the ...

3 years, 4 months ago (2017-08-03 15:43:36 UTC) #168

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-08-03 17:05:01 UTC) #169

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 4 months ago (2017-08-03 17:05:03 UTC) #170

mortonm

https://codereview.chromium.org/2961373002/diff/940001/chrome/browser/safe_browsing/sandboxed_zip_analyzer_unittest.cc File chrome/browser/safe_browsing/sandboxed_zip_analyzer_unittest.cc (right): https://codereview.chromium.org/2961373002/diff/940001/chrome/browser/safe_browsing/sandboxed_zip_analyzer_unittest.cc#newcode104 chrome/browser/safe_browsing/sandboxed_zip_analyzer_unittest.cc:104: ASSERT_EQ(data.is_signed, binary.has_signature()); On 2017/08/03 15:43:35, Robert Sesek wrote: > ...

3 years, 4 months ago (2017-08-03 17:09:13 UTC) #171

Robert Sesek

Description was changed from ========== This CL fixes two aspects of broken ZIP processing on ...

3 years, 4 months ago (2017-08-03 18:35:21 UTC) #172

Robert Sesek

LGTM https://codereview.chromium.org/2961373002/diff/960001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/960001/chrome/common/safe_browsing/zip_analyzer.cc#newcode21 chrome/common/safe_browsing/zip_analyzer.cc:21: #include "chrome/common/safe_browsing/mach_o_image_reader_mac.h" I think this will probably need ...

3 years, 4 months ago (2017-08-03 18:36:31 UTC) #173

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-08-03 18:44:56 UTC) #174

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/980001

3 years, 4 months ago (2017-08-03 18:45:11 UTC) #175

mortonm

https://codereview.chromium.org/2961373002/diff/960001/chrome/common/safe_browsing/zip_analyzer.cc File chrome/common/safe_browsing/zip_analyzer.cc (right): https://codereview.chromium.org/2961373002/diff/960001/chrome/common/safe_browsing/zip_analyzer.cc#newcode21 chrome/common/safe_browsing/zip_analyzer.cc:21: #include "chrome/common/safe_browsing/mach_o_image_reader_mac.h" On 2017/08/03 18:36:31, Robert Sesek wrote: > ...

3 years, 4 months ago (2017-08-03 18:45:22 UTC) #176

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-08-03 21:05:25 UTC) #177

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: mac_chromium_rel_ng on master.tryserver.chromium.mac (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.mac/builders/mac_chromium_rel_ng/builds/516398)

3 years, 4 months ago (2017-08-03 21:05:27 UTC) #178

mortonm

The CQ bit was checked by mortonm@google.com to run a CQ dry run

3 years, 4 months ago (2017-08-03 23:03:38 UTC) #179

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/980001

3 years, 4 months ago (2017-08-03 23:03:45 UTC) #180

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

3 years, 4 months ago (2017-08-04 00:14:41 UTC) #181

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

3 years, 4 months ago (2017-08-04 00:14:43 UTC) #182

mortonm

The patchset sent to the CQ was uploaded after l-g-t-m from palmer@chromium.org, jialiul@chromium.org, satorux@chromium.org, rsesek@chromium.org ...

3 years, 4 months ago (2017-08-04 14:53:51 UTC) #184

commit-bot: I haz the power

CQ is trying da patch. Follow status at: https://chromium-cq-status.appspot.com/v2/patch-status/codereview.chromium.org/2961373002/980001

3 years, 4 months ago (2017-08-04 14:53:57 UTC) #185

commit-bot: I haz the power

CQ is committing da patch. Bot data: {"patchset_id": 980001, "attempt_start_ts": 1501858430463320, "parent_rev": "c4c3f6423d629ca803771737becb94ccf9073b61", "commit_rev": "034ecb569929e1202f39cf744a32e8deeade06c8"}

3 years, 4 months ago (2017-08-04 14:57:45 UTC) #186

commit-bot: I haz the power

Description was changed from ========== Improve Zip File Scanning on Mac This CL fixes two ...

3 years, 4 months ago (2017-08-04 14:58:11 UTC) #187

commit-bot: I haz the power

3 years, 4 months ago (2017-08-04 14:58:14 UTC) #188

Message was sent while issue was closed.

Committed patchset #26 (id:980001) as
https://chromium.googlesource.com/chromium/src/+/034ecb569929e1202f39cf744a32...

Issue 2961373002: Improve Zip File Scanning on Mac (Closed)

Description

Patch Set 1 : crude magic check working #

Patch Set 2 : gonna implement cap first before moving on #

Patch Set 3 : removed temp dir stuff #

Patch Set 4 : addressing comments #

Patch Set 5 : added makefile #

Patch Set 6 : changing name of executable in zip #

Patch Set 7 : ready for review #

Patch Set 8 : improving readability in zip_analyzer.cc #

Patch Set 9 : removing check for .app on windows #

Patch Set 10 : addressing comments #

Patch Set 11 : addressing comments #

Patch Set 12 : addressing comments #

Patch Set 13 : refactoring in zip_reader #

Patch Set 14 : comment on return value #

Patch Set 15 : incorporating new zip_reader functionality into existing functions #

Patch Set 16 : adding debug statements for android #

Patch Set 17 : debugging int64_t on android #

Patch Set 18 : removing argument from ExtractCurrentEntryToString() #

Patch Set 19 : minor #

Patch Set 20 : slight refactoring in zip_reader #

Patch Set 21 : testing #

Patch Set 22 : avoiding multiple calls to unzReadCurrentFile #

Patch Set 23 : addressing comments #

Patch Set 24 : final mods #

Patch Set 25 : comments #

Patch Set 26 : minor #

Messages