Issue 454283002: Fix the issue 'SEGV on unknown address in CPDF_DataAvail::GetObjectSize'

Issue 454283002: Fix the issue 'SEGV on unknown address in CPDF_DataAvail::GetObjectSize' (Closed)

Created:
6 years, 4 months ago by jun_fang

Modified:
5 years, 9 months ago

Reviewers:
Tom Sepez, palmer, Bo Xu

Base URL:
https://pdfium.googlesource.com/pdfium.git@master

Visibility:
Public.

Description

Fix the issue 'SEGV on unknown address in CPDF_DataAvail::GetObjectSize' BUG=387983 R=tsepez@chromium.org Committed: https://pdfium.googlesource.com/pdfium/+/c655167

Patch Set 2 : #

Created: 6 years, 4 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+138 lines, -133 lines)			Patch
	M	core/include/fpdfapi/fpdf_parser.h	View	1	1 chunk	+114 lines, -116 lines	0 comments	Download
	M	core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp	View	1	3 chunks	+24 lines, -17 lines	0 comments	Download

Messages

Total messages: 12 (0 generated)

Expand Messages | Collapse Messages

Tom Sepez

As an outside observer to this code, I'm a little confused by the logic and ...

6 years, 4 months ago (2014-08-09 17:55:36 UTC) #2

jun_fang

https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp File core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp (right): https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp#newcode4110 core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4110: if (!m_pFileAvail->IsDataAvail(m_dwLastXRefOffset, (FX_DWORD)(m_dwFileLen - m_dwLastXRefOffset))) { On 2014/08/09 17:55:36, ...

6 years, 4 months ago (2014-08-18 06:58:00 UTC) #3

https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
File core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp (right):

https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4110: if
(!m_pFileAvail->IsDataAvail(m_dwLastXRefOffset, (FX_DWORD)(m_dwFileLen -
m_dwLastXRefOffset))) {
On 2014/08/09 17:55:36, Tom Sepez wrote:
> Is this going to change on subsequent calls?  In other words, if we try this
> once and it fails, is there ever a possibility it might succeed? It doesn't
seem
> that way (unless the file is being created underneath us) yet we end up
> repeating it. Should this move under the if (!m_bmainXrefLoad) branch below?

If IsDataAvail returns FALSE, that means that data are not available completely
for parsing them into XRefTable which is very important data structure in PDF.
So it just needs to return FALSE rather than loading XRefTable because data for
parsing are not available.

https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4115: if (!m_bMainXRefLoad)
{
On 2014/08/09 17:55:36, Tom Sepez wrote:
> does m_bMainXRefLoad mean "main xref load succeeded" or "main xref load was
> tried once already"?  I think its the later, and if so, the name should be
> m_bMainXrefLoadTried.

if |m_bMainXRefLoad| is true, it means that xref is loaded successfully not the
second meaning you mentioned.

https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4123: m_bMainXRefLoadedOK =
TRUE;
On 2014/08/09 17:55:36, Tom Sepez wrote:
> What's the difference between m_bMainXrefLoadedOK and m_bLinearizedDataOK?  Is
> it ever the case that one is true without the other?

They are different. |m_bMainXrefLoadedOK is true| means that XRef table is
loaded successfully. |m_bLinearizedDataOK is true| means that data are
downloaded and available for being parsed.

Tom Sepez

On 2014/08/18 06:58:00, jun_fang wrote: > https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp > File core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp (right): > > https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp#newcode4110 > ...

6 years, 4 months ago (2014-08-18 17:07:34 UTC) #4

On 2014/08/18 06:58:00, jun_fang wrote:
>
https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
> File core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp (right):
> 
>
https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
> core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4110: if
> (!m_pFileAvail->IsDataAvail(m_dwLastXRefOffset, (FX_DWORD)(m_dwFileLen -
> m_dwLastXRefOffset))) {
> On 2014/08/09 17:55:36, Tom Sepez wrote:
> > Is this going to change on subsequent calls?  In other words, if we try this
> > once and it fails, is there ever a possibility it might succeed? It doesn't
> seem
> > that way (unless the file is being created underneath us) yet we end up
> > repeating it. Should this move under the if (!m_bmainXrefLoad) branch below?
> 
> If IsDataAvail returns FALSE, that means that data are not available
completely
> for parsing them into XRefTable which is very important data structure in PDF.
> So it just needs to return FALSE rather than loading XRefTable because data
for
> parsing are not available.
> 
> 
Right.  So why retry the operation rather than remembering that it failed and
return FALSE?

https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
> core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4115: if
(!m_bMainXRefLoad)
> {
> On 2014/08/09 17:55:36, Tom Sepez wrote:
> > does m_bMainXRefLoad mean "main xref load succeeded" or "main xref load was
> > tried once already"?  I think its the later, and if so, the name should be
> > m_bMainXrefLoadTried.
> 
> if |m_bMainXRefLoad| is true, it means that xref is loaded successfully not
the
> second meaning you mentioned.
> 
Right.  But wouldn't the first one be more useful?  Since we could avoid a retry
and return FALSE?  We'd have to remember the status the last time around ...

>
https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser...
> core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp:4123: m_bMainXRefLoadedOK
=
> TRUE;
> On 2014/08/09 17:55:36, Tom Sepez wrote:
> > What's the difference between m_bMainXrefLoadedOK and m_bLinearizedDataOK? 
Is
> > it ever the case that one is true without the other?
> 
> They are different. |m_bMainXrefLoadedOK is true| means that XRef table is
> loaded successfully. |m_bLinearizedDataOK is true| means that data are
> downloaded and available for being parsed.
Ok.

Tom Sepez

> > So it just needs to return FALSE rather than loading XRefTable because data ...

6 years, 4 months ago (2014-08-18 17:09:44 UTC) #5

palmer

I guess we just need some clarifying documentation for these fields. https://codereview.chromium.org/454283002/diff/1/core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp File core/src/fpdfapi/fpdf_parser/fpdf_parser_parser.cpp (right): ...

6 years, 4 months ago (2014-08-19 20:26:41 UTC) #6

Tom Sepez

> Wait, then what does |m_bMainXRefLoad| mean that is distinct from > |m_bMainXRefLoadedOK|? Good question. ...

6 years, 4 months ago (2014-08-19 21:56:09 UTC) #7

jun_fang

On 2014/08/19 21:56:09, Tom Sepez wrote: > > Wait, then what does |m_bMainXRefLoad| mean that ...

6 years, 4 months ago (2014-08-21 21:24:06 UTC) #8

Tom Sepez

OK, this is starting to make some sense. When we're done here, we'll need to ...

6 years, 4 months ago (2014-08-21 21:58:23 UTC) #9

jun_fang

6 years, 4 months ago (2014-08-23 00:07:01 UTC) #12

Message was sent while issue was closed.

Committed patchset #2 manually as c655167 (presubmit successful).

Expand Messages | Collapse Messages