Issue 539323002: [Cast] Sanity-check for unbounded queue growth in PacedSender.

Issue 539323002: [Cast] Sanity-check for unbounded queue growth in PacedSender. (Closed)

Created:
6 years, 3 months ago by miu

Modified:
6 years, 3 months ago

Reviewers:
Alpha Left Google

CC:
chromium-reviews, hclam+watch_chromium.org, cbentzel+watch_chromium.org, imcheng+watch_chromium.org, hguihot+watch_chromium.org, jasonroberts+watch_google.com, avayvod+watch_chromium.org, pwestin+watch_google.com, feature-media-reviews_chromium.org, miu+watch_chromium.org, hubbe+watch_chromium.org, mikhal+watch_chromium.org

Base URL:
https://chromium.googlesource.com/chromium/src.git@master

Project:
chromium

Visibility:
Public.

More Reviews

Description

[Cast] Sanity-check for unbounded queue growth in PacedSender. Removes the spammy warning message, and replaces it with a sanity- check that the queue has not grown past 10 seconds' worth of packets. If the sanity-check trips, a process crash dump will be sent to make us aware, via crash reports, that this is a problem in-the-wild. Committed: https://crrev.com/1904e8682930a06460ffc323cc8b1c88c5f4966b Cr-Commit-Position: refs/heads/master@{#294928}

Patch Set 1 #

Patch Set 2 : Generate crash report if impossible situation is reached. #

Patch Set 3 : No packet dropping. #

Created: 6 years, 3 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+19 lines, -8 lines)			Patch
	M	media/cast/net/pacing/paced_sender.h	View	1 2	1 chunk	+2 lines, -0 lines	0 comments	Download
	M	media/cast/net/pacing/paced_sender.cc	View	1 2	5 chunks	+17 lines, -8 lines	0 comments	Download

Messages

Total messages: 18 (2 generated)

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages

Alpha Left Google

What bug is this going to fix? We should never RTCP packets, in fact the ...

6 years, 3 months ago (2014-09-04 22:39:52 UTC) #3

miu

On 2014/09/04 22:39:52, Alpha wrote: > What bug is this going to fix? We should ...

6 years, 3 months ago (2014-09-05 18:38:01 UTC) #4

Alpha Left Google

> IMO, this is the right approach, since 10 seconds' worth of packets is a ...

6 years, 3 months ago (2014-09-05 18:48:29 UTC) #5

miu

On 2014/09/05 18:48:29, Alpha wrote: > > IMO, this is the right approach, since 10 ...

6 years, 3 months ago (2014-09-05 20:50:34 UTC) #6

On 2014/09/05 18:48:29, Alpha wrote:
> > IMO, this is the right approach, since 10 seconds' worth of packets is a
huge
> > upper-bound.  It can easily be 50+ MB of data held in memory.  If this point
> is
> > reached, it is a strong signal that the queue is growing faster than it is
> > emptying.  Also, it's perfectly fine to not send anything because packets
are
> > dropped in downstream code anyway (or as UDP packets on the network).  The
> > callers of the Send/Resend methods will simply attempt to re-send packets
when
> > the ACKs are not received.
> 
> Would the easier to just to raise the limit?

I don't understand.  Raise what limit?

> If we do accumulate this much data it seems worse not to send it (and silently
> drop it). Receiver doesn't know there's subsequent data. And really have this
> much data queued indicates something is wrong and if it seems worse developer
> cannot detect it.

The point is, if this much data accumulates in the PacedSender, it will NEVER be
able to empty the queue.  I totally agree that having this much data queued
means something is wrong.  That's why I've left in WARNING messages.  I can
change to LOG(DFATAL) if you feel that's more appropriate.

BTW--It's not true that the receiver doesn't know about subsequent data.  Two
reasons:

1. FrameSender::ResendForKickstart().

2. After some packets are sent, PacedSender will once again allow packets to be
enqueued.  The caller of SendPackets() will send packets for later frames, and
then the receiver will notice that some packets were dropped in-between.

> Currently the only possible way to lose a packet is socket error or lost in
the
> network. I would prefer not to add other ways that packets can be dropped.

Packets are never going to be dropped, except when the 10-second queue length
limit is reached.  That should never happen unless we have a bug in our
implementation.  And if we do have a bug, at least we won't gobble-up memory
indefinitely until Chrome crashes.  ;-)

Alpha Left Google

On 2014/09/05 20:50:34, miu wrote: > On 2014/09/05 18:48:29, Alpha wrote: > > > IMO, ...

6 years, 3 months ago (2014-09-08 18:26:28 UTC) #7

On 2014/09/05 20:50:34, miu wrote:
> On 2014/09/05 18:48:29, Alpha wrote:
> > > IMO, this is the right approach, since 10 seconds' worth of packets is a
> huge
> > > upper-bound.  It can easily be 50+ MB of data held in memory.  If this
point
> > is
> > > reached, it is a strong signal that the queue is growing faster than it is
> > > emptying.  Also, it's perfectly fine to not send anything because packets
> are
> > > dropped in downstream code anyway (or as UDP packets on the network).  The
> > > callers of the Send/Resend methods will simply attempt to re-send packets
> when
> > > the ACKs are not received.
> > 
> > Would the easier to just to raise the limit?
> 
> I don't understand.  Raise what limit?

https://code.google.com/p/chromium/codesearch#chromium/src/media/cast/net/pac...

The limit is 100, wouldn't it be simple to raise it to something like
kMaxPacketsAllowedEnqueued. If the goal is to have developer notice such a case
and not be annoying.

> 
> > If we do accumulate this much data it seems worse not to send it (and
silently
> > drop it). Receiver doesn't know there's subsequent data. And really have
this
> > much data queued indicates something is wrong and if it seems worse
developer
> > cannot detect it.
> 
> The point is, if this much data accumulates in the PacedSender, it will NEVER
be
> able to empty the queue.  I totally agree that having this much data queued
> means something is wrong.  That's why I've left in WARNING messages.  I can
> change to LOG(DFATAL) if you feel that's more appropriate.
> 
> BTW--It's not true that the receiver doesn't know about subsequent data.  Two
> reasons:
> 
> 1. FrameSender::ResendForKickstart().
> 
> 2. After some packets are sent, PacedSender will once again allow packets to
be
> enqueued.  The caller of SendPackets() will send packets for later frames, and
> then the receiver will notice that some packets were dropped in-between.
> 
> > Currently the only possible way to lose a packet is socket error or lost in
> the
> > network. I would prefer not to add other ways that packets can be dropped.
> 
> Packets are never going to be dropped, except when the 10-second queue length
> limit is reached.  That should never happen unless we have a bug in our
> implementation.  And if we do have a bug, at least we won't gobble-up memory
> indefinitely until Chrome crashes.  ;-)

I'm just not sure how dropping packets if exceed kMaxPacketsAllowedEnqueued can
help, because the situation we're talking about is our implementation is
critically broken. Having it running continuously undetected is not ideal. What
about having PacedSender flow the error to the Javascript API such that we can
terminate the session with an error logged in the extension?

hubbe

On 2014/09/08 18:26:28, Alpha wrote: > On 2014/09/05 20:50:34, miu wrote: > > On 2014/09/05 ...

6 years, 3 months ago (2014-09-08 20:33:54 UTC) #8

On 2014/09/08 18:26:28, Alpha wrote:
> On 2014/09/05 20:50:34, miu wrote:
> > On 2014/09/05 18:48:29, Alpha wrote:
> > > > IMO, this is the right approach, since 10 seconds' worth of packets is a
> > huge
> > > > upper-bound.  It can easily be 50+ MB of data held in memory.  If this
> point
> > > is
> > > > reached, it is a strong signal that the queue is growing faster than it
is
> > > > emptying.  Also, it's perfectly fine to not send anything because
packets
> > are
> > > > dropped in downstream code anyway (or as UDP packets on the network). 
The
> > > > callers of the Send/Resend methods will simply attempt to re-send
packets
> > when
> > > > the ACKs are not received.
> > > 
> > > Would the easier to just to raise the limit?
> > 
> > I don't understand.  Raise what limit?
> 
>
https://code.google.com/p/chromium/codesearch#chromium/src/media/cast/net/pac...
> 
> The limit is 100, wouldn't it be simple to raise it to something like
> kMaxPacketsAllowedEnqueued. If the goal is to have developer notice such a
case
> and not be annoying.
> 
> > 
> > > If we do accumulate this much data it seems worse not to send it (and
> silently
> > > drop it). Receiver doesn't know there's subsequent data. And really have
> this
> > > much data queued indicates something is wrong and if it seems worse
> developer
> > > cannot detect it.
> > 
> > The point is, if this much data accumulates in the PacedSender, it will
NEVER
> be
> > able to empty the queue.  I totally agree that having this much data queued
> > means something is wrong.  That's why I've left in WARNING messages.  I can
> > change to LOG(DFATAL) if you feel that's more appropriate.
> > 
> > BTW--It's not true that the receiver doesn't know about subsequent data. 
Two
> > reasons:
> > 
> > 1. FrameSender::ResendForKickstart().
> > 
> > 2. After some packets are sent, PacedSender will once again allow packets to
> be
> > enqueued.  The caller of SendPackets() will send packets for later frames,
and
> > then the receiver will notice that some packets were dropped in-between.
> > 
> > > Currently the only possible way to lose a packet is socket error or lost
in
> > the
> > > network. I would prefer not to add other ways that packets can be dropped.
> > 
> > Packets are never going to be dropped, except when the 10-second queue
length
> > limit is reached.  That should never happen unless we have a bug in our
> > implementation.  And if we do have a bug, at least we won't gobble-up memory
> > indefinitely until Chrome crashes.  ;-)
> 
> I'm just not sure how dropping packets if exceed kMaxPacketsAllowedEnqueued
can
> help, because the situation we're talking about is our implementation is
> critically broken. Having it running continuously undetected is not ideal.
What
> about having PacedSender flow the error to the Javascript API such that we can
> terminate the session with an error logged in the extension?

Undetected problems are horrible. I suggest that in addition to dropping packets
if
we reach kMaxPacketsAllowedEnqueued, we should also call DumpWithoutCrashing().
That
way we will know if the impossible is less impossible than we thought.

     /Hubbe

miu

PTAL. Patch Set 2 incorporates fixes, plus Hubbe's idea about crash reporting when the "impossible ...

6 years, 3 months ago (2014-09-08 22:01:29 UTC) #9

Alpha Left Google

> What I don't agree with is that you see to be arguing we do ...

6 years, 3 months ago (2014-09-11 01:04:16 UTC) #10

Alpha Left Google

On 2014/09/11 01:04:16, Alpha wrote: > > What I don't agree with is that you ...

6 years, 3 months ago (2014-09-11 01:10:00 UTC) #11

miu

New patch set (#3). PTAL. I took out all the packet dropping. All this change ...

6 years, 3 months ago (2014-09-15 21:29:20 UTC) #12

Alpha Left Google

On 2014/09/15 21:29:20, miu wrote: > New patch set (#3). PTAL. > > I took ...

6 years, 3 months ago (2014-09-15 21:30:56 UTC) #13

miu

On 2014/09/15 21:30:56, Alpha wrote: > On 2014/09/15 21:29:20, miu wrote: > > New patch ...

6 years, 3 months ago (2014-09-15 21:33:15 UTC) #14

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patchset/539323002/40001

6 years, 3 months ago (2014-09-15 22:04:19 UTC) #16

commit-bot: I haz the power

Committed patchset #3 (id:40001) as d07112e514112b45a62c7f2685332099d4d4ffa3

6 years, 3 months ago (2014-09-15 23:46:04 UTC) #17

commit-bot: I haz the power

6 years, 3 months ago (2014-09-15 23:48:35 UTC) #18

Message was sent while issue was closed.

Patchset 3 (id:??) landed as
https://crrev.com/1904e8682930a06460ffc323cc8b1c88c5f4966b
Cr-Commit-Position: refs/heads/master@{#294928}

Expand Messages | Collapse Messages | Show Generated Messages | Hide Generated Messages