Issue 1315713007: gpu: Reduce GL context switches used to check pending queries.

reveman

reveman@chromium.org changed reviewers: + sievers@chromium.org

5 years, 3 months ago (2015-08-27 20:18:59 UTC) #1

reveman

The CQ bit was checked by reveman@chromium.org to run a CQ dry run

5 years, 3 months ago (2015-08-27 23:28:37 UTC) #3

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1315713007/20001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1315713007/20001

5 years, 3 months ago (2015-08-27 23:29:12 UTC) #4

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

5 years, 3 months ago (2015-08-27 23:48:07 UTC) #5

commit-bot: I haz the power

Dry run: Try jobs failed on following builders: linux_android_rel_ng on tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/linux_android_rel_ng/builds/61824)

5 years, 3 months ago (2015-08-27 23:48:08 UTC) #6

dshwang

dongseong.hwang@intel.com changed reviewers: + dongseong.hwang@intel.com

5 years, 3 months ago (2015-09-01 08:56:10 UTC) #7

dshwang

Thank you for fixing this issue. looks very good. some nits. https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): ...

5 years, 3 months ago (2015-09-01 08:56:11 UTC) #8

dshwang

I checked if this CL resolves the root issue. Unfortunately, not. I measure FPS on ...

5 years, 3 months ago (2015-09-01 10:52:35 UTC) #9

reveman

On 2015/09/01 at 10:52:35, dongseong.hwang wrote: > I checked if this CL resolves the root ...

5 years, 3 months ago (2015-09-01 15:54:11 UTC) #10

On 2015/09/01 at 10:52:35, dongseong.hwang wrote:
> I checked if this CL resolves the root issue. Unfortunately, not.
> 
> I measure FPS on http://webglsamples.org/aquarium/aquarium.html on daisy_sping
> I applied this CL and then check with and without quirk:
https://codereview.chromium.org/1221433002/
> with quirk              without quirk
> 50: 58                        40
> 250: 50                       33
> 1k: 34                        24
> 2k: 22                        19
> 4k: 12                        12
> 
> It's same result I reported in
https://code.google.com/p/chromium/issues/detail?id=522903
> I guess 
> 1. EGL driver is smart enough to do nothing on redundant context change.
> 2. The root issue probably is preemption by scheduler

I was afraid that this would not be enough. We're still doing a lot of context
switches. This just removed a few obviously unnecessary ones. It think this
patch is a good first step. The next step is to start using SignalQuery in cc/
as that would allow us to reduce the number context switches significantly
because it's only really when the compositor has run out of staging buffers that
we need to poll. We can use a pending SignalQuery as an indicator that polling
frequency needs to be relatively high. If there's no pending SignalQuery then
checking queries ones per frame might be enough.

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
File content/common/gpu/gpu_command_buffer_stub.cc (right):

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
content/common/gpu/gpu_command_buffer_stub.cc:341:
task_runner_->PostDelayedTask(
On 2015/09/01 at 08:56:10, dshwang_ooo_5.9-27.9 wrote:
> do we have to delay one more?
> PollWork() is already called via PostDelayedTask considering
|process_delayed_work_time_|.
> In addition, I'm afraid that twice post-tasks delays to perform pending tasks
for unnecessarily long time.

That's what this patch is supposed to improve. If "process_delayed_work_time_ >
current_time" then that means we've checked for pending queries since the task
was posted so performing work here would cause us to do that more frequently
than desired. We need to delay the PerformWork more to avoid that.

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
content/common/gpu/gpu_command_buffer_stub.cc:428:
scheduler_->HasMoreIdleWork()) {
On 2015/09/01 at 08:56:10, dshwang_ooo_5.9-27.9 wrote:
> Don't we need to check (scheduler_->HasPendingQueries() ||
scheduler_->HasMoreIdleWork())?

This code is only for handling idle work. Pending queries should not affect it.

https://codereview.chromium.org/1315713007/diff/40001/gpu/command_buffer/serv...
File gpu/command_buffer/service/gles2_cmd_decoder.cc (right):

https://codereview.chromium.org/1315713007/diff/40001/gpu/command_buffer/serv...
gpu/command_buffer/service/gles2_cmd_decoder.cc:11900: if
(!query_manager_.get())
On 2015/09/01 at 08:56:10, dshwang_ooo_5.9-27.9 wrote:
> if (!HasPendingQueries())

It's up to the caller to do that check if needed. This way the code is more
consistent with PerformIdleWork.

dshwang

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode341 content/common/gpu/gpu_command_buffer_stub.cc:341: task_runner_->PostDelayedTask( On 2015/09/01 15:54:11, reveman wrote: > On 2015/09/01 ...

5 years, 3 months ago (2015-09-01 16:15:50 UTC) #11

dshwang

On 2015/09/01 15:54:11, reveman wrote: > On 2015/09/01 at 10:52:35, dongseong.hwang wrote: > > I ...

5 years, 3 months ago (2015-09-01 16:16:43 UTC) #12

reveman

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode341 content/common/gpu/gpu_command_buffer_stub.cc:341: task_runner_->PostDelayedTask( On 2015/09/01 at 16:15:50, dshwang_ooo_5.9-27.9 wrote: > On ...

5 years, 3 months ago (2015-09-01 16:25:18 UTC) #13

dshwang

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode341 content/common/gpu/gpu_command_buffer_stub.cc:341: task_runner_->PostDelayedTask( On 2015/09/01 16:25:17, reveman wrote: > On 2015/09/01 ...

5 years, 3 months ago (2015-09-01 16:32:18 UTC) #14

dshwang

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode341 content/common/gpu/gpu_command_buffer_stub.cc:341: task_runner_->PostDelayedTask( On 2015/09/01 16:25:17, reveman wrote: > ScheduleDelayedWork is ...

5 years, 3 months ago (2015-09-01 16:37:00 UTC) #15

reveman

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode341 content/common/gpu/gpu_command_buffer_stub.cc:341: task_runner_->PostDelayedTask( On 2015/09/01 at 16:37:00, dshwang_ooo_5.9-27.9 wrote: > On ...

5 years, 3 months ago (2015-09-01 17:02:07 UTC) #16

no sievers

You have to rebase on top of https://codereview.chromium.org/1308913004/. It touches this code but shouldn't really ...

5 years, 3 months ago (2015-09-16 21:04:35 UTC) #18

reveman

The CQ bit was checked by reveman@chromium.org to run a CQ dry run

5 years, 3 months ago (2015-09-16 23:11:29 UTC) #19

reveman

ptal https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode381 content/common/gpu/gpu_command_buffer_stub.cc:381: scheduler_->ProcessPendingQueries(); On 2015/09/16 at 21:04:35, sievers wrote: > ...

5 years, 3 months ago (2015-09-16 23:12:08 UTC) #20

ptal

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
File content/common/gpu/gpu_command_buffer_stub.cc (right):

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
content/common/gpu/gpu_command_buffer_stub.cc:381:
scheduler_->ProcessPendingQueries();
On 2015/09/16 at 21:04:35, sievers wrote:
> Why do we really treat pending queries different from idle work?
> Apart from the fact that we already check queries after each handled message
and after flush and finish.

I'm not that familiar with the current use cases for idle work. It was created
for async uploads but is used for other things today. Would be nice to remove
the idle work concept if possible.

> 
> I'm wondering if we can even avoid the MakeCurrent() above in line 355 if we
are not idle. It means we will handle a message very soon and check pending
queries anyhow.

I'm not sure processing of queries as idle work is going to work well. I think
that would result in even worse stalls when running out of staging buffers than
we have today.

While flush and finish will often be enough to process pending queries, it's
important that we check them frequently in situations where the compositor
depends on them to make progress when initializing tiles. e.g. when raster is
fast enough that we run out of staging buffers.

My plan for reducing processing of queries when not needed but increasing the
interval when needed is to use SignalQuery API from the compositor when we run
out of staging buffers to tell the GPU service that it's now important to
process queries. I'd prefer to keep PerformIdleWork and ProcessPendingQueries
separated for now.

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
content/common/gpu/gpu_command_buffer_stub.cc:409: process_delayed_work_time_ =
current_time + delay;
On 2015/09/16 at 21:04:35, sievers wrote:
> This keeps pushing out the time though if called repeatedly, so it's a bit
counterintuitive if you just look at ScheduleDelayedWork() in isolation (the
previous implementation has an early return if already scheduled). Is this
because calling this *should* imply that you also probably just did PollWork()
or weren't idle (i.e. handled a message and also at least processed queries
then)?
> 
> I'm just ever so slightly worried that it could be misconceived and somebody
adds a call to this somewhere with the intention of 'making sure work is
scheduled'. Maybe a comment above the function is all it needs.

Improved the comment in the header file to make this more clear.

commit-bot: I haz the power

Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1315713007/60001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1315713007/60001

5 years, 3 months ago (2015-09-16 23:12:55 UTC) #21

no sievers

lgtm https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc File content/common/gpu/gpu_command_buffer_stub.cc (right): https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_command_buffer_stub.cc#newcode381 content/common/gpu/gpu_command_buffer_stub.cc:381: scheduler_->ProcessPendingQueries(); On 2015/09/16 23:12:08, reveman wrote: > On ...

5 years, 3 months ago (2015-09-17 00:34:38 UTC) #22

lgtm

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
File content/common/gpu/gpu_command_buffer_stub.cc (right):

https://codereview.chromium.org/1315713007/diff/40001/content/common/gpu/gpu_...
content/common/gpu/gpu_command_buffer_stub.cc:381:
scheduler_->ProcessPendingQueries();
On 2015/09/16 23:12:08, reveman wrote:
> On 2015/09/16 at 21:04:35, sievers wrote:
> > Why do we really treat pending queries different from idle work?
> > Apart from the fact that we already check queries after each handled message
> and after flush and finish.
> 
> I'm not that familiar with the current use cases for idle work. It was created
> for async uploads but is used for other things today. Would be nice to remove
> the idle work concept if possible.
> 
> > 
> > I'm wondering if we can even avoid the MakeCurrent() above in line 355 if we
> are not idle. It means we will handle a message very soon and check pending
> queries anyhow.
> 
> I'm not sure processing of queries as idle work is going to work well. I think
> that would result in even worse stalls when running out of staging buffers
than
> we have today.
> 
> While flush and finish will often be enough to process pending queries, it's
> important that we check them frequently in situations where the compositor
> depends on them to make progress when initializing tiles. e.g. when raster is
> fast enough that we run out of staging buffers.
> 
> My plan for reducing processing of queries when not needed but increasing the
> interval when needed is to use SignalQuery API from the compositor when we run
> out of staging buffers to tell the GPU service that it's now important to
> process queries. I'd prefer to keep PerformIdleWork and ProcessPendingQueries
> separated for now.
> 

ok sounds good. i was just thinking that because we already process queries
above after each message being handled, that i was wondering how much value it
adds to do it here for |!is_idle|. it seems that would occur when we have
messages pending (!is_idle) but are not processing them because we are
descheduled. so probably important enough to consider.

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

5 years, 3 months ago (2015-09-17 00:56:27 UTC) #23

commit-bot: I haz the power

Dry run: This issue passed the CQ dry run.

5 years, 3 months ago (2015-09-17 00:56:28 UTC) #24

reveman

The patchset sent to the CQ was uploaded after l-g-t-m from dongseong.hwang@intel.com Link to the ...

5 years, 3 months ago (2015-09-17 19:55:55 UTC) #26

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1315713007/60001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1315713007/60001

5 years, 3 months ago (2015-09-17 19:56:18 UTC) #27

commit-bot: I haz the power

The CQ bit was unchecked by commit-bot@chromium.org

5 years, 3 months ago (2015-09-17 20:08:36 UTC) #28

commit-bot: I haz the power

Try jobs failed on following builders: chromium_presubmit on tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/chromium_presubmit/builds/101128)

5 years, 3 months ago (2015-09-17 20:08:37 UTC) #29

reveman

reveman@chromium.org changed reviewers: + boliu@chromium.org

5 years, 3 months ago (2015-09-17 20:30:53 UTC) #30

boliu

So I guess this is just a rename in InProcessCommandBuffer, but there isn't actually any ...

5 years, 3 months ago (2015-09-17 20:44:06 UTC) #32

reveman

On 2015/09/17 at 20:44:06, boliu wrote: > So I guess this is just a rename ...

5 years, 3 months ago (2015-09-17 21:33:38 UTC) #33

boliu

On 2015/09/17 21:33:38, reveman wrote: > On 2015/09/17 at 20:44:06, boliu wrote: > > So ...

5 years, 3 months ago (2015-09-17 21:36:13 UTC) #34

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1315713007/60001 View timeline at https://chromium-cq-status.appspot.com/patch-timeline/1315713007/60001

5 years, 3 months ago (2015-09-17 21:38:05 UTC) #36

commit-bot: I haz the power

5 years, 3 months ago (2015-09-17 21:46:48 UTC) #38

Message was sent while issue was closed.

Patchset 4 (id:??) landed as
https://crrev.com/87580eb4e9a648eff277e8e2883f9c1616877285
Cr-Commit-Position: refs/heads/master@{#349502}

Issue 1315713007: gpu: Reduce GL context switches used to check pending queries. (Closed)

Description

Patch Set 1 #

Patch Set 2 : Fix InProcessCommandBuffer #

Patch Set 3 : webview fix #

Patch Set 4 : rebase #

Messages