Descriptioncontent: perf tests for GpuMemoryBuffers mapping and data coherency
The idea is to benchmark how the hardware, on different usages of
GpuMemoryBuffer, performs when the native buffer object (bo) is mapped into
the CPU. In particular this test aims to capture the effects of data coherency
and answer the following:
- measure memory mapping performance of GpuMemoryBuffer using shared memory
(fallback case) and also native implementation of it.
- what if the Renderer process (client) just writes into the buffer object?
- what's the effect of reading from a write-combining (WC) memory? can we
avoid read backs?
- should it be UC and/or WC mapped, to get a faster access?
- what the effect of clients doing sequential writes or non-sequential? If
the latter, a WC mapping may end up being very slow.
Besides, this CL mimics Chrome OS vgem_fb_test.c but using CC and content to
get a close to real feeling of the things. It's rather important to stress
native buffers, so GL and etc needs to work together with a 3D (non-fake)
output surface. These tests are running on top of content/ architecture.
(Note that content/child directory is not a strong requirement; I've only
placed this perf test there cause the foundation's of it got stolen from
child_thread_impl_browsertest.cc)
TEST=content_browsertests --ozone-platform=gbm --ozone-use-surfaceless --no-sandbox --gtest_filter=ChildThreadImplGpuMemoryBufferPerfTests*
BUG=475633
Patch Set 1 #Patch Set 2 : #
Total comments: 13
Patch Set 3 : separate tests for each memory operation #Patch Set 4 : add description, move from DRAFT to a real CL #
Total comments: 6
Messages
Total messages: 13 (2 generated)
|