Issue 68893014: Allocate stack item for fragment context elements once

Issue 68893014: Allocate stack item for fragment context elements once (Closed)

Created:
7 years, 1 month ago by davve

Modified:
7 years, 1 month ago

Reviewers:
eseidel

CC:
blink-reviews, dglazkov+blink, adamk+blink_chromium.org

Base URL:
https://chromium.googlesource.com/chromium/blink.git@master

Visibility:
Public.

More Reviews

Description

Allocate stack item for fragment context elements once Instead of allocating stack items for fragment context elements on demand, do it once when creating the fragment context and re-use the stack item during parsing. This avoids excessive calls to malloc when parsing for innerHTML. BUG=318711 Committed: https://src.chromium.org/viewvc/blink?view=rev&revision=162047

Patch Set 1 #

Patch Set 2 : Allocate stack item in fragment constructor #

Total comments: 2

Patch Set 3 : Get rid of PassRefPtr #

Created: 7 years, 1 month ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+15 lines, -15 lines)			Patch
	M	Source/core/html/parser/HTMLTreeBuilder.h	View	1 2	2 chunks	+4 lines, -3 lines	0 comments	Download
	M	Source/core/html/parser/HTMLTreeBuilder.cpp	View	1 2	7 chunks	+11 lines, -12 lines	0 comments	Download

Messages

Total messages: 18 (0 generated)

Expand Messages | Collapse Messages

davve

Eric, was it something like this you had in mind for avoiding the malloc just ...

7 years, 1 month ago (2013-11-13 14:02:30 UTC) #1

davve

Side-note: HTMLElementStack::hasOnlyOneElement looks like a candidate for inlining (now).

7 years, 1 month ago (2013-11-13 14:28:52 UTC) #2

eseidel

This is OK, but I Think better would be to just keep m_adjustedCurrentStackItemForFragment as a ...

7 years, 1 month ago (2013-11-14 01:49:43 UTC) #3

davve

On 2013/11/14 01:49:43, eseidel wrote: > This is OK, but I Think better would be ...

7 years, 1 month ago (2013-11-14 06:05:14 UTC) #5

eseidel

We could also just include a stack-allocated ajusted stackitem? Or can't we just return m_stack[0]? ...

7 years, 1 month ago (2013-11-14 06:44:47 UTC) #6

eseidel

Keeping innerHTML down to the minimum number of mallocs is actually important. When we very ...

7 years, 1 month ago (2013-11-14 06:46:20 UTC) #7

esprehn

On 2013/11/14 06:46:20, eseidel wrote: > [...] at one point we special cased innerHTML='text' or ...

7 years, 1 month ago (2013-11-14 06:54:13 UTC) #8

eseidel

We could do better than that though. We could short-cut earlier by scanning the innerHTML ...

7 years, 1 month ago (2013-11-14 07:02:15 UTC) #9

davve

On 2013/11/14 06:44:47, eseidel wrote: > We could also just include a stack-allocated ajusted stackitem? ...

7 years, 1 month ago (2013-11-14 07:19:09 UTC) #10

eseidel

Yeah, that's one example of the crazy we had to add in order to keep ...

7 years, 1 month ago (2013-11-14 07:24:15 UTC) #11

eseidel

Yeah, that's one example of the crazy we had to add in order to keep ...

7 years, 1 month ago (2013-11-14 07:24:15 UTC) #12

davve

I've conducted some performance measurements comparing before/after this patch. The short test is: <script> var ...

7 years, 1 month ago (2013-11-14 11:46:25 UTC) #13

I've conducted some performance measurements comparing before/after this patch.

The short test is: 

<script>
  var testString = "<span>apa</span>";

  var elm = document.createElement('div');
  t = function() {
      elm.innerHTML = testString;
  };

  for (var i =0; i < 10000; i++) {
     t();
  }
  console.log('done');
</script>

or the long test, which is identical except for the testString line:

  var testString = (new Array(500)).join("<span>apa</span>");

I measure using 'perf record -f $content_shell_binary --single-process
<url-to-test>'

(Excuse the cropped lines...)

The top of the profile stack, before patch, long test:

  7.34%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLTokenizer::nextToken(WebCore:
  5.71%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLConstructionSite::executeQueu
  5.48%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLTreeBuilder::constructTree(We
  3.95%  content_shell-n  content_shell-no-patch       [.] tc_malloc
  3.87%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLTreeBuilder::processStartTagF
  2.67%  content_shell-n  content_shell-no-patch       [.] tc_free

The top of the profile stack, after patch, long test:

  7.43%    content_shell  content_shell                  [.]
WebCore::HTMLConstructionSite::executeQu
  7.10%    content_shell  content_shell                  [.]
WebCore::HTMLTokenizer::nextToken(WebCor
  4.01%    content_shell  content_shell                  [.]
WebCore::HTMLTreeBuilder::processStartTa
  3.01%    content_shell  content_shell                  [.]
WebCore::HTMLTreeBuilder::constructTree(
  2.83%    content_shell  content_shell                  [.]
WebCore::HTMLDocumentParser::constructTr
  2.65%    content_shell  content_shell                  [.]
WTF::HashTableAddResult<WTF::HashTableIt
  2.65%    content_shell  content_shell                  [.]
WebCore::ContainerNode::removeChildren()
  2.48%    content_shell  content_shell                  [.] tc_malloc

Observation: less malloc pressure, constructTree down from 5.48% to 2.83%, more
time for executeQueuedTasks and nextToken.

The top of the profile stack, before patch, short test:

  6.79%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLTokenizer::nextToken(WebCore::
  5.88%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLTreeBuilder::constructTree(Web
  5.69%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLConstructionSite::executeQueue
  4.49%  content_shell-n  content_shell-no-patch       [.] tc_malloc
  3.94%  content_shell-n  content_shell-no-patch       [.]
WebCore::HTMLTreeBuilder::processStartTagFo
  2.81%  content_shell-n  content_shell-no-patch       [.] tc_free
  2.68%  content_shell-n  content_shell-no-patch       [.]
WebCore::ContainerNode::removeChildren()

The top of the profile stack, after patch, short test:

  7.24%    content_shell  content_shell                  [.]
WebCore::HTMLTokenizer::nextToken(WebCore
  7.08%    content_shell  content_shell                  [.]
WebCore::HTMLConstructionSite::executeQue
  4.42%    content_shell  content_shell                  [.]
WebCore::HTMLTreeBuilder::processStartTag
  2.84%    content_shell  content_shell                  [.]
WebCore::HTMLTreeBuilder::constructTree(W
  2.71%    content_shell  content_shell                  [.]
WTF::HashTableAddResult<WTF::HashTableIte
  2.47%    content_shell  content_shell                  [.]
WebCore::ContainerNode::removeChildren()
  2.47%    content_shell  content_shell                  [.]
WebCore::HTMLDocumentParser::constructTre
  2.26%    content_shell  content_shell                  [.] tc_malloc

Observations: Even more dramatic drop for malloc and constructTree; otherwise in
line with the results above.

(Feedback on the testing procedure appreciated. There are probably many with
much more experience than me doing there sorts of tests.)

eseidel

lgtm Definitely better. https://codereview.chromium.org/68893014/diff/100001/Source/core/html/parser/HTMLTreeBuilder.cpp File Source/core/html/parser/HTMLTreeBuilder.cpp (left): https://codereview.chromium.org/68893014/diff/100001/Source/core/html/parser/HTMLTreeBuilder.cpp#oldcode343 Source/core/html/parser/HTMLTreeBuilder.cpp:343: , m_contextElement(0) Oh, I love it. ...

7 years, 1 month ago (2013-11-14 15:39:41 UTC) #14

eseidel

Testing procedure sgtm. You invented your own microbenchmark, because you're fixing something very microbenchmarkable -- ...

7 years, 1 month ago (2013-11-14 15:42:58 UTC) #15

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/davve@opera.com/68893014/210001

7 years, 1 month ago (2013-11-14 20:23:07 UTC) #16

davve

On 2013/11/14 15:39:41, eseidel wrote: > lgtm > > Definitely better. Thanks for your help! ...

7 years, 1 month ago (2013-11-14 20:26:39 UTC) #17

Message was sent while issue was closed.

Change committed as 162047

Expand Messages | Collapse Messages