Issue 178323002: Simply SVGSVGElement::getElementById method

Issue 178323002: Simply SVGSVGElement::getElementById method (Closed)

Created:
6 years, 10 months ago by maheshkk

Modified:
6 years, 10 months ago

Reviewers:
pdr., rwlbuis, f(malita), Inactive, Paweł Hajdan Jr.

CC:
blink-reviews, krit, fs, ed+blinkwatch_opera.com, gyuyoung.kim_webkit.org, Stephen Chennney

Base URL:
https://chromium.googlesource.com/chromium/blink.git@master

Visibility:
Public.

More Reviews

Description

Simplify SVGSVGElement::getElementById method Simplify SVGSVGElement::getElementByID method code and also by using getAllElementsById which caches elements, we can avoid looking for matching element SVG subtree everytime. Committed: https://src.chromium.org/viewvc/blink?view=rev&revision=167890

Patch Set 1 #

Total comments: 2

Patch Set 2 : using containsMultipleElementsWithId for single tag use case #

Total comments: 3

Patch Set 3 : incorporate review comments #

Total comments: 1

Created: 6 years, 10 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+13 lines, -11 lines)			Patch
	M	Source/core/svg/SVGSVGElement.cpp	View	1 2	1 chunk	+13 lines, -11 lines	1 comment	Download

Messages

Total messages: 29 (0 generated)

Expand Messages | Collapse Messages

pdr.

LGTM https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp File Source/core/svg/SVGSVGElement.cpp (right): https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp#newcode772 Source/core/svg/SVGSVGElement.cpp:772: for (Vector<Element*>::iterator it = elements.begin(); it != elements.end(); ...

6 years, 10 months ago (2014-02-24 18:39:43 UTC) #2

maheshkk

On 2014/02/24 18:39:43, pdr wrote: > LGTM > > https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp > File Source/core/svg/SVGSVGElement.cpp (right): > ...

6 years, 10 months ago (2014-02-24 18:46:08 UTC) #3

rwlbuis

Hi Mahesh, On 2014/02/24 18:46:08, maheshkk wrote: > On 2014/02/24 18:39:43, pdr wrote: > > ...

6 years, 10 months ago (2014-02-24 18:57:23 UTC) #4

Inactive

https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp File Source/core/svg/SVGSVGElement.cpp (right): https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp#newcode770 Source/core/svg/SVGSVGElement.cpp:770: Vector<Element*> elements = treeScope().getAllElementsById(id); This may cause a performance ...

6 years, 10 months ago (2014-02-24 20:12:41 UTC) #5

maheshkk

On 2014/02/24 20:12:41, Chris Dumez wrote: > https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp > File Source/core/svg/SVGSVGElement.cpp (right): > > https://codereview.chromium.org/178323002/diff/1/Source/core/svg/SVGSVGElement.cpp#newcode770 ...

6 years, 10 months ago (2014-02-24 22:10:07 UTC) #6

Inactive

https://codereview.chromium.org/178323002/diff/100001/Source/core/svg/SVGSVGElement.cpp File Source/core/svg/SVGSVGElement.cpp (right): https://codereview.chromium.org/178323002/diff/100001/Source/core/svg/SVGSVGElement.cpp#newcode774 Source/core/svg/SVGSVGElement.cpp:774: } else { you need to return 0 before ...

6 years, 10 months ago (2014-02-24 22:14:03 UTC) #7

Inactive

Also please s/Simply/Simplify everywhere in your changelog :)

6 years, 10 months ago (2014-02-24 22:24:09 UTC) #8

maheshkk

On 2014/02/24 22:24:09, Chris Dumez wrote: > Also please s/Simply/Simplify everywhere in your changelog :) ...

6 years, 10 months ago (2014-02-24 22:36:37 UTC) #9

Inactive

LGTM but please make sure pdr is happy with this as well before landing.

6 years, 10 months ago (2014-02-24 22:41:27 UTC) #10

f(malita)

https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGElement.cpp File Source/core/svg/SVGSVGElement.cpp (right): https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGElement.cpp#newcode771 Source/core/svg/SVGSVGElement.cpp:771: Element* element = treeScope().getElementById(id); Isn't this introducing an extra ...

6 years, 10 months ago (2014-02-24 23:47:24 UTC) #11

maheshkk

On 2014/02/24 23:47:24, Florin Malita wrote: > https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGElement.cpp > File Source/core/svg/SVGSVGElement.cpp (right): > > https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGElement.cpp#newcode771 ...

6 years, 10 months ago (2014-02-24 23:54:58 UTC) #12

pdr.

6 years, 10 months ago (2014-02-25 00:05:14 UTC) #13

f(malita)

On 2014/02/24 23:54:58, maheshkk wrote: > On 2014/02/24 23:47:24, Florin Malita wrote: > > > ...

6 years, 10 months ago (2014-02-25 00:18:15 UTC) #14

On 2014/02/24 23:54:58, maheshkk wrote:
> On 2014/02/24 23:47:24, Florin Malita wrote:
> >
>
https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGE...
> > File Source/core/svg/SVGSVGElement.cpp (right):
> > 
> >
>
https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGE...
> > Source/core/svg/SVGSVGElement.cpp:771: Element* element =
> > treeScope().getElementById(id);
> > Isn't this introducing an extra lookup for the common case?
> > 
> > (one for containsMultipleElementsWithId() and another one for
> getElementById())
> > 
> > Whereas before, the fast path only took one lookup: getElementById().
> 
> Florin, yes it is an extra lookup, However containsMultipleElementsWithId() is
> map lookup and always O(1). 

Well, it's constant but not free :) Then there's the key hash cost to consider
(not sure whether we're caching that, maybe we are).

> IMO adding getAllElementsById() much better solution than existing subtree
tree
> traversal solution. 

It certainly looks better, but note that now we're always checking
isDescendant(), whereas before we didn't need to do that. Since isDescendant()
does an O(log(N)) ancestor crawl, I'm not sure this is really saving much
perf-wise. But it does look better :)

> Let me know if I can improve this solution.

Why do we need a special case for
!containsMultipleElementsWithId()/getElementById() at all? I think we can simply
get all ID hits and iterate searching for a descendant, no?

Element* SVGSVGElement::getElementById(const AtomicString& id) const
{
    // If duplicate IDs are there, return the first descendant of the svg
element.
    const Vector<Element*>& elements = treeScope().getAllElementsById(id);
    Vector<Element*>::const_iterator end = elements.end();
    for (Vector<Element*>::const_iterator it = elements.begin(); it != end;
++it) {
        if ((*it)->isDescendantOf(this))
            return *it;
    }

    return 0;
}

Inactive

On 2014/02/25 00:18:15, Florin Malita wrote: > On 2014/02/24 23:54:58, maheshkk wrote: > > On ...

6 years, 10 months ago (2014-02-25 00:27:48 UTC) #15

On 2014/02/25 00:18:15, Florin Malita wrote:
> On 2014/02/24 23:54:58, maheshkk wrote:
> > On 2014/02/24 23:47:24, Florin Malita wrote:
> > >
> >
>
https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGE...
> > > File Source/core/svg/SVGSVGElement.cpp (right):
> > > 
> > >
> >
>
https://codereview.chromium.org/178323002/diff/120001/Source/core/svg/SVGSVGE...
> > > Source/core/svg/SVGSVGElement.cpp:771: Element* element =
> > > treeScope().getElementById(id);
> > > Isn't this introducing an extra lookup for the common case?
> > > 
> > > (one for containsMultipleElementsWithId() and another one for
> > getElementById())
> > > 
> > > Whereas before, the fast path only took one lookup: getElementById().
> > 
> > Florin, yes it is an extra lookup, However containsMultipleElementsWithId()
is
> > map lookup and always O(1). 
> 
> Well, it's constant but not free :) Then there's the key hash cost to consider
> (not sure whether we're caching that, maybe we are).
> 
> > IMO adding getAllElementsById() much better solution than existing subtree
> tree
> > traversal solution. 
> 
> It certainly looks better, but note that now we're always checking
> isDescendant(), whereas before we didn't need to do that. Since isDescendant()
> does an O(log(N)) ancestor crawl, I'm not sure this is really saving much
> perf-wise. But it does look better :)
> 
> > Let me know if I can improve this solution.
> 
> Why do we need a special case for
> !containsMultipleElementsWithId()/getElementById() at all? I think we can
simply
> get all ID hits and iterate searching for a descendant, no?
> 
> Element* SVGSVGElement::getElementById(const AtomicString& id) const
> {
>     // If duplicate IDs are there, return the first descendant of the svg
> element.
>     const Vector<Element*>& elements = treeScope().getAllElementsById(id);
>     Vector<Element*>::const_iterator end = elements.end();
>     for (Vector<Element*>::const_iterator it = elements.begin(); it != end;
> ++it) {
>         if ((*it)->isDescendantOf(this))
>             return *it;
>     }
> 
>     return 0;
> }

Mahesh actually did this in his original patch. I advised against it because I
did not want to take the risk of making the common case (no duplicate id) slower
due to calling a more complex method and constructing needlessly a Vector in
this case.

The approach I proposed (and in the lastest version of the CL) is the one I used
in SelectorQuery.cpp. I proposed that one because this particular SelectorQuery
code is well covered by performance tests so we know it performs well.

As you mention, there is still an extra hash lookup in the regular case, but
this did not cause any trouble in the SelectorQuery code. However, using
getAllElementsById() in the multiple id case had a significant performance
impact in SelectorQuery (which is why I introduced getAllElementsById() in the
first place).

We could do something different here than in SelectorQuery but then we better
have this covered by performance tests (not sure we do currently).

f(malita)

On 2014/02/25 00:27:48, Chris Dumez wrote: > Mahesh actually did this in his original patch. ...

6 years, 10 months ago (2014-02-25 00:41:26 UTC) #16

f(malita)

On 2014/02/25 00:41:26, Florin Malita wrote: > Something doesn't click for me, help me understand: ...

6 years, 10 months ago (2014-02-25 00:50:33 UTC) #17

Inactive

On 2014/02/25 00:41:26, Florin Malita wrote: > Something doesn't click for me, help me understand: ...

6 years, 10 months ago (2014-02-25 00:58:39 UTC) #18

Inactive

On 2014/02/25 00:58:39, Chris Dumez wrote: > On 2014/02/25 00:41:26, Florin Malita wrote: > > ...

6 years, 10 months ago (2014-02-25 01:02:53 UTC) #19

On 2014/02/25 00:58:39, Chris Dumez wrote:
> On 2014/02/25 00:41:26, Florin Malita wrote:
> > Something doesn't click for me, help me understand: if getAllElementsById()
is
> > expensive (presumably on first call for a given ID, when building the cached
> > vector), why isn't containsMultipleElementsWithId() just as expensive?
Looking
> > at DocumentOrderedMap::containsMultiple(), it seems it's not really building
> the
> > vector and only looking at already cached values - but then, to turn the
> > question around, is containsMultipleElementsWithId() really giving the
correct
> > answer?
> 
> What I am saying is that getAllElementsById() is more costly for the more
> general case (no duplicate id) than getElementById(). There are several
reasons
> for this:
> 1. getAllElementsById() needs to construct a Vector (at least the first time
it
> is called after cache invalidation)
> 2. getAllElementsById() is a bit more complex than getElementById() some extra
> checks that we don't really need if there is no duplicate id
> 3. We need to iterate over the returned Vector instead of dealing with a
simple
> Element*. Sure, the for loop will break quite fast but we will still
> unnecessarily evaluate the end-loop condition twice.
> 
> The reason containsMultipleElementsWithId() isn't very expensive is that it is
> inlined and it does:
> - A simple hash lookup
> - An integer comparison (count > 1).
> 
> containsMultipleElementsWithId() does not cause any tree traversal because the
> "count" of Elements with a given id is always valid. What is lazily created is
> the Vector containing all the Elements with a given id (See
> DocumentOrderedMap.*).

Oh, and I forgot to mention that when DocumentOrderedMap::getElementById() is
called, its implementation does not construct a Vector with one element. We have
an Element pointer for this fast / common use case and we lazily populate the
Vector only when getAllElementsById() is called. This is why the values in the
DocumentOrderedMap hashmap looks like:
    struct MapEntry {
        Element* element; // Cache for getElementById
        unsigned count; // Maintained to be valid at all times
        Vector<Element*> orderedList; // Cache for getAllElementsById
    };

f(malita)

On 2014/02/25 01:02:53, Chris Dumez wrote: > Oh, and I forgot to mention that when ...

6 years, 10 months ago (2014-02-25 01:08:42 UTC) #20

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/mahesh.kk@samsung.com/178323002/120001

6 years, 10 months ago (2014-02-25 20:53:32 UTC) #22

maheshkk

Thanks you all for the review! I will commit this now and will continue experimenting ...

6 years, 10 months ago (2014-02-25 20:53:52 UTC) #23

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/mahesh.kk@samsung.com/178323002/120001

6 years, 10 months ago (2014-02-25 23:03:08 UTC) #24

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/mahesh.kk@samsung.com/178323002/120001

6 years, 10 months ago (2014-02-25 23:22:44 UTC) #25

Paweł Hajdan Jr.

The CQ bit was unchecked by phajdan.jr@chromium.org

6 years, 10 months ago (2014-02-26 05:45:52 UTC) #26

Paweł Hajdan Jr.

The CQ bit was checked by phajdan.jr@chromium.org

6 years, 10 months ago (2014-02-26 06:01:35 UTC) #27

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/mahesh.kk@samsung.com/178323002/120001

6 years, 10 months ago (2014-02-26 06:03:13 UTC) #28

Message was sent while issue was closed.

Change committed as 167890

Expand Messages | Collapse Messages