Issue 49633004: Add --optimize_for_size to tests flag matrix.

This change requires https://codereview.chromium.org/47743007/ and https://codereview.chromium.org/47023003/ to land before the tests will run successfully with ...

7 years, 1 month ago (2013-10-29 12:35:39 UTC) #1

Sven Panne

NOT LGTM. https://codereview.chromium.org/49633004/diff/1/tools/run-tests.py File tools/run-tests.py (right): https://codereview.chromium.org/49633004/diff/1/tools/run-tests.py#newcode57 tools/run-tests.py:57: ["--optimize-for-size"], I am not sure if this ...

7 years, 1 month ago (2013-10-29 12:43:31 UTC) #2

rmcilroy

https://codereview.chromium.org/49633004/diff/1/tools/run-tests.py File tools/run-tests.py (right): https://codereview.chromium.org/49633004/diff/1/tools/run-tests.py#newcode57 tools/run-tests.py:57: ["--optimize-for-size"], On 2013/10/29 12:43:31, Sven Panne wrote: > I ...

7 years, 1 month ago (2013-10-29 14:02:54 UTC) #3

Sven Panne

On 2013/10/29 14:02:54, rmcilroy wrote: > I realize this will increase test time by about ...

7 years, 1 month ago (2013-10-29 14:18:33 UTC) #4

Michael Achenbach

I agree to Sven's suggestions. I can set up some builders on the waterfall to ...

7 years, 1 month ago (2013-10-29 14:38:02 UTC) #5

rmcilroy

On 2013/10/29 14:38:02, machenbach wrote: > I agree to Sven's suggestions. I can set up ...

7 years, 1 month ago (2013-10-29 15:45:53 UTC) #6

On 2013/10/29 14:38:02, machenbach wrote:
> I agree to Sven's suggestions. I can set up some builders on the waterfall to
> run a special config, once it is ready (just give me a note).
> 
> The "arm" builder that runs on real hardware is still quite fast, so I suggest
> we split it into two, one with and one without the flag (and optionally one
> being a release and the other a debug builder).
> 
> Then I just "randomly" select one builder of each other platform to run with
> that flag on.
> 
> Alternative: When passing in via --extra-flags=--optimize-for-size, every test
> variant will have the optimize-for-size flag set. Another option would be to
> extend the run-tests script with a new optimize-for-size parameter that, only
> when set, adds a variant with just optimize-for-size to the other three
> variants. Then we would have +33% on a few selected builders, while also
keeping
> the old test configurations.
> 
> What way to go depends on if you want the cross product of that flag with the
> other variants or not.

I would be happy with either of these suggestions, I'll chat you tomorrow to
decide on the best approach and work out the fine details of what needs to be
done to get some of the runners running with this flag.

> I know that this is annoying, and of course everybody likes to get good test
> coverage for his own new feature, but it is important to keep test times
> manageable. Recently there was a very visible tendency that people obviously
> didn't run the test suite at all before committing, and we should actively
fight
> that tendency...

This isn't just testing of a feature.  The flag is used in production devices
(low-end Android phones), so I think it is important to have continuous testing
of this since it can interact in unexpected ways with seemingly unrelated
changes in code generation.  I totally agree with you that we should keep the
test runner quick to encourage people to run it before committing changes -
hopefully the suggestions yourself and Michael have made will provide the best
of both worlds.

Thanks.

Jakob Kummerow

Adding new slave definitions increases cycle time quite considerably when two or more slaves have ...

7 years, 1 month ago (2013-10-29 15:52:31 UTC) #7

rmcilroy

7 years, 1 month ago (2013-10-29 16:17:01 UTC) #8

On 2013/10/29 15:52:31, Jakob wrote:
> Adding new slave definitions increases cycle time quite considerably when two
or
> more slaves have to share a VM. Instead, I'd suggest we model this as an
> additional step that can be added to a few bots.

SGTM.

> Ross, any input on which test suites are useful to run with the flag? mjsunit
> probably, maybe test262, mozilla, or benchmarks. cctests, messages, and
> preparser probably don't make much sense.

Actually it was cctests which found the previous two issues (bad interactions
when SetFunctionEntryHook was non-null, and when the debugger was active), so I
think it would make sense to include that.  I'm not sure on the other tests, but
I agree messages and preparser are probably less interesting for this step.

Expand Messages | Collapse Messages