Issue 6698020: ichspi.c: lower down the ICH7 chipset CDS timeout from 60s to 1ms.

Issue 6698020: ichspi.c: lower down the ICH7 chipset CDS timeout from 60s to 1ms. (Closed)

Created:
9 years, 9 months ago by Louis

Modified:
9 years, 4 months ago

Reviewers:
kliegs, dhendrix, stefanreinauer, Sean Paul, hailfinger

CC:
chromium-os-reviews_chromium.org

Base URL:
ssh://git@gitrw.chromium.org:9222/flashrom.git@master

Visibility:
Public.

Description

ichspi.c: lower down the ICH7 chipset CDS timeout from 60s to 1ms. The loooong time is not necessary actually. Change to 30 secs for a full chip erase. Also exit early if an error happens. The long time would make flashrom look like hung for forever especially when probing hundreds of flashes chip on ICH7M. Remove this can get failed message less than 1 sec in total. We were good on EEEPC and Lenovo S10-3t because we never launched 2 or more flashrom in boot time (one hung for forever is fine as long as not in boot path). But now, activate_date and dump_vpd_log hit this issue. Change-Id: I5a13f80673b452159c8dcf3fa95ea2d5a4944b9c R=reinauer@chromium.org,dhendrix@chromium.org BUG=chromium-os:13092 TEST=Tested all following combinations: {on Lenovo S10-t3, EEEPC, Alex and Mario} for {BIOS and EC} flashes with {-r / -w} commands. Committed: http://chrome-svn/viewvc/chromeos?view=rev&revision=d2129f1

Patch Set 1 #

Total comments: 2

Patch Set 2 : increase timeout to 30s for full erase. a early exit if error appears. #

Created: 9 years, 9 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+8 lines, -3 lines)			Patch
	M	ichspi.c	View	1	1 chunk	+8 lines, -3 lines	0 comments	Download

Messages

Total messages: 16 (0 generated)

Expand Messages | Collapse Messages

Louis

Hi David and Stefan, Waterloo folks cannot boot up their machines due to this issue, ...

9 years, 9 months ago (2011-03-15 13:07:50 UTC) #2

kliegs

PTAL. This CL is blocking development and needs to be resolved. On 2011/03/15 13:07:50, Louis ...

9 years, 9 months ago (2011-03-15 17:27:18 UTC) #3

dhendrix

On 2011/03/15 17:27:18, kliegs wrote: > PTAL. This CL is blocking development and needs to ...

9 years, 9 months ago (2011-03-15 18:51:56 UTC) #4

kliegs

As the machines are currently in an unusable state should we submit this anyways as ...

9 years, 9 months ago (2011-03-15 19:02:12 UTC) #5

dhendrix

While I am in favor of reducing the delay, I'm very reluctant to let this ...

9 years, 9 months ago (2011-03-15 19:30:38 UTC) #6

dhendrix

On 2011/03/15 19:02:12, kliegs wrote: > As the machines are currently in an unusable state ...

9 years, 9 months ago (2011-03-15 19:33:14 UTC) #7

dhendrix

On 2011/03/15 19:33:14, dhendrix wrote: > On 2011/03/15 19:02:12, kliegs wrote: > > As the ...

9 years, 9 months ago (2011-03-15 19:36:10 UTC) #8

dhendrix

Some erase commands also take a long time. On Alex, I am seeing timeouts when ...

9 years, 9 months ago (2011-03-16 01:07:32 UTC) #9

Louis

Most look reasonable to me. But erase commands doesn't make sense to me because the ...

9 years, 9 months ago (2011-03-16 01:56:31 UTC) #10

dhendrix

LGTM with patch set 2 A short explanation -- Louis and I determined that the ...

9 years, 9 months ago (2011-03-16 04:24:30 UTC) #11

LGTM with patch set 2

A short explanation -- Louis and I determined that the "Blocked Access Status"
bit is also useful for determining if access was successful at all. In kliegs'
case, access to the ROM attached to the SPI controller will fail because it
doesn't exist, and we are assuming that the blocked access status bit will
reflect that.

We tested this out and our assumption seems to be true. So to un-block people,
we're going to go ahead and commit this. We may have to come up with another
different approach if the Intel FAE tells us we're wrong...

On 2011/03/16 01:56:31, Louis wrote:
> Most look reasonable to me. But erase commands doesn't make sense to me
because
> the behavior should be:
> 1. flashrom issues erase command (thru ICH) and ICH returns immediately.
> 2. flashrom issues read status command (thru ICH) and loop until BUSY bit is
> cleared.
> 
> That means the ich7_run_opcode() should not timeout. Also, the flashrom -w
works
> fine on Mario, but -E gets timeout. That looks suspicious to me because they
> should use the same mechanism I assume. 
> 
> Looking at this now.
> 
> On 2011/03/16 01:07:32, dhendrix wrote:
> > Some erase commands also take a long time. On Alex, I am seeing timeouts
when
> it
> > attempts sector erase operations (the problem gets worse with block erase
> > operations of larger size).
> > 
> > I added some print statements to help instrument the time for a few
> instructions
> > on Alex. Here is what I have found so far:
> > 0x9f (Read Device ID): ~10us
> > 0x90 (Read Manufacturer and Device ID): ~10us
> > 0x03 (Byte read): <50us
> > 0x02 (Write byte): ~200us
> > 0x05 (Read status register): ~200us
> > 0x20 ((4KB) sector erase): ~30ms
> > 0xd8 (64KB block erase): ~350ms
> > 0x60 (chip erase): ~10s
> > 
> > 
> > So I think we definitely need a long timeout here to account for the various
> > erase instructions.
> > 
> > The instruction that we care about for the purposes of failing quickly is
> RDID.
> > If we want to get hacky, we can set the timeout to something very short
> > specifically for RDID (and maybe other instructions, like REMS).

Louis

Thanks David. We come out a good idea for this. Please see the new uploaded ...

9 years, 9 months ago (2011-03-16 04:26:04 UTC) #12

dhendrix

On 2011/03/16 04:26:04, Louis wrote: > Thanks David. We come out a good idea for ...

9 years, 9 months ago (2011-03-16 05:50:37 UTC) #13

hailfinger

Two comments: The changelog and the comment in the code are incorrect: The patch actually ...

9 years, 9 months ago (2011-03-16 22:27:05 UTC) #14

Louis

Thanks for pointing out. I will find a chance to fix it. The pending CL ...

9 years, 9 months ago (2011-03-17 09:51:19 UTC) #15

Louis

9 years, 9 months ago (2011-03-17 09:56:07 UTC) #16

Sorry, I send the mail out too fast.

60ms is not enough for some actions, like full chip erase. In theory, the
ICH can complete the erase command in very short time, like hundred of us,
but for erase commands, they take more longer than we expect. David
experimented a chip full erase and took about 10s to get CDS from ICH. This
should be tested before this is committed.

On Thu, Mar 17, 2011 at 5:50 PM, Louis Yung-Chieh Lo <yjlou@chromium.org>wrote:

> Thanks for pointing out. I will find a chance to fix it.
>
> The pending CL looks good to me. We are asking Intel FAE if the FCERR
> behavior consists on all platform, for example, NM10 and ICH7-M.
>
>
> On Thu, Mar 17, 2011 at 6:27 AM, <hailfinger@gmail.com> wrote:
>
>> Two comments:
>>
>> The changelog and the comment in the code are incorrect: The patch
>> actually
>> reduces the delay from 60 s to 30 s.
>>
>> Were you aware of the following pending (since Nov. 2010) patch?
>> http://patchwork.coreboot.org/patch/2356/
>> I will repost an updated version against current flashrom svn trunk and
>> add you
>> to CC for review.
>>
>>
>> http://codereview.chromium.org/6698020/
>>
>
>

Expand Messages | Collapse Messages