third_party/google/protobuf/internal/decoder.py - Issue 1153333003: Added tools to retrieve CQ builders from a CQ config

Side by Side Diff: third_party/google/protobuf/internal/decoder.py

Issue 1153333003: Added tools to retrieve CQ builders from a CQ config (Closed) Base URL: https://chromium.googlesource.com/chromium/tools/depot_tools.git@master

Patch Set: Addressed comments Created 5 years, 6 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View unified diff | Download patch

OLD	NEW
(Empty)
	1 # Protocol Buffers - Google's data interchange format

	2 # Copyright 2008 Google Inc. All rights reserved.

	3 # http://code.google.com/p/protobuf/

	4 #

	5 # Redistribution and use in source and binary forms, with or without

	6 # modification, are permitted provided that the following conditions are

	7 # met:

	8 #

	9 # * Redistributions of source code must retain the above copyright

	10 # notice, this list of conditions and the following disclaimer.

	11 # * Redistributions in binary form must reproduce the above

	12 # copyright notice, this list of conditions and the following disclaimer

	13 # in the documentation and/or other materials provided with the

	14 # distribution.

	15 # * Neither the name of Google Inc. nor the names of its

	16 # contributors may be used to endorse or promote products derived from

	17 # this software without specific prior written permission.

	18 #

	19 # THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS

	20 # "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT

	21 # LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR

	22 # A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT

	23 # OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,

	24 # SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT

	25 # LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,

	26 # DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY

	27 # THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT

	28 # (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE

	29 # OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

	30

	31 #PY25 compatible for GAE.

	32 #

	33 # Copyright 2009 Google Inc. All Rights Reserved.

	34

	35 """Code for decoding protocol buffer primitives.

	36

	37 This code is very similar to encoder.py -- read the docs for that module first.

	38

	39 A "decoder" is a function with the signature:

	40 Decode(buffer, pos, end, message, field_dict)

	41 The arguments are:

	42 buffer: The string containing the encoded message.

	43 pos: The current position in the string.

	44 end: The position in the string where the current message ends. May be

	45 less than len(buffer) if we're reading a sub-message.

	46 message: The message object into which we're parsing.

	47 field_dict: message._fields (avoids a hashtable lookup).

	48 The decoder reads the field and stores it into field_dict, returning the new

	49 buffer position. A decoder for a repeated field may proactively decode all of

	50 the elements of that field, if they appear consecutively.

	51

	52 Note that decoders may throw any of the following:

	53 IndexError: Indicates a truncated message.

	54 struct.error: Unpacking of a fixed-width field failed.

	55 message.DecodeError: Other errors.

	56

	57 Decoders are expected to raise an exception if they are called with pos > end.

	58 This allows callers to be lax about bounds checking: it's fineto read past

	59 "end" as long as you are sure that someone else will notice and throw an

	60 exception later on.

	61

	62 Something up the call stack is expected to catch IndexError and struct.error

	63 and convert them to message.DecodeError.

	64

	65 Decoders are constructed using decoder constructors with the signature:

	66 MakeDecoder(field_number, is_repeated, is_packed, key, new_default)

	67 The arguments are:

	68 field_number: The field number of the field we want to decode.

	69 is_repeated: Is the field a repeated field? (bool)

	70 is_packed: Is the field a packed field? (bool)

	71 key: The key to use when looking up the field within field_dict.

	72 (This is actually the FieldDescriptor but nothing in this

	73 file should depend on that.)

	74 new_default: A function which takes a message object as a parameter and

	75 returns a new instance of the default value for this field.

	76 (This is called for repeated fields and sub-messages, when an

	77 instance does not already exist.)

	78

	79 As with encoders, we define a decoder constructor for every type of field.

	80 Then, for every field of every message class we construct an actual decoder.

	81 That decoder goes into a dict indexed by tag, so when we decode a message

	82 we repeatedly read a tag, look up the corresponding decoder, and invoke it.

	83 """

	84

	85 __author__ = 'kenton@google.com (Kenton Varda)'

	86

	87 import struct

	88 import sys ##PY25

	89 _PY2 = sys.version_info[0] < 3 ##PY25

	90 from google.protobuf.internal import encoder

	91 from google.protobuf.internal import wire_format

	92 from google.protobuf import message

	93

	94

	95 # This will overflow and thus become IEEE-754 "infinity". We would use

	96 # "float('inf')" but it doesn't work on Windows pre-Python-2.6.

	97 _POS_INF = 1e10000

	98 _NEG_INF = -_POS_INF

	99 _NAN = _POS_INF * 0

	100

	101

	102 # This is not for optimization, but rather to avoid conflicts with local

	103 # variables named "message".

	104 _DecodeError = message.DecodeError

	105

	106

	107 def _VarintDecoder(mask, result_type):

	108 """Return an encoder for a basic varint value (does not include tag).

	109

	110 Decoded values will be bitwise-anded with the given mask before being

	111 returned, e.g. to limit them to 32 bits. The returned decoder does not

	112 take the usual "end" parameter -- the caller is expected to do bounds checking

	113 after the fact (often the caller can defer such checking until later). The

	114 decoder returns a (value, new_pos) pair.

	115 """

	116

	117 local_ord = ord

	118 py2 = _PY2 ##PY25

	119 ##!PY25 py2 = str is bytes

	120 def DecodeVarint(buffer, pos):

	121 result = 0

	122 shift = 0

	123 while 1:

	124 b = local_ord(buffer[pos]) if py2 else buffer[pos]

	125 result \|= ((b & 0x7f) << shift)

	126 pos += 1

	127 if not (b & 0x80):

	128 result &= mask

	129 result = result_type(result)

	130 return (result, pos)

	131 shift += 7

	132 if shift >= 64:

	133 raise _DecodeError('Too many bytes when decoding varint.')

	134 return DecodeVarint

	135

	136

	137 def _SignedVarintDecoder(mask, result_type):

	138 """Like _VarintDecoder() but decodes signed values."""

	139

	140 local_ord = ord

	141 py2 = _PY2 ##PY25

	142 ##!PY25 py2 = str is bytes

	143 def DecodeVarint(buffer, pos):

	144 result = 0

	145 shift = 0

	146 while 1:

	147 b = local_ord(buffer[pos]) if py2 else buffer[pos]

	148 result \|= ((b & 0x7f) << shift)

	149 pos += 1

	150 if not (b & 0x80):

	151 if result > 0x7fffffffffffffff:

	152 result -= (1 << 64)

	153 result \|= ~mask

	154 else:

	155 result &= mask

	156 result = result_type(result)

	157 return (result, pos)

	158 shift += 7

	159 if shift >= 64:

	160 raise _DecodeError('Too many bytes when decoding varint.')

	161 return DecodeVarint

	162

	163 # We force 32-bit values to int and 64-bit values to long to make

	164 # alternate implementations where the distinction is more significant

	165 # (e.g. the C++ implementation) simpler.

	166

	167 _DecodeVarint = _VarintDecoder((1 << 64) - 1, long)

	168 _DecodeSignedVarint = _SignedVarintDecoder((1 << 64) - 1, long)

	169

	170 # Use these versions for values which must be limited to 32 bits.

	171 _DecodeVarint32 = _VarintDecoder((1 << 32) - 1, int)

	172 _DecodeSignedVarint32 = _SignedVarintDecoder((1 << 32) - 1, int)

	173

	174

	175 def ReadTag(buffer, pos):

	176 """Read a tag from the buffer, and return a (tag_bytes, new_pos) tuple.

	177

	178 We return the raw bytes of the tag rather than decoding them. The raw

	179 bytes can then be used to look up the proper decoder. This effectively allows

	180 us to trade some work that would be done in pure-python (decoding a varint)

	181 for work that is done in C (searching for a byte string in a hash table).

	182 In a low-level language it would be much cheaper to decode the varint and

	183 use that, but not in Python.

	184 """

	185

	186 py2 = _PY2 ##PY25

	187 ##!PY25 py2 = str is bytes

	188 start = pos

	189 while (ord(buffer[pos]) if py2 else buffer[pos]) & 0x80:

	190 pos += 1

	191 pos += 1

	192 return (buffer[start:pos], pos)

	193

	194

	195 # --------------------------------------------------------------------

	196

	197

	198 def _SimpleDecoder(wire_type, decode_value):

	199 """Return a constructor for a decoder for fields of a particular type.

	200

	201 Args:

	202 wire_type: The field's wire type.

	203 decode_value: A function which decodes an individual value, e.g.

	204 _DecodeVarint()

	205 """

	206

	207 def SpecificDecoder(field_number, is_repeated, is_packed, key, new_default):

	208 if is_packed:

	209 local_DecodeVarint = _DecodeVarint

	210 def DecodePackedField(buffer, pos, end, message, field_dict):

	211 value = field_dict.get(key)

	212 if value is None:

	213 value = field_dict.setdefault(key, new_default(message))

	214 (endpoint, pos) = local_DecodeVarint(buffer, pos)

	215 endpoint += pos

	216 if endpoint > end:

	217 raise _DecodeError('Truncated message.')

	218 while pos < endpoint:

	219 (element, pos) = decode_value(buffer, pos)

	220 value.append(element)

	221 if pos > endpoint:

	222 del value[-1] # Discard corrupt value.

	223 raise _DecodeError('Packed element was truncated.')

	224 return pos

	225 return DecodePackedField

	226 elif is_repeated:

	227 tag_bytes = encoder.TagBytes(field_number, wire_type)

	228 tag_len = len(tag_bytes)

	229 def DecodeRepeatedField(buffer, pos, end, message, field_dict):

	230 value = field_dict.get(key)

	231 if value is None:

	232 value = field_dict.setdefault(key, new_default(message))

	233 while 1:

	234 (element, new_pos) = decode_value(buffer, pos)

	235 value.append(element)

	236 # Predict that the next tag is another copy of the same repeated

	237 # field.

	238 pos = new_pos + tag_len

	239 if buffer[new_pos:pos] != tag_bytes or new_pos >= end:

	240 # Prediction failed. Return.

	241 if new_pos > end:

	242 raise _DecodeError('Truncated message.')

	243 return new_pos

	244 return DecodeRepeatedField

	245 else:

	246 def DecodeField(buffer, pos, end, message, field_dict):

	247 (field_dict[key], pos) = decode_value(buffer, pos)

	248 if pos > end:

	249 del field_dict[key] # Discard corrupt value.

	250 raise _DecodeError('Truncated message.')

	251 return pos

	252 return DecodeField

	253

	254 return SpecificDecoder

	255

	256

	257 def _ModifiedDecoder(wire_type, decode_value, modify_value):

	258 """Like SimpleDecoder but additionally invokes modify_value on every value

	259 before storing it. Usually modify_value is ZigZagDecode.

	260 """

	261

	262 # Reusing _SimpleDecoder is slightly slower than copying a bunch of code, but

	263 # not enough to make a significant difference.

	264

	265 def InnerDecode(buffer, pos):

	266 (result, new_pos) = decode_value(buffer, pos)

	267 return (modify_value(result), new_pos)

	268 return _SimpleDecoder(wire_type, InnerDecode)

	269

	270

	271 def _StructPackDecoder(wire_type, format):

	272 """Return a constructor for a decoder for a fixed-width field.

	273

	274 Args:

	275 wire_type: The field's wire type.

	276 format: The format string to pass to struct.unpack().

	277 """

	278

	279 value_size = struct.calcsize(format)

	280 local_unpack = struct.unpack

	281

	282 # Reusing _SimpleDecoder is slightly slower than copying a bunch of code, but

	283 # not enough to make a significant difference.

	284

	285 # Note that we expect someone up-stack to catch struct.error and convert

	286 # it to _DecodeError -- this way we don't have to set up exception-

	287 # handling blocks every time we parse one value.

	288

	289 def InnerDecode(buffer, pos):

	290 new_pos = pos + value_size

	291 result = local_unpack(format, buffer[pos:new_pos])[0]

	292 return (result, new_pos)

	293 return _SimpleDecoder(wire_type, InnerDecode)

	294

	295

	296 def _FloatDecoder():

	297 """Returns a decoder for a float field.

	298

	299 This code works around a bug in struct.unpack for non-finite 32-bit

	300 floating-point values.

	301 """

	302

	303 local_unpack = struct.unpack

	304 b = (lambda x:x) if _PY2 else lambda x:x.encode('latin1') ##PY25

	305

	306 def InnerDecode(buffer, pos):

	307 # We expect a 32-bit value in little-endian byte order. Bit 1 is the sign

	308 # bit, bits 2-9 represent the exponent, and bits 10-32 are the significand.

	309 new_pos = pos + 4

	310 float_bytes = buffer[pos:new_pos]

	311

	312 # If this value has all its exponent bits set, then it's non-finite.

	313 # In Python 2.4, struct.unpack will convert it to a finite 64-bit value.

	314 # To avoid that, we parse it specially.

	315 if ((float_bytes[3:4] in b('\x7F\xFF')) ##PY25

	316 ##!PY25 if ((float_bytes[3:4] in b'\x7F\xFF')

	317 and (float_bytes[2:3] >= b('\x80'))): ##PY25

	318 ##!PY25 and (float_bytes[2:3] >= b'\x80')):

	319 # If at least one significand bit is set...

	320 if float_bytes[0:3] != b('\x00\x00\x80'): ##PY25

	321 ##!PY25 if float_bytes[0:3] != b'\x00\x00\x80':

	322 return (_NAN, new_pos)

	323 # If sign bit is set...

	324 if float_bytes[3:4] == b('\xFF'): ##PY25

	325 ##!PY25 if float_bytes[3:4] == b'\xFF':

	326 return (_NEG_INF, new_pos)

	327 return (_POS_INF, new_pos)

	328

	329 # Note that we expect someone up-stack to catch struct.error and convert

	330 # it to _DecodeError -- this way we don't have to set up exception-

	331 # handling blocks every time we parse one value.

	332 result = local_unpack('<f', float_bytes)[0]

	333 return (result, new_pos)

	334 return _SimpleDecoder(wire_format.WIRETYPE_FIXED32, InnerDecode)

	335

	336

	337 def _DoubleDecoder():

	338 """Returns a decoder for a double field.

	339

	340 This code works around a bug in struct.unpack for not-a-number.

	341 """

	342

	343 local_unpack = struct.unpack

	344 b = (lambda x:x) if _PY2 else lambda x:x.encode('latin1') ##PY25

	345

	346 def InnerDecode(buffer, pos):

	347 # We expect a 64-bit value in little-endian byte order. Bit 1 is the sign

	348 # bit, bits 2-12 represent the exponent, and bits 13-64 are the significand.

	349 new_pos = pos + 8

	350 double_bytes = buffer[pos:new_pos]

	351

	352 # If this value has all its exponent bits set and at least one significand

	353 # bit set, it's not a number. In Python 2.4, struct.unpack will treat it

	354 # as inf or -inf. To avoid that, we treat it specially.

	355 ##!PY25 if ((double_bytes[7:8] in b'\x7F\xFF')

	356 ##!PY25 and (double_bytes[6:7] >= b'\xF0')

	357 ##!PY25 and (double_bytes[0:7] != b'\x00\x00\x00\x00\x00\x00\xF0')):

	358 if ((double_bytes[7:8] in b('\x7F\xFF')) ##PY25

	359 and (double_bytes[6:7] >= b('\xF0')) ##PY25

	360 and (double_bytes[0:7] != b('\x00\x00\x00\x00\x00\x00\xF0'))): ##PY25

	361 return (_NAN, new_pos)

	362

	363 # Note that we expect someone up-stack to catch struct.error and convert

	364 # it to _DecodeError -- this way we don't have to set up exception-

	365 # handling blocks every time we parse one value.

	366 result = local_unpack('<d', double_bytes)[0]

	367 return (result, new_pos)

	368 return _SimpleDecoder(wire_format.WIRETYPE_FIXED64, InnerDecode)

	369

	370

	371 def EnumDecoder(field_number, is_repeated, is_packed, key, new_default):

	372 enum_type = key.enum_type

	373 if is_packed:

	374 local_DecodeVarint = _DecodeVarint

	375 def DecodePackedField(buffer, pos, end, message, field_dict):

	376 value = field_dict.get(key)

	377 if value is None:

	378 value = field_dict.setdefault(key, new_default(message))

	379 (endpoint, pos) = local_DecodeVarint(buffer, pos)

	380 endpoint += pos

	381 if endpoint > end:

	382 raise _DecodeError('Truncated message.')

	383 while pos < endpoint:

	384 value_start_pos = pos

	385 (element, pos) = _DecodeSignedVarint32(buffer, pos)

	386 if element in enum_type.values_by_number:

	387 value.append(element)

	388 else:

	389 if not message._unknown_fields:

	390 message._unknown_fields = []

	391 tag_bytes = encoder.TagBytes(field_number,

	392 wire_format.WIRETYPE_VARINT)

	393 message._unknown_fields.append(

	394 (tag_bytes, buffer[value_start_pos:pos]))

	395 if pos > endpoint:

	396 if element in enum_type.values_by_number:

	397 del value[-1] # Discard corrupt value.

	398 else:

	399 del message._unknown_fields[-1]

	400 raise _DecodeError('Packed element was truncated.')

	401 return pos

	402 return DecodePackedField

	403 elif is_repeated:

	404 tag_bytes = encoder.TagBytes(field_number, wire_format.WIRETYPE_VARINT)

	405 tag_len = len(tag_bytes)

	406 def DecodeRepeatedField(buffer, pos, end, message, field_dict):

	407 value = field_dict.get(key)

	408 if value is None:

	409 value = field_dict.setdefault(key, new_default(message))

	410 while 1:

	411 (element, new_pos) = _DecodeSignedVarint32(buffer, pos)

	412 if element in enum_type.values_by_number:

	413 value.append(element)

	414 else:

	415 if not message._unknown_fields:

	416 message._unknown_fields = []

	417 message._unknown_fields.append(

	418 (tag_bytes, buffer[pos:new_pos]))

	419 # Predict that the next tag is another copy of the same repeated

	420 # field.

	421 pos = new_pos + tag_len

	422 if buffer[new_pos:pos] != tag_bytes or new_pos >= end:

	423 # Prediction failed. Return.

	424 if new_pos > end:

	425 raise _DecodeError('Truncated message.')

	426 return new_pos

	427 return DecodeRepeatedField

	428 else:

	429 def DecodeField(buffer, pos, end, message, field_dict):

	430 value_start_pos = pos

	431 (enum_value, pos) = _DecodeSignedVarint32(buffer, pos)

	432 if pos > end:

	433 raise _DecodeError('Truncated message.')

	434 if enum_value in enum_type.values_by_number:

	435 field_dict[key] = enum_value

	436 else:

	437 if not message._unknown_fields:

	438 message._unknown_fields = []

	439 tag_bytes = encoder.TagBytes(field_number,

	440 wire_format.WIRETYPE_VARINT)

	441 message._unknown_fields.append(

	442 (tag_bytes, buffer[value_start_pos:pos]))

	443 return pos

	444 return DecodeField

	445

	446

	447 # --------------------------------------------------------------------

	448

	449

	450 Int32Decoder = _SimpleDecoder(

	451 wire_format.WIRETYPE_VARINT, _DecodeSignedVarint32)

	452

	453 Int64Decoder = _SimpleDecoder(

	454 wire_format.WIRETYPE_VARINT, _DecodeSignedVarint)

	455

	456 UInt32Decoder = _SimpleDecoder(wire_format.WIRETYPE_VARINT, _DecodeVarint32)

	457 UInt64Decoder = _SimpleDecoder(wire_format.WIRETYPE_VARINT, _DecodeVarint)

	458

	459 SInt32Decoder = _ModifiedDecoder(

	460 wire_format.WIRETYPE_VARINT, _DecodeVarint32, wire_format.ZigZagDecode)

	461 SInt64Decoder = _ModifiedDecoder(

	462 wire_format.WIRETYPE_VARINT, _DecodeVarint, wire_format.ZigZagDecode)

	463

	464 # Note that Python conveniently guarantees that when using the '<' prefix on

	465 # formats, they will also have the same size across all platforms (as opposed

	466 # to without the prefix, where their sizes depend on the C compiler's basic

	467 # type sizes).

	468 Fixed32Decoder = _StructPackDecoder(wire_format.WIRETYPE_FIXED32, '<I')

	469 Fixed64Decoder = _StructPackDecoder(wire_format.WIRETYPE_FIXED64, '<Q')

	470 SFixed32Decoder = _StructPackDecoder(wire_format.WIRETYPE_FIXED32, '<i')

	471 SFixed64Decoder = _StructPackDecoder(wire_format.WIRETYPE_FIXED64, '<q')

	472 FloatDecoder = _FloatDecoder()

	473 DoubleDecoder = _DoubleDecoder()

	474

	475 BoolDecoder = _ModifiedDecoder(

	476 wire_format.WIRETYPE_VARINT, _DecodeVarint, bool)

	477

	478

	479 def StringDecoder(field_number, is_repeated, is_packed, key, new_default):

	480 """Returns a decoder for a string field."""

	481

	482 local_DecodeVarint = _DecodeVarint

	483 local_unicode = unicode

	484

	485 def _ConvertToUnicode(byte_str):

	486 try:

	487 return local_unicode(byte_str, 'utf-8')

	488 except UnicodeDecodeError, e:

	489 # add more information to the error message and re-raise it.

	490 e.reason = '%s in field: %s' % (e, key.full_name)

	491 raise

	492

	493 assert not is_packed

	494 if is_repeated:

	495 tag_bytes = encoder.TagBytes(field_number,

	496 wire_format.WIRETYPE_LENGTH_DELIMITED)

	497 tag_len = len(tag_bytes)

	498 def DecodeRepeatedField(buffer, pos, end, message, field_dict):

	499 value = field_dict.get(key)

	500 if value is None:

	501 value = field_dict.setdefault(key, new_default(message))

	502 while 1:

	503 (size, pos) = local_DecodeVarint(buffer, pos)

	504 new_pos = pos + size

	505 if new_pos > end:

	506 raise _DecodeError('Truncated string.')

	507 value.append(_ConvertToUnicode(buffer[pos:new_pos]))

	508 # Predict that the next tag is another copy of the same repeated field.

	509 pos = new_pos + tag_len

	510 if buffer[new_pos:pos] != tag_bytes or new_pos == end:

	511 # Prediction failed. Return.

	512 return new_pos

	513 return DecodeRepeatedField

	514 else:

	515 def DecodeField(buffer, pos, end, message, field_dict):

	516 (size, pos) = local_DecodeVarint(buffer, pos)

	517 new_pos = pos + size

	518 if new_pos > end:

	519 raise _DecodeError('Truncated string.')

	520 field_dict[key] = _ConvertToUnicode(buffer[pos:new_pos])

	521 return new_pos

	522 return DecodeField

	523

	524

	525 def BytesDecoder(field_number, is_repeated, is_packed, key, new_default):

	526 """Returns a decoder for a bytes field."""

	527

	528 local_DecodeVarint = _DecodeVarint

	529

	530 assert not is_packed

	531 if is_repeated:

	532 tag_bytes = encoder.TagBytes(field_number,

	533 wire_format.WIRETYPE_LENGTH_DELIMITED)

	534 tag_len = len(tag_bytes)

	535 def DecodeRepeatedField(buffer, pos, end, message, field_dict):

	536 value = field_dict.get(key)

	537 if value is None:

	538 value = field_dict.setdefault(key, new_default(message))

	539 while 1:

	540 (size, pos) = local_DecodeVarint(buffer, pos)

	541 new_pos = pos + size

	542 if new_pos > end:

	543 raise _DecodeError('Truncated string.')

	544 value.append(buffer[pos:new_pos])

	545 # Predict that the next tag is another copy of the same repeated field.

	546 pos = new_pos + tag_len

	547 if buffer[new_pos:pos] != tag_bytes or new_pos == end:

	548 # Prediction failed. Return.

	549 return new_pos

	550 return DecodeRepeatedField

	551 else:

	552 def DecodeField(buffer, pos, end, message, field_dict):

	553 (size, pos) = local_DecodeVarint(buffer, pos)

	554 new_pos = pos + size

	555 if new_pos > end:

	556 raise _DecodeError('Truncated string.')

	557 field_dict[key] = buffer[pos:new_pos]

	558 return new_pos

	559 return DecodeField

	560

	561

	562 def GroupDecoder(field_number, is_repeated, is_packed, key, new_default):

	563 """Returns a decoder for a group field."""

	564

	565 end_tag_bytes = encoder.TagBytes(field_number,

	566 wire_format.WIRETYPE_END_GROUP)

	567 end_tag_len = len(end_tag_bytes)

	568

	569 assert not is_packed

	570 if is_repeated:

	571 tag_bytes = encoder.TagBytes(field_number,

	572 wire_format.WIRETYPE_START_GROUP)

	573 tag_len = len(tag_bytes)

	574 def DecodeRepeatedField(buffer, pos, end, message, field_dict):

	575 value = field_dict.get(key)

	576 if value is None:

	577 value = field_dict.setdefault(key, new_default(message))

	578 while 1:

	579 value = field_dict.get(key)

	580 if value is None:

	581 value = field_dict.setdefault(key, new_default(message))

	582 # Read sub-message.

	583 pos = value.add()._InternalParse(buffer, pos, end)

	584 # Read end tag.

	585 new_pos = pos+end_tag_len

	586 if buffer[pos:new_pos] != end_tag_bytes or new_pos > end:

	587 raise _DecodeError('Missing group end tag.')

	588 # Predict that the next tag is another copy of the same repeated field.

	589 pos = new_pos + tag_len

	590 if buffer[new_pos:pos] != tag_bytes or new_pos == end:

	591 # Prediction failed. Return.

	592 return new_pos

	593 return DecodeRepeatedField

	594 else:

	595 def DecodeField(buffer, pos, end, message, field_dict):

	596 value = field_dict.get(key)

	597 if value is None:

	598 value = field_dict.setdefault(key, new_default(message))

	599 # Read sub-message.

	600 pos = value._InternalParse(buffer, pos, end)

	601 # Read end tag.

	602 new_pos = pos+end_tag_len

	603 if buffer[pos:new_pos] != end_tag_bytes or new_pos > end:

	604 raise _DecodeError('Missing group end tag.')

	605 return new_pos

	606 return DecodeField

	607

	608

	609 def MessageDecoder(field_number, is_repeated, is_packed, key, new_default):

	610 """Returns a decoder for a message field."""

	611

	612 local_DecodeVarint = _DecodeVarint

	613

	614 assert not is_packed

	615 if is_repeated:

	616 tag_bytes = encoder.TagBytes(field_number,

	617 wire_format.WIRETYPE_LENGTH_DELIMITED)

	618 tag_len = len(tag_bytes)

	619 def DecodeRepeatedField(buffer, pos, end, message, field_dict):

	620 value = field_dict.get(key)

	621 if value is None:

	622 value = field_dict.setdefault(key, new_default(message))

	623 while 1:

	624 value = field_dict.get(key)

	625 if value is None:

	626 value = field_dict.setdefault(key, new_default(message))

	627 # Read length.

	628 (size, pos) = local_DecodeVarint(buffer, pos)

	629 new_pos = pos + size

	630 if new_pos > end:

	631 raise _DecodeError('Truncated message.')

	632 # Read sub-message.

	633 if value.add()._InternalParse(buffer, pos, new_pos) != new_pos:

	634 # The only reason _InternalParse would return early is if it

	635 # encountered an end-group tag.

	636 raise _DecodeError('Unexpected end-group tag.')

	637 # Predict that the next tag is another copy of the same repeated field.

	638 pos = new_pos + tag_len

	639 if buffer[new_pos:pos] != tag_bytes or new_pos == end:

	640 # Prediction failed. Return.

	641 return new_pos

	642 return DecodeRepeatedField

	643 else:

	644 def DecodeField(buffer, pos, end, message, field_dict):

	645 value = field_dict.get(key)

	646 if value is None:

	647 value = field_dict.setdefault(key, new_default(message))

	648 # Read length.

	649 (size, pos) = local_DecodeVarint(buffer, pos)

	650 new_pos = pos + size

	651 if new_pos > end:

	652 raise _DecodeError('Truncated message.')

	653 # Read sub-message.

	654 if value._InternalParse(buffer, pos, new_pos) != new_pos:

	655 # The only reason _InternalParse would return early is if it encountered

	656 # an end-group tag.

	657 raise _DecodeError('Unexpected end-group tag.')

	658 return new_pos

	659 return DecodeField

	660

	661

	662 # --------------------------------------------------------------------

	663

	664 MESSAGE_SET_ITEM_TAG = encoder.TagBytes(1, wire_format.WIRETYPE_START_GROUP)

	665

	666 def MessageSetItemDecoder(extensions_by_number):

	667 """Returns a decoder for a MessageSet item.

	668

	669 The parameter is the _extensions_by_number map for the message class.

	670

	671 The message set message looks like this:

	672 message MessageSet {

	673 repeated group Item = 1 {

	674 required int32 type_id = 2;

	675 required string message = 3;

	676 }

	677 }

	678 """

	679

	680 type_id_tag_bytes = encoder.TagBytes(2, wire_format.WIRETYPE_VARINT)

	681 message_tag_bytes = encoder.TagBytes(3, wire_format.WIRETYPE_LENGTH_DELIMITED)

	682 item_end_tag_bytes = encoder.TagBytes(1, wire_format.WIRETYPE_END_GROUP)

	683

	684 local_ReadTag = ReadTag

	685 local_DecodeVarint = _DecodeVarint

	686 local_SkipField = SkipField

	687

	688 def DecodeItem(buffer, pos, end, message, field_dict):

	689 message_set_item_start = pos

	690 type_id = -1

	691 message_start = -1

	692 message_end = -1

	693

	694 # Technically, type_id and message can appear in any order, so we need

	695 # a little loop here.

	696 while 1:

	697 (tag_bytes, pos) = local_ReadTag(buffer, pos)

	698 if tag_bytes == type_id_tag_bytes:

	699 (type_id, pos) = local_DecodeVarint(buffer, pos)

	700 elif tag_bytes == message_tag_bytes:

	701 (size, message_start) = local_DecodeVarint(buffer, pos)

	702 pos = message_end = message_start + size

	703 elif tag_bytes == item_end_tag_bytes:

	704 break

	705 else:

	706 pos = SkipField(buffer, pos, end, tag_bytes)

	707 if pos == -1:

	708 raise _DecodeError('Missing group end tag.')

	709

	710 if pos > end:

	711 raise _DecodeError('Truncated message.')

	712

	713 if type_id == -1:

	714 raise _DecodeError('MessageSet item missing type_id.')

	715 if message_start == -1:

	716 raise _DecodeError('MessageSet item missing message.')

	717

	718 extension = extensions_by_number.get(type_id)

	719 if extension is not None:

	720 value = field_dict.get(extension)

	721 if value is None:

	722 value = field_dict.setdefault(

	723 extension, extension.message_type._concrete_class())

	724 if value._InternalParse(buffer, message_start,message_end) != message_end:

	725 # The only reason _InternalParse would return early is if it encountered

	726 # an end-group tag.

	727 raise _DecodeError('Unexpected end-group tag.')

	728 else:

	729 if not message._unknown_fields:

	730 message._unknown_fields = []

	731 message._unknown_fields.append((MESSAGE_SET_ITEM_TAG,

	732 buffer[message_set_item_start:pos]))

	733

	734 return pos

	735

	736 return DecodeItem

	737

	738 # --------------------------------------------------------------------

	739 # Optimization is not as heavy here because calls to SkipField() are rare,

	740 # except for handling end-group tags.

	741

	742 def _SkipVarint(buffer, pos, end):

	743 """Skip a varint value. Returns the new position."""

	744 # Previously ord(buffer[pos]) raised IndexError when pos is out of range.

	745 # With this code, ord(b'') raises TypeError. Both are handled in

	746 # python_message.py to generate a 'Truncated message' error.

	747 while ord(buffer[pos:pos+1]) & 0x80:

	748 pos += 1

	749 pos += 1

	750 if pos > end:

	751 raise _DecodeError('Truncated message.')

	752 return pos

	753

	754 def _SkipFixed64(buffer, pos, end):

	755 """Skip a fixed64 value. Returns the new position."""

	756

	757 pos += 8

	758 if pos > end:

	759 raise _DecodeError('Truncated message.')

	760 return pos

	761

	762 def _SkipLengthDelimited(buffer, pos, end):

	763 """Skip a length-delimited value. Returns the new position."""

	764

	765 (size, pos) = _DecodeVarint(buffer, pos)

	766 pos += size

	767 if pos > end:

	768 raise _DecodeError('Truncated message.')

	769 return pos

	770

	771 def _SkipGroup(buffer, pos, end):

	772 """Skip sub-group. Returns the new position."""

	773

	774 while 1:

	775 (tag_bytes, pos) = ReadTag(buffer, pos)

	776 new_pos = SkipField(buffer, pos, end, tag_bytes)

	777 if new_pos == -1:

	778 return pos

	779 pos = new_pos

	780

	781 def _EndGroup(buffer, pos, end):

	782 """Skipping an END_GROUP tag returns -1 to tell the parent loop to break."""

	783

	784 return -1

	785

	786 def _SkipFixed32(buffer, pos, end):

	787 """Skip a fixed32 value. Returns the new position."""

	788

	789 pos += 4

	790 if pos > end:

	791 raise _DecodeError('Truncated message.')

	792 return pos

	793

	794 def _RaiseInvalidWireType(buffer, pos, end):

	795 """Skip function for unknown wire types. Raises an exception."""

	796

	797 raise _DecodeError('Tag had invalid wire type.')

	798

	799 def _FieldSkipper():

	800 """Constructs the SkipField function."""

	801

	802 WIRETYPE_TO_SKIPPER = [

	803 _SkipVarint,

	804 _SkipFixed64,

	805 _SkipLengthDelimited,

	806 _SkipGroup,

	807 _EndGroup,

	808 _SkipFixed32,

	809 _RaiseInvalidWireType,

	810 _RaiseInvalidWireType,

	811 ]

	812

	813 wiretype_mask = wire_format.TAG_TYPE_MASK

	814

	815 def SkipField(buffer, pos, end, tag_bytes):

	816 """Skips a field with the specified tag.

	817

	818 \|pos\| should point to the byte immediately after the tag.

	819

	820 Returns:

	821 The new position (after the tag value), or -1 if the tag is an end-group

	822 tag (in which case the calling loop should break).

	823 """

	824

	825 # The wire type is always in the first byte since varints are little-endian.

	826 wire_type = ord(tag_bytes[0:1]) & wiretype_mask

	827 return WIRETYPE_TO_SKIPPER[wire_type](buffer, pos, end)

	828

	829 return SkipField

	830

	831 SkipField = _FieldSkipper()

OLD	NEW

« no previous file with comments | « third_party/google/protobuf/internal/cpp_message.py ('k') | third_party/google/protobuf/internal/encoder.py » ('j') | no next file with comments »