chrome/test/functional/dataset_converter.py - Issue 6246147: Test Autofill's ability to merge duplicate profiles and...

Side by Side Diff: chrome/test/functional/dataset_converter.py

Issue 6246147: Test Autofill's ability to merge duplicate profiles and... (Closed) Base URL: svn://chrome-svn/chrome/trunk/src/

Patch Set: '' Created 9 years, 10 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View unified diff | Download patch | Annotate | Revision Log

OLD	NEW
(Empty)
	1 #!/usr/bin/python

	2 # Copyright (c) 2011 The Chromium Authors. All rights reserved.

	3 # Use of this source code is governed by a BSD-style license that can be

	4 # found in the LICENSE file.

	5

	6 """Takes in a dataset profiles file and outputs to a dictionary list format
	dennisjeffrey 2011/02/11 00:53:17 The first line of this comment should be a 1-line The first line of this comment should be a 1-line summary of this module. Currently it's using 2 lines. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > The first line of this comment should be a 1-line summary of this module. > Currently it's using 2 lines. Done.
	7 for converting Autofill profile datasets.

	8

	9 Used for test autofill.AutoFillTest.testMergeDuplicateProfilesInAutofill.

	10 """

	11

	12 import re

	13 import codecs

	14 import sys

	15 import os
	dennisjeffrey 2011/02/11 00:53:17 These should be specified in alphabetical order. These should be specified in alphabetical order. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > These should be specified in alphabetical order. Done.
	16

	17

	18 class DatasetConverter(object):

	19 def __init__(self, input_filename, output_filename = None,

	20 display_nothing = True, display_input_lines = False,

	21 display_converted_lines = False):
	dennisjeffrey 2011/02/11 00:53:17 Don't put spaces around the "=" when you're defini Don't put spaces around the "=" when you're defining the default argument values. dennisjeffrey 2011/02/11 00:53:17 Using the "logging" module with different verbosit Using the "logging" module with different verbosity levels might be a better approach, as opposed to having different parameters to describe what should or should not be displayed. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Using the "logging" module with different verbosity levels might be a better > approach, as opposed to having different parameters to describe what should or > should not be displayed. Done.
	22 """Constructs a dataset converter object.

	23

	24 Full input pattern:

	25 '(?P<NAME_FIRST>.?)\\|(?P<MIDDLE_NAME>.?)\\|(?P<NAME_LAST>.*?)\\|

	26 (?P<EMAIL_ADDRESS>.?)\\|(?P<COMPANY_NAME>.?)\\|(?P<ADDRESS_HOME_LINE1>.*?)

	27 \\|(?P<ADDRESS_HOME_LINE2>.?)\\|(?P<ADDRESS_HOME_CITY>.?)\\|

	28 (?P<ADDRESS_HOME_STATE>.?)\\|(?P<ADDRESS_HOME_ZIP>.?)\\|

	29 (?P<ADDRESS_HOME_COUNTRY>.*?)\\|

	30 (?P<PHONE_HOME_WHOLE_NUMBER>.?)\\|(?P<PHONE_FAX_WHOLE_NUMBER>.?)$'

	31

	32 Full ouput pattern:

	33 "{u'NAME_FIRST': u'%s', u'NAME_MIDDLE': u'%s', u'NAME_LAST': u'%s',

	34 u'EMAIL_ADDRESS': u'%s', u'COMPANY_NAME': u'%s', u'ADDRESS_HOME_LINE1':

	35 u'%s', u'ADDRESS_HOME_LINE2': u'%s', u'ADDRESS_HOME_CITY': u'%s',

	36 u'ADDRESS_HOME_STATE': u'%s', u'ADDRESS_HOME_ZIP': u'%s',

	37 u'ADDRESS_HOME_COUNTRY': u'%s', u'PHONE_HOME_WHOLE_NUMBER': u'%s',

	38 u'PHONE_FAX_WHOLE_NUMBER': u'%s',},"

	39

	40 The pattern is a regular expression which has named parenthesis groups
	Nirnimesh 2011/02/11 19:39:54 I think the input/output pattern above is illustra I think the input/output pattern above is illustrative enough. The long description seems overly detailed to me -- and seems like describing the code. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 19:39:54, Nirnimesh wrote: > I think the input/output pattern above is illustrative enough. > The long description seems overly detailed to me -- and seems like describing > the code. Done.
	41 like this (?P<name>...) in order to match the '\|' separated fields.

	42 If we had only the NAME_FIRST and NAME_MIDDLE fields (e.g 'Jared\|JV') our

	43 pattern would be: "(?P<NAME_FIRST>.?)\\|(?P<NAME_MIDDLE>.?)$"

	44

	45 This means that '(?P<NAME_FIRST> regexp)\\|' matches whatever regular

	46 expression is inside the parentheses, and indicates the start and end of a

	47 group; the contents of a group can be retrieved after a match has been

	48 performed using the symbolic group name 'NAME_FIRST'.

	49

	50 The regexp is '.?'. '.' which means to match 0 or more repetitions of any

	51 character. The following '?' makes the regexp non-greedy meaning it will

	52 stop at the first occurrence of the '\|' character (escaped in the pattern).

	53

	54 For '(?P<NAME_MIDDLE>.*?)$' there is no '\|' at the end, so we have '$' to

	55 indicate the end of the line.

	56

	57 From the full pattern, we construct once from the FIELDS list.

	58

	59 The out_line_pattern for one field: "{u'NAME_FIRST': u'%s',"

	60 is ready to accept the value for the 'NAME_FIRST' field once it is extracted

	61 from an input line using the above group pattern.

	62

	63 'pattern' is used in CreateDictionaryFromRecord(line) to construct and

	64 return a dictionary from a line.

	65

	66 'out_line_pattern' is used in 'convert()' to construct the final dataset

	67 line that will be printed to the output file.

	68

	69 Args:

	70 input_filename: name and path of the input dataset.

	71 output_filename: name and path of the converted file, default is None.

	72 display_nothing: output display on the screen, default is True.

	73 display_input_lines: output display of the inpute file, default is False.

	74 display_converted_lines: output display of the converted file,

	75 default is False.

	76 """

	77 self._fields = [

	78 u'NAME_FIRST',

	79 u'NAME_MIDDLE',

	80 u'NAME_LAST',

	81 u'EMAIL_ADDRESS',

	82 u'COMPANY_NAME',

	83 u'ADDRESS_HOME_LINE1',

	84 u'ADDRESS_HOME_LINE2',

	85 u'ADDRESS_HOME_CITY',

	86 u'ADDRESS_HOME_STATE',

	87 u'ADDRESS_HOME_ZIP',

	88 u'ADDRESS_HOME_COUNTRY',

	89 u'PHONE_HOME_WHOLE_NUMBER',

	90 u'PHONE_FAX_WHOLE_NUMBER',

	91 ]
	dennisjeffrey 2011/02/11 00:53:17 Since _fields is just a constant array, would it b Since _fields is just a constant array, would it be better declared as a class attribute rather than a data attribute (since all objects of the DatasetConverter class would have the same set of _fields anyway, right?). dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Since _fields is just a constant array, would it be better declared as a class > attribute rather than a data attribute (since all objects of the > DatasetConverter class would have the same set of _fields anyway, right?). Done.
	92 self._output_pattern = u"{"
	Nirnimesh 2011/02/11 19:39:54 prefer single quote char ' prefer single quote char ' dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 19:39:54, Nirnimesh wrote: > prefer single quote char ' Done.
	93 for key in self._fields:

	94 self._output_pattern += u"u'%s': u'%s', " %(key, "%s")
	dennisjeffrey 2011/02/11 00:53:17 I think this could be re-written like this: self. I think this could be re-written like this: self._output_pattern += u"u'%s': u'%%s', " % key dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > I think this could be re-written like this: > > self._output_pattern += u"u'%s': u'%%s', " % key Done.
	95 self._output_pattern = self._output_pattern[:-1] + "},\n"

	96

	97 self._input_filename = input_filename
	dennisjeffrey 2011/02/11 00:53:17 We should probably check to ensure that input_file We should probably check to ensure that input_filename refers to a valid file and raise an exception if not. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > We should probably check to ensure that input_filename refers to a valid file > and raise an exception if not. Done.
	98 self._output_filename = output_filename

	99 self._display_nothing = display_nothing

	100 self._display_input_lines = display_input_lines

	101 self._display_converted_lines = display_converted_lines

	102 self._record_length = len(self._fields)
	dennisjeffrey 2011/02/11 00:53:17 Perhaps we could remove this variable and just rep Perhaps we could remove this variable and just replace its two uses below with "len(self._fields)" itself? dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Perhaps we could remove this variable and just replace its two uses below with > "len(self._fields)" itself? Done.
	103

	104 def CreateDictionaryFromRecord(self, line):
	dennisjeffrey 2011/02/11 00:53:17 If this function is only used by the _Convert() fu If this function is only used by the _Convert() function below, then should the function name here also be preceded by an underscore? dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > If this function is only used by the _Convert() function below, then should the > function name here also be preceded by an underscore? Done.
	105 """Constructs and returns a dictionary from a record in the dataset file.

	106 Escapes single quotation first and uses split('\|') to separate values.
	dennisjeffrey 2011/02/11 00:53:17 This first line of the comment should be a 1-line This first line of the comment should be a 1-line summary of the method. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > This first line of the comment should be a 1-line summary of the method. Done.
	107

	108 Example:

	109 Take an argument as a string u'John\|Doe\|Mountain View'

	110 and returns a dictionary

	111 {

	112 u'NAME_FIRST': u'John',

	113 u'NAME_LAST': u'Doe',

	114 u'ADDRESS_HOME_CITY': u'Mountain View',

	115 }

	116

	117 Arg:
	dennisjeffrey 2011/02/11 00:53:17 "Arg" --> "Args" (I think it should be "Args" eve "Arg" --> "Args" (I think it should be "Args" even if there's only one). dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > "Arg" --> "Args" > > (I think it should be "Args" even if there's only one). Done.
	118 line: row of record from the dataset file.
	dennisjeffrey 2011/02/11 00:53:17 Since this method returns something, you should ha Since this method returns something, you should have a "Returns:" section after the "Args:" section. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Since this method returns something, you should have a "Returns:" section after > the "Args:" section. Done.
	119 """

	120 # Ignore irrelevant record lines such as comment lines.
	dennisjeffrey 2011/02/11 00:53:17 Besides comment lines, what other lines are consid Besides comment lines, what other lines are considered irrelevant? dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Besides comment lines, what other lines are considered irrelevant? Done.
	121 if not '\|' in line:
	dennisjeffrey 2011/02/11 00:53:17 What if a comment contains a "\|" character? Then What if a comment contains a "\|" character? Then will the code below erroneously try to process that comment line? dyu1 2011/02/16 03:17:31 No, I have a check in place (line 129) where it ch No, I have a check in place (line 129) where it checks if the pipe separates 13 fields (12 pipes/13 fields) and if not then it just displays and error line and return nothing (on the fly it will ignore that line completely and move to the next line). On 2011/02/11 00:53:17, dennisjeffrey wrote: Show quoted text > What if a comment contains a "\|" character? Then will the code below > erroneously try to process that comment line? dennis_jeffrey 2011/02/16 19:43:29 Oh, ok. I didn't realize that each line is expect Show quoted text On 2011/02/16 03:17:31, dyu1 wrote: > No, I have a check in place (line 129) where it checks if the pipe separates 13 > fields (12 pipes/13 fields) and if not then it just displays and error line and > return nothing (on the fly it will ignore that line completely and move to the > next line). > > On 2011/02/11 00:53:17, dennisjeffrey wrote: > > What if a comment contains a "\|" character? Then will the code below > > erroneously try to process that comment line? > Oh, ok. I didn't realize that each line is expected to be exactly 13 fields separated by 12 pipes.
	122 return
	dennisjeffrey 2011/02/11 00:53:17 Is it possible to have a valid line that does not Is it possible to have a valid line that does not contain any "\|", for example, if the line only contains a single value? dyu1 2011/02/16 03:17:31 Well the dataset given to me is in the following f Well the dataset given to me is in the following format john\|\|doe\|john.doe@gmail.com\|\|1950 Amphitheatre Ave #2\|\|\|\|14888\|US\|4195551234\| and contains total of 12 pipes and that's what I'm parsing for and converting it a dictionary list output on the fly. It might still be valid but the 243 records given doesn't seem to be of that format you asked. On 2011/02/11 00:53:17, dennisjeffrey wrote: Show quoted text > Is it possible to have a valid line that does not contain any "\|", for example, > if the line only contains a single value? dennis_jeffrey 2011/02/16 19:43:29 Ok, I see. I was thinking that in general, a reco Show quoted text On 2011/02/16 03:17:31, dyu1 wrote: > Well the dataset given to me is in the following format > john\|\|doe\|john.doe@gmail.com\|\|1950 Amphitheatre Ave #2\|\|\|\|14888\|US\|4195551234\| > > and contains total of 12 pipes and that's what I'm parsing for and converting it > a dictionary list output on the fly. It might still be valid but the 243 records > given doesn't seem to be of that format you asked. > On 2011/02/11 00:53:17, dennisjeffrey wrote: > > Is it possible to have a valid line that does not contain any "\|", for > example, > > if the line only contains a single value? > Ok, I see. I was thinking that in general, a record with a single field may have no '\|' characters. I didn't realize that you're parsing only a particular record format here with 13 fields and 12 pipes.
	123 re_pattern = re.compile("'", re.UNICODE)

	124 line = re_pattern.sub(r"\'", line)
	dennisjeffrey 2011/02/11 00:53:17 You might want to add a comment to describe what y You might want to add a comment to describe what you're doing in these two lines. It looks weird to call "compile" on a quote character, and then just substitute that for the line. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > You might want to add a comment to describe what you're doing in these two > lines. It looks weird to call "compile" on a quote character, and then just > substitute that for the line. Done. dennis_jeffrey 2011/02/16 19:43:29 Oops, sorry - Now that I see your comment, I reali Show quoted text On 2011/02/16 03:17:31, dyu1 wrote: > On 2011/02/11 00:53:17, dennisjeffrey wrote: > > You might want to add a comment to describe what you're doing in these two > > lines. It looks weird to call "compile" on a quote character, and then just > > substitute that for the line. > > Done. Oops, sorry - Now that I see your comment, I realize that I misread the original line (See, the comments can help! :-D).
	125

	126 line_list = line.split('\|')

	127 if line_list:

	128 # Check for case when a line may have more or less fields than expected.

	129 if len(line_list) != self._record_length:

	130 print >> sys.stderr, "Error: a '\|' seperated line has %d fields \

	131 instead of %d" % (len(line_list), self._record_length)

	132 print >> sys.stderr, "\t%s" % line

	133 return
	dennisjeffrey 2011/02/11 00:53:17 How about raising an exception rather than just re How about raising an exception rather than just returning nothing? Also, you might want to consider using the "logging" module rather than using "print". dyu1 2011/02/16 03:17:31 Done for logging. If I raise an exception here th Done for logging. If I raise an exception here then the script will stop rather than continuing on and just ignoring the line, which is what I need it to do when creating the list of dictionaries on the fly in my pyauto test. On 2011/02/11 00:53:17, dennisjeffrey wrote: Show quoted text > How about raising an exception rather than just returning nothing? Also, you > might want to consider using the "logging" module rather than using "print". dennis_jeffrey 2011/02/16 19:43:29 Ok, I think a logging.warning like what you do now Show quoted text On 2011/02/16 03:17:31, dyu1 wrote: > Done for logging. > > If I raise an exception here then the script will stop rather than continuing on > and just ignoring the line, which is what I need it to do when creating the list > of dictionaries on the fly in my pyauto test. > > On 2011/02/11 00:53:17, dennisjeffrey wrote: > > How about raising an exception rather than just returning nothing? Also, you > > might want to consider using the "logging" module rather than using "print". > Ok, I think a logging.warning like what you do now is a good solution in this case.
	134 out_record = {}

	135 i = 0

	136 for key in self._fields:

	137 out_record[key] = line_list[i]

	138 i += 1
	dennisjeffrey 2011/02/11 00:53:17 It looks like here, you're assuming that the order It looks like here, you're assuming that the order in which entries are specified in line_list, matches the order in which keys are considered when you iterate through self._fields. Is that really ok to assume? dyu1 2011/02/16 03:17:31 Yes, since the order of the keys from the order in Yes, since the order of the keys from the order in the _fields list. The ordering will not change and it's the order of the entries in the line_list as they appear. On 2011/02/11 00:53:17, dennisjeffrey wrote: Show quoted text > It looks like here, you're assuming that the order in which entries are > specified in line_list, matches the order in which keys are considered when you > iterate through self._fields. Is that really ok to assume?
	139 return out_record

	140

	141 def _Convert(self, input_file, output_file):

	142 """The real conversion takes place here.
	dennisjeffrey 2011/02/11 00:53:17 I think it would be more useful to say what's bein I think it would be more useful to say what's being converted in this comment. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > I think it would be more useful to say what's being converted in this comment. Done.
	143

	144 Args:

	145 input_file: dataset input file.

	146 output_file: the converted dictionary list output file.
	dennisjeffrey 2011/02/11 00:53:17 Since this function returns something, you need a Since this function returns something, you need a "Returns:" section after the "Args:" section. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Since this function returns something, you need a "Returns:" section after the > "Args:" section. Done.
	147 """

	148 list_of_dict = []

	149 i = 0

	150 if output_file:

	151 output_file.write("[")

	152 output_file.write(os.linesep)

	153 for line in input_file.readlines():

	154 line = line.strip()

	155 if not line:

	156 continue

	157 line = unicode(line, 'UTF-8')

	158 output_record = self.CreateDictionaryFromRecord(line)

	159 if output_record:

	160 i += 1

	161 list_of_dict.append(output_record)

	162 output_line = self._output_pattern %tuple(
	dennisjeffrey 2011/02/11 00:53:17 Put a space after the "%". Put a space after the "%". dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Put a space after the "%". Done.
	163 [output_record[key] for key in self._fields])

	164 if output_file:

	165 output_file.write(output_line)

	166 output_file.write(os.linesep)

	167 if not self._display_nothing:

	168 if self._display_input_lines:

	169 print "\n%d: %s" %(i, line.encode(sys.stdout.encoding, 'ignore'))
	dennisjeffrey 2011/02/11 00:53:17 Put a space after the "%". Put a space after the "%". dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Put a space after the "%". Done.
	170 if self._display_converted_lines:

	171 print "\tconverted to: %s" %output_line.encode(
	dennisjeffrey 2011/02/11 00:53:17 You may want to consider using the "logging" modul You may want to consider using the "logging" module rather than "print". dennisjeffrey 2011/02/11 00:53:17 Put a space after the "%". Put a space after the "%". dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Put a space after the "%". Done.
	172 sys.stdout.encoding, 'ignore')

	173 else:

	174 if not self._display_input_lines and not i % 10:

	175 print "\t%d lines converted so far!" %i
	dennisjeffrey 2011/02/11 00:53:17 Put a space after the "%". Put a space after the "%". dennisjeffrey 2011/02/11 00:53:17 I assume all lines should be converted nearly inst I assume all lines should be converted nearly instantaneously from the perspective of a human user (unless input files can be huge). Is it really helpful to print a message after every 10 lines are converted?
	176 if output_file:

	177 output_file.write("]")

	178 output_file.write(os.linesep)

	179 if not self._display_nothing:

	180 print

	181 print "%d lines converted SUCCESSFULLY!" %i
	dennisjeffrey 2011/02/11 00:53:17 Put a space after the "%". Put a space after the "%". dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Put a space after the "%". Done.
	182 print "--- FINISHED ---"

	183 print
	dennisjeffrey 2011/02/11 00:53:17 Again, consider using "logging" instead of "print" Again, consider using "logging" instead of "print". That way, you can specify verbosity levels for the different printed messages, and let a user specify what level of verbosity they want to see as output. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Again, consider using "logging" instead of "print". That way, you can specify > verbosity levels for the different printed messages, and let a user specify what > level of verbosity they want to see as output. Done.
	184 return list_of_dict

	185

	186 def Convert(self):

	187 """Takes arguments of two file names and creates two file objects, then
	dennisjeffrey 2011/02/11 00:53:17 This method actually doesn't take any parameter ar This method actually doesn't take any parameter arguments. It just uses the values of two data attributes of the current object. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > This method actually doesn't take any parameter arguments. It just uses the > values of two data attributes of the current object. Done.
	188 calls _Convert() with these two file objects to do the real conversion."""
	dennisjeffrey 2011/02/11 00:53:17 The first comment line should be a 1-line summary The first comment line should be a 1-line summary of this method. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > The first comment line should be a 1-line summary of this method. Done.
	189 with open(self._input_filename) as input_file:

	190 if self._output_filename:

	191 with codecs.open(self._output_filename, mode = 'wb',

	192 encoding = 'utf-8-sig') as output_file:
	dennisjeffrey 2011/02/11 00:53:17 Remove the spaces around the "=" when specifying t Remove the spaces around the "=" when specifying the named parameter values. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Remove the spaces around the "=" when specifying the named parameter values. Done.
	193 return self._Convert(input_file, output_file)

	194 else:

	195 return self._Convert(input_file, None)

	196
	dennisjeffrey 2011/02/11 00:53:17 Should have an extra blank line here: the style gu Should have an extra blank line here: the style guide says to put 2 blank lines before top-level definitions. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > Should have an extra blank line here: the style guide says to put 2 blank lines > before top-level definitions. Done.
	197 def main():

	198 c = DatasetConverter(r'../data/autofill/dataset.txt',
	dennisjeffrey 2011/02/11 00:53:17 Is it better to hard-code the input filename and o Is it better to hard-code the input filename and output filename here, or would it be better to specify these as command-line inputs to the main() function? dyu1 2011/02/16 03:17:31 Well command-line input would be find for the stan Well command-line input would be find for the standalone version of this script but what about creating the dictionary list on the fly in the pyAuto test? On 2011/02/11 00:53:17, dennisjeffrey wrote: Show quoted text > Is it better to hard-code the input filename and output filename here, or would > it be better to specify these as command-line inputs to the main() function? dennis_jeffrey 2011/02/16 19:43:29 When this module is invoked via the PyAuto test, t Show quoted text On 2011/02/16 03:17:31, dyu1 wrote: > Well command-line input would be find for the standalone version of this script > but what about creating the dictionary list on the fly in the pyAuto test? > > On 2011/02/11 00:53:17, dennisjeffrey wrote: > > Is it better to hard-code the input filename and output filename here, or > would > > it be better to specify these as command-line inputs to the main() function? > When this module is invoked via the PyAuto test, the input and output filenames will be passed as input to this module when class DatasetConverter is instantiated (just as you're already doing). However, when running this module as a standalone program, right now it uses hard-coded values for the input and output filenames. I was just wondering whether it might be more useful to allow a user, when invoking this module as a standalone program, to specify the desired input and output filenames (perhaps the existing hardcoded filenames could be used as defaults, but at least it seems useful to allow the user to override these defaults if desired). Also, since you're now using the logging module, then when this module is invoked from a PyAuto test, you should probably have the PyAuto test pass as input (to the class DatasetConverter constructor) the desired verbosity level. Next, in the event this module is invoked as a standalone program, you may want to allow the user to specify the desired verbosity level too, as a command-line argument to this program.
	199 r'../data/autofill/dataset_duplicate-profiles.txt')
	dennisjeffrey 2011/02/11 00:53:17 The second argument should line up underneath the The second argument should line up underneath the first argument. dyu1 2011/02/16 03:17:31 Done. Show quoted text On 2011/02/11 00:53:17, dennisjeffrey wrote: > The second argument should line up underneath the first argument. Done.
	200 c.Convert()

	201

	202 if __name__ == '__main__':

	203 main()

OLD	NEW

« chrome/test/functional/autofill.py ('K') | « chrome/test/functional/autofill.py ('k') | no next file » | no next file with comments »