Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(583)

Side by Side Diff: net/base/sdch_manager.h

Issue 423813002: Sdch view for net-internals (Closed) Base URL: https://chromium.googlesource.com/chromium/src.git@master
Patch Set: Create Dictionary JSON directly in SdchManager::SdchInfoToValue Created 6 years, 4 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View unified diff | Download patch
OLDNEW
1 // Copyright (c) 2011 The Chromium Authors. All rights reserved. 1 // Copyright (c) 2011 The Chromium Authors. All rights reserved.
2 // Use of this source code is governed by a BSD-style license that can be 2 // Use of this source code is governed by a BSD-style license that can be
3 // found in the LICENSE file. 3 // found in the LICENSE file.
4 4
5 // Provides global database of differential decompression dictionaries for the 5 // Provides global database of differential decompression dictionaries for the
6 // SDCH filter (processes sdch enconded content). 6 // SDCH filter (processes sdch enconded content).
7 7
8 // Exactly one instance of SdchManager is built, and all references are made 8 // Exactly one instance of SdchManager is built, and all references are made
9 // into that collection. 9 // into that collection.
10 // 10 //
11 // The SdchManager maintains a collection of memory resident dictionaries. It 11 // The SdchManager maintains a collection of memory resident dictionaries. It
12 // can find a dictionary (based on a server specification of a hash), store a 12 // can find a dictionary (based on a server specification of a hash), store a
13 // dictionary, and make judgements about what URLs can use, set, etc. a 13 // dictionary, and make judgements about what URLs can use, set, etc. a
14 // dictionary. 14 // dictionary.
15 15
16 // These dictionaries are acquired over the net, and include a header 16 // These dictionaries are acquired over the net, and include a header
17 // (containing metadata) as well as a VCDIFF dictionary (for use by a VCDIFF 17 // (containing metadata) as well as a VCDIFF dictionary (for use by a VCDIFF
18 // module) to decompress data. 18 // module) to decompress data.
19 19
20 #ifndef NET_BASE_SDCH_MANAGER_H_ 20 #ifndef NET_BASE_SDCH_MANAGER_H_
21 #define NET_BASE_SDCH_MANAGER_H_ 21 #define NET_BASE_SDCH_MANAGER_H_
22 22
23 #include <map> 23 #include <map>
24 #include <set> 24 #include <set>
25 #include <string> 25 #include <string>
26 #include <vector>
Randy Smith (Not in Mondays) 2014/08/13 17:35:46 Is this still needed?
baranovich 2014/08/13 19:13:48 Done.
26 27
27 #include "base/gtest_prod_util.h" 28 #include "base/gtest_prod_util.h"
28 #include "base/memory/ref_counted.h" 29 #include "base/memory/ref_counted.h"
29 #include "base/memory/scoped_ptr.h" 30 #include "base/memory/scoped_ptr.h"
30 #include "base/threading/non_thread_safe.h" 31 #include "base/threading/non_thread_safe.h"
31 #include "base/time/time.h" 32 #include "base/time/time.h"
32 #include "net/base/net_export.h" 33 #include "net/base/net_export.h"
34 #include "net/base/net_log.h"
Randy Smith (Not in Mondays) 2014/08/13 17:35:45 Can't this be in the cc file? Maybe with a forwar
baranovich 2014/08/13 19:13:48 Done.
33 #include "url/gurl.h" 35 #include "url/gurl.h"
34 36
37 namespace base {
38 class Value;
39 class DictionaryValue;
Randy Smith (Not in Mondays) 2014/08/13 17:35:45 Is this still needed?
baranovich 2014/08/13 19:13:48 Done.
40 }
41
35 namespace net { 42 namespace net {
36 43
37 //------------------------------------------------------------------------------ 44 //------------------------------------------------------------------------------
38 // Create a public interface to help us load SDCH dictionaries. 45 // Create a public interface to help us load SDCH dictionaries.
39 // The SdchManager class allows registration to support this interface. 46 // The SdchManager class allows registration to support this interface.
40 // A browser may register a fetcher that is used by the dictionary managers to 47 // A browser may register a fetcher that is used by the dictionary managers to
41 // get data from a specified URL. This allows us to use very high level browser 48 // get data from a specified URL. This allows us to use very high level browser
42 // functionality in this base (when the functionaity can be provided). 49 // functionality in this base (when the functionaity can be provided).
43 class NET_EXPORT SdchFetcher { 50 class NET_EXPORT SdchFetcher {
44 public: 51 public:
(...skipping 13 matching lines...) Expand all
58 DISALLOW_COPY_AND_ASSIGN(SdchFetcher); 65 DISALLOW_COPY_AND_ASSIGN(SdchFetcher);
59 }; 66 };
60 67
61 //------------------------------------------------------------------------------ 68 //------------------------------------------------------------------------------
62 69
63 class NET_EXPORT SdchManager : public NON_EXPORTED_BASE(base::NonThreadSafe) { 70 class NET_EXPORT SdchManager : public NON_EXPORTED_BASE(base::NonThreadSafe) {
64 public: 71 public:
65 // A list of errors that appeared and were either resolved, or used to turn 72 // A list of errors that appeared and were either resolved, or used to turn
66 // off sdch encoding. 73 // off sdch encoding.
67 enum ProblemCodes { 74 enum ProblemCodes {
68 MIN_PROBLEM_CODE, 75 PROBLEM_CODE_OK = 0,
69 76
70 // Content-encoding correction problems. 77 #define SDCH_PROBLEM_CODE(label, value) label = value,
71 ADDED_CONTENT_ENCODING = 1, 78 #include "net/base/sdch_problem_code_list.h"
72 FIXED_CONTENT_ENCODING = 2, 79 #undef SDCH_PROBLEM_CODE
73 FIXED_CONTENT_ENCODINGS = 3,
74
75 // Content decoding errors.
76 DECODE_HEADER_ERROR = 4,
77 DECODE_BODY_ERROR = 5,
78
79 // More content-encoding correction problems.
80 OPTIONAL_GUNZIP_ENCODING_ADDED = 6,
81
82 // Content encoding correction when we're not even tagged as HTML!?!
83 BINARY_ADDED_CONTENT_ENCODING = 7,
84 BINARY_FIXED_CONTENT_ENCODING = 8,
85 BINARY_FIXED_CONTENT_ENCODINGS = 9,
86
87 // Dictionary selection for use problems.
88 DICTIONARY_FOUND_HAS_WRONG_DOMAIN = 10,
89 DICTIONARY_FOUND_HAS_WRONG_PORT_LIST = 11,
90 DICTIONARY_FOUND_HAS_WRONG_PATH = 12,
91 DICTIONARY_FOUND_HAS_WRONG_SCHEME = 13,
92 DICTIONARY_HASH_NOT_FOUND = 14,
93 DICTIONARY_HASH_MALFORMED = 15,
94
95 // Dictionary saving problems.
96 DICTIONARY_HAS_NO_HEADER = 20,
97 DICTIONARY_HEADER_LINE_MISSING_COLON = 21,
98 DICTIONARY_MISSING_DOMAIN_SPECIFIER = 22,
99 DICTIONARY_SPECIFIES_TOP_LEVEL_DOMAIN = 23,
100 DICTIONARY_DOMAIN_NOT_MATCHING_SOURCE_URL = 24,
101 DICTIONARY_PORT_NOT_MATCHING_SOURCE_URL = 25,
102 DICTIONARY_HAS_NO_TEXT = 26,
103 DICTIONARY_REFERER_URL_HAS_DOT_IN_PREFIX = 27,
104
105 // Dictionary loading problems.
106 DICTIONARY_LOAD_ATTEMPT_FROM_DIFFERENT_HOST = 30,
107 DICTIONARY_SELECTED_FOR_SSL = 31,
108 DICTIONARY_ALREADY_LOADED = 32,
109 DICTIONARY_SELECTED_FROM_NON_HTTP = 33,
110 DICTIONARY_IS_TOO_LARGE= 34,
111 DICTIONARY_COUNT_EXCEEDED = 35,
112 DICTIONARY_ALREADY_SCHEDULED_TO_DOWNLOAD = 36,
113 DICTIONARY_ALREADY_TRIED_TO_DOWNLOAD = 37,
114
115 // Failsafe hack.
116 ATTEMPT_TO_DECODE_NON_HTTP_DATA = 40,
117
118
119 // Content-Encoding problems detected, with no action taken.
120 MULTIENCODING_FOR_NON_SDCH_REQUEST = 50,
121 SDCH_CONTENT_ENCODE_FOR_NON_SDCH_REQUEST = 51,
122
123 // Dictionary manager issues.
124 DOMAIN_BLACKLIST_INCLUDES_TARGET = 61,
125
126 // Problematic decode recovery methods.
127 META_REFRESH_RECOVERY = 70, // Dictionary not found.
128 // defunct = 71, // Almost the same as META_REFRESH_UNSUPPORTED.
129 // defunct = 72, // Almost the same as CACHED_META_REFRESH_UNSUPPORTED.
130 // defunct = 73, // PASSING_THROUGH_NON_SDCH plus DISCARD_TENTATIVE_SDCH.
131 META_REFRESH_UNSUPPORTED = 74, // Unrecoverable error.
132 CACHED_META_REFRESH_UNSUPPORTED = 75, // As above, but pulled from cache.
133 PASSING_THROUGH_NON_SDCH = 76, // Tagged sdch but missing dictionary-hash.
134 INCOMPLETE_SDCH_CONTENT = 77, // Last window was not completely decoded.
135 PASS_THROUGH_404_CODE = 78, // URL not found message passing through.
136
137 // This next report is very common, and not really an error scenario, but
138 // it exercises the error recovery logic.
139 PASS_THROUGH_OLD_CACHED = 79, // Back button got pre-SDCH cached content.
140
141 // Common decoded recovery methods.
142 META_REFRESH_CACHED_RECOVERY = 80, // Probably startup tab loading.
143 DISCARD_TENTATIVE_SDCH = 81, // Server decided not to use sdch.
144
145 // Non SDCH problems, only accounted for to make stat counting complete
146 // (i.e., be able to be sure all dictionary advertisements are accounted
147 // for).
148
149 UNFLUSHED_CONTENT = 90, // Possible error in filter chaining.
150 // defunct = 91, // MISSING_TIME_STATS (Should never happen.)
151 CACHE_DECODED = 92, // No timing stats recorded.
152 // defunct = 93, // OVER_10_MINUTES (No timing stats recorded.)
153 UNINITIALIZED = 94, // Filter never even got initialized.
154 PRIOR_TO_DICTIONARY = 95, // We hadn't even parsed a dictionary selector.
155 DECODE_ERROR = 96, // Something went wrong during decode.
156
157 // Problem during the latency test.
158 LATENCY_TEST_DISALLOWED = 100, // SDCH now failing, but it worked before!
159
160 MAX_PROBLEM_CODE // Used to bound histogram. 80 MAX_PROBLEM_CODE // Used to bound histogram.
161 }; 81 };
162 82
163 // Use the following static limits to block DOS attacks until we implement 83 // Use the following static limits to block DOS attacks until we implement
164 // a cached dictionary evicition strategy. 84 // a cached dictionary evicition strategy.
165 static const size_t kMaxDictionarySize; 85 static const size_t kMaxDictionarySize;
166 static const size_t kMaxDictionaryCount; 86 static const size_t kMaxDictionaryCount;
167 87
168 // There is one instance of |Dictionary| for each memory-cached SDCH 88 // There is one instance of |Dictionary| for each memory-cached SDCH
169 // dictionary. 89 // dictionary.
(...skipping 15 matching lines...) Expand all
185 const std::string& client_hash, 105 const std::string& client_hash,
186 const GURL& url, 106 const GURL& url,
187 const std::string& domain, 107 const std::string& domain,
188 const std::string& path, 108 const std::string& path,
189 const base::Time& expiration, 109 const base::Time& expiration,
190 const std::set<int>& ports); 110 const std::set<int>& ports);
191 ~Dictionary(); 111 ~Dictionary();
192 112
193 const GURL& url() const { return url_; } 113 const GURL& url() const { return url_; }
194 const std::string& client_hash() const { return client_hash_; } 114 const std::string& client_hash() const { return client_hash_; }
115 const std::string& domain() const { return domain_; }
116 const std::string& path() const { return path_; }
117 const base::Time& expiration() const { return expiration_; }
118 const std::set<int>& ports() const { return ports_; }
195 119
196 // Security method to check if we can advertise this dictionary for use 120 // Security method to check if we can advertise this dictionary for use
197 // if the |target_url| returns SDCH compressed data. 121 // if the |target_url| returns SDCH compressed data.
198 bool CanAdvertise(const GURL& target_url); 122 bool CanAdvertise(const GURL& target_url) const;
199 123
200 // Security methods to check if we can establish a new dictionary with the 124 // Security methods to check if we can establish a new dictionary with the
201 // given data, that arrived in response to get of dictionary_url. 125 // given data, that arrived in response to get of dictionary_url.
Randy Smith (Not in Mondays) 2014/08/13 17:35:46 Specify return value interpretation? (And below.)
baranovich 2014/08/13 18:38:58 If SDCH is disabled (or SDCH over https is disable
Randy Smith (Not in Mondays) 2014/08/19 19:00:53 Right, thank you for the summary. Thinking this t
202 static bool CanSet(const std::string& domain, const std::string& path, 126 static ProblemCodes CanSet(const std::string& domain,
203 const std::set<int>& ports, const GURL& dictionary_url); 127 const std::string& path,
128 const std::set<int>& ports,
129 const GURL& dictionary_url);
204 130
205 // Security method to check if we can use a dictionary to decompress a 131 // Security method to check if we can use a dictionary to decompress a
206 // target that arrived with a reference to this dictionary. 132 // target that arrived with a reference to this dictionary.
207 bool CanUse(const GURL& referring_url); 133 ProblemCodes CanUse(const GURL& referring_url) const;
208 134
209 // Compare paths to see if they "match" for dictionary use. 135 // Compare paths to see if they "match" for dictionary use.
210 static bool PathMatch(const std::string& path, 136 static bool PathMatch(const std::string& path,
211 const std::string& restriction); 137 const std::string& restriction);
212 138
213 // Compare domains to see if the "match" for dictionary use. 139 // Compare domains to see if the "match" for dictionary use.
214 static bool DomainMatch(const GURL& url, const std::string& restriction); 140 static bool DomainMatch(const GURL& url, const std::string& restriction);
215 141
216
217 // The actual text of the dictionary. 142 // The actual text of the dictionary.
218 std::string text_; 143 std::string text_;
219 144
220 // Part of the hash of text_ that the client uses to advertise the fact that 145 // Part of the hash of text_ that the client uses to advertise the fact that
221 // it has a specific dictionary pre-cached. 146 // it has a specific dictionary pre-cached.
222 std::string client_hash_; 147 std::string client_hash_;
223 148
224 // The GURL that arrived with the text_ in a URL request to specify where 149 // The GURL that arrived with the text_ in a URL request to specify where
225 // this dictionary may be used. 150 // this dictionary may be used.
226 const GURL url_; 151 const GURL url_;
(...skipping 60 matching lines...) Expand 10 before | Expand all | Expand 10 after
287 // Check to see if SDCH is enabled (globally), and the given URL is in a 212 // Check to see if SDCH is enabled (globally), and the given URL is in a
288 // supported domain (i.e., not blacklisted, and either the specific supported 213 // supported domain (i.e., not blacklisted, and either the specific supported
289 // domain, or all domains were assumed supported). If it is blacklist, reduce 214 // domain, or all domains were assumed supported). If it is blacklist, reduce
290 // by 1 the number of times it will be reported as blacklisted. 215 // by 1 the number of times it will be reported as blacklisted.
291 bool IsInSupportedDomain(const GURL& url); 216 bool IsInSupportedDomain(const GURL& url);
292 217
293 // Schedule the URL fetching to load a dictionary. This will always return 218 // Schedule the URL fetching to load a dictionary. This will always return
294 // before the dictionary is actually loaded and added. 219 // before the dictionary is actually loaded and added.
295 // After the implied task does completes, the dictionary will have been 220 // After the implied task does completes, the dictionary will have been
296 // cached in memory. 221 // cached in memory.
297 void FetchDictionary(const GURL& request_url, const GURL& dictionary_url); 222 ProblemCodes FetchDictionary(const GURL& request_url,
223 const GURL& dictionary_url);
298 224
299 // Security test function used before initiating a FetchDictionary. 225 // Security test function used before initiating a FetchDictionary.
300 // Return true if fetch is legal. 226 // Return PROBLEM_CODE_OK if fetch is legal.
301 bool CanFetchDictionary(const GURL& referring_url, 227 ProblemCodes CanFetchDictionary(const GURL& referring_url,
302 const GURL& dictionary_url) const; 228 const GURL& dictionary_url) const;
303 229
304 // Add an SDCH dictionary to our list of availible dictionaries. This addition 230 // Add an SDCH dictionary to our list of availible dictionaries. This addition
305 // will fail (return false) if addition is illegal (data in the dictionary is 231 // will fail (return false) if addition is illegal (data in the dictionary is
306 // not acceptable from the dictionary_url; dictionary already added, etc.). 232 // not acceptable from the dictionary_url; dictionary already added, etc.).
233 // |problem| will contain related problem code.
307 bool AddSdchDictionary(const std::string& dictionary_text, 234 bool AddSdchDictionary(const std::string& dictionary_text,
308 const GURL& dictionary_url); 235 const GURL& dictionary_url,
236 ProblemCodes* problem);
309 237
310 // Find the vcdiff dictionary (the body of the sdch dictionary that appears 238 // Find the vcdiff dictionary (the body of the sdch dictionary that appears
311 // after the meta-data headers like Domain:...) with the given |server_hash| 239 // after the meta-data headers like Domain:...) with the given |server_hash|
312 // to use to decompreses data that arrived as SDCH encoded content. Check to 240 // to use to decompreses data that arrived as SDCH encoded content. Check to
313 // be sure the returned |dictionary| can be used for decoding content supplied 241 // be sure the returned |dictionary| can be used for decoding content supplied
314 // in response to a request for |referring_url|. 242 // in response to a request for |referring_url|.
315 // Return null in |dictionary| if there is no matching legal dictionary. 243 // Return null in |dictionary| if there is no matching legal dictionary.
316 void GetVcdiffDictionary(const std::string& server_hash, 244 // Returns PROBLEM_CODE_OK if dictionary is not found, SDCH(-over-https) is
317 const GURL& referring_url, 245 // disabled, or if matching legal dictionary exists. Otherwise returns the
318 scoped_refptr<Dictionary>* dictionary); 246 // corresponding problem code.
247 ProblemCodes GetVcdiffDictionary(const std::string& server_hash,
248 const GURL& referring_url,
249 scoped_refptr<Dictionary>* dictionary);
319 250
320 // Get list of available (pre-cached) dictionaries that we have already loaded 251 // Get list of available (pre-cached) dictionaries that we have already loaded
321 // into memory. The list is a comma separated list of (client) hashes per 252 // into memory. The list is a comma separated list of (client) hashes per
322 // the SDCH spec. 253 // the SDCH spec.
323 void GetAvailDictionaryList(const GURL& target_url, std::string* list); 254 void GetAvailDictionaryList(const GURL& target_url, std::string* list);
324 255
325 // Construct the pair of hashes for client and server to identify an SDCH 256 // Construct the pair of hashes for client and server to identify an SDCH
326 // dictionary. This is only made public to facilitate unit testing, but is 257 // dictionary. This is only made public to facilitate unit testing, but is
327 // otherwise private 258 // otherwise private
328 static void GenerateHash(const std::string& dictionary_text, 259 static void GenerateHash(const std::string& dictionary_text,
329 std::string* client_hash, std::string* server_hash); 260 std::string* client_hash, std::string* server_hash);
330 261
331 // For Latency testing only, we need to know if we've succeeded in doing a 262 // For Latency testing only, we need to know if we've succeeded in doing a
332 // round trip before starting our comparative tests. If ever we encounter 263 // round trip before starting our comparative tests. If ever we encounter
333 // problems with SDCH, we opt-out of the test unless/until we perform a 264 // problems with SDCH, we opt-out of the test unless/until we perform a
334 // complete SDCH decoding. 265 // complete SDCH decoding.
335 bool AllowLatencyExperiment(const GURL& url) const; 266 bool AllowLatencyExperiment(const GURL& url) const;
336 267
337 void SetAllowLatencyExperiment(const GURL& url, bool enable); 268 void SetAllowLatencyExperiment(const GURL& url, bool enable);
338 269
270 base::Value* SdchInfoToValue() const;
271
339 private: 272 private:
273 bool IsInBlacklistedDomain(const GURL& url);
274
275 static bool IsSdchEnabledForUrl(const GURL& url);
276
340 typedef std::map<std::string, int> DomainCounter; 277 typedef std::map<std::string, int> DomainCounter;
341 typedef std::set<std::string> ExperimentSet; 278 typedef std::set<std::string> ExperimentSet;
342 279
343 // A map of dictionaries info indexed by the hash that the server provides. 280 // A map of dictionaries info indexed by the hash that the server provides.
344 typedef std::map<std::string, scoped_refptr<Dictionary> > DictionaryMap; 281 typedef std::map<std::string, scoped_refptr<Dictionary> > DictionaryMap;
345 282
346 // Support SDCH compression, by advertising in headers. 283 // Support SDCH compression, by advertising in headers.
347 static bool g_sdch_enabled_; 284 static bool g_sdch_enabled_;
348 285
349 // Support SDCH compression for HTTPS requests and responses. When supported, 286 // Support SDCH compression for HTTPS requests and responses. When supported,
(...skipping 19 matching lines...) Expand all
369 // List of hostnames for which a latency experiment is allowed (because a 306 // List of hostnames for which a latency experiment is allowed (because a
370 // round trip test has recently passed). 307 // round trip test has recently passed).
371 ExperimentSet allow_latency_experiment_; 308 ExperimentSet allow_latency_experiment_;
372 309
373 DISALLOW_COPY_AND_ASSIGN(SdchManager); 310 DISALLOW_COPY_AND_ASSIGN(SdchManager);
374 }; 311 };
375 312
376 } // namespace net 313 } // namespace net
377 314
378 #endif // NET_BASE_SDCH_MANAGER_H_ 315 #endif // NET_BASE_SDCH_MANAGER_H_
OLDNEW

Powered by Google App Engine
This is Rietveld 408576698