EP1971930A2 - Site acceleration with content prefetching enabled through customer-specific configurations - Google Patents
Site acceleration with content prefetching enabled through customer-specific configurationsInfo
- Publication number
- EP1971930A2 EP1971930A2 EP06848289A EP06848289A EP1971930A2 EP 1971930 A2 EP1971930 A2 EP 1971930A2 EP 06848289 A EP06848289 A EP 06848289A EP 06848289 A EP06848289 A EP 06848289A EP 1971930 A2 EP1971930 A2 EP 1971930A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- content
- prefetching
- prefetch
- edge server
- given
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9574—Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
Definitions
- the present invention relates generally to content delivery in distributed networks. Brief Description of the Related Art
- a company's Web site represents its public face. It is often the initial point of contact for obtaining access to the company's information or doing business with the company.
- Public facing Web sites are used for many purposes. They can be used to transact commerce, where end consumers evaluate and buy products and services, and they are often linked to revenue generation and satisfying customer requests. They can be used as news and information portals for supplying the latest content for consumers.
- a company's Web site can be used as a customer self-service venue, where customer satisfaction is critical to loyalty in getting customers to return to the Web site. These are merely representative examples, of course.
- Web sites increasingly become a key component of a company's business and its external communications. As such, the capability and flexibility of the supporting Internet infrastructure for the Web site becomes mission-critical. In particular, the infrastructure must provide good performance for all end user consumers, regardless of their location. The site must scale to handle high traffic load during peak usage periods. It must remain available 24x7, regardless of conditions on the Internet. When performance, reliability, or scalability problems do occur, Web site adoption and usage can be negatively impacted, resulting in greater costs, decreased revenue, and customer satisfaction issues.
- a “distributed system” of this type typically refers to a collection of autonomous computers linked by a network or networks, together with the software, systems, protocols and techniques designed to facilitate various services, such as content delivery or the support of outsourced site infrastructure.
- content delivery means the storage, caching, or transmission of content, streaming media and applications on behalf of content providers, including ancillary technologies used therewith including, without limitation, DNS request handling, provisioning, data monitoring and reporting, content targeting, personalization, and business intelligence.
- ancillary technologies used therewith including, without limitation, DNS request handling, provisioning, data monitoring and reporting, content targeting, personalization, and business intelligence.
- the term “outsourced site infrastructure” means the distributed systems and associated technologies that enable an entity to operate and/or manage a third party's Web site infrastructure, in whole or in part, on the third party's behalf.
- FIGS 1-2 illustrate a known CDN infrastructure for managing content delivery on behalf of participating content providers.
- computer system 100 is configured as a CDN and is managed by a service provider.
- the CDN is assumed to have a set of machines 102a-n distributed around the Internet, and some or even all of these machines may be located in data centers owned or operated by third parties. Typically, most of the machines are servers located near the edge of the Internet, i.e., at or adjacent end user access networks.
- a Network Operations Command Center (NOCC) 104 may be used to administer and manage operations of the various machines in the system.
- NOCC Network Operations Command Center
- Third party content sites such as Web site 106, offload delivery of content (e.g., HTML, embedded page objects, streaming media, software downloads, and the like) to the distributed computer system 100 and, in particular, to "edge" servers. Typically, this service is provided for a fee.
- CDN content provider customers offload their content delivery by aliasing (e.g., by a DNS canonical name) given content provider domains or sub-domains to domains that are managed by the service provider's authoritative domain name service. End users that desire such content may be directed to the distributed computer system to obtain that content more reliably and efficiently.
- the distributed computer system typically also includes other infrastructure, such as a distributed data collection system 108 that collects usage and other data from the edge servers, aggregates that data across a region or set of regions, and passes that data to other back-end systems 110, 112, 114 and 116 to facilitate monitoring, logging, alerts, billing, management and other operational and administrative functions.
- Distributed network agents 118 monitor the network as well as the server loads and provide network, traffic and load data to a DNS query handling mechanism 115, which is authoritative for content domains being managed by the CDN.
- a distributed data transport mechanism 120 may be used to distribute control information (e.g., metadata to manage content, to facilitate load balancing, and the like) to the edge servers.
- a given machine 200 comprises commodity hardware (e.g., an Intel Pentium processor) 202 running an operating system kernel (such as Linux or variant) 204 that supports one or more applications 206a-n.
- operating system kernel such as Linux or variant
- given machines typically run a set of applications, such as an HTTP Web proxy 207, a name server 208, a local monitoring process 210, a distributed data collection process 212, and the like.
- the machine typically includes one or more media servers, such as a Windows Media Server (WMS) or Flash 2.0 server, as required by the supported media formats.
- WMS Windows Media Server
- the CDN may be configured to provide certain advanced content delivery functionality, for example, in the case where the edge server does not have the requested content (e.g., the content is not present, the content is present but is stale, the content is "dynamic" and must be created on the origin server, and the like). In such circumstances, the edge server must "go forward” to obtain the requested content.
- An enhanced CDN often provides the capability to facilitate this "go forward” process.
- a subset of the edge servers in the CDN is organized as a cache hierarchy, so that a given edge server in an edge region has an associated "parent" region that may store an authoritative copy of certain requested content.
- a cache hierarchy of this type is then controlled at a fine-grain level using edge server and parent server configuration rules that are provided through the distributed data transport mechanism.
- U.S. Patent No. 7,133,905 which is assigned to the assignee of the present application, describes this scheme.
- Another advanced function that may be implemented is quite useful when an edge server has to go forward to an origin server for dynamic or non-cacheable content.
- the CDN is configured so that a given edge server has the option of going forward (to the origin) using intermediate.
- CDN edge nodes instead of relying upon default BGP routing.
- the CDN performs tests to determine a set of alternative best paths between a given edge server and the origin server, and it makes those paths known to the edge server dynamically, typically in the form of a map.
- the edge server When the edge server needs to go forward, it examines the map to determine whether to go forward using default BGP or one of the alternate paths through an intermediate CDN node.
- This path optimization process is quite useful when the content in question must be generated dynamically, although the process can be used whenever it is necessary for a given edge server to obtain given content from a given source.
- This performance-based path optimization scheme is described in U.S. Publication No. 2002/0163882, which is also assigned to the assignee of the present application.
- a CDN edge server is configured to provide one or more extended content delivery features on a domain-specific, customer-specific basis, preferably using configuration files that are distributed to the edge servers using a configuration system.
- a given configuration file preferably is XML-based and includes a set of content handling rules and directives that facilitate one or more advanced content handling features, such as content prefetching.
- the edge server retrieves objects (such as images and scripts) embedded in pages (normally HTML content) at the same time it serves the page to the browser rather than waiting for the browser's request for these objects. This can significantly decrease the overall rendering time of the page and improve the user experience of a Web site.
- prefetching can be applied to either cacheable or uncacheable content.
- prefetching When prefetching is used for cacheable content, and the object to be prefetched is already in cache, the object is moved from disk into memory so that it is ready to be served.
- prefetching When prefetching is used for uncacheable content, the retrieved objects are uniquely associated with the client browser request that triggered the prefetch so that these objects cannot be served to a different end user.
- prefetching can be combined with tiered distribution and other edge server configuration options to further improve the speed of delivery and/or to protect the origin server from bursts of prefetching requests.
- Figure 1 is a representative prior art content delivery network in which the present invention may be implemented
- Figure 2 is a representative edge server of the content delivery network of Figure 1;
- Figure 3 is a portion of the CDN of Figure 1 in which content prefetching is enabled according to the present invention
- Figure 4 is a table of HTML elements that may be prefetched based on settings in a customer-specific configuration file; and Figure 5 is a representative default set of metadata to enable the prefetching feature according to the present invention.
- a CDN customer has off-loaded all or some of its content delivery requirements to the CDN service provider in a well-known manner.
- the CDN customer operates a site at the origin server 316.
- An Internet-accessible client 300 e.g., an end user client machine having a browser and media player
- CDN authoritative DNS mechanism 302 has been directed by CDN authoritative DNS mechanism 302 to a nearby edge server 304.
- Edge server 304 may be configured as described above and illustrated in Figure 2. This server includes a management process that provides the content prefetching functionality of the present invention, as will be described in more detail below.
- the given edge server 304 may be located in a set (or "region") of edge servers that are co-located at a given Internet-accessible data center. For convenience, only one edge server per region is shown.
- Content handling rules are configured into each edge server, preferably via a metadata configuration system 306.
- the configuration system provides edge server content control metadata via links 318, which themselves may include other infrastructure (servers, and the like).
- U.S. Patent No. 7,111,057 illustrates a useful infrastructure for delivering and managing edge server content control information, and this and other edge server control information can be provisioned by the CDN service provider itself, or (via an extranet or the like) the content provider customer who operates the origin server 316.
- edge servers 304 and 310 may comprise a cache hierarchy so that edge server 304 may "go forward" to a CDN parent instead of to the origin server as needed.
- This tiered distribution is described in U.S. Patent No. 7,133,905, as noted above.
- the CDN may provide overlay path routing to enable the edge server 304 to go forward to the origin server 316 through an alternate CDN path, such as the path through edge server 308, or through edge server 312, depending on whether one of these alternative paths provides better performance than a default BGP path.
- this performance-based overlay path delivery scheme is described in U.S. Publication No. 2002/0163882. The disclosures of each of the above-identified references are incorporated herein by reference.
- an edge server 314 may be co-located with the customer origin server, although this is not required.
- a given CDN edge server is configured to provide one or more extended content delivery features.
- the CDN edge servers are configurable to provide these delivery features on a customer-specific, customer domain- specific, preferably using XML-based configuration files that are distributed to the edge servers using a metadata configuration system such as described above.
- a given XML-based configuration file includes a set of content handling rules and directives that facilitate one or more advanced content handling features.
- an edge server management process receives a request for content, it searches an index file for a match on a customer hostname associated with the request. If there is no match, the edge server process rejects the request. If there is a match, the edge server process loads metadata from the configuration file to determine how it will handle the request.
- the metadata for the hostname may indicate whether to serve the request from cache or from the origin. If the metadata indicates that the request is associated with cached or cacheable content, the information may then direct the edge server to look for the content in a local cache or, failing that, to fetch the content from a CDN cache hierarchy parent node. If content is cacheable, the metadata may instruct the edge server process to apply given content handling directives.
- One such set of directives implement a content prefetch function, which will be explained in detail below.
- the metadata may indicate how the edge server should go forward, e.g., using path optimization to try to reach the origin using intermediate CDN paths.
- Other metadata may control how a given edge server establishes and maintains connections with one or more other edge servers or other machines, or how the edge server should deliver the content to the requesting end user browser once it has been obtained.
- a set of content handling directives are set forth in the XML configuration file for a given customer domain and used to control the edge server to provide these advanced functions.
- an XML-based configuration file controls an edge server to provide an enhanced content delivery function, namely, content prefetching, on a per-customer, per customer domain-basis.
- an edge server can set up (directly or with the assistance of the CDN service provider) an edge server handling configuration for all or part of the customer's Web site or other content to be delivered over the CDN.
- a participating content provider can be a formal Web publisher, or the content in question can be user-generated content (UGC).
- the edge server is shared among participating content providers, and one or more of such providers establish a prefetching configuration (by default, or as a custom configuration) that is enabled and enforced on the edge server.
- the edge server retrieves images and scripts embedded in pages (normally HTML content) at the same time it serves the page to the browser rather than waiting for the browser's request for these objects. This operation can significantly decrease the overall rendering time of the page and improve the user experience of a Web site.
- edge server functions that are also defined in such customer-specific, domain-specific configurations.
- These functions include, without limitation, path optimization (e.g., for non-cacheable or dynamic content), client-server (e.g., edge server-to-edge server) TCP connection optimizations, content compression, and the like.
- path optimization e.g., for non-cacheable or dynamic content
- client-server e.g., edge server-to-edge server
- TCP connection optimizations e.g., content compression, and the like.
- TCP connection optimization involves adjusting one or more TCP settings (e.g., congestion window size, retransmit timeout, packet reordering, and the like), which reduces edge server-to-edge server communication latency, as does content compression.
- TCP settings e.g., congestion window size, retransmit timeout, packet reordering, and the like
- prefetching can be applied to either cacheable or uncacheable content.
- prefetching When prefetching is used for cacheable content, and the object to be prefetched is already in cache, the object is moved from disk into memory so that it is ready to be served.
- prefetching When prefetching is used for uncacheable content, the retrieved objects are uniquely associated with the client browser request that triggered the prefetch so that these objects cannot be served to a different end user.
- prefetching can be combined with tiered distribution to further improve the speed of object delivery and to protect the origin server from bursts of prefetching requests.
- the edge server applies prefetching only to responses with a content-type header that begins with certain extensions (e.g. text/html, or some other given format), only responses with HTTP status codes of 200 or 404 are scanned for prefetchable objects; objects to be prefetched are referenced using the same protocol (HTTP or HTTPS) as the client used to request the original page; and object references use the same hostname as the original request.
- HTTP or HTTPS HyperText Transfer Protocol
- object references use the same hostname as the original request.
- the edge server requests content from the origin server (or a parent edge server) only when it receives a request for the content from an end client (browser). This means that images referenced by a page are not retrieved until the end user's browser has received and read the page and requested those images.
- the normal request flow is as follows: the browser requests the page from the edge server, the edge server retrieves the page from cache (or from the origin server if the page is not already in cache), the edge server returns the page to the browser, the browser scans the contents of the page and requests the objects referenced by the page, the edge server retrieves the images and other content from cache (or from the origin server if the objects are not already in cache), and the edge server returns the requested objects to the browser.
- the edge server actively scans the page for embedded images and scripts and retrieves these objects before they are requested by the end-user's browser.
- the new request flow is as follows.
- the end-user client requests a page from the edge server; the edge server retrieves the page from cache (or from the origin server if the page is not already in cache).
- the edge server then scans the page (usually HTML) for referenced images and scripts at the same time it serves the page to the browser. Note that it is not required that every response is scanned.
- the conditions that determine whether a response is scanned are described below.
- the edge server retrieves the referenced images and scripts from cache (or from the origin server if the objects are not already in cache), and the browser scans the page and requests the objects referenced by it.
- the edge server returns the requested objects to the client browser.
- the edge server scans appropriate responses from the origin. Not every response is required to be scanned.
- the edge server scans the response and begins prefetching embedded objects if one or more of the following conditions are true (and preferably all of them must be): a prefetching status is "on" for this request, an HTTP status code of the response sent to the client is 200 or 404, a response content-type starts with one of a set of configured strings (by default, the edge server scans the response only if the content type starts with texthtml), no preset limit (a configurable threshold) on an average number of prefetch requests per unit time has been reached, a prefetch-on-hit metadata tag is "on” if the page is already in cache, an indirect-only metadata tag is "off if the edge server is an edge server that connects directly to the origin, a push, status metadata tag is "off if the edge server is connecting to a cache hierarchy parent, and the edge server received a special request header from a child peer
- the edge server When the edge server identifies a response that should trigger prefetching, it parses the page (usually HTML) as it sends the response to the client. Each time the edge server encounters a tag in the page that is a candidate for prefetching (with two exceptions noted below), it creates a dummy request for the URL for this object and applies metadata to the request. Preferably, this is done only for the first instance of an object reference, so that multiple references to the same object within the page do not result in multiple requests for that object.
- HTML elements that generate candidates for prefetching are ⁇ IMG> and ⁇ SCRIPT>, although optionally a given configuration file can be used to configure prefetching of objects referenced by any of the elements listed in the table in Figure 4.
- any type of content object may be prefetched and not just images and scripts.'
- the edge server decides to prefetch an object if the following are true: prefetchabie-object metadata is "on" for this object, the object is not found in cache (if the object is found in cache, preferably it is moved to a hot object cache), and no prefetching limit has been reached.
- a regular expression parser in the edge server allows scanning of any text file for prefetch candidate URLs. This is particularly useful when the response is JavaScript. If the regular expressions processor is used, the configuration file must enclose the metadata within a match on a given content-type or file extension of the object to avoid having these settings apply to other parsing.
- the request When the edge server creates the requests to prefetch embedded objects, the request typically contains a number of components. These include, for example, a header x-cdnsp- Pref etch-object, which is used to prevent these requests from triggering further prefetching.
- the header preferably is also sent to the origin, and it may include a current level of recursion when recursive prefetching is enabled (as described below).
- the request may also include the request headers from the base page (including all cookies, regardless of whether the path specified for a cookie matches the path for the embedded object).
- the request also includes cookies created from any Set-Cookie headers in the response page, where the domain of the Set-Cookie matches the hostname for the request.
- the edge server ignores path and secure parameters of the Set-Cookie headers. If necessary, using the configuration file the edge server can be controlled to select which cookies are used in the prefetch request. By default, preferably all cookies and Set-Cookies that match the hostname are included, although metadata can be set to include or ignore cookies by name. By default, when prefetching is enabled, the edge server will prefetch objects that are non-cacheable along with those that are cacheable. The server may also prefetch objects that have a zero second TTL assigned to them.
- the edge server holds these objects in a separate buffer; they expire after a short time, can be served only once, and can be served only to the same user that requested the page that references them (e.g., based on the browser-ID). Further, preferably the edge server can prefetch two kinds of noncacheable objects: objects with "no-store” or "bypass-cache” metadata, and objects that are non-cacheable based on their response headers (for example, a Vary: header). Using metadata in the configuration file, however, the edge server can be configured to not prefetch no-store or bypass-cache content and not store uncacheable objects that were prefetched.
- the browser-ID buffer may include cacheable objects with very short TTLs. This may be desirable if a TTL is only a few seconds, and the number of objects included in an HTML page is large enough to prevent the browser from requesting those objects before then expire in cache. If a prefetched object's TTL expires before the browser requests it, the edge server must re-request the object when the browser request finally arrives, and the benefit of prefetching is lost for this object.
- the browser-ID is applied to objects with short TTL's, preferably they are handled just like uncacheable objects.
- the objects preferably are associated with a "browser-ID" that uniquely identifies the user.
- the edge server computes a "browser-ID" based on the IP address, cookies, and request headers of the user request. By default, all cookies are used in the computation along with the User- Agent and Authorization headers. Special rules may be applied for users behind proxies or with no user session cookies, as in such case there is no guarantee the browser-ID will identify a unique user.
- the browser-ID is a value that is generated by applying a given function to a concatenation, e.g., the IP address of the browser (as seen by the edge server), a hash calculated over one or more cookie values, and a hash of one or more header values.
- a browser requests content from a domain, it will open only a few connections to the server and make its requests across that limited number of connections.
- An edge server does not limit its forward connections the way a browser does.
- the edge server maintains persistent connections, and it reuses those established connections as they are available. If there are more requests to satisfy then there are connections available, the edge server preferably opens new connections. It relies on the origin server to limit the number of connections.
- an edge server performed prefetching for an HTML page that contained many references to content not in the edge server's cache, that edge server could attempt to open many connections with the origin server. If this occurred across the network of edge servers simultaneously, the origin server could be overloaded with requests from the edge servers. To avoid this problem, preferably some degree of rate limiting is applied to the prefetching functions. To this end, there is a set of metadata tags, fon limiting.. , the number of prefetch requests that can be in process to the origin (or other) server at one time.
- These tags control the time period over which to measure the average number of requests, an upper bound for prefetch requests in process (beyond this point prefetch requests are blocked), a lower bound for prefetch requests in process (after prefetching has been blocked, it can restart once the number of requests in process drops to this level), a maximum number of URLs to scan for prefetching within a single page, and a maximum number of prefetch requests to generate for a single page.
- prefetching function has assumed that the objects referenced within the base page do not themselves contain prefetchable objects. This is not always the case.
- an HTML page could include another HTML section through use of an ⁇ iframe> tag, and the included HTML might make reference to an image.
- the base page cannot be fully rendered by the browser until the second HTML section and its embedded image have both been fetched.
- it might be advantageous to prefetch recursively That is, to prefetch objects referenced by prefetched HTML.
- recursive prefetching can be enabled, including controlling how many level of prefetching are performed.
- the recursion feature can also be set to be used only on URLs found in HTML tags that define links (i.e., A, AREA, LINK or FORM tags), as it is not desirable to prefetch links recursively.
- a request header x-cdnsp-Pref etch is used between cache hierarchy edge server peers.
- the parent edge server process When enabled, however, the parent edge server process issues an cdnsp_PREFETCH_pusH request to a requesting edge server process as soon as the response comes in to the parent edge server.
- the protocol HTTP or HTTPS
- the request includes request headers that are sent as early as possible (preferably even before the parent edge server goes forward for the embedded object).
- the response headers and body of the prefetched object are sent as soon as the parent receives them from the origin.
- the parent edge server decides to abort the request, it sends an HTTP 500 response to the edge server, which will preserve the persistent connection.
- these requests always use persistent connections (i.e., they either have a body content-length or use chunking, and they do not get closed by the edge server process).
- the edge server processes involved in such prefetching authenticate one another to prevent malicious users from forcing arbitrary objects into the edge server cache.
- the cdnsp_PREFETCH_PUSH request also may contain a line with x-csnsp- Prefetch-Browser-iD.
- the PUT body of this request contains first the response headers from the original object, then the response body. After the edge server process parses the request headers, it will "plug" the parsing of the rest of the response on the already existing code on the forward side.
- a given CDN customer can establish a custom prefetching configuration for given site or other content, preferably using an XML- based configuration file that is delivered to and used by a given edge server to implement a prefetching directive.
- an XML- based configuration file that is delivered to and used by a given edge server to implement a prefetching directive.
- the same configuration file is delivered to all of the edge servers, and this configuration file may be changed dynamically using the metadata configuration system.
- a configuration file includes directives that identify the appropriate content for prefetching and how the feature should be enabled for that content.
- the following section provides additional detail regarding the metadata related to configuring the prefetch feature.
- all metadata used for prefetching starts with a given tag such as edgeservices :pref etch. While there are many metadata tags for use in tuning the configuration, most of these tags are not necessary in the customer's configuration file, as default settings (as described below) may be adequate.
- a minimal set of metadata to enable prefetching for a given content provider domain is shown in Figure 5. The meaning of this metadata (among other prefetching directives) is described below.
- prefetch metadata
- prefetching is enabled through a single metadata tag:
- the corresponding response will be scanned for prefetchable objects if the response has an HTTP 200 or a 404 status code and a content-Type header that begins with "text/html" or other content types defined in edgeservices .-prefetch . content-types .
- edgeservices .-prefetch .
- content-types By default, only responses of type "text/html" are scanned for prefetchable objects.
- Other content types can be added using the following tag. Note that a wildcard should be used if the content-type header contains more than the simple type definition. The tag takes a space separated list of content-type strings: ⁇ edgeservices:prefetch. content-types>text/html* ⁇ /edgeservices: prefetch. content- types>
- the edge server scans responses using an SGML parser to identify HTML tags that contain candidate URLs for prefetching.
- an SGML parser it may be desired to scan the page as plaintext using a regular expression parser.
- the parser-type is set to regex using the following tag:
- a regular expression processor is turned on and is provided a list of rules.
- a set of regular expression rules are defined for the parser to use. Whenever parser-type is set to regex the entire page is scanned using these rules. If the parser-type is set to html the rules are used only within the ⁇ script> tags, and the rest of the page body is scanned using the SGML processor. Note also that regardless of the parser type setting, the content-type of the response must be listed in the prefetch . content-types tag for the edge server to parse the response.
- these tags are inside a match on the response content-type, or inside a match on uri extension of the object, wherein Prefetch . regex . rule is a listable tag set that contains a perl-compatible regular expression along with a string and flags:
- the following metadata tag indicates whether or not an object is prefetchable. This tag should be set explicitly, as the default value preferably is of f :
- edgeservices prefetch.prefetchable-object>on ⁇ /edgeservices:prefetch, prefetchable-object> If this flag is on, no prefetching occurs if the edge server goes directly to the origin:
- the edge server does not prefetch if the request for the HTML is a hit (an in-memory hit, an ICP hit, or an IMS hit). If the flag is on, the edge server prefetches from cached HTML (i.e., the server goes forward for non-cached or expired objects, and moves cached objects from disk into hot object cache). If the cache hit happens on the edge server, then the prefetching is done from the edge server even if push, status is true (because the parent would not get any request):
- a pref etch-on-304 flag controls whether the edge server will prefetch if the response to the client is an HTTP 304. The default is "off," so the server will scan the page and prefetch embedded objects only when the response to the client is an HTTP 200 for the base page. If this flag is on, the edge server also prefetches when a 304 is returned to the client. When this flag is on, prefetch-on-hit should be on as well: ⁇ edgeservices:prefetch.prefetch-on-304>off ⁇ /edgeservices:prefetch.prefetch-on-304>
- the edge server will prefetch after processing dynamic content assembly (DCA) requests. This means that only objects that are referenced in the HTML response served to the client browser are actually prefetched. If it is desired to prefetch all objects referenced separately by DCA fragments and containers, the following tag preferably is turned off:
- the edge server When the browser-id status flag is on (the default setting), the edge server temporarily keeps prefetched non-cacheable objects (and objects with a zero-second TTL) in memory. These objects are associated with the user's browser-id, and they can only be served once.
- the non-cacheable objects preferably preferably are never stored on disk:
- the browser-ID can be calculated using a default calculation, or a set of metadata tags can be used to define what components (e.g., IP address mask size, cookies and headers) are used for the calculation.
- a fetch separator controls the actual retrieval of content through prefetching using metadata tags that are now described.
- a first tag is used to define a list of cookies that get sent to the origin when the edge server prefetches embedded objects.
- the list is *, so all cookies are sent.
- the edge server also takes into account the "Set-Cookie" headers from the HTML response: ⁇ edgeservices : prefetch. fetch. pass-cookies>* ⁇ /edgeservices:prefetch. fetch.pass- cookies>
- a second tag is used to define a list of cookies that do not get sent to the origin when the edge server prefetches embedded objects. By default, this list is empty:
- the following tag is used to define a maximum number of distinct URLs the edge server will consider prefetching inside the HTML page. For each URL, the edge server creates a request object, applies metadata, checks that the prefetchabie-object flag is set, checks that the object is not already in memory, and then fetches it if necessary: ⁇ edgeservices:prefetch. fetch. max-urls-per-page>20 ⁇ /edgeservices:prefetch. fetch.max- urls-per-page>
- the following tag is used to define a maximum number of embedded objects for which the edge server is going forward:
- edgeservices prefetch. fetch.max-prefetches-per- page>15 ⁇ /edgeservices:prefetch. fetch.max-prefetches-per-page>
- the following tag is a space-separated list of HTML elements.
- the possible values are A, IMG, SCRIPT, FRAME, IFRAME, LINK, OBJECT, AREA, APPLET, EMBED, INPUT, OFORM, BODY, TABLE, TD, TH, BASE and INCLUDE.
- This is just a representative list, and the list may be expanded to include any tags that can reference external objects.
- the edge server only considers prefetching objects inside these elements:
- edge server When the HTML returned to the user is compressed, the edge server must unzip it internally to parse it. If the following flag is on, the server will not try to uncompress the HTML and therefore will not be able to prefetch the embedded objects.
- prefetching will be allowed from domains other than the domain of the original request for the HTML, provided (for example) that those domains use the same CDN map. For example, if the client requests a page from html . example . com and that page contains references to images at images . example . com, and both these domains are aliased (through a DNS CNAME) to axxx . g . cdnsp . net then the prefetch requests can go forward.
- an identifier e.g., a serial number
- the edge server can allow prefetch requests regardless of the serial number in the map name by turning off the tag:
- This metadata is used to define a maximum number of objects the edge server will try to prefetch without going back to a main processing loop (no prefetching):
- the edge server will stop loading embedded objects from disk after it reads more than a threshold set by the following metadata: ⁇ edgeservices:prefetch. fetch. disk-abort- threshold>10KB ⁇ /edgeservices : prefetch. fetch. disk-abort-threshold>
- the edge server will stop loading embedded objects from the network after it reads more than a threshold set by the following metadata (this will close the forward connection if no user request has been received yet): ⁇ edgeservices : prefetch . fetch . network-abort- threshold>lMB ⁇ /edgeservices:pref etch. f etch. network-abort-threshold>
- This tag specifies a maximum allowed depth for recursive prefetching.
- a default value is "1,” which means that recursive prefetching is disabled.
- a minimum value is “1,", and the absolute maximum value is “5.”
- the value "0" is not valid; to disable prefetching, the ⁇ prefetch.status> tag is set to "off' instead.
- recursive prefetching can be applied to tags that define links using the following tag: edgeservices :prefetch . recursion. allow-link-recursion>off ⁇ / . .. >
- a default value is "off.”
- tags A, AREA, LINK, FORM are prefetched at a first level (the client-requested HTML) if declared in the list of tags to prefetch but are not prefetched at the recursive levels below.
- the edge server When the edge server makes a forward request to prefetch an object, it adds the request header X-Cdnsp- Prefetched-Object.
- the value preferably is a current level of recursion (starts at 1, and goes until ⁇ recursion.depth>).
- the following metadata is used to apply rate limiting to prefetch requests. It can restrict prefetching based on either or both the raw number of requests (the “count”) and the amount of bandwidth the requests have used (the “bandwidth”). Further, the metadata can be set for specific types of prefetch requests (for example, requests to the origin) or for "all" prefetch requests:
- ⁇ edgeservices :prefetch. fetch. rate-control> ⁇ status>on ⁇ /status> ⁇ type> ⁇ /type> ⁇ high-co ⁇ mt>0 ⁇ /high-count> ⁇ low-cou ⁇ t>0 ⁇ /low-count> ⁇ high-bandwidth>OB ⁇ /high-bandwidth> ⁇ low-bandwidth>OB ⁇ /low-bandwidth> ⁇ tirae-scale>30s ⁇ /time-scale> ⁇ /edgeservices :prefetch. fetch. rate-control>
- the high-bandwidth tag specifies the point at which rate limiting will be applied based on bandwidth usage of the prefetch requests.
- the high-count tag specifies the point at which rate limiting will be applied based on the number of prefetch requests of the given type.
- the low-bandwidth tag is the point at which prefetching can resume. When the bandwidth consumption drops to this level, new prefetch requests can be generated.
- the low-count is the point at which prefetching can resume. When the number of outstanding prefetch requests drops to this level, new prefetch requests can be generated.
- the status tag controls whether the prefetch rate limiting feature is used.
- the time-scale tag defines the time scale over which the moving average of prefetch requests (count) or bytes used (bandwidth) are measured. So 3 for example, with the default setting, if the high-count number of prefetch requests were generated in 30 seconds, no more requests could be generated until the iow- count number of prefetch requests was reached within a 30 second measurement window.
- the type tag specifies the type of request for which rate controls should be imposed. Valid values may include all (applies to all prefetch related requests, and no other separate rate control settings apply), disk (requests resulting in a disk hit), cache-h (requests resulting in cache-h or path optimization fetch or push), and origin (requests resulting in a forward request to the origin).
- tags can be used:
- these limits are based on the number of prefetch requests per forward hostname, and they are available to be overridden at the level of customer configuration. Also, preferably only prefetch requests that must go forward to the origin are counted against the limit. The edge server will stop prefetching if the average number of prefetching requests per second reaches this watermark. A value of 0 means that there is no watermark:
- edgeservices prefetch. fetch. limits . requests-high- watermark>10 ⁇ /edgeservices .-prefetch, fetch, limits. requests-high-watermark>
- edgeservices prefetch . fetch . limits . requests-low-watermark>5 ⁇ /edgeservices : prefetch. fetch. limits . requests-low-watermark>
- the edge server will stop prefetching if the average number of bytes per second reaches this threshold.
- a value of 0 means that there is no watermark:
- edge server prefetch. fetch. limits.bandwidth-high- watermark>100KB ⁇ /edgeservices :prefetch. fetch. limits .bandwidth-high-watermark>
- fetch limits preferably the edge server will stop prefetching until an average number of bytes per second reaches this threshold. This setting should never be zero when limits are applied, otherwise the server could stop prefetching for a considerable period of time:
- the following metadata is the time scale used in the computation of the exponentially- weighted moving average for the number of requests per second and for the bandwidth. The larger it is, the slower the moving average will vary:
- This baseline tag is used to temporarily stop prefetching if a given edge server CPU utilization percent is above this threshold: ⁇ edgeservices :prefetch.percent-threshold>90 ⁇ /edgeservices :prefetch.percent- threshold>
- prefetching can be combined with other edge server features, such as path optimization, TCP connection optimization, content compression optimizations, and the like.
- path optimization e.g., TCP connection optimization, content compression optimizations, and the like.
- the customer-specific, domain- specific configuration file may include a path optimization directive such as the following:
- the status attribute turns the function on for the domain, and the selection-method sets the method by which the edge server identifies its parents.
- the edge server uses the map name and the ma ⁇ -pa rents to form a hostname that resolves through DNS to the appropriate IP addresses of the alternative edge servers that are used for the go forward request.
- the policy sets the order in which the edge server will contact the cache parents.
- the edge server When set to performance the edge server will use test object races to order the parents and the origin based on the speed of responses to a set of race requests.
- the map sets the base hostname the edge server will use to construct the final hostname it looks up in DNS.
- the max-pa rents attribute sets how many indirect routes the edge server will use. These are representative settings.
- the following metadata can be set in the configuration file, once again on a customer-specific, domain-specific, basis.
- the controls for changing the TCP settings are in a separator: network : tcp . transport .
- network tcp . transport
- the nodes may include: network: tcp. transport, size network: tcp. transport. timeout
- these nodes are the same. They each contain a status (to turn the node on or off, a parameter (the name of the parameter to be set), a direction (to define which connection this setting will control), and a value (the value to set for the parameter).
- the parameter may be one of: cwnd_init (initial congestion window), cwnd_ssinc (slow start increase), cwnd_cainc (congestion avoidance rate), cwr_dec (congestion reduction rate), and many others.
- the direction defines which connection this setting will control. The possible values are: edge-to-user, edge-to-origin, edge-to-parent, and edge-to-child.
- the following metadata illustrates how to adjust the TCP settings used for edge server- to-edge server communication.
- the first setting is for the edge-to-child direction, and it adjusts the initial congestion window.
- the initial congestion window is also adjusted for the edge-to-parent direction so that the child advertises an appropriately large window and can use that larger window for POST transactions: ⁇ network:tcp. transport. size> ⁇ status>on ⁇ /status> ⁇ value>6 ⁇ /value>
- the following metadata can be included in the configuration file to facilitate content compression (e.g., from the edge server to the browser):
- the various edge server routines that manage the metadata tag handling are implemented in software running on commodity hardware.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US75517605P | 2005-12-30 | 2005-12-30 | |
US11/647,750 US8447837B2 (en) | 2005-12-30 | 2006-12-29 | Site acceleration with content prefetching enabled through customer-specific configurations |
PCT/US2006/049502 WO2007079192A2 (en) | 2005-12-30 | 2006-12-30 | Site acceleration with content prefetching enabled through customer-specific configurations |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1971930A2 true EP1971930A2 (en) | 2008-09-24 |
EP1971930A4 EP1971930A4 (en) | 2012-03-28 |
Family
ID=38228858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP20060848289 Ceased EP1971930A4 (en) | 2005-12-30 | 2006-12-30 | Site acceleration with content prefetching enabled through customer-specific configurations |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP1971930A4 (en) |
WO (1) | WO2007079192A2 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9396130B2 (en) | 2012-08-18 | 2016-07-19 | Qualcomm Technologies, Inc. | System translation look-aside buffer integrated in an interconnect |
CN109995671B (en) * | 2018-01-02 | 2022-12-02 | 中国移动通信有限公司研究院 | Resource allocation method of virtualized content distribution network and network service equipment |
US11843682B1 (en) * | 2022-08-31 | 2023-12-12 | Adobe Inc. | Prepopulating an edge server cache |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6393526B1 (en) * | 1997-10-28 | 2002-05-21 | Cache Plan, Inc. | Shared cache parsing and pre-fetch |
EP1398715A2 (en) * | 2002-09-11 | 2004-03-17 | Hughes Electronics Corporation | System and method for pre-fetching content in a proxy architecture |
US20040258053A1 (en) * | 2003-06-16 | 2004-12-23 | Mentat Inc. | Pre-fetch communication systems and methods |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7054935B2 (en) * | 1998-02-10 | 2006-05-30 | Savvis Communications Corporation | Internet content delivery network |
WO2002071242A1 (en) * | 2001-03-01 | 2002-09-12 | Akamai Technologies, Inc. | Optimal route selection in a content delivery network |
-
2006
- 2006-12-30 WO PCT/US2006/049502 patent/WO2007079192A2/en active Application Filing
- 2006-12-30 EP EP20060848289 patent/EP1971930A4/en not_active Ceased
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6393526B1 (en) * | 1997-10-28 | 2002-05-21 | Cache Plan, Inc. | Shared cache parsing and pre-fetch |
EP1398715A2 (en) * | 2002-09-11 | 2004-03-17 | Hughes Electronics Corporation | System and method for pre-fetching content in a proxy architecture |
US20040258053A1 (en) * | 2003-06-16 | 2004-12-23 | Mentat Inc. | Pre-fetch communication systems and methods |
Non-Patent Citations (1)
Title |
---|
See also references of WO2007079192A2 * |
Also Published As
Publication number | Publication date |
---|---|
EP1971930A4 (en) | 2012-03-28 |
WO2007079192A3 (en) | 2008-04-10 |
WO2007079192A2 (en) | 2007-07-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8447837B2 (en) | Site acceleration with content prefetching enabled through customer-specific configurations | |
US7395355B2 (en) | Method for caching and delivery of compressed content in a content delivery network | |
US6959318B1 (en) | Method of proxy-assisted predictive pre-fetching with transcoding | |
US10986159B2 (en) | Client side cache visibility with TLS session tickets | |
KR100571059B1 (en) | Distributed Systems and Methods for Prefetching | |
KR100791430B1 (en) | Method and system for network caching | |
US8484319B2 (en) | Content delivery network (CDN) content server request handling mechanism with metadata framework support | |
US7987239B2 (en) | Method and system for caching role-specific fragments | |
JP5874146B2 (en) | Extension of content distribution network (CDN) to mobile or wired networks | |
US20040128346A1 (en) | Bandwidth savings and qos improvement for www sites by catching static and dynamic content on a distributed network of caches | |
WO2009144688A2 (en) | System, method and device for locally caching data | |
JP2010108508A (en) | Satellite anticipatory bandwidth acceleration | |
EP1311957A2 (en) | BANDWIDTH SAVINGS AND QoS IMPROVEMENT FOR WWW SITES BY CATCHING STATIC AND DYNAMIC CONTENT ON A DISTRIBUTED NETWORK OF CACHES | |
US10154068B2 (en) | Self-adjusting tiered caching system to optimize traffic performance and origin offload | |
EP1971930A2 (en) | Site acceleration with content prefetching enabled through customer-specific configurations | |
Appliance | Internet content adaptation protocol (icap) | |
Rangarajan et al. | User specific request redirection in a content delivery network | |
WO2003083612A2 (en) | System and method for optimizing internet applications | |
Candan et al. | Integration of Database and Internet Technologies for Scalable End-to-end E-commerce Systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20080730 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK RS |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20120223 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06F 17/30 20060101AFI20120217BHEP |
|
DAX | Request for extension of the european patent (deleted) | ||
17Q | First examination report despatched |
Effective date: 20130322 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R003 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20151008 |