US20080147981A1 - Recommendations for intelligent data caching - Google Patents
Recommendations for intelligent data caching Download PDFInfo
- Publication number
- US20080147981A1 US20080147981A1 US12/030,316 US3031608A US2008147981A1 US 20080147981 A1 US20080147981 A1 US 20080147981A1 US 3031608 A US3031608 A US 3031608A US 2008147981 A1 US2008147981 A1 US 2008147981A1
- Authority
- US
- United States
- Prior art keywords
- data
- caching
- recommendations
- processing system
- executable instructions
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9574—Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
Definitions
- the present invention relates generally to data caching of web content on a computer system on a network and, more specifically, for making automated recommendations toward intelligent data caching of dynamic web page content.
- the Internet and the World Wide Web provide intra-enterprise connectivity, inter-enterprise connectivity and application hosting on a larger scale than ever before.
- WWW World Wide Web
- Each web site normally further provides a plurality of web pages to be served to the local computer systems upon request.
- Each local computer system may access the remote web sites with web browser software.
- the WWW is a collection of servers on an IP (Internet Protocol) network, such as the Internet, an Intranet or an Extranet, that utilize the Hypertext Transfer Protocol (HTTP).
- IP Internet Protocol
- Internet will be used to refer to any IP network.
- HTTP is a known application protocol that provides users with access to files, which can be in different formats, such as text, graphics, images, sound, and video, using a standard page description language known as Hypertext Markup Language (HTML).
- HTML Hypertext Markup Language
- HTML Hypertext Markup Language
- HTML Hypertext Markup Language
- HTML Hypertext Markup Language
- HTML Hypertext Markup Language
- HTML Hyperlinks
- Hyperlinks commonly are displayed as highlighted text or other graphical image on the web page. Selection of a hyperlink with a pointing device, such as a computer mouse, causes the local computer to download the HTML associated with the web page from a remote server. The browser then renders the HTML into the displayed web page.
- Web pages accessed over the Internet are commonly downloaded into the volatile cache of a local computer system.
- the volatile cache is a high-speed buffer that temporarily stores web pages from accessed remote web sites. The volatile cache thus enables a user to quickly review web pages that were already downloaded, thereby eliminating the need to repeat the relatively slow process of traversing the Internet to access previously viewed web pages. This is called local caching.
- the first web servers were merely HTTP servers that resolved universal resource locators (URLs) by extracting literally from the URL the path to a file that contained the needed page, and transmitting the page back to the browser.
- URLs universal resource locators
- a “static” page is a page which, each time it is requested and served to a requester, has the same byte content. That is, it does not depend upon which requester is requesting the page, when the requester is requesting the page, etc.; the byte content of that page remains the same.
- a “dynamic page” is a page which has byte content that may very well change depending upon the particular requestor, when the page is being requested, etc. This will be discussed further below. It is important that web pages be served as quickly as possible, both to reduce the response time to a single user, and to increase the number of users that can be served concurrently. To improve the response time, the Web server uses caches. Web server caches are used to store web page responses in a readily accessible memory location so that when the web page is requested by a user, a previously cached web page response can be retrieved from cache and served quickly to the user.
- Caching web page responses by the web server works quite well for web page responses having static content, i.e., content that doesn't change frequently.
- An example of a static web page is one, at a company's web site, comprising a compilation of text and graphics objects describing that company's history.
- classic web servers cache static pages quite effectively.
- classic web servers serve web page responses, some of which are static, namely, responses comprising HTML from the file system.
- Each of the static responses has a last modified date associated with it that is maintained by the file system. The contents of the response and its associated last modified date are simply stored in the cache for possible future use by the web server.
- the server requests the latest modification date for that page from the file system and compares the latest modification date with the last modified date associated with the candidate cached response. If the latest modification date is the same as the last modified date associated with the candidate cached response, the candidate cached response is considered to be “fresh” and is served to the request (i.e., to the requesting user).
- the candidate cached response is considered “stale” and a “fresh” response is retrieved and built by the web server for serving to the requesting user.
- the fresh response, along with its associated last modified date, is cached to replace the stale response.
- newer web servers provide not only static web pages but also dynamic web pages, i.e., a page having byte content that may very well change depending upon the particular requester, when the page is being requested, etc.
- dynamic web pages are pages containing content from a number of different sources or pages having computed content.
- a page may contain macros that compute content for the page, i.e., the page has “computable content”. These macros may change the page content each time the page is accessed. This makes it difficult to cache that page using the classic caching method described above.
- Macros or formulas are expressions that perform a function, such as determining field values, defining which documents appear in a view, or calculating values for a column.
- the page may contain information from a number of different sources, and that information may or may not have associated last modified dates making it difficult, if not impossible, to cache using the classic caching method.
- the page may comprise a composite of a number of “parts” including: other documents, designs from databases, content from databases, the present user's identity, the current time, the current environment, etc. Some of these parts are actual entities in the system, e.g., documents, databases, etc. Some parts though are “virtual” and are used to model the effects of the execution of macros or scripts, for example, the user's identity may be accessed via one of a number of macros for performing specialized.
- the server After building a web page response, the server must determine whether the response that it is preparing to serve the requesting user is cacheable (i.e., determining its cacheability). Second, the server, upon receiving a request for a web page whose previous response has been cached, must determine whether the cached response is valid (i.e., determining its validity) and applicable (i.e., determining its applicability). For instance, web page responses containing macros that are time-dependent may not be cacheable at all.
- a page includes a macro for providing the current time, then every access of the page is unique and the page cannot be cached in memory at all.
- a cached page is valid for serving to some users but not others. For instance, if the page includes a macro for the user's name, then the page can be cached for serving to that particular user, but not for serving to others.
- HTML representing a document is specific to a user if macros are dependent on user name or user roles. Using this user data, some data may be made visible based on which user is requesting it.
- DHTML Dynamic HTML
- DHTML Dynamic HTML
- the dynamic HTML may comprise scripts that run on the browser to change the appearance of the web page such as by displaying a button that, if pushed, displays additional text or graphics.
- dynamic in the DHTML sense refers to the browser, not the server.
- a DHTML page may still be “static” in that the byte content may be the same each time the page is requested, so for the purposes of this invention, a DHTML page may be “static” or “dynamic” in the sense of the invention.
- the content is not dependent on any thing, e.g., the properties of the request, such as the identity of the particular user, the time of day that the request is made, etc.
- “Dynamic” content refers to content that has such dependencies. Thus, “dynamic” in the DHTML sense is not related to “dynamic” in the sense of the embodiments of the present invention.
- the problems may be further expressed as that of not knowing which dynamic content such as that found in JSP (Java server page) and ASP (active server page) technology need to be cached and then how to cache effectively. Further when implementing a manual cache configuration incorrect assumptions may cause system performance degradation. This problem arises when developers construct dynamic pages but do not cache them appropriately; which may then lead to poor system performance. Additional decisions based on the various dynamic properties of the page are required regarding whether to not cache a particular dynamic page at all or to cache it as a page fragment or a composite page (including sub-fragments). The drawback of these solutions is the need for significant knowledge of the dynamic content design and the system's dynamic content caching infrastructure.
- a computer system, method and apparatus for making intelligent recommendations for dynamic content caching there is provided.
- a Scanner component that takes a list of dynamic pages as input and based on a dynamic page model (JSP, ASP, etc.) and XML definition file outputs the dynamic page content as entities-relationships data described in an XML file.
- This data is then fed into an Analyser component to be analyzed using the certainty factor model and rule based expert system implementation with user-defined object weighting schemes based on an Object Dependency Graph (ODG) model.
- JSP dynamic page model
- ASP ASP
- ODG Object Dependency Graph
- the Analyzer then generates caching recommendations in an analysis report with Certainty Factors for each hypothesis, such as cached as a page fragment, cached as a composite page or not cached at all.
- the Analyser's output is in the form of an enhanced ODG with additional attributes described by an XML file.
- This enhanced ODG is then used by a Generator component to generate a cache policy for the computer system based on the computer system's cache policy model.
- the enhanced ODG may optionally be passed to a Visualizer component providing a graphical view of the cached objects and their dependencies.
- There is also a cache advisor report which may be viewed through the visualizer component.
- Embodiments of the present invention may be used to minimize requirements for significant knowledge of page design and cache infrastructure of a system. It may also be used in place of using intuition to determine what needs to be cached, as the certainty factor and rule based expert system provides intelligent recommendations based on supporting evidence to optimize caching opportunities. Further by design all components are logically separated affording greater flexibility to plug in each component, such as the use of a different Scanner, Analyzer, Generator, or Visualizer component. Use of a tool such as that which may be an implementation of an embodiment of the present invention may also provide for automatic generation of the cache policy. Further instead of reading text or code, all object dependencies may be visualized in a user-friendly graphical user interface.
- a computer implemented method for generating intelligent caching recommendations related to dynamic web content for use on a caching system comprising: extracting data associated with the dynamic content of interest in accordance with a predetermined data model; analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
- a computer system for generating intelligent caching recommendations related to dynamic web content for use on a caching system comprising: a means for extracting data associated with the dynamic content of interest in accordance with a predetermined data model; a means for analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and a means for generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
- an article of manufacture for directing a data processing system generate intelligent caching recommendations related to dynamic web content for use on a caching system
- the article of manufacture comprising: a computer usable medium embodying one or more instructions executable by the data processing system, the one or more instructions comprising: data processing system executable instructions for extracting data associated with the dynamic content of interest in accordance with a predetermined data model; data processing system executable instructions for analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and data processing system executable instructions for generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
- FIG. 1 is a block diagram of a typical computer system in which an embodiment of the present invention may be implemented
- FIG. 2 is block diagram showing the components of an embodiment of the present invention of FIG. 1 ;
- FIG. 3 is component flow diagram depicting data flow relationships among components of the embodiment of FIG. 2 ;
- FIG. 4 is a block diagram depicting interaction among components of a system of FIG. 1 and the embodiment of FIG. 2 .
- FIG. 1 depicts, in a simplified block diagram, a computer system 100 suitable for implementing embodiments of the present invention.
- Computer system 100 has a central processing unit (CPU) 110 , which is a programmable processor for executing programmed instructions, such as instructions contained in memory 108 .
- Memory 108 can also include hard disk, tape or other storage media. While a single CPU is depicted in FIG. 1 , it is understood that other forms of computer systems can be used to implement the invention, including multiple CPUs.
- the present invention can be implemented in a distributed computing environment having a plurality of computers communicating via a suitable network 119 , such as the Internet.
- CPU 110 is connected to memory 108 either through a dedicated system bus 105 and/or a general system bus 106 .
- Memory 108 can be a random access semiconductor.
- Memory 108 is depicted conceptually as a single monolithic entity but it is well known that memory 108 can be arranged in a hierarchy of caches and other memory devices.
- FIG. 1 illustrates that operating system 120 , may reside in memory 108 .
- components of an embodiment of the present invention such as that of cache advisor tool 200 of FIG. 2 .
- Operating system 120 provides functions such as device interfaces, memory management, multiple task management, and the like as known in the art.
- CPU 110 can be suitably programmed to read, load, and execute instructions of operating system 120 and those of cache advisor tool 200 .
- Computer system 100 has the necessary subsystems and functional components to implement testing of files as will be discussed later.
- Other programs include server software applications in which network adapter 118 interacts with the server software application to enable computer system 100 to function as a network server via network 119 as well as to provide data from remote instances supporting embodiments of cache advisor tool 200 and dynamic content 225 .
- General system bus 106 supports transfer of data, commands, and other information between various subsystems of computer system 100 . While shown in simplified form as a single bus, bus 106 can be structured as multiple buses arranged in hierarchical form.
- Display adapter 114 supports video display device 115 , which is a cathode-ray tube display or a display based upon other suitable display technology that may be used to depict data.
- Output device 295 of FIG. 2 may be one of a family of such devices as device 115 .
- the Input/output adapter 112 supports devices suited for input and output, such as keyboard or mouse device 113 , and a disk drive unit (not shown).
- Storage adapter 142 supports one or more data storage devices 144 , which could include a magnetic hard disk drive or CD-ROM drive although other types of data storage devices can be used, including removable media for storing dynamic content 225 data as well as intermediate data such as files used to aid in processing of such data and for storing output in the form of reports and caching recommendations as in cachespec.xml 325 .
- data storage devices 144 could include a magnetic hard disk drive or CD-ROM drive although other types of data storage devices can be used, including removable media for storing dynamic content 225 data as well as intermediate data such as files used to aid in processing of such data and for storing output in the form of reports and caching recommendations as in cachespec.xml 325 .
- Adapter 117 is used for operationally connecting many types of peripheral computing devices to computer system 100 via bus 106 , such as printers, bus adapters, and other computers using one or more protocols including Token Ring, LAN connections, as known in the art.
- Network adapter 118 provides a physical interface to a suitable network 119 , such as the Internet.
- Network adapter 118 includes a modem that can be connected to a telephone line for accessing network 119 .
- Computer system 100 can be connected to another network server via a local area network using an appropriate network protocol and the network server can in turn be connected to the Internet.
- FIG. 1 is intended as an exemplary representation of computer system 100 by which embodiments of the present invention can be implemented. It is understood that in other computer systems, many variations in system configuration are possible in addition to those mentioned here.
- FIG. 2 is a block diagram depicting the components and their relationships of an embodiment of the present invention which may be supported within a system configuration of FIG. 1 .
- Scanner 201 extracts information from the dynamic content pages based on a selected dynamic page model, such as JSP or ASP. Extraction uses information regarding the dynamic page model from dynamic content model 205 in conjunction with data contained in XML file 206 . The relationships found are then documented as entities-relationships 210 supported by another XML file 215 and related DTD 220 .
- XML mapping 206 which represents dynamic content model 205 in an XML format is used by scanner 201 to parse different entity-relationships data 210 .
- Output of the translated data is in a XML format to be interpreted by analyser 230 .
- Entities-relationships 210 is then translated through analyser 230 into an enhanced ODG (object dependency graph) model with cacheability data 250 .
- Analyser 230 analyzes extracted dynamic page entities-relationships data 210 and extends ODG model 245 by attaching attributes to ODG nodes, obtained through analysis of the ODG 245 and the entire web application (with data from XML file 215 and related DTD 220 ).
- Analyser 230 also applies certainty factor model (probabilistic model) and expert system (heuristics) used for caching dynamic web content in conjunction with weighting scheme and XML based rules 240 for the expert system.
- the rules may be derived from various different cacheability indicators or evidence. Typical cacheability indicators are as follows:
- JSP JavaServer Pages
- An example may be provided as: CF[CwoF
- HAR & HRR] 0.9.
- Prior probability is the probability of any page having HV without any evidence.
- Cacheability also takes into account a weighting scheme based on user input, allowing an expert developer to fine tune the algorithm.
- Prior probabilities for each evidence and values such as MAXnp in the examples above can be defined in a configuration file hence allowing the expert user knowledge to play a role in the cacheability calculation of different systems.
- Analyser 230 produces output of the analyzed data in a XML format to be interpreted by Generator 260 for producing a cache policy 270 XML file. Output from analyser 230 may also be sent as enhanced ODG 250 with or without CA report 255 (cache advisor) to Visualizer 280 for a visual representation. Visualizer 280 creates viewable objects 285 for use with application view 290 to be seen on output device 295 .
- cache policy Generator 260 generates a cache policy based on the analysis report of enhanced ODG 250 and information combined with cache policy model 265 .
- the IBM product WebSphere Application Server Dynamic Cache uses a cache policy in the form of an XML file (cachespec.xml) and its cache policy model is the data type definition file (cachespec.dtd) for the cachespec.xml.
- cache policy XML 270 may be generated based on a given confidence level.
- Visualizer 280 provides a user with a graphical view of object dependencies determined through analysis using a colouring scheme to highlight relationships. For example when displaying objects and dependencies, objects that appear in blue may be JSPs, those in red may be Java beans, pink may be for tag libraries, and HTML objects may be in grey, while unknown object may appear in black. Similarly dependencies could be shown as green to indicate they are dynamic, while cyan could indicate static includes, pink for tag library links and grey for HTML anchors. Further any object having an arrow pointing to it by another object is a parent of that object while an object having a pointer to another object is the child of that object. In the usual manner clicking on an object or dependency will cause the properties of that object or dependency to be displayed. In this way the visualizer 280 provides the ability to view object information at the nodes and at the edges (if dependencies and attributes exist) of the enhanced ODG 250 and maps the analysis report to highlight the important objects with priorities.
- Cache advisor report 255 is presented as a textual display having rows of information related to the relationships and resulting recommendations.
- FIG. 3 is an overview of the data flow of an embodiment of the present invention as may be used in a tool.
- this embodiment of the present invention in the form of tool, there may be seen three configuration files: a mapfile.xml, a config.properties and a rules.xml. All three files are configurable by the user of the tool. It would be expected that one skilled in the art having knowledge of the system before would be able to modify the config.properties and rules.xml as these files may be provided with default values.
- mapfileSample.xml may be provided to allow a user to use the content within mapfileSample.xml as is by renaming the file to mapfile.xml.
- the tool may be used rather quickly and allowing further configuration or customization to suit the intended use situation.
- the file mapfile.xml is used to map unknown objects to user known objects, for example where some dynamically included page can become unknown, such as the case where the incfile below can be any value depending on the value of the string includeDir which doesn't get determined until runtime.
- a user defined mapping file to map unknown objects to user known objects is provided.
- ⁇ maplist> is the root element that consists of a list of ⁇ mapping> elements
- mapfile.xml file Following is a sample of the mapfile.xml file:
- the config.properties file contains information needed to configure prior probabilities, weighting schemes, and threshold values used by the analyser.
- Threshold values are values that determine if an evidence is completely true, e.g. in the cacheability algorithm rule P(HV
- np) np/MAXnp where MAXnp is the threshold value for np, the probability of high variability given evidence of number of request parameters is equal to 1 if the number of request parameters is greater than or equal to the threshold value defined by MAXnp, otherwise the probability of high variability given evidence of number of request parameters is equal to the number of request parameters divided by the threshold value MAXnp.
- Prior probabilities define the probabilities of an event without any evidence support. These prior probabilities are used in cases where there does not exist evidence support, e.g. missing the information of number and range of request parameters then the calculation of HV will use the prior probabilities.
- the threshold values are values that determine if an evidence is completely true, e.g. in the cacheability algorithm rule P(HV
- np) np/MAXnp where MAXnp is the threshold value for np, the probability of high variability given evidence of number of request parameters is equal to 1 if the number of request parameters is greater than or equal to the threshold value defined by MAXnp, otherwise the probability of high variability given evidence of number of request parameters is equal to the number of request parameters divided by the threshold value MAXnp.
- Prior probabilities define the probabilities of an event without any evidence support. These prior probabilities are used in cases where there does not exist evidence support, e.g. missing the information of number and range of request parameters then the calculation of HV will use the prior probabilities.
- the rules.xml file contains specification of certainty factor rules. These certainty factor rules need to be defined in a rules.xml file for subsequent use by the cacheability algorithm. For example, the first rules within the rule list for CWF is CF[CwF
- HV & HEC & HAR & HRR] 0.9.
- ⁇ cacheadvisor> is the root element that contains a list of ⁇ rulelist> elements
- ⁇ rulelist> element contains a list of ⁇ rule> elements that define the Certainty Factors for certain rules “name” attribute is the name of the hypothesis, e.g.
- CWF - Caching with fragments, CWoF - Caching without fragments, NC - Not Cached ⁇ rule> element contains a proposition that defines the Certainty Factor rule “cf” attribute is the certainty factor for the rule ⁇ proposition> element contains a list of ⁇ proposition> elements that define the components in the rule “type” attribute is the operator of the proposition, the values can be conjunction (and &), disjunction (or
- Cache advisor 200 is an implementation of an embodiment of the present invention in the form of a computer system tool containing specific functions for managing cache data as described in FIG. 2 .
- Components tool GUI 355 ; report 345 and graph 350 correspond to elements as previously shown.
- Tool GUI 355 is used to provide the display mechanism for reports such as report.html 335 as generated by generator 260 .
- Certainty factor rules contained within file rules.xml 300 are provided to analysis functions of cache advisor 300 . These rules comprise the expert based rules system contained within the embodiment. The value ranges from disbelief to belief and is not to be confused with a probability.
- Config.properties 305 is also used by cache advisor 200 functions as it contains data regarding settings and values for thresholds, weights and prior probabilities.
- Mapfile.xml 310 provides the third piece of data to cache advisor 200 in the form of mapping entries. These mapping entries resolve unknown entries to some user known entry. The user can choose to use a sample set of data as may be found in file mapfilesample.xml 315 or modify the sample to provide more installation unique data. Optionally the user may also modify settings contained within rules.xml 300 and config.properties 305 as a means to generate different recommendations.
- Report 345 and graph 350 components of cache advisor 200 may be viewed as part of visualizer 280 of cache advisor 200 .
- the file objects just described may be viewed through report 345 and tool GUI 355 or other tool capable of displaying files having XML data.
- Cache advisor 200 generates output in the form of report.html 335 which is a cacheability report and cachespecsample.xml containing recommended cache data.
- the user would further modify cachespecsample.xml 330 by adding information regarding cache, data dependency and invalidation identifiers to create cache implementation file cachespec.xml 325 .
- Cachespec.xml 325 is further tested and modified by the user throughout production cycles.
- Information from an implementation on server 320 further provides input to modifications of the configuration files (rules.xml 300 , config.properties 305 and mapfile.xml 310 ) to regenerate recommendations as output in cachespec.xml 325 .
- FIG. 4 is a block diagram showing the relationship between cache advisor tool 200 of FIG. 2 and computer system 100 of FIG. 1 .
- runtime 400 is the environment in which the applications of interest execute generating empirical data describing activity and events. This information is collected in various forms such as web logs by a statistics collector activity 410 .
- Collector 410 may also collect other information regarding application usage as defined by collection routines and capabilities of the generating application and support infrastructure. Information obtained through collector 410 is passed to evaluator 430 .
- Evaluator 430 combines the statistics describing application activity with data obtained from caching system 420 .
- Caching system 420 may be at least one of a hardware, software or combination thereof.
- Evaluator 430 determines data regarding cache hit ratio, page invalidation and reusability of pages and other cache related information which may be of value in assessing and improving performance of the dynamic content being served. This evaluated information is then fed into the analyser of cache advisor tool 200 as described in FIG. 2 . Analysis of the received data is performed in cache advisor tool 200 and recommendations are provided and implemented in server 320 of FIG. 3 operating in runtime 400 . All of this activity may occur within a single system such as server 320 or it may be dispersed across a number of physical systems of which FIG. 1 shows but one example.
- the recommendation system as described may operate as an automatic iterative feedback loop generating collecting assessing and recommending change to tune or improve the performance of serving dynamic content.
- the user in the form of a system administrator also has the opportunity to inject values into the process so as to manually override recommendations provided by the cache advisor tool 200 .
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
According to the present invention, there is provided a computer system, method and apparatus for making intelligent recommendations for dynamic content caching. In one embodiment of the present invention there is provided a computer implemented method for generating intelligent caching recommendations related to dynamic web content for use on a caching system. The method comprising extracting data associated with the dynamic content of interest in accordance with a predetermined data model. Next analyzing the extracted data in accordance with a plurality of certainty factors and a rule based expert system. Completing the analysis and generating a set of caching recommendations from the analyzed data suitable for use by the caching system. Implementing the recommendations in the caching system are repeated iteratively, as in a loop, automatically generating intelligent caching recommendations related to the dynamic web content for use on the caching system.
Description
- The present invention relates generally to data caching of web content on a computer system on a network and, more specifically, for making automated recommendations toward intelligent data caching of dynamic web page content.
- The Internet and the World Wide Web (WWW) provide intra-enterprise connectivity, inter-enterprise connectivity and application hosting on a larger scale than ever before. By exploiting the broadly available and deployed standards of the Internet and the WWW, system users and designers can leverage a single architecture to build client/server applications for internal use that can reach outside to customers, business partners and suppliers.
- Each web site normally further provides a plurality of web pages to be served to the local computer systems upon request. Each local computer system may access the remote web sites with web browser software.
- The WWW is a collection of servers on an IP (Internet Protocol) network, such as the Internet, an Intranet or an Extranet, that utilize the Hypertext Transfer Protocol (HTTP). Hereinafter, “Internet” will be used to refer to any IP network. HTTP is a known application protocol that provides users with access to files, which can be in different formats, such as text, graphics, images, sound, and video, using a standard page description language known as Hypertext Markup Language (HTML). Among a number of basic document formatting functions, HTML allows software developers to specify graphical pointers on displayed web pages, commonly referred to as “hyperlinks,” that point to other web pages resident on remote servers. Hyperlinks commonly are displayed as highlighted text or other graphical image on the web page. Selection of a hyperlink with a pointing device, such as a computer mouse, causes the local computer to download the HTML associated with the web page from a remote server. The browser then renders the HTML into the displayed web page.
- Web pages accessed over the Internet, whether by a hyperlink, opening directly via an “open” button in the browser, or some other means, are commonly downloaded into the volatile cache of a local computer system. In a computer system, for example, the volatile cache is a high-speed buffer that temporarily stores web pages from accessed remote web sites. The volatile cache thus enables a user to quickly review web pages that were already downloaded, thereby eliminating the need to repeat the relatively slow process of traversing the Internet to access previously viewed web pages. This is called local caching.
- On the server side, the first web servers were merely HTTP servers that resolved universal resource locators (URLs) by extracting literally from the URL the path to a file that contained the needed page, and transmitting the page back to the browser. Such a server was very simple; it could only be used to access static pages.
- A “static” page is a page which, each time it is requested and served to a requester, has the same byte content. That is, it does not depend upon which requester is requesting the page, when the requester is requesting the page, etc.; the byte content of that page remains the same. By contrast, a “dynamic page” is a page which has byte content that may very well change depending upon the particular requestor, when the page is being requested, etc. This will be discussed further below. It is important that web pages be served as quickly as possible, both to reduce the response time to a single user, and to increase the number of users that can be served concurrently. To improve the response time, the Web server uses caches. Web server caches are used to store web page responses in a readily accessible memory location so that when the web page is requested by a user, a previously cached web page response can be retrieved from cache and served quickly to the user.
- Caching web page responses by the web server works quite well for web page responses having static content, i.e., content that doesn't change frequently. An example of a static web page is one, at a company's web site, comprising a compilation of text and graphics objects describing that company's history.
- In fact, classic web servers cache static pages quite effectively. Specifically, classic web servers serve web page responses, some of which are static, namely, responses comprising HTML from the file system. Each of the static responses has a last modified date associated with it that is maintained by the file system. The contents of the response and its associated last modified date are simply stored in the cache for possible future use by the web server. When a subsequent request is received by the server for that page, the server requests the latest modification date for that page from the file system and compares the latest modification date with the last modified date associated with the candidate cached response. If the latest modification date is the same as the last modified date associated with the candidate cached response, the candidate cached response is considered to be “fresh” and is served to the request (i.e., to the requesting user). If the latest modification date is later than the last modified date associated with the candidate cached response, the candidate cached response is considered “stale” and a “fresh” response is retrieved and built by the web server for serving to the requesting user. The fresh response, along with its associated last modified date, is cached to replace the stale response. This caching scheme saves the time and server processor cycles that otherwise would have been spent to build requested pages which otherwise could have been cached using this classic caching scheme.
- However, newer web servers provide not only static web pages but also dynamic web pages, i.e., a page having byte content that may very well change depending upon the particular requester, when the page is being requested, etc. Examples of dynamic web pages are pages containing content from a number of different sources or pages having computed content. For example, a page may contain macros that compute content for the page, i.e., the page has “computable content”. These macros may change the page content each time the page is accessed. This makes it difficult to cache that page using the classic caching method described above. Macros or formulas are expressions that perform a function, such as determining field values, defining which documents appear in a view, or calculating values for a column.
- Alternatively, the page may contain information from a number of different sources, and that information may or may not have associated last modified dates making it difficult, if not impossible, to cache using the classic caching method. For example, the page may comprise a composite of a number of “parts” including: other documents, designs from databases, content from databases, the present user's identity, the current time, the current environment, etc. Some of these parts are actual entities in the system, e.g., documents, databases, etc. Some parts though are “virtual” and are used to model the effects of the execution of macros or scripts, for example, the user's identity may be accessed via one of a number of macros for performing specialized. They can be used to format text strings, generate dates and times, format dates and times, evaluate conditional statements, calculate numeric values, calculate values in a list, convert text to numbers or numbers to text, or activate agents and actions. These various part types are computable parts and have correspondingly various types of attributes that can not be handled by the classic caching systems and methods.
- Clearly, it is more difficult to use caching as a mechanism for improving user response time for pages with dynamic content. This problem for the server is twofold. First, after building a web page response, the server must determine whether the response that it is preparing to serve the requesting user is cacheable (i.e., determining its cacheability). Second, the server, upon receiving a request for a web page whose previous response has been cached, must determine whether the cached response is valid (i.e., determining its validity) and applicable (i.e., determining its applicability). For instance, web page responses containing macros that are time-dependent may not be cacheable at all. If a page includes a macro for providing the current time, then every access of the page is unique and the page cannot be cached in memory at all. Another example is where is a cached page is valid for serving to some users but not others. For instance, if the page includes a macro for the user's name, then the page can be cached for serving to that particular user, but not for serving to others. (HTML representing a document is specific to a user if macros are dependent on user name or user roles. Using this user data, some data may be made visible based on which user is requesting it.)
- The term “Dynamic HTML” (DHTML) needs to be explained in the context of the embodiments of the present invention. “Dynamic” as used in DHTML is referring primarily to the effect that the code has on the web page appearance at the browser. For instance, the dynamic HTML may comprise scripts that run on the browser to change the appearance of the web page such as by displaying a button that, if pushed, displays additional text or graphics. The key distinction is that “dynamic” in the DHTML sense refers to the browser, not the server. From the server's point of view, a DHTML page may still be “static” in that the byte content may be the same each time the page is requested, so for the purposes of this invention, a DHTML page may be “static” or “dynamic” in the sense of the invention. The content is not dependent on any thing, e.g., the properties of the request, such as the identity of the particular user, the time of day that the request is made, etc. “Dynamic” content, as used in the embodiments of the present invention, refers to content that has such dependencies. Thus, “dynamic” in the DHTML sense is not related to “dynamic” in the sense of the embodiments of the present invention.
- The problems may be further expressed as that of not knowing which dynamic content such as that found in JSP (Java server page) and ASP (active server page) technology need to be cached and then how to cache effectively. Further when implementing a manual cache configuration incorrect assumptions may cause system performance degradation. This problem arises when developers construct dynamic pages but do not cache them appropriately; which may then lead to poor system performance. Additional decisions based on the various dynamic properties of the page are required regarding whether to not cache a particular dynamic page at all or to cache it as a page fragment or a composite page (including sub-fragments). The drawback of these solutions is the need for significant knowledge of the dynamic content design and the system's dynamic content caching infrastructure.
- As can be readily seen, using caching as a means for increasing server performance for responses which have dynamic content has a number of complications and difficulties which have not been overcome by prior systems. As such, HTML representing responses having dynamic content has not been cached in the past. Accordingly, an embodiment to cache content that can include dynamic content without suffering from the drawbacks discussed above is needed. An additional solution is required to make the caching process simpler such that developers can easily determine which dynamic content needs to be cached and how the content should be cached.
- According to the present invention, there is provided a computer system, method and apparatus for making intelligent recommendations for dynamic content caching. In an embodiment of the present invention there is a focus on four main components. Included is a Scanner component that takes a list of dynamic pages as input and based on a dynamic page model (JSP, ASP, etc.) and XML definition file outputs the dynamic page content as entities-relationships data described in an XML file. This data is then fed into an Analyser component to be analyzed using the certainty factor model and rule based expert system implementation with user-defined object weighting schemes based on an Object Dependency Graph (ODG) model. The Analyzer then generates caching recommendations in an analysis report with Certainty Factors for each hypothesis, such as cached as a page fragment, cached as a composite page or not cached at all. The Analyser's output is in the form of an enhanced ODG with additional attributes described by an XML file. This enhanced ODG is then used by a Generator component to generate a cache policy for the computer system based on the computer system's cache policy model. The enhanced ODG may optionally be passed to a Visualizer component providing a graphical view of the cached objects and their dependencies. There is also a cache advisor report which may be viewed through the visualizer component.
- Embodiments of the present invention may be used to minimize requirements for significant knowledge of page design and cache infrastructure of a system. It may also be used in place of using intuition to determine what needs to be cached, as the certainty factor and rule based expert system provides intelligent recommendations based on supporting evidence to optimize caching opportunities. Further by design all components are logically separated affording greater flexibility to plug in each component, such as the use of a different Scanner, Analyzer, Generator, or Visualizer component. Use of a tool such as that which may be an implementation of an embodiment of the present invention may also provide for automatic generation of the cache policy. Further instead of reading text or code, all object dependencies may be visualized in a user-friendly graphical user interface.
- In one embodiment of the present invention there is provided a computer implemented method for generating intelligent caching recommendations related to dynamic web content for use on a caching system, comprising: extracting data associated with the dynamic content of interest in accordance with a predetermined data model; analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
- In another embodiment of the present invention there is provided a computer system for generating intelligent caching recommendations related to dynamic web content for use on a caching system, comprising: a means for extracting data associated with the dynamic content of interest in accordance with a predetermined data model; a means for analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and a means for generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
- In yet another embodiment of the present invention there is provided an article of manufacture for directing a data processing system generate intelligent caching recommendations related to dynamic web content for use on a caching system, the article of manufacture comprising: a computer usable medium embodying one or more instructions executable by the data processing system, the one or more instructions comprising: data processing system executable instructions for extracting data associated with the dynamic content of interest in accordance with a predetermined data model; data processing system executable instructions for analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and data processing system executable instructions for generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
- Other aspects and features of the present invention will become apparent to those of ordinary skill in the art upon review of the following description of specific embodiments of the invention in conjunction with the accompanying figures.
- In the figures, which illustrate embodiments of the present invention by example only,
-
FIG. 1 is a block diagram of a typical computer system in which an embodiment of the present invention may be implemented; -
FIG. 2 is block diagram showing the components of an embodiment of the present invention ofFIG. 1 ; -
FIG. 3 is component flow diagram depicting data flow relationships among components of the embodiment ofFIG. 2 ; and -
FIG. 4 is a block diagram depicting interaction among components of a system ofFIG. 1 and the embodiment ofFIG. 2 . - Like reference numerals refer to corresponding components and steps throughout the drawings.
-
FIG. 1 depicts, in a simplified block diagram, acomputer system 100 suitable for implementing embodiments of the present invention.Computer system 100 has a central processing unit (CPU) 110, which is a programmable processor for executing programmed instructions, such as instructions contained inmemory 108.Memory 108 can also include hard disk, tape or other storage media. While a single CPU is depicted inFIG. 1 , it is understood that other forms of computer systems can be used to implement the invention, including multiple CPUs. It is also appreciated that the present invention can be implemented in a distributed computing environment having a plurality of computers communicating via asuitable network 119, such as the Internet. -
CPU 110 is connected tomemory 108 either through adedicated system bus 105 and/or a general system bus 106.Memory 108 can be a random access semiconductor.Memory 108 is depicted conceptually as a single monolithic entity but it is well known thatmemory 108 can be arranged in a hierarchy of caches and other memory devices.FIG. 1 illustrates thatoperating system 120, may reside inmemory 108. As well may components of an embodiment of the present invention such as that ofcache advisor tool 200 ofFIG. 2 . -
Operating system 120 provides functions such as device interfaces, memory management, multiple task management, and the like as known in the art.CPU 110 can be suitably programmed to read, load, and execute instructions ofoperating system 120 and those ofcache advisor tool 200.Computer system 100 has the necessary subsystems and functional components to implement testing of files as will be discussed later. Other programs (not shown) include server software applications in whichnetwork adapter 118 interacts with the server software application to enablecomputer system 100 to function as a network server vianetwork 119 as well as to provide data from remote instances supporting embodiments ofcache advisor tool 200 anddynamic content 225. - General system bus 106 supports transfer of data, commands, and other information between various subsystems of
computer system 100. While shown in simplified form as a single bus, bus 106 can be structured as multiple buses arranged in hierarchical form.Display adapter 114 supportsvideo display device 115, which is a cathode-ray tube display or a display based upon other suitable display technology that may be used to depict data.Output device 295 ofFIG. 2 may be one of a family of such devices asdevice 115. The Input/output adapter 112 supports devices suited for input and output, such as keyboard ormouse device 113, and a disk drive unit (not shown).Storage adapter 142 supports one or moredata storage devices 144, which could include a magnetic hard disk drive or CD-ROM drive although other types of data storage devices can be used, including removable media for storingdynamic content 225 data as well as intermediate data such as files used to aid in processing of such data and for storing output in the form of reports and caching recommendations as incachespec.xml 325. -
Adapter 117 is used for operationally connecting many types of peripheral computing devices tocomputer system 100 via bus 106, such as printers, bus adapters, and other computers using one or more protocols including Token Ring, LAN connections, as known in the art.Network adapter 118 provides a physical interface to asuitable network 119, such as the Internet.Network adapter 118 includes a modem that can be connected to a telephone line for accessingnetwork 119.Computer system 100 can be connected to another network server via a local area network using an appropriate network protocol and the network server can in turn be connected to the Internet.FIG. 1 is intended as an exemplary representation ofcomputer system 100 by which embodiments of the present invention can be implemented. It is understood that in other computer systems, many variations in system configuration are possible in addition to those mentioned here. - Logical system architecture to effectively automate the dynamic content recommendation process is established. The following
FIG. 2 is a block diagram depicting the components and their relationships of an embodiment of the present invention which may be supported within a system configuration ofFIG. 1 .Scanner 201 extracts information from the dynamic content pages based on a selected dynamic page model, such as JSP or ASP. Extraction uses information regarding the dynamic page model fromdynamic content model 205 in conjunction with data contained inXML file 206. The relationships found are then documented as entities-relationships 210 supported by anotherXML file 215 andrelated DTD 220.XML mapping 206 which representsdynamic content model 205 in an XML format is used byscanner 201 to parse different entity-relationships data 210. - Output of the translated data is in a XML format to be interpreted by
analyser 230. Entities-relationships 210 is then translated throughanalyser 230 into an enhanced ODG (object dependency graph) model withcacheability data 250.Analyser 230 analyzes extracted dynamic page entities-relationships data 210 and extendsODG model 245 by attaching attributes to ODG nodes, obtained through analysis of theODG 245 and the entire web application (with data fromXML file 215 and related DTD 220).Analyser 230 also applies certainty factor model (probabilistic model) and expert system (heuristics) used for caching dynamic web content in conjunction with weighting scheme and XML basedrules 240 for the expert system. The rules may be derived from various different cacheability indicators or evidence. Typical cacheability indicators are as follows: -
- High Variability (HV)—The number of invocations of a dynamic page
- High External code (HEC)—The amount of code that accesses the database
- High Internal Code (HIC)—The amount of code that performs logic
- High Author-Time Reuse (HAR)—The number of times a dynamic page is dynamically included at author-time
- High Run-Time Reuse (HRR)—The number of times a dynamic page is requested at run-time
- High Run-Time Invalidation (HRI)—The number of times a dynamic page is invalidated at run-time
- For example, using the JavaServer Pages (JSP) dynamic page model:
-
- HV is determined based on the number of request parameters (np) and the range of the parameters' values (rp).
- HEC is determined based on the number of Java Bean's (nj), Taglibs (nt) and Plugins (nP).
- HIC is determined based on the number of bytes of Java code (nc) in the JSP file
- HAR is determined based on the number of times a page is dynamically included in another page (ni)
- HRR is determined based on the number of times an instance of a page is dynamically requested at run-time per hour (nr)
- HRI is determined based on the frequency of invalidation of a page at run-time per hour (fi)
- The above indicators or evidence may then be used to compute the certainty factors as found in certainty factors and rules based
system 235 for a following hypothesis: - (Please see http://www.blutner.de/uncert/CertaintyFactorModel.pdf for an introduction on the Certainty Factor Model)
-
- NC=Not Cached
- CwoF=Cached Without Fragments
- CwF=Cached With Fragments
- CF=Certainty Factor
- MB=Measure of Belief
- MD=Measure of Disbelief
- h=hypothesis
- e=evidence
- CF=(MB−MD)/1−min(MB, MD)
- MB[h|e]=1 if P(h)=1, otherwise MB[h|e]=(max[P(h|e), P(h)]−P(h))/(1−P(h))
- MD[h|e]=1 if P(h)=0, otherwise MD[h|e]=(min[P(h|e), P(h)]−P(h))/−P(h)
- An initial set of CF's (rules), given certain evidence, need to be created in order to provide more accurate CF's given a combination involving any of these evidence. An example may be provided as: CF[CwoF|HAR & HRR]=0.9.
- As well, probabilities given certain evidence need to be determined in the calculation of MB/MD as in the next example of P(HV|np)=1 if (np>=MAXnp), otherwise P(HV|np)=np/MAXnp
- Also, a prior probability (probability of an event without any evidence) of a dynamic page is required for each evidence. For example, Prior(HV) is the probability of any page having HV without any evidence.
- Cacheability also takes into account a weighting scheme based on user input, allowing an expert developer to fine tune the algorithm. Prior probabilities for each evidence and values such as MAXnp in the examples above can be defined in a configuration file hence allowing the expert user knowledge to play a role in the cacheability calculation of different systems.
-
Analyser 230 produces output of the analyzed data in a XML format to be interpreted byGenerator 260 for producing acache policy 270 XML file. Output fromanalyser 230 may also be sent asenhanced ODG 250 with or without CA report 255 (cache advisor) to Visualizer 280 for a visual representation.Visualizer 280 createsviewable objects 285 for use withapplication view 290 to be seen onoutput device 295. -
Generator 260 generates a cache policy based on the analysis report ofenhanced ODG 250 and information combined withcache policy model 265. For example, the IBM product WebSphere Application Server Dynamic Cache uses a cache policy in the form of an XML file (cachespec.xml) and its cache policy model is the data type definition file (cachespec.dtd) for the cachespec.xml. Several ofcache policy XML 270 may be generated based on a given confidence level. -
Visualizer 280 provides a user with a graphical view of object dependencies determined through analysis using a colouring scheme to highlight relationships. For example when displaying objects and dependencies, objects that appear in blue may be JSPs, those in red may be Java beans, pink may be for tag libraries, and HTML objects may be in grey, while unknown object may appear in black. Similarly dependencies could be shown as green to indicate they are dynamic, while cyan could indicate static includes, pink for tag library links and grey for HTML anchors. Further any object having an arrow pointing to it by another object is a parent of that object while an object having a pointer to another object is the child of that object. In the usual manner clicking on an object or dependency will cause the properties of that object or dependency to be displayed. In this way thevisualizer 280 provides the ability to view object information at the nodes and at the edges (if dependencies and attributes exist) of theenhanced ODG 250 and maps the analysis report to highlight the important objects with priorities. -
Cache advisor report 255 is presented as a textual display having rows of information related to the relationships and resulting recommendations. - Referring now to
FIG. 3 is an overview of the data flow of an embodiment of the present invention as may be used in a tool. In this embodiment of the present invention in the form of tool, there may be seen three configuration files: a mapfile.xml, a config.properties and a rules.xml. All three files are configurable by the user of the tool. It would be expected that one skilled in the art having knowledge of the system before would be able to modify the config.properties and rules.xml as these files may be provided with default values. In a similar manner a sample mapfile.xml called mapfileSample.xml may be provided to allow a user to use the content within mapfileSample.xml as is by renaming the file to mapfile.xml. In this manner the tool may be used rather quickly and allowing further configuration or customization to suit the intended use situation. - The file mapfile.xml is used to map unknown objects to user known objects, for example where some dynamically included page can become unknown, such as the case where the incfile below can be any value depending on the value of the string includeDir which doesn't get determined until runtime. To overcome such problems, a user defined mapping file to map unknown objects to user known objects is provided.
-
<% String incfile; incfile = includeDir + “CachedHeaderDisplay.jsp”; %> <jsp:include page=“<%=incfile%>” flush=“true”/> - Sample syntax definition of a mapfile.xml may be as follows:
-
<maplist> is the root element that consists of a list of <mapping> elements <mapping> is an element that defines a mapping of unknown objects to user known objects “from” attribute defines the fully qualified path of the unknown object e.g. <%=incfile%> in any JSP's under directory FashionFlow will be defined as from=“<path_to_FashionFlow>\FashionFlow\<%=incfile%>”. Notice that the “<” is defined as < and “>” is defined as > in XML files <destination> is an element that defines the destination mapping of the unknown objects “to” attribute defines the fully qualified path of destination object e.g. if <%=incfile%> really refers to FashionFlow\include\styles\styles1\CachedHeaderDisplay.jsp then one would define the destination as to=“<path_to_FashionFlow\FashionFlow\include\styles\style1\CachedHeaderDisplay. jsp” - Following is a sample of the mapfile.xml file:
-
<?xml version=“1.0” encoding=“UTF-8”?> <maplist> <mapping from=“C:\WSAD\workspace_CAS\CacheAdvisor\WC_Code\FashionFlow\<%=incfile%>”> <destination to=“C:\WSAD\workspace_CAS\CacheAdvisor\WC_Code\FashionFlow\include\styles\style1\HeaderDisplay .jsp”/> <destination to=“C:\WSAD\workspace_CAS\CacheAdvisor\WC_Code\FashionFlow\include\styles\style1\FooterDisplay .jsp”/> <destination to=“C:\WSAD\workspace_CAS\CacheAdvisor\WC_Code\FashionFlow\include\styles\style1\SidebarDisplay .jsp”/> </mapping> - The config.properties file contains information needed to configure prior probabilities, weighting schemes, and threshold values used by the analyser.
- Threshold values are values that determine if an evidence is completely true, e.g. in the cacheability algorithm rule P(HV|np)=1 if (np>=MAXnp), otherwise P(HV|np)=np/MAXnp where MAXnp is the threshold value for np, the probability of high variability given evidence of number of request parameters is equal to 1 if the number of request parameters is greater than or equal to the threshold value defined by MAXnp, otherwise the probability of high variability given evidence of number of request parameters is equal to the number of request parameters divided by the threshold value MAXnp.
- Weight schemes define the weighting of each attribute contributing to the belief or disbelief of the evidence, e.g. users can define how the number of request parameters (np) and the range of the request parameters (rp) will contribute to the calculation of high variability (HV). These weights are expressed in the config.properties as percentages in the form of decimals (30% or 0.3 weight for np and 70% or 0.7 weight for rp), therefore the weight of np+the weight of rp=1.
- Prior probabilities define the probabilities of an event without any evidence support. These prior probabilities are used in cases where there does not exist evidence support, e.g. missing the information of number and range of request parameters then the calculation of HV will use the prior probabilities.
- A sample of the syntax definition of config.properties follows:
-
threshold.max.params (denoted earlier by MAXnp) is the threshold value for the number of request parameters threshold.max.range is the threshold value for the range of request parameters threshold.max.beans is the threshold value for the number of beans threshold.max.tagLibs is the threshold value for the number of tag libraries threshold.max.bytesCode is the threshold value for the number of bytes of the page threshold.max.dynaInc is the threshold value for the number of dynamic includes of the page threshold.max.statInc is the threshold value for the number of static includes of the page threshold.max.runTimeReuse is the threshold value for the number of runtime reuses of the page (this value will not be used until runtime analysis is incorporated into the tool) threshold.max.runTimeInvalid is the threshold value for the number of runtime invalidations of the page (this value will not be used until runtime analysis is incorporated into the tool) weight.numParams is the weight of the number of request parameters weight.totalRange is the weight of the range of request parameters weight.numBeans is the weight of the number of beans weight.numTagLibs is the weight of the number of tag libraries weight.numBytesCode is the weight of the number of bytes of the page weight.numDynaInc is the weight of the number of dynamic includes weight.numStatInc is the weight of the number of static includes weight.runTimeReuse is the weight of the runtime reuses (this value will not be used until runtime analysis is incorporated into the tool) weight.runTimeInvalid is the weight of the runtime invalidations (this value will not be used until runtime analysis is incorporated into the tool) prior.variability is the prior probability of high variability prior.extCode is the prior probability of high external code prior.intCode is the prior probability of high internal code prior.authTimeReuse is the prior probability of high author time reuse prior.runTimeReuse is the prior probability of high runtime reuse (this value will not be used until runtime analysis is incorporated into the tool) prior.runTimeInvalid is the prior probability of high runtime invalidation (this value will not be used until runtime analysis is incorporated into the tool) prior.ncChild is the prior probability of having a non-cached child prior.cwofChild is the prior probability of having a child that's cached without fragments - The following is an example of a config.properties file containing settings for configuring prior probabilities, weighting schemes, and threshold values. The threshold values are values that determine if an evidence is completely true, e.g. in the cacheability algorithm rule P(HV|np)=1 if (np >=MAXnp), otherwise P(HV|np)=np/MAXnp where MAXnp is the threshold value for np, the probability of high variability given evidence of number of request parameters is equal to 1 if the number of request parameters is greater than or equal to the threshold value defined by MAXnp, otherwise the probability of high variability given evidence of number of request parameters is equal to the number of request parameters divided by the threshold value MAXnp.
- Weight schemes define the weighting of each attribute contributing to the belief or disbelief of the evidence, e.g. users can define how the number of request parameters (np) and the range of the request parameters (rp) will contribute to the calculation of high variability (HV). These weights are expressed in this config.properties as percentages in the form of decimals (30% or 0.3 weight for np and 70% or 0.7 weight for rp), therefore the weight of np+the weight of rp=1.
- Prior probabilities define the probabilities of an event without any evidence support. These prior probabilities are used in cases where there does not exist evidence support, e.g. missing the information of number and range of request parameters then the calculation of HV will use the prior probabilities.
- Sample syntax definition of a config.properties file follows:
-
threshold.max.params (denoted earlier by MAXnp) is the threshold value for the number of request parameters threshold.max.range is the threshold value for the range of request parameters threshold.max.beans is the threshold value for the number of beans threshold.max.tagLibs is the threshold value for the number of tag libraries threshold.max.bytesCode is the threshold value for the number of bytes of the page threshold.max.dynaInc is the threshold value for the number of dynamic includes of the page threshold.max.statInc is the threshold value for the number of static includes of the page threshold.max.runTimeReuse is the threshold value for the number of runtime reuses of the page (this value will not be used until runtime analysis is incorporated into the tool) threshold.max.runTimeInvalid is the threshold value for the number of runtime invalidations of the page (this value will not be used until runtime analysis is incorporated into the tool) weight.numParams is the weight of the number of request parameters weight.totalRange is the weight of the range of request parameters weight.numBeans is the weight of the number of beans weight.numTagLibs is the weight of the number of tag libraries weight.numBytesCode is the weight of the number of bytes of the page weight.numDynaInc is the weight of the number of dynamic includes weight.numStatInc is the weight of the number of static includes weight.runTimeReuse is the weight of the runtime reuses (this value will not be used until runtime analysis is incorporated into the tool) weight.runTimeInvalid is the weight of the runtime invalidations (this value will not be used until runtime analysis is incorporated into the tool) prior.variability is the prior probability of high variability prior.extCode is the prior probability of high external code prior.intCode is the prior probability of high internal code prior.authTimeReuse is the prior probability of high author time reuse prior.runTimeReuse is the prior probability of high runtime reuse (this value will not be used until runtime analysis is incorporated into the tool) prior.runTimeInvalid is the prior probability of high runtime invalidation (this value will not be used until runtime analysis is incorporated into the tool) prior.ncChild is the prior probability of having a non-cached child prior.cwofChild is the prior probability of having a child that's cached without fragments - Following is an example of a default config.properties file showing the settings with values:
-
////////////////////////// // THRESHOLDS // ////////////////////////// threshold.max.params = 5 threshold.max.range = 10000 threshold.max.beans = 2 threshold.max.tagLibs = 3 threshold.max.bytesCode = 10000 threshold.max.dynaInc = 4 threshold.max.statInc = 4 threshold.max.runTimeReuse = 600 threshold.max.runTimeInvalid = 60 ////////////////////////// // WEIGHTING SCHEMES // ////////////////////////// // -- Variability weight.numParams = 0.3 weight.totalRange = 0.7 // -- External Code weight.numBeans = 0.95 weight.numTagLibs = 0.05 // -- InternalCode weight.numBytesCode = 1 // -- Author-time Reuse weight.numDynaInc = 0.5 weight.numStatInc = 0.5 // -- Run-time Reuse weight.runTimeReuse = 1 // -- Run-time Invalidation weight.runTimeInvalid = 1 ////////////////////////// // PRIOR PROBABILITIES // ////////////////////////// prior.variability = 0.01 prior.extCode = 0.2 prior.intCode = 0.01 prior.authTimeReuse = 0.1 prior.runTimeReuse = 0.0 prior.runTimeInvalid = 0.0 prior.ncChild = 0.1 prior.cwofChild = 0.2 - The rules.xml file contains specification of certainty factor rules. These certainty factor rules need to be defined in a rules.xml file for subsequent use by the cacheability algorithm. For example, the first rules within the rule list for CWF is CF[CwF|HV & HEC & HAR & HRR]=0.9.
- Sample syntax definitions for a rules.xml type of file are as follows:
-
<cacheadvisor> is the root element that contains a list of <rulelist> elements <rulelist> element contains a list of <rule> elements that define the Certainty Factors for certain rules “name” attribute is the name of the hypothesis, e.g. CWF - Caching with fragments, CWoF - Caching without fragments, NC - Not Cached <rule> element contains a proposition that defines the Certainty Factor rule “cf” attribute is the certainty factor for the rule <proposition> element contains a list of <proposition> elements that define the components in the rule “type” attribute is the operator of the proposition, the values can be conjunction (and &), disjunction (or |), atomic (without operator) “negate” attribute is the negation of the proposition, the values can be true or false “name” attribute is the name of the evidence, the values can be: variability - is determined by two page properties: number of request parameters and the range (possibilities of values) of the request parameters internalCode - is determined by the number of bytes (file size) of the dynamic page externalCode - is determined by the number of beans and tag libraries of the dynamic page authortimeReuse - is determined by the number of times the page is dynamically included runtimeReuse - is determined by the number of times the page is hit at runtime runtimeInvalidation - is determined by the number of invalidations of the page at runtime notCachedChildren - is determined by whether the page has a child that's not cached CWoFChildren - is determined by whether the page has a child that caches without fragments CWFChildren - is determined by whether the page has a child that caches with fragments childRequested - is determined by whether the page is sending request parameters to dynamically included child - The following is an example of a portion of a default rules.xml file showing two rules:
-
<?xml version=“1.0” encoding=“UTF-8”?> <cacheadvisor> <rulelist name=“CWF”> <rule cf=“0.9”> <proposition type=“conjunction”> <proposition type=“atomic” name=“variability”/> <proposition type=“atomic” name=“externalCode”/> <proposition type=“atomic” name= “authortimeReuse”/> <proposition type=“atomic” name=“runtimeReuse”/> </proposition> </rule> <rule cf=“0.7”> <proposition type=“conjunction”> <proposition type=“atomic” name= “authortimeReuse”/> <proposition type=“atomic” name= “notCachedChildren” negate=“true”/> <proposition type=“atomic” name=“CWoFChildren” negate=“true”/> </proposition> </rule> </rulelist> </cacheadvisor> - Having just described the more granular elements a sample usage flow will now be described.
Cache advisor 200 is an implementation of an embodiment of the present invention in the form of a computer system tool containing specific functions for managing cache data as described inFIG. 2 .Components tool GUI 355;report 345 andgraph 350 correspond to elements as previously shown.Tool GUI 355 is used to provide the display mechanism for reports such asreport.html 335 as generated bygenerator 260. Certainty factor rules contained withinfile rules.xml 300 are provided to analysis functions ofcache advisor 300. These rules comprise the expert based rules system contained within the embodiment. The value ranges from disbelief to belief and is not to be confused with a probability.Config.properties 305 is also used bycache advisor 200 functions as it contains data regarding settings and values for thresholds, weights and prior probabilities.Mapfile.xml 310 provides the third piece of data tocache advisor 200 in the form of mapping entries. These mapping entries resolve unknown entries to some user known entry. The user can choose to use a sample set of data as may be found infile mapfilesample.xml 315 or modify the sample to provide more installation unique data. Optionally the user may also modify settings contained withinrules.xml 300 andconfig.properties 305 as a means to generate different recommendations. -
Report 345 andgraph 350 components ofcache advisor 200 may be viewed as part ofvisualizer 280 ofcache advisor 200. The file objects just described may be viewed throughreport 345 andtool GUI 355 or other tool capable of displaying files having XML data.Cache advisor 200 generates output in the form ofreport.html 335 which is a cacheability report and cachespecsample.xml containing recommended cache data. The user would further modifycachespecsample.xml 330 by adding information regarding cache, data dependency and invalidation identifiers to create cache implementationfile cachespec.xml 325.Cachespec.xml 325 is further tested and modified by the user throughout production cycles. Information from an implementation onserver 320 further provides input to modifications of the configuration files (rules.xml 300,config.properties 305 and mapfile.xml 310) to regenerate recommendations as output incachespec.xml 325. - Referring now o
FIG. 4 is a block diagram showing the relationship betweencache advisor tool 200 ofFIG. 2 andcomputer system 100 ofFIG. 1 . Beginning withruntime 400 is the environment in which the applications of interest execute generating empirical data describing activity and events. This information is collected in various forms such as web logs by astatistics collector activity 410.Collector 410 may also collect other information regarding application usage as defined by collection routines and capabilities of the generating application and support infrastructure. Information obtained throughcollector 410 is passed toevaluator 430.Evaluator 430 combines the statistics describing application activity with data obtained fromcaching system 420.Caching system 420 may be at least one of a hardware, software or combination thereof. A particular implementation is not important as the ability to provide data on the functional performance of the caching system itself.Evaluator 430 then determines data regarding cache hit ratio, page invalidation and reusability of pages and other cache related information which may be of value in assessing and improving performance of the dynamic content being served. This evaluated information is then fed into the analyser ofcache advisor tool 200 as described inFIG. 2 . Analysis of the received data is performed incache advisor tool 200 and recommendations are provided and implemented inserver 320 ofFIG. 3 operating inruntime 400. All of this activity may occur within a single system such asserver 320 or it may be dispersed across a number of physical systems of whichFIG. 1 shows but one example. The recommendation system as described may operate as an automatic iterative feedback loop generating collecting assessing and recommending change to tune or improve the performance of serving dynamic content. The user in the form of a system administrator also has the opportunity to inject values into the process so as to manually override recommendations provided by thecache advisor tool 200. - Of course, the above described embodiments are intended to be illustrative only and in no way limiting. The described embodiments of carrying out the present invention are susceptible to many modifications of form, arrangement of parts, details and order of operation. The invention, rather, is intended to encompass all such modification within its scope, as defined by the claims.
Claims (15)
1-7. (canceled)
8. A computer system for generating intelligent caching recommendations related to dynamic web content for use on a caching system, comprising:
a means for extracting data associated with the dynamic content of interest in accordance with a predetermined data model;
a means for analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and
a means for generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
9. The computer system of claim 8 wherein the means for extracting the data comprises:
a dynamic content model in combination with dynamic content descriptors; and
produces a plurality of entity-relationship data.
10. The computer system of claim 8 wherein the means for analyzing the extracted data comprises:
means for obtaining an object dependency model;
means for obtaining a weighting scheme;
means for creating an enhanced object dependency graph model combined with cacheability data; and,
means for creating a cache advisor report.
11. The computer system of claim 8 wherein, the means for generating a set of caching recommendations comprises:
means for obtaining a cache policy model for use with a generator; and
means for generating a cache policy.
12. The computer system of claim 10 , wherein the enhanced object dependency graph model combined with cacheability data and the cache advisor report may be optionally viewed through a visualizer.
13. The computer system of claim 12 , wherein optional viewing through the visualizer provides color-keyed output of the enhanced object dependency graph model combined with cacheability data designating the inherent relationships and properties of the enhanced object dependency model combined with cacheability data.
14. The computer system of claim 8 , wherein the means for extracting data associated with the dynamic content of interest in accordance with a predetermined data model, the analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system and the generating a set of caching recommendations from the analyzed data suitable for the caching system and implementing the recommendations in the caching system are repeated iteratively, as in a loop, automatically generating intelligent caching recommendations related to the dynamic web content for use on the caching system.
15. An article of manufacture for directing a data processing system generate intelligent caching recommendations related to dynamic web content for use on a caching system, the article of manufacture comprising:
a computer usable medium embodying one or more instructions executable by the data processing system, the one or more instructions comprising:
data processing system executable instructions for extracting data associated with the dynamic content of interest in accordance with a predetermined data model;
data processing system executable instructions for analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system; and
data processing system executable instructions for generating a set of caching recommendations from the analyzed data suitable for use by the caching system.
16. The article of manufacture of claim 15 wherein the data processing system executable instructions for extracting the data comprises:
a dynamic content model in combination with dynamic content descriptors; and produces a plurality of entity-relationship data.
17. The article of manufacture of claim 15 wherein the data processing system executable instructions for analyzing the extracted data comprises:
data processing system executable instructions for obtaining an object dependency model;
data processing system executable instructions for obtaining a weighting scheme;
data processing system executable instructions for creating an enhanced object dependency graph model combined with cacheability data; and,
data processing system executable instructions for creating a cache advisor report.
18. The article of manufacture of claim 15 wherein, the data processing system executable instructions for generating a set of caching recommendations comprises:
data processing system executable instructions for obtaining a cache policy model for use with a generator; and
data processing system executable instructions for generating a cache policy.
19. The article of manufacture of claim 17 , wherein the enhanced object dependency graph model combined with cacheability data and the cache advisor report may be optionally viewed through a visualizer.
20. The article of manufacture of claim 19 , wherein optional viewing through the visualizer provides color-keyed output of the enhanced object dependency graph model combined with cacheability data designating the inherent relationships and properties of the enhanced object dependency model combined with cacheability data.
21. The article of manufacture of claim 15 , wherein the data processing system executable instructions for extracting data associated with the dynamic content of interest in accordance with a predetermined data model, the analyzing the extracted data in accordance with plurality of certainty factors and a rules based expert system and the generating a set of caching recommendations from the analyzed data suitable for the caching system and implementing the recommendations in the caching system are repeated iteratively, as in a loop, automatically generating intelligent caching recommendations related to the dynamic web content for use on the caching system.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/030,316 US20080147981A1 (en) | 2004-04-21 | 2008-02-13 | Recommendations for intelligent data caching |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2,465,155 | 2004-04-21 | ||
CA002465155A CA2465155C (en) | 2004-04-21 | 2004-04-21 | Recommendations for intelligent data caching |
US11/088,377 US7389386B2 (en) | 2004-04-21 | 2005-03-24 | Recommendations for intelligent data caching |
US12/030,316 US20080147981A1 (en) | 2004-04-21 | 2008-02-13 | Recommendations for intelligent data caching |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/088,377 Continuation US7389386B2 (en) | 2004-04-21 | 2005-03-24 | Recommendations for intelligent data caching |
Publications (1)
Publication Number | Publication Date |
---|---|
US20080147981A1 true US20080147981A1 (en) | 2008-06-19 |
Family
ID=35137809
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/088,377 Expired - Fee Related US7389386B2 (en) | 2004-04-21 | 2005-03-24 | Recommendations for intelligent data caching |
US12/030,316 Abandoned US20080147981A1 (en) | 2004-04-21 | 2008-02-13 | Recommendations for intelligent data caching |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/088,377 Expired - Fee Related US7389386B2 (en) | 2004-04-21 | 2005-03-24 | Recommendations for intelligent data caching |
Country Status (2)
Country | Link |
---|---|
US (2) | US7389386B2 (en) |
CA (1) | CA2465155C (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070061849A1 (en) * | 2005-09-12 | 2007-03-15 | Walker Philip M | Systems and methods for processing information or data on a computer |
US20090063590A1 (en) * | 2007-08-30 | 2009-03-05 | Microsoft Corporation | Operating System Support of Graceful Degradation for Web Applications |
US20090077481A1 (en) * | 2007-09-14 | 2009-03-19 | Yuuichi Ishii | Information processing apparatus, information processing method, and recording medium |
US7636093B1 (en) * | 2005-07-29 | 2009-12-22 | Adobe Systems Incorporated | Parameterized motion paths |
US20100153928A1 (en) * | 2008-12-16 | 2010-06-17 | Microsoft Corporation | Developing and Maintaining High Performance Network Services |
US20110093771A1 (en) * | 2005-04-18 | 2011-04-21 | Raz Gordon | System and method for superimposing a document with date information |
US8412898B2 (en) | 2007-12-06 | 2013-04-02 | F-Secure Corporation | Method for automatically backing up digital data preserved in memory in a computer installation and data medium readable by a computer having the associated instructions stored in the memory thereof |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4249719B2 (en) * | 2005-03-29 | 2009-04-08 | 株式会社日立製作所 | Backup system, program, and backup method |
US8719363B2 (en) * | 2005-10-19 | 2014-05-06 | Adobe Systems Incorporated | Presentation of secondary local content in a region of a web page after an elapsed time |
US8706964B1 (en) * | 2007-09-28 | 2014-04-22 | The Mathworks, Inc. | Automatic generation of cache-optimized code |
US8180964B1 (en) * | 2007-09-28 | 2012-05-15 | The Mathworks, Inc. | Optimization of cache configuration for application design |
CA2708628C (en) * | 2007-12-18 | 2017-03-07 | Bae Systems Plc | Assisting failure mode and effects analysis of a system comprising a plurality of components |
US8185694B2 (en) * | 2008-07-25 | 2012-05-22 | International Business Machines Corporation | Testing real page number bits in a cache directory |
US20100281035A1 (en) * | 2009-04-30 | 2010-11-04 | David Carmel | Method and System of Prioritising Operations On Network Objects |
US8984048B1 (en) * | 2010-04-18 | 2015-03-17 | Viasat, Inc. | Selective prefetch scanning |
CN102331985B (en) * | 2010-07-12 | 2013-09-25 | 阿里巴巴集团控股有限公司 | Method and device for fragment nested caching of webpage |
US9400851B2 (en) | 2011-06-23 | 2016-07-26 | Incapsula, Inc. | Dynamic content caching |
US9110751B2 (en) | 2012-02-13 | 2015-08-18 | Microsoft Technology Licensing, Llc | Generating and caching software code |
GB2503452A (en) * | 2012-06-26 | 2014-01-01 | Nds Ltd | Supplying a request for content together with a caching recommendation to cloud equipment |
US11521143B2 (en) | 2019-08-14 | 2022-12-06 | International Business Machines Corporation | Supply chain disruption advisor |
US11526446B1 (en) * | 2020-05-22 | 2022-12-13 | Amazon Technologies, Inc. | Modifying caching amongst services from a history of requests and responses |
Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5338117A (en) * | 1993-01-27 | 1994-08-16 | American Packaging Corporation | Bag and method of making the same |
US5367473A (en) * | 1990-06-18 | 1994-11-22 | Bell Communications Research, Inc. | Expert system for computer system resource management |
US6351767B1 (en) * | 1999-01-25 | 2002-02-26 | International Business Machines Corporation | Method and system for automatically caching dynamic content based on a cacheability determination |
US6408360B1 (en) * | 1999-01-25 | 2002-06-18 | International Business Machines Corporation | Cache override control in an apparatus for caching dynamic content |
US20020078300A1 (en) * | 1999-08-16 | 2002-06-20 | Chanda Dharap | Semantics-based caching policy to minimize latency |
US20020099807A1 (en) * | 2001-01-22 | 2002-07-25 | Doyle Ronald P. | Cache management method and system for storIng dynamic contents |
US20020112032A1 (en) * | 2001-02-15 | 2002-08-15 | International Business Machines Corporation | Method and system for specifying a cache policy for caching web pages which include dynamic content |
US20020112125A1 (en) * | 2000-12-18 | 2002-08-15 | Copeland George P. | Command caching to improve network server performance |
US20020111992A1 (en) * | 2000-12-18 | 2002-08-15 | Copeland George P. | JSP composition in a cache for web applications with dynamic content |
US20020116582A1 (en) * | 2000-12-18 | 2002-08-22 | Copeland George P. | Batching of invalidations and new values in a web cache with dynamic content |
US20020116583A1 (en) * | 2000-12-18 | 2002-08-22 | Copeland George P. | Automatic invalidation dependency capture in a web cache with dynamic content |
US20020116474A1 (en) * | 2000-12-18 | 2002-08-22 | Copeland George P. | Detecting and handling affinity breaks in web applications |
US20020133516A1 (en) * | 2000-12-22 | 2002-09-19 | International Business Machines Corporation | Method and apparatus for end-to-end content publishing system using XML with an object dependency graph |
US20020147997A1 (en) * | 1996-03-21 | 2002-10-10 | John Harvey Turner | Animal model for transplantation |
US20020152244A1 (en) * | 2000-12-22 | 2002-10-17 | International Business Machines Corporation | Method and apparatus to dynamically create a customized user interface based on a document type definition |
US6598121B2 (en) * | 1998-08-28 | 2003-07-22 | International Business Machines, Corp. | System and method for coordinated hierarchical caching and cache replacement |
US6606525B1 (en) * | 1999-12-27 | 2003-08-12 | Motorola, Inc. | System and method of merging static data in web pages |
US6622168B1 (en) * | 2000-04-10 | 2003-09-16 | Chutney Technologies, Inc. | Dynamic page generation acceleration using component-level caching |
US20030187935A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for fragment linking and fragment caching |
US20030188021A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for processing multiple fragment requests in a single message |
US20030188016A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for restrictive caching of user-specific fragments limited to a fragment cache closest to a user |
US20030188009A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for caching fragments while avoiding parsing of pages that do not contain fragments |
US6631451B2 (en) * | 1999-12-22 | 2003-10-07 | Xerox Corporation | System and method for caching |
US20030191800A1 (en) * | 2001-12-19 | 2003-10-09 | International Business Machines Corporation | Method and system for a foreach mechanism in a fragment link to efficiently cache portal content |
US20030191812A1 (en) * | 2001-12-19 | 2003-10-09 | International Business Machines Corporation | Method and system for caching role-specific fragments |
US6640207B2 (en) * | 1998-10-27 | 2003-10-28 | Siemens Aktiengesellschaft | Method and configuration for forming classes for a language model based on linguistic classes |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6823360B2 (en) | 2000-12-18 | 2004-11-23 | International Business Machines Corp. | Cofetching in a command cache |
-
2004
- 2004-04-21 CA CA002465155A patent/CA2465155C/en not_active Expired - Fee Related
-
2005
- 2005-03-24 US US11/088,377 patent/US7389386B2/en not_active Expired - Fee Related
-
2008
- 2008-02-13 US US12/030,316 patent/US20080147981A1/en not_active Abandoned
Patent Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5367473A (en) * | 1990-06-18 | 1994-11-22 | Bell Communications Research, Inc. | Expert system for computer system resource management |
US5338117A (en) * | 1993-01-27 | 1994-08-16 | American Packaging Corporation | Bag and method of making the same |
US20020147997A1 (en) * | 1996-03-21 | 2002-10-10 | John Harvey Turner | Animal model for transplantation |
US6598121B2 (en) * | 1998-08-28 | 2003-07-22 | International Business Machines, Corp. | System and method for coordinated hierarchical caching and cache replacement |
US6640207B2 (en) * | 1998-10-27 | 2003-10-28 | Siemens Aktiengesellschaft | Method and configuration for forming classes for a language model based on linguistic classes |
US6351767B1 (en) * | 1999-01-25 | 2002-02-26 | International Business Machines Corporation | Method and system for automatically caching dynamic content based on a cacheability determination |
US6408360B1 (en) * | 1999-01-25 | 2002-06-18 | International Business Machines Corporation | Cache override control in an apparatus for caching dynamic content |
US20020078300A1 (en) * | 1999-08-16 | 2002-06-20 | Chanda Dharap | Semantics-based caching policy to minimize latency |
US6631451B2 (en) * | 1999-12-22 | 2003-10-07 | Xerox Corporation | System and method for caching |
US6606525B1 (en) * | 1999-12-27 | 2003-08-12 | Motorola, Inc. | System and method of merging static data in web pages |
US6622168B1 (en) * | 2000-04-10 | 2003-09-16 | Chutney Technologies, Inc. | Dynamic page generation acceleration using component-level caching |
US20020111992A1 (en) * | 2000-12-18 | 2002-08-15 | Copeland George P. | JSP composition in a cache for web applications with dynamic content |
US20020116474A1 (en) * | 2000-12-18 | 2002-08-22 | Copeland George P. | Detecting and handling affinity breaks in web applications |
US20020116583A1 (en) * | 2000-12-18 | 2002-08-22 | Copeland George P. | Automatic invalidation dependency capture in a web cache with dynamic content |
US20020116582A1 (en) * | 2000-12-18 | 2002-08-22 | Copeland George P. | Batching of invalidations and new values in a web cache with dynamic content |
US20020112125A1 (en) * | 2000-12-18 | 2002-08-15 | Copeland George P. | Command caching to improve network server performance |
US20020133516A1 (en) * | 2000-12-22 | 2002-09-19 | International Business Machines Corporation | Method and apparatus for end-to-end content publishing system using XML with an object dependency graph |
US20020152244A1 (en) * | 2000-12-22 | 2002-10-17 | International Business Machines Corporation | Method and apparatus to dynamically create a customized user interface based on a document type definition |
US20020099807A1 (en) * | 2001-01-22 | 2002-07-25 | Doyle Ronald P. | Cache management method and system for storIng dynamic contents |
US20020112032A1 (en) * | 2001-02-15 | 2002-08-15 | International Business Machines Corporation | Method and system for specifying a cache policy for caching web pages which include dynamic content |
US20030188016A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for restrictive caching of user-specific fragments limited to a fragment cache closest to a user |
US20030188009A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for caching fragments while avoiding parsing of pages that do not contain fragments |
US20030188021A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for processing multiple fragment requests in a single message |
US20030191800A1 (en) * | 2001-12-19 | 2003-10-09 | International Business Machines Corporation | Method and system for a foreach mechanism in a fragment link to efficiently cache portal content |
US20030191812A1 (en) * | 2001-12-19 | 2003-10-09 | International Business Machines Corporation | Method and system for caching role-specific fragments |
US20030187935A1 (en) * | 2001-12-19 | 2003-10-02 | International Business Machines Corporation | Method and system for fragment linking and fragment caching |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110093771A1 (en) * | 2005-04-18 | 2011-04-21 | Raz Gordon | System and method for superimposing a document with date information |
US7636093B1 (en) * | 2005-07-29 | 2009-12-22 | Adobe Systems Incorporated | Parameterized motion paths |
US20070061849A1 (en) * | 2005-09-12 | 2007-03-15 | Walker Philip M | Systems and methods for processing information or data on a computer |
US20090063590A1 (en) * | 2007-08-30 | 2009-03-05 | Microsoft Corporation | Operating System Support of Graceful Degradation for Web Applications |
US20090077481A1 (en) * | 2007-09-14 | 2009-03-19 | Yuuichi Ishii | Information processing apparatus, information processing method, and recording medium |
US8375314B2 (en) * | 2007-09-14 | 2013-02-12 | Ricoh Company, Ltd. | Information processing apparatus, information processing method, and recording medium |
US8412898B2 (en) | 2007-12-06 | 2013-04-02 | F-Secure Corporation | Method for automatically backing up digital data preserved in memory in a computer installation and data medium readable by a computer having the associated instructions stored in the memory thereof |
US8667240B2 (en) | 2007-12-06 | 2014-03-04 | F-Secure Corporation | Method for automatically backing up digital data preserved in memory in a computer installation and data medium readable by a computer having the associated instructions stored in the memory thereof |
US20100153928A1 (en) * | 2008-12-16 | 2010-06-17 | Microsoft Corporation | Developing and Maintaining High Performance Network Services |
Also Published As
Publication number | Publication date |
---|---|
US7389386B2 (en) | 2008-06-17 |
US20050240732A1 (en) | 2005-10-27 |
CA2465155C (en) | 2008-12-09 |
CA2465155A1 (en) | 2005-10-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20080147981A1 (en) | Recommendations for intelligent data caching | |
US11513844B1 (en) | Pipeline set selection based on duty cycle estimation | |
US7113942B2 (en) | Scalable storage and processing of hierarchical documents | |
CA3025493C (en) | Optimizing read and write operations in object schema-based application programming interfaces (apis) | |
US9026898B2 (en) | System and method for managing web-based forms and dynamic content of website | |
EP2808790B1 (en) | Migration assessment for cloud computing platforms | |
US6351767B1 (en) | Method and system for automatically caching dynamic content based on a cacheability determination | |
US6408360B1 (en) | Cache override control in an apparatus for caching dynamic content | |
US9244956B2 (en) | Recommending data enrichments | |
US8001463B2 (en) | Web page communications using parameters and events | |
US20030105838A1 (en) | System and method for actively managing an enterprise of configurable components | |
US9875090B2 (en) | Program analysis based on program descriptors | |
US20060070022A1 (en) | URL mapping with shadow page support | |
US11630695B1 (en) | Dynamic reassignment in a search and indexing system | |
US8429673B2 (en) | Systems and methods of accessing information across distributed computing components | |
US11693710B1 (en) | Workload pool hierarchy for a search and indexing system | |
US11252257B2 (en) | Dynamic rest access | |
US20080209555A1 (en) | Approach for proactive notification of contract changes in a software service | |
US10261765B1 (en) | Enhancing program execution using optimization-driven inlining | |
US20220365921A1 (en) | Verifiable Cacheable Calclulations | |
US6735594B1 (en) | Transparent parameter marker support for a relational database over a network | |
US20130111329A1 (en) | Hinting system at user interface design time | |
US10733192B1 (en) | Expression evaluation infrastructure | |
Arshad | Privacy fox-A JavaScript-based P3P agent for Mozilla Firefox | |
Mertz | Understanding and automating application-level caching |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE |