WO2010094927A1 - Plate-forme d'accès à un contenu et procédés et appareil fournissant un accès à un contenu internet pour des dispositifs hétérogènes - Google Patents
Plate-forme d'accès à un contenu et procédés et appareil fournissant un accès à un contenu internet pour des dispositifs hétérogènes Download PDFInfo
- Publication number
- WO2010094927A1 WO2010094927A1 PCT/GB2010/000295 GB2010000295W WO2010094927A1 WO 2010094927 A1 WO2010094927 A1 WO 2010094927A1 GB 2010000295 W GB2010000295 W GB 2010000295W WO 2010094927 A1 WO2010094927 A1 WO 2010094927A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- content
- web
- request
- requestor
- device type
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9577—Optimising the visualization of content, e.g. distillation of HTML documents
Definitions
- the present invention relates to data access and communications, and in particular relates to improved access and customized access to Internet content from heterogeneous devices within a communications network, such as for improved access and customized access to Website content from mobile telephones and other wireless communication devices such as PDAs.
- the Internet has made vast amounts of information available to computer users, as well as large numbers of applications such as games and other content.
- advances in telecommunications and data processing technologies have enabled Websites to be viewed on mobile devices, almost all of the available content is designed for the screen size and processing capability of desktop and laptop computers.
- Most Website providers only consider mobile Web content as an afterthought, rather than as a parallel development and distribution medium, and many consider it prohibitively expensive to customize all of their content for mobile device users.
- Websites It is very common for publicly accessible content such as Websites to include a range of different content types and embedded components or 'assets' including text, images, video and audio content, with feature-rich user interfaces including drop-down menus, scrolling news feeds, pop-up banner advertisements and so on.
- This rich content is desirable, but there are various reasons why it is often not practical to view such content on mobile devices: 1.
- the speed of viewing is affected by the method of access. This can be the result of any one factor or a combination of factors, such as: the resource constraints of the access device - such as limited working memory, limited processing power and/or limited communication bandwidth; the size of the data transfer; and the complexity of Websites.
- conventional Websites can take a long time to download and cannot be displayed correctly on many communication device display screens.
- HTML mark-up is not appropriate with communication device user interfaces or supportable from the limited i/o interfaces of that device. Many communication devices still do not have touch-sensitive screens and most do not have a controller suitable for driving a PC Mouse-type pointer. Hence some Web page features such as drop-down menus, hover over pop-up windows, etc have no easy means of being controlled by the user of a communication device.
- the inventors of the present invention have determined that it is possible to provide an enhanced user experience for mobile device users, while also making a number of improvements for Web site providers and other content providers, for telecommunication network providers, and for advanced users who wish to share their proposals for an optimised content selection and display.
- a partial solution to some of the problems described above is to manually create alternative versions of Web pages in Wireless Markup Language (WML) and to provide access to these pages via the Wireless Application Protocol (WAP) and a WAP Browser running on the user device.
- WML Wireless Markup Language
- WAP Wireless Application Protocol
- a native WML page may still be presented differently on different end-user devices, due in part to variations in how WAP is implemented but primarily because of great differences between end-user devices.
- Developing a WML version of a Web site that enables complex site content to be displayed effectively on a wide range of wireless devices is a difficult and time consuming task.
- Many Website owners do not have the financial means or technical knowledge to be able to build a WAP site, and may be discouraged by the fact that building a WAP site does not guarantee the display of content on every mobile device.
- WAP pages are designed to solve display problems on end-user devices and so their use does not automatically address the problem of network 'bottlenecks'.
- Website content is displayable to users, there is a high overhead inherent in modifying Web pages to create a mobile- friendly version for access by a large number of users of heterogeneous mobile devices, and in maintaining that modified set of pages.
- Another solution is for mobile-friendly WML pages to be generated dynamically when requested by a mobile device user.
- Such run-time transcoding of site content can avoid the Website provider having to pre-generate and maintain separate WAP pages for mobile devices, but the run-time transcoding introduces other problems.
- the additional processing may contribute to unsatisfactory latency, regardless of whether this is implemented on the user's device or elsewhere in the network.
- not all Web page content can be converted to a format that can be displayed via a WAP browser, and not all devices are WAP-enabled.
- a positive user experience is critical to the adoption of new services and providing truly open access to media content, and long upload and download times are a major source of frustration for typical mobile device users.
- Mobile access to Web content and services is discouraged by poor response times, and transcoded content currently fails to provide a really positive user experience.
- current solutions involving dynamic content transformations - whether from HTML to WML, or adapting HTML pages for smaller displays - are also not ideal solutions.
- WO 2008/073207 discloses Web content adaptation for mobile devices.
- a mobile device transmits request data, that identifies a requested Web page and identifies the requestor device type, and awaits a response.
- the identification information in the request is compared with adaptation parameters to determine how response data obtained from a Web server should be adapted before being returned to the requestor mobile device.
- a main adapted sub-page and a set of subsequent adapted sub-pages are returned to the mobile device, each sub-page including adapted content that is in an order suited for viewing on a mobile device.
- the sub-pages may be cached for future reference.
- WO 02/087135 discloses adapting information content for mobile electronic devices that have limited hardware or network communications capabilities. Data content is converted into a document object tree and this is adapted using content folders. The content folders of interest are then used to provide adapted content for the software, hardware and network characteristics of the device.
- the inventors of the present invention have determined that an improved user experience is achievable for users of communication devices, by providing selected content in an optimized manner according to different device types, different content provider preferences, different user group preferences and/or for different categories of Web site and Web content.
- the invention provides a solution for customizing the displayed content and format and for creating new versions of Web content (including 'mini page' versions of Web pages) for a communications device that combines selected content from a number of different sources such as from multiple original Web pages.
- Another aspect of the invention provides a solution for automated analysis of a Web page to identify content components, component layout patterns and component combinations. These can be recognized and matched with stored definitions or templates. Presentation rules or templates are stored that are specific to a recognizable pattern or combination of content components, and specific to a device type or user preferences.
- the Web page analyzer according to an embodiment of this invention cooperates with a mini page generator to enable a reformatting of the content onto a mini page that is suitable for the requesting device type or preferences of the requesting user.
- device-type-specific directory pages and content pages are hosted in a content access platform that greatly improves the end user experience when accessing web content from a resource-constrained device, while also reducing the burden on Website content providers and communication network providers.
- a customised directory definition and a customized set of mini pages can be generated and saved for the preferences of an individual user or user group.
- the required directory page for a given device type can be determined from received requests - either from device information in request header attributes (if available) or by use of predefined subdomain names that include a device type identifier.
- a suitable version of a Web page or other Web contents can be identified based on the requestor device type.
- the present invention provides a solution for use in a communications network, in which multimedia content such as Website content needs to be accessed from many heterogeneous devices such as mobile telephones and PDAs, as well as desktop and laptop computers.
- multimedia content such as Website content needs to be accessed from many heterogeneous devices such as mobile telephones and PDAs, as well as desktop and laptop computers.
- a content access platform and methods and apparatus according to various embodiments of the invention are provided to mitigate the problems identified above.
- Embodiments of the invention enable suitable versions of content such as Web page content to be provided to each end-user device that makes a request for the content, the provided versions being suitable for the capabilities of the requestor device and a desired user experience.
- Web pages that are suitable for display on a particular mobile telephone's screen area via the telephone's Web browser can be generated and stored so as to be more easily available for retrieval.
- the invention can take account of the requirements and capabilities of users' devices such as particular screen sizes, memory capacity, browser and operating system type, and other device-type characteristics.
- the invention can provide open access to Internet content from a very wide and extendable range of heterogeneous access devices in a seamless manner, potentially supporting the total device-type and browser base.
- embodiments of the invention enable elements of Web content to be combined in accordance with user-specific requirements and content-provider specifications, to provide a customized user experience.
- suitable versions of Web pages can be performed preemptively in advance of a request being received, or in response to a request being received.
- suitable versions of Web pages ('mini pages') and/or suitable templates for generating 'mini pages' are generated and stored so as to be available for use when required. Any number of pre-formatted 'mini page' versions may be generated and stored, according to the number of different device types that may request access to this content.
- the invention is also useful in providing improved access to suitable versions of downloadable application programs such as games, to ensure that a request for an application from a particular end user device is responded to by providing a version of the application that is suitable for the device type.
- One aspect of the invention provides a content access platform comprising at least one content server for storing one or more versions of items of content (such as 'mini page' versions of Web pages), and a router for redirecting content requests (e.g. HTTP Web page requests) to a selected content server or repository in response to identifying the content request as having come from a particular requestor device type.
- the content server stores one or more versions of the content that are suitable for display on one or more specific types of requestor devices.
- the router identifies attributes of the request that are representative of the requestor device type, and redirects the request to an appropriate content server at which an appropriate version of the content is stored, or redirects requests to a plurality of content servers to retrieve each of a plurality of elements of a desired mini page.
- the appropriate version of the content is retrieved (or is generated by combining a number of elements of content) and is sent to the requestor device, and the end user can then view the appropriate version of the media content.
- the content access platform preferably includes a Website analyzer for scanning and analyzing the content of Web sites to identify high-level visual components of each Web page.
- the analysis can involve parsing the HTML within a Web page source file to identify static content component types, and generating HTML DOM structures corresponding to the pattern or layout of the Web page.
- the analysis can involve using a script engine or Browser engine to create the DOM tree of a page at runtime, to identify dynamic contents that are created by client-side script (such as Javascript or AJAX).
- the analyzer performs an analysis of the text on a page, such as a statistical analysis of high frequency words on the page or Website, to identify representative key words or to determine whether the site is a sports site, a retail site, a news site, and so on. Category information and/or identified keywords may then be stored in association with the Web page or Website.
- the analyzer also identifies the separate building blocks of the site and their layout (e.g. parsing the source HTML to determine the hierarchical HTML document structure of a page and the page hierarchy within a Website, and identifying major 'assets' that users of the site will require access to).
- the analyzer may perform its operations pre-emptively, for example when prompted by an operator of the content access platform, to generate mini pages in readiness for later retrieval by device users when users request access to the respective Web content.
- New versions of Web contents referred to as 'mini pages' in the context of this patent specification include both minor changes to an original Web page (such as a scaled down and re-ordered pages including substantially the same content as the original page) as well as simplified pages that omit a proportion of the original Web page content for simplified display on a resource-constrained device.
- a 'mini page' may also include components that were not included in a single original page, resulting in a new combination of content.
- device-type-specif ⁇ c mini pages is especially advantageous for users of resource-constrained devices that have either small screen sizes or constraints on download speeds or memory, but no limitation is implied by the phrase 'mini page' which is intended to encompass any replacement Web pages that are created for display on a particular type of device or created for a particular set of users.
- device-type-specific templates can be generated and stored for reconfiguring Web page content for display on devices of that type.
- the use of device-type- specific templates that encapsulate device characteristics and/or user requirements for generation of the mini-pages enables Web content to be presented in a manner that is optimised for the particular user, especially when combined with the intelligent modelling of a Website to categorize and model the site and to use that site model to guide the selection or generation of a suitable template.
- end users are unaware of redirection of requests by the router, and yet the user benefits from faster and more efficient downloads, and benefits from a device-type- suitable version or a customized version of the requested content being displayed on their particular device.
- an improved user experience is achieved transparently. Users can continue making content requests in the conventional manner or via a new mechanism that is described below.
- the telecommunications network providers benefit from embodiments of the invention that enable requestor devices to be sent reduced-data-size versions of media content, as the invention can be used to provide an alternative access mechanism for very popular content that may otherwise suffer from communication bottlenecks and latency. By mitigating such problems, embodiments of the invention improve end user experiences and facilitate the delivery of additional services by the network providers.
- a content server of the content access platform provides a repository into which many different Website providers and other content providers can save mobile-device-compatible 'mini page' versions of their Websites or selected content.
- Web content providers can benefit from the media access platform as a distribution channel for Web content, and content distribution may be a fee-based service to enable the media access platform provider to recover costs.
- the storage provided by the content access platform ensures that Website providers are not required to maintain additional storage for their different versions of Web content, as this and the labour-intensive processes of creating and maintaining a mobile-device-friendly version of a Website can be outsourced to the content access platform's template-based generation and content servers. This lowers the cost barrier for Web content providers who would like to provide a mobile-device-friendly version of their content.
- the original Web servers can be shielded from the problems of potentially very high network traffic arising from mobile devices sending media content requests, as requests from such devices will be redirected to the respective content servers of the content access platform that hold suitable mini pages or hold content for building a mini page.
- Website providers responsibility for maintaining a satisfactory response to requests from mobile devices can be outsourced from the Website providers to the provider of the content access platform.
- This can be very beneficial to a Website provider, especially during periods of very high numbers of requests such as during popular sporting events. Many Web servers will be unable to manage the high workload that arises if requests from every different device type are competing for service from that server.
- the Website providers can rely on the router to achieve efficient routing of requests to the content server that holds the relevant mini pages or elements of mini pages, since the router has the requisite knowledge of where each version of the content is stored. This knowledge can help to avoid network bottlenecks.
- a content access platform provides a highly scalable mechanism for publication of customized Web pages and for publication of other media content such as games and other application programs - enabling a bespoke user experience when this is desired but providing a solution that scales to millions of users of many different Websites.
- Website developers and other content providers that do not have sufficient resources to make their content widely available to mobile device users can take advantage of the content access platform to publish different versions of their content, each version being suitable for a different user device type and/or user group. Requests to retrieve the particular media content can be redirected by the router to retrieve the most suitable version, without the content provider having to participate in the identification of requestor device types or the routing of requests or the handling of potentially large numbers of requests from mobile device users.
- This aspect of the invention can be implemented as a fee-based service, providing commercial opportunities to the provider of the content access platform as well as the Web content providers.
- particular desired combinations of media content may be specified within a new configuration file such that a customized combination of content is available for retrieval by some device users.
- the specification of a desired content combination may be by the original content provider or by another special interest group. End users may register their devices with the content access platform with an indication of which device-type ⁇ and user- group they correspond to. This could involve users specifying language requirements, such that their content requests are routed to one or more content servers holding video and associated Mandarin text and audio, whereas another user may wish to access the same video with English text and audio.
- the customized content combinations are achieved by the user specifying content requirements, which requirements data is then used (for example by a Website provider) to generate a template that captures the content requirements as an XML configuration file that includes one or more URLs for retrieval of desired content.
- requirements data is then used (for example by a Website provider) to generate a template that captures the content requirements as an XML configuration file that includes one or more URLs for retrieval of desired content.
- Some users may desire to have a combination of applications such as a news feed component providing latest sports scores in combination with a small video screen area showing live action, additional images and games.
- the content identifiers can be defined in a template or configuration file and can be used to combine the desired Web assets within a new mini page. The mini page can then be stored for access by the requesting user, and potentially by other users.
- the media access platform thus provides a user-centric mechanism for experiencing content in a preferred and customized way, which differentiates from the mechanical content- transcoding-layer approach of known solutions.
- This embodiment of the invention provides opportunities for placing advertisements with popular content for maximum visibility.
- One embodiment of the invention includes a router that cooperates with a proxy service to enable more efficient route determination to be carried out for sending content requests across the network, using knowledge of the locations of device-type-specific versions of content to enable improved resolution of routing questions.
- the route determination will enable network bottlenecks to be bypassed in many cases. In a conventional system, it is common for bottlenecks to arise when requests from large numbers of mobile telecommunication devices are directed at the same Web server during a short period of time.
- an efficient route to the desired network location can be determined.
- One aspect of the invention provides apparatus for use in a communications network, comprising: a template repository storing a set of template files that each specify information for determining how Web content is to be displayed on a particular type of user device that is associated with the respective template file;
- a request analyzer arranged within the network for analyzing requests for access to Web content, wherein the content requests include information enabling identification of the device type of the respective requestor device, and wherein the request analyzer is adapted to analyze requests from requestor devices to identify the requestor device type;
- a router that is responsive to identification of a requestor device type to retrieve a respective template file from the template repository and, in response to the template specifying a plurality of separate sources of Web content, to send a plurality of Web content requests to the separate sources, thereby to retrieve required Web content from the plurality of separate sources;
- a content generator for combining the retrieved Web content and providing the combined content to the requestor device.
- a further aspect of the invention provides a method for accessing Web content via a communications network, the method comprising:
- a further aspect of the invention provides a router system, for use in a communications network, wherein the router system is arranged for receipt of content requests from content requestor devices, which content requests include information enabling identification of the requestor device type, the router system comprising:
- a repository storing network location information for a plurality of sources of content, the repository also including an identification of respective device types with which each of the plurality of content sources is associated;
- a request analyzer for analyzing received content requests from requestor devices to identify the respective requestor device type
- a request redirection component that is responsive to identification of a requestor device type to redirect the request to a selected plurality of the content sources.
- a further aspect of the invention provides a method for automated generation of a displayable version of Web content, the method comprising: analyzing Web content to detect a set of identifiers of content types within the content;
- the saved presentation can be identified and served to the requestor device.
- a further aspect of the invention provides apparatus for use in a communications network, comprising:
- a Web content analyzer for analyzing and categorizing Web content
- a first content server system comprising a repository for storing a first version of one or more items of Web content, the first version of the Web content being predefined as suitable for display of Web content of a first category on devices of a first device type, and wherein the first content server system is configured to provide access to the first version of the Web content in response to receipt of a request for the Web content;
- a request analyzer arranged within the network for analyzing requests for access to Web content wherein the content requests include information enabling identification of the respective requestor device type, wherein the request analyzer is adapted to analyze requests from requestor devices to identify the requestor device type; and a request redirection component, which is responsive to the requestor device being identified as a device of the first device type, to redirect the request to the first content server system and to retrieve the first version of the Web content.
- a further aspect of the invention provides a method for automated generation of a new version of media content, the method comprising:
- the requirements definition includes information defining processing operations to be performed on selected content and the method of automated generation comprises performing the defined processing operations on selected content before combining the selected content to create a new media content version.
- the content requirements definition preferably comprises an XML configuration file specifying screen dimension and/or layout information for a type of user device.
- Another aspect of the invention provides a media content generator for carrying out a method as described above.
- Various components of the invention described above such as the request analyzer, Website analyzer, mini page generator, template generator and the request redirection component and route determination function of the router and proxy server, may be implemented as one or more computer program products.
- Such products comprise computer program code recorded on machine-readable recording medium and configured to control the operation of a data processing system on which the program code is executed.
- Apparatus such as the proxy server and content server according to preferred embodiments of the invention preferably comprises one or more data processing systems having one or more processors, data storage resources including random access memory and non-volatile storage providing a content repository and cache for mini pages, and network communication capabilities. These systems are capable of running the computer program implementations of the above-described components.
- Figure 1 is a schematic representation of the components of a Web content delivery platform according to an embodiment of the invention, for use in a communications network;
- Figure 2 represents a directory access request and mini page access via the directory, according to an embodiment of the invention, and shows example screen shots including an example mini page and example MiniSite directory pages that can be used to link to the mini page;
- Figures 3 A,3B,4A and 4B show the sequence of operations of a content access method, which operations are performed at various components of the communications network of Figure 1, according to an embodiment of the invention
- Figure 5 shows examples of list modules in a Web page
- Figure 6 shows an example article module
- Figure 7 shows a mini page representation of a list module
- Figure 8 shows an example mini page representation of an article module
- Figure 9 shows an example visual list module and its corresponding DOM structure
- Figure 10 shows an example visual component and the way it is specified in the example DOM structure
- Figure 11 shows an example article module and its subdomain tree
- Figure 12 shows an exemplary snippet of a list module definition
- Figure 13 shows an example of two modules with matching DOM structures
- Figure 14 shows the structure of a configuration file according to one embodiment of the invention.
- Figure 15 shows compression and other transformations for resource handling, according to an embodiment of the invention.
- FIG 1 is a schematic representation of the architecture of a Web content access platform 10 according to the invention, and operational steps for accessing Web content are described with reference to Figures 3A,3B,4A,4B.
- the platform comprises a router 20 that may be running on a data processing system such as a network gateway server system (not shown) for receiving Web content requests from a plurality of different types of end-user devices 100, 110, 120, 130.
- a network gateway server system not shown
- requestor devices 100,110,120,130 including desktop and laptop personal computers and a range of PDAs and high-functionality mobile telephones ('smart phones') and low-end mobile telephones.
- requestor devices 100,110,120,130 including desktop and laptop personal computers and a range of PDAs and high-functionality mobile telephones ('smart phones') and low-end mobile telephones.
- the Web content requests may comprise conventional HTTP requests or an alternative form of Web content request such as described below.
- the Web content access platform 10 of this embodiment hosts 'mini pages' in a content repository 60 and hosts a MiniSite directory in directory repository 140.
- the MiniSite directory comprises a list of available mini pages, and can be provided as a set of alternative versions of the directory pages, with each version being suitable for presentation on a different type of device.
- Example screen shots showing a MiniSite Directory screen display are included in Figure 2, but the displayed representation can vary greatly between devices.
- the router 20 within the content access platform 10 includes a mechanism for intercepting directory requests and Web content requests. This mechanism is provided by the request analyzer 30, and its operations are described in detail below.
- user agent header attributes (such as HTTP request headers) are analyzed on receipt of a Web content request, and are compared with a device database that can match the user agent (UA) header attributes to a set of characteristics and capabilities of the requestor device. This is described below.
- end users devices have pre-installed user agents that are configured to specify their device type explicitly within non-standard requests sent to the content access platform 10. This is especially advantageous if the communication network includes other communication gateway servers that do not pass on UA header attributes when routing requests.
- a directory-reference URL is embedded in a user device for use by the device browser.
- the device type is specified by means of a subdomain name within the URL.
- a mobile telephone's Web browser is configured to send standard requests that include a unique resource identifier comprising a domain name pref ⁇ xe.d by a device type identifier such as: 'e71.url23.com' f where 'e71 ' identifies a specific device type such as a Nokia e71 mobile phone, and 'url23.com' identifies the domain name associated with a MiniSite directory
- intuitive identifiers for device types (such as this example 'e71 ') are desirable, it is known in the art that there can be many levels within the hierarchy of labels within a domain name and each label can include up to 63 ASCII characters up to a total of 253 characters for the domain name, and so the simple example given above is just one simple example of use of a subdomain name as
- the content access platform of this embodiment allows for both request types to be used.
- Other implementations of the non-standard resource request can be used, as long as this is predefined for the requestor devices 100,110,120,130 and the request analyzer 30 of the router 20 such that any such non-standard requests can be interpreted correctly.
- the use of device-type identifiers in subdomain names is advantageous because this can easily be extracted from requests and compared with a device type database.
- the device type prefix defining a subdomain e.g. the prefix 'e71 ' of subdomain 'e71.url23.com' of domain 'url23.com'
- the device database can be searched for the request and compared with a device database, to retrieve device characteristics or display requirements, and/or to determine which pre-stored MiniSite directory page 210,220,230 or mini pages 240 are suitable for the type of device.
- the device-type identifier within the request can be directly matched with a stored directory page that is associated with that device type. This process is described in more detail below with reference to Figures 3A, 3B, 4A and 4B.
- Each type of mobile handset e.g. each device series or model
- Customized directories can be provided and hosted within the content access server, with different content categories and different recommended URL links for different users.
- a user's personalised directory page can be held in storage of the content access platform and accessed by user requests specifying a sub-domain- name that identifies the directory page.
- This hosting of personalized resources enables users to share their directory pages (like music playlist sharing) with friends or work colleagues using either SMS or email to build a social network or increase workplace efficiency.
- the embodiment described above takes advantage of a pre-installed -directory-reference URL on mobile handsets 100,110,120,130 that is used by an installed browser 250,251 to send directory requests 200 via the router 20, but in an alternative embodiment a version of the MiniSite directory page 210,220,230 can be installed on the mobile handset itself.
- a MiniSite directory page is the improved user experience that results from avoiding the need for users to key in Mini-page URLs.
- the content access platform 10 of this embodiment includes a plurality of content servers that each provide data storage suitable for storing content in an accessible form or for storing directories listing available content.
- Figure 1 represents the content access platform 10 as a single system including a directory repository 140 and a content repository 60, but no limitation to a single system is required and it is well known that a single mainframe server computer can be replaced by a distributed network of servers, and references to a content server or a repository herein are intended to include the possibility of distributed architectures. Additionally, volatile cache storage is provided within the router 20 to provide fast access to recently used mini pages.
- the content access platform 10 has internet connectivity for accessing external Web servers 40 for accessing Web content that is not stored in these repositories, using known communication mechanisms.
- FIGs 3 A, 3B, 4A and 4B show processes carjied out within the content access platform 10 in response to receipt of a request 300 from an end user device.
- each request is received via the router 20 and analyzed by the router's request analyzer 30.
- the analyzer 30 initially checks 310 for a prefix other than 'www' within the subdomain name, identifying any such prefix as a potential device type identifier.
- the analyzer compares 321 the subdomain name with known subdomain names representing device-type-specific resources (as a first mechanism for identifying device types or model numbers).
- the analyzer If there is no identified match for the subdomain name, the analyzer also obtains 322 HTTP header attributes from the request and checks 323 for a device-identifier within HTTP header attributes, if such device attributes are received (some countries have communication infrastructures that do not pass on such header attributes, so it is advantageous that another mechanism, such as the use of subdomain names, is also implemented). Having identified a device-type-identifying subdomain name or HTTP header attribute, the analyzer compares 320 the identifier with entries in a device database 50 that maps device type identifiers to device model numbers.
- the subdomain name is compared with the device database 50 to identify a device model number and this model number is compared 330 with the database to determine whether the characteristics or display requirements are known for that model. If the model is known, the subdomain name is also compared 340 with tables of user-specific subdomains to determine whether the request is for a directory page that is specific to an individual user or user group. If yes, the respective user-specific and device-type-compatible directory page is served 350 to the requestor device; if no, a device-type-compatible directory page is served 360.
- a standard directory page is served 390 and an alternative mechanism is provided to determine an appropriate version of the directory page. This is achieved by sending 370 to the requestor device a request device information - preferably in the form of a user prompt, although this could be automated if communication devices are configured to automatically respond to requests for device information.
- the device model and retrieved information obtained in this way can then be added 380 to the device" database. Thereafter, devices of their type will be recognizable and can be matched to a respective device-compatible directory page. This provides a self-learning mechanism, such that the content access platform will support more devices over time.
- the served directory pages enable easy user selection of mini pages that are available via the content access platform, via the known mechanism of browser activation when a user selects a hyperlink within a displayed Web page.
- Described below is the analysis and processing of content requests, other than directory requests, when received by the router 20 of the content access platform 10. This includes requests that are sent from a user's device when a user interacts with a displayed MiniSite directory page, as well as other requests for mini pages and conventional Web page requests.
- the request analyzer 30 determines 310, from the domain name, that it is not a MiniSite directory request.
- the requestor device type and a user preference token are included in the request.
- user agent information in HTTP request header attributes can be obtained 401 (when these attributes are available) and analyzed 402 and compared with a device profile database by the request analyzer 30 to determine 400 the device type. This can provide a specific device model number that has known screen size, processing power and browser capability. A check is carried out
- the requestor device is determined to be a conventional PC
- the request is directed 420 to a standard Website on a Web server that includes content suitable for display or execution by the PC.
- the Web server provides access to the required resource in a conventional manner - retrieving and serving Web contents to the requestor device. This content can be served via the server system of the router, or via other network links.
- the Web access content server 10 includes features for hosting customized Web page versions that match user preferences, so some users may wish to exploit the capabilities of the content access platform even when there is no device-related motivation for special processing.
- conventional requests from PCs are routed to the target Web servers without serving mini pages but, in an alternative embodiment, PC users could register an interest in receiving customized content from the content access system.
- the requestor device is not a PC, the requestor device is assumed to be a device that would benefit from selection of a mini page or a version of Web content that is suitable for the requestor device.
- the router 20 thus determines 410 that the request should be processed further within the content access platform 10.
- the request analyzer 30 analyzes 430 the request and compares 440 the requestor device model number with an index of device types in a device database 50 that is accessed by the router 20.
- HTTP header attributes can include additional explicit device characteristics
- the device model number is extracted from the HTTP header for non-PC-originated requests and this number is used as a search key for accessing device capabilities from a device profile database 50.
- the device profile database 50 contains detailed characteristics for each of a large number of mobile device types, and is updateable as new devices come onto the market. Such device profile databases are known in the art (for example, open source lists of device capabilities are maintained in a publicly accessible manner). If user agent information is not available (e.g. is blocked within a communication network), a query page can be sent 450 to the requestor device to invite users to specify device characteristics (screen size, manufacturer, supported media types, etc). The user can insert information into the query page and submit this to the content access platform. This device information can then be permanently stored (e.g. using cookies or a user account database).
- the main additional information held in the device profile database 50 of the present embodiment is a device type identifier that can be used to represent the characteristics of the requestor device.
- device type A all requestor devices having a particular range of screen dimensions (i.e. similar sized screens), memory within a defined range and communication bandwidth within a defined range
- device type B devices having an alternative screen layout
- device type C devices having an alternative screen layout
- a device with an identical screen size to device type A, but with less processing power, memory and colour depth may be device type D, and so on.
- the model number of a current requestor device is held in the database 50 with device capability information and an associated device type identifier.
- the device type identifier is retrieved from the device profile database and combined with the
- Web content identifier typically a URL from the HTTP request header that identifies a requested Web page
- this combination of a device type identifier and Web content identifier are then compared 500 with an index of cached versions of Web contents that are stored in association with the router 20.
- a cache 25 within the router system holds this index and holds copies of the most recently retrieved mini pages and other Web contents, to enable fast access. If a suitable mini page is held in the cache, it is served 510 to the requestor.
- the combined device type identifier and Web content identifier is compared 520 with an index of Web content versions stored in a content repository 60 of a content server.
- the router forwards 550 the content request to the content server holding the required version of the requested Web contents, and this version is served 530,560 to the requestor.
- the content server holding content repository 60 may comprise a highly distributed server network including a plurality of content server systems. With a distributed approach, network bottlenecks can be avoided and less expensive server systems can be implemented (with built in redundancy and peer-recovery if required - as is known in the art). However, although contents may be distributed, in the present embodiment of the invention the index of stored versions is stored at each system that provides routing functions.
- each of the content versions stored within a repository of the access server 10 is a 'mini page' comprising an alternative version of the original target Web page, or is a device-type-specific version of a content component - such as an image, a video clip, a game or another application program.
- the mini pages and stored content versions will have a smaller data size than the original Web page contents, and will be more suitable for display on a small display screen or execution within a mobile device.
- 'mini pages' in the context of this patent specification include both minor changes to an original Web page (such as a scaled down and re-ordered pages including substantially the same content as the original page) as well as simplified pages that omit a proportion of the original Web page content for simplified display on a resource-constrained device.
- a 'mini page' may also include components that were not included in a single original page, resulting in a new combination of content, and no limitation is implied by the word 'mini' which is intended to encompass any replacement Web pages that are created for display on a particular type of device or created for a particular set of users.
- the corresponding mini page or content component is retrieved from the content repository 60 of the content server and transmitted 530 to the requestor device 100,110,120,130.
- the content repository 60 may be located in close proximity to the server data processing system that is providing the router 20 functionality, but ihe content access platform of the present embodiment is not limited in this way, and a typical content request will be routed across the network to the relevant content server. For static content, this routing relies on the index of stored content held at the router system to route the request to the respective content server holding the respective repository.
- a request that requires access to Web content as a mini-page is routed to a Web site (or ⁇ to a plurality of content sources) that includes suitable content.
- the request may be routed to the same Web server as holds the original Web page, if the mini-page and original web page are held on the same server.
- the requests are typically re-routed to a different content server that holds mini-page versions of Web pages.
- This is implemented by means of a proxy service 70 that re-routes requests (that request content from an original site) to an alternative network location at which is held an alternative version of the requested content.
- the proxy service is implemented using the Domain Name System (DNS) and uses knowledge of the location of different versions of particular requested content ('mini page' versions).
- DNS Domain Name System
- a Web page analyzer 150 is provided by the content access server for analyzing the contents of selected Web sites.
- the Web page analyzer is used to identify content components and component combinations within a Web page, and to compare these with known component combinations and layout patterns. An identified combination or pattern can be matched with stored presentation definitions or templates. Presentation rules or templates are stored that are specific to a recognizable pattern or combination of content components, and specific to a device type or user preferences.
- the Web page analyzer according to this embodiment cooperates with a mini page generator to enable a reformatting of Web page content onto a mini page that is suitable for the requesting device type or preferences of the requesting user.
- the first major function of the Web page analyzer 150 of this embodiment is to semantically identify the high-level components of a Web page and to extract their meta data. These are the components that would be visually identifiable as separate components by a typical PC user.
- the analysis involves parsing the HTML within a Web page source file to identify static content component types, but also involves generating HTML DOM structures corresponding to the pattern or layout of the Web page so that dynamic contents that are created by client-side scripts are also identifiable.
- a script engine is implemented within the Web page analyzer to create the DOM tree of the page so that this structure (representing the layout pattern of the components) is available for analysis.
- a combination of specific component types is referred to hereafter as a 'pattern', and a 'module' is the term used to describe a particular combination of actual components - i.e. a module is an instance or implementation of a pattern.
- the components and modules are tagged using keywords extracted from their title and description fields in the component metadata.
- a module comprising a combination of Web site content components is thus recognizable if it corresponds to a predefined pattern within the DOM structure of the page.
- the modules that correspond to known patterns can then be used by a mini page generator (described below) to generate a mini page based on a set of stored templates that define presentation requirements of a particular requestor device type. Templates can also be customized according to particular user or user-group requirements.
- a 'List module' is a reference to content that is available on a separate Web page, and mainly contains the following elements:
- Figure 5 provides examples of list modules (represented as Module- 1, Module-2, etc) in a conventional Web page.
- Figure 7 shows a mini page representation 710 of a list module.
- An 'Article module' contains actual Web page content, and mainly contains
- An example article module is shown in Figure 6, with a set of separately identifiable components title 600, metadata 610, picture 620 and text 630, as well as some related links 640.
- a mini page article module 720 is shown in Figure 8.
- Module content may be static or dynamic, with a different analysis procedure required for dynamic contents.
- HTML elements within the Web page source file can be analyzed using an html parser to extract the component-identifying HTML headers.
- an article title and the body of a text article can be identified as separate elements.
- client side script i.e. java-script, AJAX
- the content components can be identified by a script engine that creates the Web page DOM tree at execution time. An example is user comments on a news Web site.
- the module identification process includes a pattern recognition step.
- pattern recognition can be iteratively improved by means of an adaptive learning process as new pattern are identified and stored.
- the system generates .or is provided with a basic set of module-definitionrfiles (in XML files format). These can be obtained by analyzing a selected set of popular web sites (e.g. The Guardian, BBC, YouTube, etc) and identifying the static HTML components and dynamic components that are part of the DOM structure of pages on these sites. Having generated and saved a set of module-definitions, the content access platform will be able to automatically identify modules from unknown Web sites by using existing module-definition-files obtained from sites that have identical or similar HTML DOM structures. Additionally, each time the Web page analyzer within the content access platform encounters unknown patterns, a system administrator can be prompted to create new definitions for the new patterns.
- module-definitionrfiles in XML files format.
- FIG. 9 An example visual list module component 910 and its corresponding definition 920 in the DOM tree are shown in Figure 9. This can involve manual steps as set out below.
- the system administrator can: Specify URLs or URL-patterns applied to the module; Specify location(s) of contents in the module, including:
- List module specify the outer-most enclosing HTML element (i.e. ⁇ DIV>, ⁇ TABLE>, etc)
- Article module specify HTML elements for each attribute in the module, including related links (which are list modules embedded in an article module)
- List module extract all ANCHOR ( ⁇ A> tag) within the enclosing element, then combine textual or imagery date with same "HREF" attribute
- Article module locate and extract data from html elements of every attribute.
- An example URL pattern is given below. This comprises a regular-expression (REGX) used as criteria to define a set of URLs. Any URL that matches the URL pattern REGX belongs to the set.
- REGX regular-expression
- Figure 11 provides an example of an article module 1110, showing the association between the visual components within that module and their details in the subdomain tree.
- a snippet of a list module definition is provided in Figure 12.
- module identification can be automated, as described below.
- the analysis method involves downloading the page source file (e.g. using a traditional crawler) and parsing the HTML file to generate a DOM tree (using an html parser). The DOM tree is then traversed and, for every node N traversed, the following steps are performed:
- the specifying and locating of HTML elements involves use of a tree-path string of the element node in the DOM tree.
- a tree-path string of the element node in the DOM tree For example, referring to Figure 10, to specify the location of text component 1010 "Ajax dynamic content ", according to its element 1020 in the DOM tree, its tree-path string is Html-body-div[id:header]-div[class:title]-hl. Locating a module or its attributes is then simply a tree path search.
- the tree path string comes in two forms:
- modules within new Web sites that implement one of these patterns can be automatically recognized, as follows. Given a URL whose content is unknown to the system, the referenced page is retrieved and parsed into a DOM tree. The DOM tree is broken into Sub-trees and the system checks each sub-tree against existing module-definitions, trying to find a best match for each sub-tree. The best match module-definition is used to automatically extract data for the sub-tree (module). Note that if two HTML visual components have similar or identical DOM structures, the module definition of one module can be applied to another module. An example of two modules 1310 and 1320 having the same module definition (by virtue of having the same DOM structure) is shown in Figure 13. The "more news" and "latest multimedia" modules on an exampletnews Web page have different content but the same DOM structure.
- Modules can be saved in categories according to keywords as well as HTML DOM structural information. Each of a set of categories such as news, finance, sports, is assigned with a set of representative tags or keywords. Keywords can be extracted from a module based o txt word frequency or other measures, and the extracted keywords can be matched against a keyword set that is stored for each category. The keyword matching is evaluated by use of a weighted matching score, with the module identified as belonging to the category having the highest matching score.
- a new mini page may be built in response to a user request at runtime, or as a pre-emptive operation when Web pages are identified that are expected to have a high demand for requests from mobile devices.
- Newly created mini pages can stored at a content server for reuse (for example caching in volatile storage, if this is in accordance with a mini page caching policy).
- Such mini pages can combine multiple components that are located on different content servers within a network.
- a configuration file is used to identify the components that are to be included in a mini page, the IP address of the respective server holding each component, and how the components are to be displayed on a particular type of requestor device. The structure and use of such configuration files is described in more detail below.
- a respective device-type-suitable configuration file is retrieved 570 from a content server and information within this file is used to identify components of interest.
- the components can be retrieved from respective content servers and processed and combined 290 as specified in the configuration file.
- the proxy server that re-routes requests to specific content servers can invoke the Web site analyzer program (described above) to retrieve 540 required Web content over the Internet.
- the retrieved page content can be decomposed into its component parts using analysis functions within the Web site analyzer, and content components can be extracted as specified within the configuration file.
- the configuration file specifies how the components are to be combined and adapted to form 590 a new mini page, including the component order and format determining how this mini page is to be displayed on relevant devices.
- the mini page is then returned 530,550 to the requestor device.
- the mini page can be cached 595 at this time.
- the content access platform 10 thus provides a distribution mechanism for mini pages, some of which may be manually created and some of which may be generated automatically.
- the access and distribution mechanism described above functions in the same manner regardless of how the mini page was created, and regardless of whether it was added to the platform by the original Website provider or was generated automatically within the content access platform.
- Mini pages can also be provided by other interested parties (such as a mini page designer or an end user).
- the content access platform of one embodiment of the invention avoids this manual effort by providing a mini page generator 90 for automated mini page generation 590.
- a first function of the mini page generator is to apply pre-generated templates 800 (representing a suitable page layout and content display properties and requirements for a particular device type) to content obtained 560 from a Website of interest, selecting required components of the retrieved Web page and reformatting the content for display.
- the templates 800 may be implemented as XML configuration files that define mini page presentation properties for an identified device type, and which can be applied to Web page contents to control the generation of a new version of the Web page contents.
- a new version ('mini page') that is appropriate for a requestor device, and making that available to the requestor device, optimized presentation of the requested Web content is enabled for the particular requestor device.
- the pre-generated templates can be stored in a combined template and content repository within the content servers, such that an initial search 500,520 of the cache and mini page repository for a matching mini page can, if unsuccessful, be quickly followed by identification and retrieval 570 of the relevant template.
- An appropriate mini page can then be created 590 by applying the configuration instructions of the template to the original Web page contents, selecting components of the Web content and reformatting for display on the requestor device screen.
- Figure 14 shows an exemplary template structure.
- the customized configuration file (template) structure of Figure 14 is implemented as an XML file.
- the tree structure illustrated is part of the Document Type Definition (DTD) of these XML templates and various nodes within the DTD tree structure are explained below.
- DTD Document Type Definition
- 'Identification information' 802 - includes a unique template name 803 (such as 'nba data' for a template associated with the National Basketball Association's statistical results data); and a template description 804 (for holding a description such as 'nba statistic data', and may include other options).
- a unique template name 803 such as 'nba data' for a template associated with the National Basketball Association's statistical results data
- a template description 804 for holding a description such as 'nba statistic data', and may include other options.
- 'List of URLs of interest' 805 exists to hold URLs for content that is to be included in the mini pages created from this template. This includes a URL pattern 806 (such as 'http://data.nba.tom.com/stats.htmr and 'http://nba.tom.com/news').
- 'Web page analyzer' 807 - defines crawling properties, including a content server IP address 808 at which the template may be applied to Web content.
- the analyzer node also specifies page content 809 to be crawled (i.e. portions or parts of the target page to be retrieved and stored, such as specific HTML elements to be discarded and elements to be retained, and whether original style tags should be retained or not).
- an analyzer bean 810 which is a specific Java bean for extracting data from a source Web page.
- This node includes a default view Java Bean 821 (which may be included if device-specific reformatting information is not included in the node 820) and a device rendering list 822 which lists characteristics for one or more device types. For example, a device may be specified by its device number 824 'nokia95' or 'nokia72' for example, its screen width 825 '240px' or '176px' and other capabilities such as whether there is JavaScript support ('yes' or 'no') and flash memory support ('yes' or 'no').
- the device rendering list also includes a rendering style/format 828 and identification of a page component to be rendered 827.
- • 'Accelerator Properties' 830 specifies whether to apply download acceleration or not ('yes' or 'no'). This can make use of a conventional dedicated download manager program (80 in Figure 1), which implements such techniques as searching 831 (i.e. crawling the Web) for mirror sites and performing parallel segmented downloading, variable bandwidth usage and limiting, resuming paused downloads, etc. Also specified, within a cache behaviour definition node 832, is a cache interval in seconds (e.g. 180 seconds).
- the creation of a template for use to create and serve mini pages involves the following steps:
- the contents included in the mini page must be specified within the XML configuration file, for example by specifying: URLs of original Web pages, whose contents will be used in the mini page; and, for each original Web page, a definition of which part(s) need to be included (the content components or "assets" of a Web page).
- a component can be specified using an HTML element ID, or using the position in the
- HTML document tree of the page such as, for example, the content contained in 3 r division (DIV) of the page body.
- a component is uniquely identified by its ID.
- a component can be reused by other mini pages. For example, if a user wanted to create a mini page that shows live stock index chart from YAHOO and MSN money, the user needs to identify the source pages www.vahoo.com and www.msn.com as well as identifying the 2 components
- mini page metadata such as a page title, CSS style-sheet, background, etc.
- IDs of components to be included in the mini page are specified.
- Preferences for reformatting are specified (e.g. background, font).
- Templates representing content display properties for a particular device type may be stored in the template and content repository for that particular device type without reference to a particular Web site or other Web content.
- a first embodiment of the invention selects a stored template that is appropriate for the particular type of Web site as well as the particular user-device type whenever such a template is available.
- a site may be identified as a sports event site, a news site, a retail store site, and so on, and the way Website content is selected and presented on the requestor device can then be adapted for the type of site being considered. The achievement of this adaptation of display properties for different Website types or categories as well as different device types is described below.
- Templates as set out above thus enable the creation of customized combinations and customized layouts and formats of content components, that may be retrieved from a plurality of different Web sites and other network-accessible sources.
- the content access platform of the present invention makes it possible for Website providers and others to provide these new combinations of content for access by a wide range of different types of communication devices - including PCs, PDAs and high-end or low-end mobile telephones.
- embodiments of the invention can significantly improve access to Web content and enhance the user experience for people accessing Web content from a mobile device.
- a Website analyzer component is associated with the mini page generator, and implements an analysis of the contents of the selected Website in order to determine whether the Website can be categorized as one of a number of predefined Website types.
- the Website analyzer is implemented as a computer program that is configured to scan selected Websites (or other online content) to analyze their contents.
- a first function of the Website analyzer program is to determine whether the content matches any one of a set of predefined categories of content.
- the analyzer program scans the text of a Web page to identify high frequency words, and then compares the identified high frequency words with a knowledge base in which keywords are associated with specific categories of Website or Web content.
- This provides a first high-level categorization and indexing of crawled pages, enabling identification of sports-related Websites or pages, and separate identification of financial information Websites or pages, etc. Additional pattern matching algorithms and data mining techniques can be implemented, including image analysis and other content and context analysis techniques may be employed, but this is not essential. High frequency keywords and categorization information can then be stored in association with a Web page or Website.
- HTML tags can identify titles distinctly from other text paragraphs.
- the tags and word frequency statistics and other content-analysis can identify specific content within a Web page.
- the identification of different elements or "assets" within a Web page can thus make use of existing HTML tags and the HTML can be parsed to identify the separate elements.
- the HTML tags or other characteristics of these elements can then be used to assign each element to one of a number of categories of content.
- a template can be automatically selected or a new template can be generated.
- the selected or generated template will indicate a subset of the identified categories of content that are to be included in a particular 'mini page' version of an original Web page or other content.
- the template also specifies their layout within a defined display screen area.
- the templates in this embodiment are implemented as XML configuration files, and can specify which categories of content are to be included in a mini page without changes, and which categories of content are to be reformatted or transcoded in some other way when being included in the mini page.
- an image, a video clip or an audio file could be replaced with reduced-data- size versions - providing the 'best fit' in terms of resolution/quality of video, image and audio while taking account of the resource constraints of the user's device (screen size, memory, bandwidth, etc).
- a text column could be reformatted to fit within a predefined screen area.
- the configuration file that is defined for a particular device type can then be applied to the categorized set of building blocks of the Web page of interest to control generation of a mini page.
- the mini page can then be stored in a content server with other 'mini pages' and content that is suitable for devices of the ' particular type.
- templates or configuration files are generated with reference to end-user device characteristics, such as screen size data, and with reference to the requirements of the target demographic of device users and the requirements of the Web site provider. Some Web site components may be prioritized more highly than others to achieve particular emphasis on a display device.
- the generation of a template is described in detail above.
- the above-described Website analysis can be performed pre-emptively by the Website analyzer in advance of specific requests being received for content from that site, with the Website analyzer running as a background task to identify Websites having content which matches a list of content types and predefined topics that are expected to be of interest to mobile device users and hence for which mini pages are likely to be required.
- a pre-emptive determination of which sites are to be analyzed and to have mini pages created for them is based on a combination of pre-emptive Website provider decisions and measured network traffic or measured Website hits corresponding to actual received requests. This approach ensures that mini pages are generated for the most frequently accessed Websites.
- this is implemented by storing both a requestor-device-type identifier and a Website-content-type identifier with a stored template. Taking account of both the characteristics of the device and the characteristics of the Website enables selection of the best available stored template for generating a mini page.
- the cache is organized in two layers, comprising a first cache layer in volatile memory (e.g. RAM of the content server) forstoring frequently access mini pages and templates, and a second cache layer in non- volatile storage (e.g. disk storage) for less-frequently accessed mini pages and templates.
- volatile memory e.g. RAM of the content server
- non- volatile storage e.g. disk storage
- a mini page should not be cached for such a long period that the match scores become very outdated and yet retaining a still image on a mini page after that image has been replaced on the main site is likely to be less important to users.
- the analysis and categorization of a Website and its contents by the analyzer enables this Website- specific caching policy to be implemented.
- the content access platform of the present embodiment also provides support for generation of mini pages using a default template, and support for the generation of new templates, in order to cater for new device types that become available in the future.
- a requestor device type cannot be identified from a device model number within the HTTP header attributes or a received request, two options exist.
- the HTTP header details are investigated more fully to identify characteristics of the requestor device type, and these characteristics are compared with the device capabilities information in the device profile database to identify similar device types.
- a device type identifier corresponding to the best fit device type is then used to select a template that is likely to be suitable for the new device type.
- this template is used to generate a suitable mini page for the requested Web page, and the mini page is returned to the device user, but this mini page can be returned with a user prompt asking for further details of the device type.
- the user can then be invited to download and execute a small footprint device analysis program, which obtains device capability information for use in generating a new device-type- specific template.
- the template structure shown in Figure 14 is used for the newly generated template, with device characteristics and other fields being populated with information obtained by the device analysis program.
- the new template is then added to the template and content repository of the content server.
- Some users will decline to run the device analysis program, and an alternative approach is to ask the user a set of questions about their device, but if some users execute the program the results can be used to update the device profile database, and one or more new templates can be generated.
- An alternative embodiment selects a default template whenever a suitable template cannot be identified and this default template is used to generate a fist mini page for sending to that requestor device.
- the fist mini page can be returned with one or more user prompts to establish whether the mini page could be displayed or to obtain device information.
- the content access platform makes use of resource compression and other transformations, to ensure that the size and format requirements of different devices (e.g. different mobile handset models) are matched by the mini pages that are served to those devices.
- Figure 15 comprises an example table of multimedia resources and the compression and format transformations that are applied to them in one embodiment of the invention.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Une plate-forme, des procédés et un appareil d'accès à un contenu permettent un meilleur accès à un contenu Internet pour des utilisateurs de dispositifs hétérogènes, tels que des téléphones mobiles et des assistants électroniques personnels. La plate-forme présente un mécanisme de distribution pour le contenu requis qui est adapté aux capacités du type de dispositif demandeur, à l'aide de mini-pages spécifiques au type de dispositif qui remplacent les pages Web d'origine et à l'aide de modèles mis en œuvre sous forme de fichiers de configuration XML pour générer les mini-pages. La plate-forme présente également un support pour les combinaisons personnalisées de contenu à fournir en fonction des exigences spécifiées dans les modèles stockés, de sorte que les fournisseurs de site Web et autres peuvent offrir une expérience utilisateur personnalisée aux différents utilisateurs en fonction de leurs préférences ou de leur type de dispositif. Les mini-pages sont générées en se référant à une analyse et à une catégorisation de la page ou du site Web d'origine ainsi qu'aux capacités des dispositifs demandeurs. L'invention concerne également des pages de répertoire compatibles avec le dispositif, en réponse à des demandes de répertoire comprenant un identifiant de type de dispositif pour s'assurer qu'une page de répertoire appropriée est fournie.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10708579A EP2399209A1 (fr) | 2009-02-19 | 2010-02-19 | Plate-forme d'accès à un contenu et procédés et appareil fournissant un accès à un contenu internet pour des dispositifs hétérogènes |
IL214766A IL214766A0 (en) | 2009-02-19 | 2011-08-21 | Content access platform and methods and apparatus providing access to internet content for heterogeneous devices |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0902834.1 | 2009-02-19 | ||
GBGB0902834.1A GB0902834D0 (en) | 2009-02-19 | 2009-02-19 | Content access platform and methods and apparatus providing access to internet content for heterogeneous devices |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010094927A1 true WO2010094927A1 (fr) | 2010-08-26 |
Family
ID=40565401
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2010/000295 WO2010094927A1 (fr) | 2009-02-19 | 2010-02-19 | Plate-forme d'accès à un contenu et procédés et appareil fournissant un accès à un contenu internet pour des dispositifs hétérogènes |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP2399209A1 (fr) |
GB (1) | GB0902834D0 (fr) |
IL (1) | IL214766A0 (fr) |
WO (1) | WO2010094927A1 (fr) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011026223A1 (fr) * | 2009-09-02 | 2011-03-10 | Andrew Echenberg | Distribution de contenu sur un réseau |
EP2429151A1 (fr) * | 2010-09-08 | 2012-03-14 | Saffron Digital Limited | Livraison d'un fichier |
GB2483655A (en) * | 2010-09-14 | 2012-03-21 | Thunderhead Ltd | Device capability modelling and automatic content assembly |
WO2012082397A1 (fr) * | 2010-12-18 | 2012-06-21 | Qualcomm Incorporated | Procédés et appareil pour permettre une application hybride web et native |
CN102932452A (zh) * | 2012-10-31 | 2013-02-13 | 北京奇虎科技有限公司 | 网站类型识别系统 |
WO2013059375A1 (fr) | 2011-10-18 | 2013-04-25 | Microsoft Corporation | Optimisation personnalisée de pages internet |
WO2014058950A1 (fr) * | 2012-10-10 | 2014-04-17 | Microsoft Corporation | Redirection d'application mobile automatique |
EP2687996A4 (fr) * | 2011-03-14 | 2015-05-06 | Xiaomi Inc | Procédé et système de réagencement de page web |
US9386119B2 (en) | 2013-07-30 | 2016-07-05 | International Business Machines Corporation | Mobile web adaptation techniques |
US9450824B2 (en) | 2014-02-04 | 2016-09-20 | Wipro Limited | Systems and methods for smart request processing |
WO2016186882A1 (fr) * | 2015-05-21 | 2016-11-24 | Google Inc. | Service de mandataire pour demandes de contenu |
US9967316B2 (en) | 2014-01-30 | 2018-05-08 | Google Llc | Accessing media item referenced in application |
US10304074B2 (en) | 2012-06-11 | 2019-05-28 | Retailmenot, Inc. | Devices, methods, and computer-readable media for redemption header for merchant offers |
CN110263238A (zh) * | 2019-06-21 | 2019-09-20 | 浙江华坤道威数据科技有限公司 | 一种基于大数据的舆情聆听系统 |
CN110688530A (zh) * | 2019-08-19 | 2020-01-14 | 天津开心生活科技有限公司 | 一种json数据的处理方法及装置 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143821A1 (en) * | 2000-12-15 | 2002-10-03 | Douglas Jakubowski | Site mining stylesheet generator |
WO2002087135A2 (fr) | 2001-04-25 | 2002-10-31 | Novarra, Inc. | Systeme et procede permettant d'adapter un contenu d'informations pour un dispositif electronique |
US20080139191A1 (en) * | 2006-12-08 | 2008-06-12 | Miguel Melnyk | Content adaptation |
-
2009
- 2009-02-19 GB GBGB0902834.1A patent/GB0902834D0/en not_active Ceased
-
2010
- 2010-02-19 WO PCT/GB2010/000295 patent/WO2010094927A1/fr active Application Filing
- 2010-02-19 EP EP10708579A patent/EP2399209A1/fr not_active Withdrawn
-
2011
- 2011-08-21 IL IL214766A patent/IL214766A0/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143821A1 (en) * | 2000-12-15 | 2002-10-03 | Douglas Jakubowski | Site mining stylesheet generator |
WO2002087135A2 (fr) | 2001-04-25 | 2002-10-31 | Novarra, Inc. | Systeme et procede permettant d'adapter un contenu d'informations pour un dispositif electronique |
US20080139191A1 (en) * | 2006-12-08 | 2008-06-12 | Miguel Melnyk | Content adaptation |
WO2008073207A2 (fr) | 2006-12-08 | 2008-06-19 | Bytemobile, Inc. | Adaptation de contenu |
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011026223A1 (fr) * | 2009-09-02 | 2011-03-10 | Andrew Echenberg | Distribution de contenu sur un réseau |
EP2429151A1 (fr) * | 2010-09-08 | 2012-03-14 | Saffron Digital Limited | Livraison d'un fichier |
US8769042B2 (en) | 2010-09-08 | 2014-07-01 | Saffron Digital Limited | Delivering a file |
GB2483655A (en) * | 2010-09-14 | 2012-03-21 | Thunderhead Ltd | Device capability modelling and automatic content assembly |
WO2012082397A1 (fr) * | 2010-12-18 | 2012-06-21 | Qualcomm Incorporated | Procédés et appareil pour permettre une application hybride web et native |
US8645491B2 (en) | 2010-12-18 | 2014-02-04 | Qualcomm Incorporated | Methods and apparatus for enabling a hybrid web and native application |
EP2687996A4 (fr) * | 2011-03-14 | 2015-05-06 | Xiaomi Inc | Procédé et système de réagencement de page web |
EP2769311A4 (fr) * | 2011-10-18 | 2015-05-27 | Microsoft Technology Licensing Llc | Optimisation personnalisée de pages internet |
KR20140078676A (ko) * | 2011-10-18 | 2014-06-25 | 마이크로소프트 코포레이션 | 웹 페이지의 맞춤 최적화 기법 |
WO2013059375A1 (fr) | 2011-10-18 | 2013-04-25 | Microsoft Corporation | Optimisation personnalisée de pages internet |
KR102132137B1 (ko) * | 2011-10-18 | 2020-07-08 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 웹 페이지의 맞춤 최적화 기법 |
US9887873B2 (en) | 2011-10-18 | 2018-02-06 | Microsoft Technology Licensing, Llc | Custom rendering of web pages based on web page requests |
KR20190107299A (ko) * | 2011-10-18 | 2019-09-19 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 웹 페이지의 맞춤 최적화 기법 |
KR102021317B1 (ko) * | 2011-10-18 | 2019-09-16 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 웹 페이지의 맞춤 최적화 기법 |
US10586243B2 (en) | 2012-06-11 | 2020-03-10 | Retailmenot, Inc. | Devices, methods, and computer-readable media for redemption header for merchant offers |
US10304074B2 (en) | 2012-06-11 | 2019-05-28 | Retailmenot, Inc. | Devices, methods, and computer-readable media for redemption header for merchant offers |
WO2014058950A1 (fr) * | 2012-10-10 | 2014-04-17 | Microsoft Corporation | Redirection d'application mobile automatique |
WO2014059148A1 (fr) * | 2012-10-10 | 2014-04-17 | Microsoft Corporation | Redirection automatique d'applications mobiles |
US10834233B2 (en) | 2012-10-10 | 2020-11-10 | Microsoft Technology Licensing, Llc | Automatic mobile application redirection |
CN102932452B (zh) * | 2012-10-31 | 2015-11-25 | 北京奇虎科技有限公司 | 网站类型识别系统 |
CN102932452A (zh) * | 2012-10-31 | 2013-02-13 | 北京奇虎科技有限公司 | 网站类型识别系统 |
US9386119B2 (en) | 2013-07-30 | 2016-07-05 | International Business Machines Corporation | Mobile web adaptation techniques |
US9967316B2 (en) | 2014-01-30 | 2018-05-08 | Google Llc | Accessing media item referenced in application |
US9450824B2 (en) | 2014-02-04 | 2016-09-20 | Wipro Limited | Systems and methods for smart request processing |
WO2016186882A1 (fr) * | 2015-05-21 | 2016-11-24 | Google Inc. | Service de mandataire pour demandes de contenu |
CN110263238A (zh) * | 2019-06-21 | 2019-09-20 | 浙江华坤道威数据科技有限公司 | 一种基于大数据的舆情聆听系统 |
CN110688530A (zh) * | 2019-08-19 | 2020-01-14 | 天津开心生活科技有限公司 | 一种json数据的处理方法及装置 |
CN110688530B (zh) * | 2019-08-19 | 2022-04-26 | 天津开心生活科技有限公司 | 一种json数据的处理方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
GB0902834D0 (en) | 2009-04-08 |
IL214766A0 (en) | 2011-11-30 |
EP2399209A1 (fr) | 2011-12-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2010094927A1 (fr) | Plate-forme d'accès à un contenu et procédés et appareil fournissant un accès à un contenu internet pour des dispositifs hétérogènes | |
US9686374B2 (en) | System and method for fragment level dynamic content regeneration | |
KR100490734B1 (ko) | 주석기반 문서 자동 생성장치 및 방법 | |
US7353246B1 (en) | System and method for enabling information associations | |
US10498847B2 (en) | System and method for mobile application deep linking | |
KR101824222B1 (ko) | 동적 컨텐츠 및 스테일 컨텐츠를 포함하는 웹사이트들의 고속 렌더링 | |
US7505978B2 (en) | Aggregating content of disparate data types from disparate data sources for single point access | |
US7933917B2 (en) | Personalized search method and system for enabling the method | |
US7996754B2 (en) | Consolidated content management | |
US20090006338A1 (en) | User created mobile content | |
US20070067305A1 (en) | Display of search results on mobile device browser with background process | |
US20090083232A1 (en) | Search results with search query suggestions | |
US20070208704A1 (en) | Packaged mobile search results | |
US20070192674A1 (en) | Publishing content through RSS feeds | |
US20070192683A1 (en) | Synthesizing the content of disparate data types | |
US20090235187A1 (en) | System and method for content navigation | |
WO2005104759A2 (fr) | Systeme, procede et appareil de selection, affichage, gestion, suivi et transfert d'acces dans le contenu de pages web et d'autres sources | |
US20080172396A1 (en) | Retrieving Dated Content From A Website | |
US10104196B2 (en) | Method of and server for transmitting a personalized message to a user electronic device | |
US20080297521A1 (en) | System and method for providing skins for a web page | |
CN114528510A (zh) | 网页数据处理的方法、装置、电子设备及介质 | |
RU2739720C2 (ru) | Способ и сервер для передачи персонализированного сообщения на пользовательское электронное устройство | |
CA2505837A1 (fr) | Portail vivant personnalise sur l'internet | |
US20130212126A1 (en) | Method and Apparatus for Conducting a Search |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10708579 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 214766 Country of ref document: IL |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010708579 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020117021887 Country of ref document: KR |