US20040153467A1 - System and method for cataloguing digital information for searching and retrieval - Google Patents
System and method for cataloguing digital information for searching and retrieval Download PDFInfo
- Publication number
- US20040153467A1 US20040153467A1 US10/760,472 US76047204A US2004153467A1 US 20040153467 A1 US20040153467 A1 US 20040153467A1 US 76047204 A US76047204 A US 76047204A US 2004153467 A1 US2004153467 A1 US 2004153467A1
- Authority
- US
- United States
- Prior art keywords
- information
- metadata
- stored
- server
- classification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 230000008569 process Effects 0.000 claims abstract description 9
- 238000004891 communication Methods 0.000 claims description 17
- 241000239290 Araneae Species 0.000 claims description 8
- 238000012545 processing Methods 0.000 claims description 2
- 238000011160 research Methods 0.000 description 5
- 230000007123 defense Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- RZVHIXYEVGDQDX-UHFFFAOYSA-N 9,10-anthraquinone Chemical compound C1=CC=C2C(=O)C3=CC=CC=C3C(=O)C2=C1 RZVHIXYEVGDQDX-UHFFFAOYSA-N 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/38—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/38—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/383—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/953—Organization of data
- Y10S707/956—Hierarchical
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99932—Access augmentation or optimizing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99937—Sorting
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Definitions
- the present invention relates to the field of information indexing, cataloguing and retrieval, and in particular to a system and method for automatically cataloguing Internet information repositories, creating an eXtensible Markup Language (XML) metaindex in an encoded XML format (i.e., the Resource Description Framework (RDF) format), and providing a mechanism to effectively search and retrieve the information.
- XML eXtensible Markup Language
- RDF Resource Description Framework
- search system that can be used to perform a search across many heterogeneous information retrieval systems. For example, many organizations have built information retrieval systems to permit users to obtain documents published by that organization. It is desirable to provide a search system that can index and catalogue information stored in many different formats on different websites, permitting users to perform a search through a single web portal.
- Keyword indexing alone is proving inadequate in providing a search system that permits a user to effectively locate and access information on the Internet.
- FIG. 1 depicts a diagram of a digital library according to one embodiment of the present invention
- FIG. 2 depicts an automated cataloguing and index system according to one embodiment of the present invention
- FIG. 3 shows a sample metadata data structure according to one embodiment of the present invention
- FIG. 4 depicts a sample Resource Description Framework (RDF) schema according to one embodiment of the present invention.
- FIG. 5 depicts a sample XML/RDF representation of document metadata according to an embodiment of the present invention.
- An embodiment of the present invention provides a method and system for indexing and cataloguing data stored on one or more information repositories.
- the information repositories may be distributed on a computer network. As the data stored in the information repositories is scanned, keywords are collected and indexed. The keywords are used to catalogue the data and to create metadata that is stored to assist in searching and retrieval of the data.
- One embodiment of the present invention is an information search and retrieval system.
- a user sends a request to perform a search or retrieve data to a web portal server.
- the server uses stored metadata to identify relevant documents.
- the data can then be retrieved and sent to the user.
- the metadata may be stored on the web portal server or may be located on one or more metadata servers.
- Some embodiments of the present invention store metadata information encoded in the eXtensible Markup Language (XML).
- XML eXtensible Markup Language
- some embodiments use the Resource Description Framework (RDF) to define and store the metadata.
- RDF Resource Description Framework
- Integrated information repositories form a federated digital library in the form of an index accessible through web portal technology.
- Such an index encapsulates the specific operations or contents of individual member libraries or data-marts with an XML wrapper, making access to the constituent repositories transparent to the user.
- Some problems that had to be solved to integrate digital libraries in an index structure according to the present invention were the following: (1) integrating existing digital libraries into a federated digital library; (2) insulating the federated digital library from changes made in individual digital libraries; (3) making relocation of individual digital libraries transparent to users; and (4) overcoming the lack of sufficient metadata in some digital libraries.
- Prior art libraries with meta-indexes have no automated classification ability to populate key descriptor fields based on domains or classification schemes.
- One embodiment of the present invention can be used to expand the descriptor fields, allowing a more robust index of library assets.
- XML with its tremendous support by the Web community can serve as a meta language, accepted by most digital libraries, to specify interfaces and methods of interactions.
- the eXtensible Markup Language (XML) is a simple dialect of SGML that has been endorsed by the W3C consortium.
- This meta-tagging approach makes it possible for a library to implement its own policies and features as well as to change them as long as it is able to describe these changes in the XML-based language, specifically an index built in the Resource Description Framework (RDF). In particular, it does not require any existing library to change its architecture but only to describe it.
- RDF Resource Description Framework
- RDF is an XML application that adds semantics to documents by encoding and using metadata.
- RDF could be used to encode content advisory ratings, information about the author, and licensing or copyright information.
- RDF is a general purpose XML application that can be used to encode any metadata.
- One embodiment of the present invention creates metadata regarding a document and stores that information using RDF.
- Other embodiments of the present invention use a relational database to store metadata information.
- FIG. 1 describes a Web portal architecture using RDF indexes based on XML technology integrating different digital libraries. Search engines, with a knowledge of XML/RDF, then can access/filter for relevant data.
- workstation 101 is any computing device that can run a web browser.
- workstation 101 can be: a personal computer running MicrosoftTM Internet ExplorerTM or NetscapeTM CommunicationTM; a personal digital assistant (PDA) such as a PalmTM computing device running a web browser; or a wireless communication device providing access to the Internet or other computer network.
- PDA personal digital assistant
- communications network 102 is the Internet.
- LAN local area network
- WAN wide area network
- corporate intranet a commercial service provider network.
- Workstation 101 connects the communications network 102 through a communications component.
- the communications component includes a 56 Kbps modem, a network adapter, a cable modem, an ethernet card, or any other network access device.
- portal server 103 is a SunTM UnixTM server running the SolarisTM operating system.
- the present invention could also be practiced using a WindowsTM NTTM server, a LinuxTM server, a NovellTM NetwareTM server, or any other computing platform for portal server 103 .
- Portal server 103 connects to communications network 102 through a communications component such as those discussed above with regard to workstation 101 .
- Portal server 103 receives a request from workstation 101 and formulates a request to metadata server 104 .
- metadata server 104 is a WindowsTM NTTM computing device running an LDAP directory server application.
- portal server 103 uses standard LDAP requests to allow permission to retrieve metadata information across communications network 102 .
- Metadata server 104 includes a communications component such as that described above with regards to workstation 101 and portal server 103 .
- the metadata is stored in XML/RDF format.
- This XML-encoded metadata is returned to portal server 103 in response to a request.
- Portal server 103 then sends a request to one or more of the appropriate information repositories 105 .
- Each information repository 105 is a computing device connected to communications network 102 in the same manner as the above-mentioned servers. These repositories store a collection of information.
- portal server 103 is able to identify and retrieve the most relevant information necessary to satisfy a user request.
- FIG. 2 illustrates the automated cataloguing support process according to one embodiment of the present invention.
- the system builds an indexed infrastructure, automatically cataloguing heterogeneous information repositories based on a pre-defined classification hierarchy. Once classified based on the ontology mapping, the documents and other relevant extracted meta-data, the index represents the metadata using a RDF schema.
- the Resource Description Framework is an infrastructure that enables the encoding, exchange, and reuse of structured meta-data. It is an application of XML that imposes needed structural constraints to provide unambiguous methods of expressing semantics. This structural constraint allows the interchangeability of metadata defined by heterogeneous sources. RDF additionally provides a means for publishing both human-readable and machine-processable vocabularies designed to encourage the reuse and extension of meta-data semantics among disparate information communities.
- One embodiment of the present invention uses the RDF schema standard for describing collections of documents that represent a single logical “bucket.”
- RDF schema for describing collections of documents that represent a single logical “bucket.”
- one embodiment of the present invention also includes a “classmark” property for a bucket or container.
- a classmark for a bucket is obtained by matching the bucket with a pre-defined classification hierarchy. This specification results in better search engine capabilities, and also helps in cataloguing for describing the content.
- FIG. 2 shows the automated cataloging and indexing components of one embodiment of the present invention.
- source digital repository 201 stores various documents that are available for retrieval.
- This repository can be a digital library, a database, a website, or any other information repository.
- the system first collects keyword information as shown in 202 .
- the information available in the repository is first scanned using a spider application such as Berkeley's SWISH-ETM.
- the spider collects a list of all keywords contained in each document, generating an index to facilitate searching and further processing.
- the present invention could use additional spiders or other data collection applications.
- the spider can be configured to traverse all available documents on source digital repository 201 .
- the spider can also be configured to only traverse documents to a fixed depth.
- classification hierarchy 203 is a predetermined classification system.
- DTIC Defense Technical Information Center
- Association for Computing Machinery publishes a computer science classification system
- U.S. Patent and Trademark Office publishes a classification system of all technological arts. Any classification system can be used as a domain with the present invention to automatically catalogue and index documents.
- the classification hierarchy 203 is a specific weighted domain ontology used to identify documents based on keywords found within each document.
- classification hierarchy 203 includes a hierarchical list of classifications. Each classification within the hierarchical list includes one or more keywords representative of that class.
- one classification system includes a top-level classification labeled “Aviation Technology.” Within this classification, there are three sub-classifications: “Aerodynamics”; “Aircraft”; and “Flight Control and Instrumentation.”
- Each classification includes keywords representative of that class. For example, “Aerodynamics” includes “dynamics of testing,” “wind tunnel,” etc. These keywords are used to determine the most likely classification of a document.
- the classification hierarchy 203 functions as a thesaurus, assisting in the correct identification and classification of a document based on the keyword index generated in 202 .
- the present invention automatically catalogues documents in source digital repository 201 as shown in 204 .
- the mapping of documents to one or more specific classifications can be performed in many ways.
- documents are catalogued by mapping keywords from 202 against a specific weighted domain ontology, such as classification hierarchy 203 .
- a neural network is used to recognize which categories within classification hierarchy 203 are most likely relevant to the referenced document.
- One of ordinary skill in the art would recognize other methods to categorize documents in accordance with the present invention.
- the automated cataloguing system is effective; however, it is not 100% accurate.
- one embodiment of the present invention includes a review process whereby the automatically suggested classifications are reviewed by a user to ensure they are accurate.
- a user performs the cataloguing process; however, the automated cataloguing system is used to suggest an appropriate classification to the user, thereby aiding the human operator, increasing the operational speed and accuracy of the cataloguing process.
- Metadata information is created and stored as shown in 205 .
- metadata is encoded and stored in XML/RDF format.
- Other embodiments store metadata or update a key descriptor field in a database system, a flat file, or any other mechanism that provides a way to store and retrieve data. For example, for previously built indexes based on full word searching, the existing indexes can be updated with the cataloguing tool.
- This information can be used by portal server 103 to facilitate effective searching and retrieval of data stored in source digital repository 201 .
- FIG. 3 shows a data structure for containing metadata according to one embodiment of the present invention.
- the data structure includes the following attributes: (1) a URL; (2) a title; (3) an author; (4) an abstract; (5) a collection; (6) a keyword; (7) one or more matched words; (8) a path; (9) a classmark; (10) a classification date; and (11) a last modified date.
- attributes will be discussed in turn below.
- FIG. 3 The attributes in FIG. 3 are shown according to one embodiment of the present invention.
- This data structure is designed to record metadata for information stored on the web.
- the present invention could be used to record metadata about data stored in other formats.
- the metadata could be used to facilitate searching of an OracleTM database or any other relational or object-oriented database.
- the metadata structure could be modified to better accommodate the stored data.
- the URL attribute stores a uniform resource locator (URL), a property uniquely identifying the data.
- the most common URL is a web address.
- http://www.saic.research.com/RDF/source/agriculturel.txt uniquely identifies the location of a web page.
- http: defines that protocol that is used to access the information.
- HTTP represents the standard protocol used on the web, the hypertext transfer protocol.
- www.saic.research.com defines the server where the information is stored.
- IP Internet Protocol
- computers When using this protocol, computers must convert host names to IP addresses using a distributed hierarchical database known as the Domain Name Service (DNS).
- DNS Domain Name Service
- the “Title” attribute gives the title of the resource. For most webpages, the title is displayed on the title bar on the top of a web browser. This data is intended to convey the general purpose and content of the document to a user.
- the “Author” attribute identifies the person or persons who wrote the document.
- the “Author” attribute identifies the owner of the document within the server file system. Modern computer operating systems are designed to support multiple users. Each user logs on to the system using a user identifier. When a file is created on a computer, the user creating the file is recorded as the owner of that file or document. In one embodiment, this information is used to populate the “Author” attribute.
- the “Abstract” attribute stores the document's or resource's abstract.
- the abstract gives a brief overview of the document designed to facilitate searching and allowing a user to quickly determine if a document is relevant.
- the “Collection” attribute identifies the type of a resource.
- a document may be a “Technical Report,” a “Proposal,” a “Refereed Journal,” a “Thesis”, and so on. This attribute is used to identify the general type of a document to assist in searching and retrieval of information.
- the “Keyword” attribute is usually stored as a RDF Bag container.
- An RDF Bag container stores multiple values.
- the “Keyword” attribute can store one or more keywords.
- Each keyword is a word identified in a document that assists in identifying the subject matter of that document.
- the “MatchedWords” attribute is one or more words from a document that match the classification. This attribute can include one matched word, or can contain an RDF Bag holding a plurality of matched words. For example in FIG. 3, the “MatchedWords” attribute includes “field” and “general.”
- the “Path” attribute identifies that path component of the URL as discussed above.
- the path is “source/agriculture1.txt.” This identifies the location of the referenced document within the information repository system.
- the “Classmark” attribute identifies a classification for a document.
- the classification can include one or more predetermined classification systems.
- FIG. 3 shows two classifications; “Ordnance.Aerial Bombs” and “Ordnance.Underwater Ordnance”. These classifications are within the Defense Department's DTIC classification system.
- other classification systems are used.
- ACM Association of Computing Machinery
- the U.S. patent and Trademark Office publishes a classification hierarchy for all areas within the technological arts for classifying issued patents.
- the classmark attribute is assigned through an automated process.
- the “Classification_date” attribute stores the date that a classmark was assigned to the referenced document. This identifies when the document was classified.
- the “Last_modified” attribute stores the date the referenced document was last modified.
- this attribute is obtained from the operating system of the information repository.
- the date that a file is created and the date the a file was last modified are stored with each file on the system. Using this information, the date that a document was last modified can be obtained from the operating system and used to populate this field.
- FIG. 4 shows a sample RDF schema according to one embodiment of the present invention.
- the shown RDF schema defines a vocabulary for representing metadata.
- the RDF shown implements the data structure shown in FIG. 3 in an XML/RDF format.
- XML/RDF-aware browsers can use the metadata information to search and retrieve information from the data store.
- FIG. 5 shows a sample document encoded in XML/RDF using the vocabulary defined in FIG. 4.
- a user can further restrict a search to a particular classification. If one possible classification is “Ordnance.Aerial Bombs,” the user can restrict the search to only those documents with this classification in their classmark attribute. Additionally, a user's keyword search will be more effective by utilizing the “MatchedWords” and “Keyword” metadata attributes.
- the present invention provides more effective searching and information retrieval capabilities than the widely used keyword indexing systems.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The system and method for searching and retrieving information stored in heterogeneous information repositories. A portal server retrieves user requests through a computer network and looks up information stored in a metadata databases. For example, the metadata may be encoded in an XML/RDF format and stored in a directory server to facilitate effective searching and retrieval of information from an information repository. Metadata includes information including a classmark definition for each document. The classmark is determined through an automated cataloguing process.
Description
- The present invention relates to the field of information indexing, cataloguing and retrieval, and in particular to a system and method for automatically cataloguing Internet information repositories, creating an eXtensible Markup Language (XML) metaindex in an encoded XML format (i.e., the Resource Description Framework (RDF) format), and providing a mechanism to effectively search and retrieve the information.
- In the last few years, there has been an explosion of information available on the Internet. In the very early 1990s, the Internet was a network consisting of computers from military, research, and educational organizations. There were small collections of information available through mostly file transfer protocol (FTP) sites and Gopher sites. With the advent of the web and increases in bandwidths beginning in about 1993, people began to put more and more information on the Internet.
- Originally, the Internet was only available for non-commercial research and educational use. When the Defense Advanced Research Projects Agency (DARPA) relaxed usage restrictions, finally permitting commercial use, Internet usage exploded. Today, most households have Internet access and anyone with Internet access can publish information on the Internet.
- Shortly after the advent of the web, users realized that there was a need to have a way to search the Internet to assist users in locating information. Websites such as Lycos™ and AltaVista™ were developed to meet this need. These sites used spiders to scan the Internet for content, collecting and indexing keywords. These full-text-based indexes were then used on a website to assist users in searching the Internet to locate needed information. This method was effective when the Internet was young. Recognizing problems associated with large quantities of indexes, many larger search engine sites, such as Yahoo™ and Excite™, began to manually catalogue the indexed material. Manual cataloguing is not an effective methodology for organizing the vast amount of information on the WWW.
- Today, most of the available content is unstructured so that it is difficult to locate pertinent data. Anyone with Internet access can publish any information they wish on the Internet. As the cost of access and disk space has decreased, the volume of information available has grown tremendously. Elementary search engines that simply create indexes of keywords are becoming increasingly ineffective in identifying relevant documents. There is a growing need for more effective search systems.
- There is an additional need to provide a search system that can be used to perform a search across many heterogeneous information retrieval systems. For example, many organizations have built information retrieval systems to permit users to obtain documents published by that organization. It is desirable to provide a search system that can index and catalogue information stored in many different formats on different websites, permitting users to perform a search through a single web portal.
- Finally, there is a need to provide a system for performing automated cataloguing and indexing of documents. Prior art systems have simply created keyword indexes. There is a need for a system that uses a thesaurus and a classification system to determine both keywords for an indexed document but also a class for the document to permit more effective search and retrieval of information.
- As the quantity of information available on the Internet grows, it is becoming more and more important to provide more advanced search and retrieval capabilities. Keyword indexing alone is proving inadequate in providing a search system that permits a user to effectively locate and access information on the Internet.
- In the drawings:
- FIG. 1 depicts a diagram of a digital library according to one embodiment of the present invention;
- FIG. 2 depicts an automated cataloguing and index system according to one embodiment of the present invention;
- FIG. 3 shows a sample metadata data structure according to one embodiment of the present invention;
- FIG. 4 depicts a sample Resource Description Framework (RDF) schema according to one embodiment of the present invention; and
- FIG. 5 depicts a sample XML/RDF representation of document metadata according to an embodiment of the present invention.
- An embodiment of the present invention provides a method and system for indexing and cataloguing data stored on one or more information repositories. The information repositories may be distributed on a computer network. As the data stored in the information repositories is scanned, keywords are collected and indexed. The keywords are used to catalogue the data and to create metadata that is stored to assist in searching and retrieval of the data.
- One embodiment of the present invention is an information search and retrieval system. A user sends a request to perform a search or retrieve data to a web portal server. The server then uses stored metadata to identify relevant documents. The data can then be retrieved and sent to the user. The metadata may be stored on the web portal server or may be located on one or more metadata servers.
- Some embodiments of the present invention store metadata information encoded in the eXtensible Markup Language (XML). In addition, some embodiments use the Resource Description Framework (RDF) to define and store the metadata.
- Various embodiments of the present invention fulfill one or more of the needs discussed above. These embodiments will be described in detail below in the detailed description of the invention.
- To build an effective and growing information infrastructure, it is necessary to integrate or catalogue collections of heterogeneous digital libraries. Integrated information repositories form a federated digital library in the form of an index accessible through web portal technology. Such an index encapsulates the specific operations or contents of individual member libraries or data-marts with an XML wrapper, making access to the constituent repositories transparent to the user.
- Some problems that had to be solved to integrate digital libraries in an index structure according to the present invention were the following: (1) integrating existing digital libraries into a federated digital library; (2) insulating the federated digital library from changes made in individual digital libraries; (3) making relocation of individual digital libraries transparent to users; and (4) overcoming the lack of sufficient metadata in some digital libraries.
- Prior art libraries with meta-indexes have no automated classification ability to populate key descriptor fields based on domains or classification schemes. One embodiment of the present invention can be used to expand the descriptor fields, allowing a more robust index of library assets.
- XML with its tremendous support by the Web community can serve as a meta language, accepted by most digital libraries, to specify interfaces and methods of interactions. The eXtensible Markup Language (XML) is a simple dialect of SGML that has been endorsed by the W3C consortium. This meta-tagging approach makes it possible for a library to implement its own policies and features as well as to change them as long as it is able to describe these changes in the XML-based language, specifically an index built in the Resource Description Framework (RDF). In particular, it does not require any existing library to change its architecture but only to describe it.
- RDF is an XML application that adds semantics to documents by encoding and using metadata. For example, RDF could be used to encode content advisory ratings, information about the author, and licensing or copyright information. RDF is a general purpose XML application that can be used to encode any metadata. One embodiment of the present invention creates metadata regarding a document and stores that information using RDF. Other embodiments of the present invention use a relational database to store metadata information.
- FIG. 1 describes a Web portal architecture using RDF indexes based on XML technology integrating different digital libraries. Search engines, with a knowledge of XML/RDF, then can access/filter for relevant data.
- A user logs on to the
system using workstation 101. In one embodiment,workstation 101 is any computing device that can run a web browser. For example,workstation 101 can be: a personal computer running Microsoft™ Internet Explorer™ or Netscape™ Communication™; a personal digital assistant (PDA) such as a Palm™ computing device running a web browser; or a wireless communication device providing access to the Internet or other computer network. - Using
workstation 101, the user sends a search or retrieval request throughcommunications network 102. In one embodiment of the present invention,communications network 102 is the Internet. One of ordinary skill in the art would appreciate that any other computer network could also be used with the present invention including, as some examples, a local area network (LAN), a wide area network (WAN), a corporate intranet, or a commercial service provider network.Workstation 101 connects thecommunications network 102 through a communications component. For example, in various embodiments of the present invention, the communications component includes a 56 Kbps modem, a network adapter, a cable modem, an ethernet card, or any other network access device. -
Workstation 101 sends a request throughcommunications network 102 toportal server 103. In one embodiment of the present invention,portal server 103 is a Sun™ Unix™ server running the Solaris™ operating system. The present invention could also be practiced using a Windows™ NT™ server, a Linux™ server, a Novell™ Netware™ server, or any other computing platform forportal server 103.Portal server 103 connects tocommunications network 102 through a communications component such as those discussed above with regard toworkstation 101. -
Portal server 103 receives a request fromworkstation 101 and formulates a request tometadata server 104. In one embodiment of the present invention,metadata server 104 is a Windows™ NT™ computing device running an LDAP directory server application. In this embodiment,portal server 103 uses standard LDAP requests to allow permission to retrieve metadata information acrosscommunications network 102.Metadata server 104 includes a communications component such as that described above with regards toworkstation 101 andportal server 103. - In one embodiment of the present invention, the metadata is stored in XML/RDF format. This XML-encoded metadata is returned to
portal server 103 in response to a request.Portal server 103 then sends a request to one or more of theappropriate information repositories 105. Eachinformation repository 105 is a computing device connected tocommunications network 102 in the same manner as the above-mentioned servers. These repositories store a collection of information. Using the metadata obtained frommetadata server 104,portal server 103 is able to identify and retrieve the most relevant information necessary to satisfy a user request. - FIG. 2 illustrates the automated cataloguing support process according to one embodiment of the present invention. In this embodiment, the system builds an indexed infrastructure, automatically cataloguing heterogeneous information repositories based on a pre-defined classification hierarchy. Once classified based on the ontology mapping, the documents and other relevant extracted meta-data, the index represents the metadata using a RDF schema.
- The Resource Description Framework (RDF) is an infrastructure that enables the encoding, exchange, and reuse of structured meta-data. It is an application of XML that imposes needed structural constraints to provide unambiguous methods of expressing semantics. This structural constraint allows the interchangeability of metadata defined by heterogeneous sources. RDF additionally provides a means for publishing both human-readable and machine-processable vocabularies designed to encourage the reuse and extension of meta-data semantics among disparate information communities.
- One embodiment of the present invention uses the RDF schema standard for describing collections of documents that represent a single logical “bucket.” Among other metadata information associated with a bucket, one embodiment of the present invention also includes a “classmark” property for a bucket or container. A classmark for a bucket is obtained by matching the bucket with a pre-defined classification hierarchy. This specification results in better search engine capabilities, and also helps in cataloguing for describing the content.
- FIG. 2 shows the automated cataloging and indexing components of one embodiment of the present invention. In this example, source
digital repository 201 stores various documents that are available for retrieval. This repository can be a digital library, a database, a website, or any other information repository. - According to one embodiment of the present invention, the system first collects keyword information as shown in202. The information available in the repository is first scanned using a spider application such as Berkeley's SWISH-E™. The spider collects a list of all keywords contained in each document, generating an index to facilitate searching and further processing. The present invention could use additional spiders or other data collection applications.
- In one embodiment of the present invention, the spider can be configured to traverse all available documents on source
digital repository 201. The spider can also be configured to only traverse documents to a fixed depth. - Once a keyword index has been generated, one embodiment of the present invention uses
classification hierarchy 203 to automatically catalogue documents as shown in 204. According to one embodiment of the present invention,classification hierarchy 203 is a predetermined classification system. There are many such classification systems currently in use. For example, the Department of Defense publishes the Defense Technical Information Center (DTIC) classification system; the Association for Computing Machinery publishes a computer science classification system; and the U.S. Patent and Trademark Office publishes a classification system of all technological arts. Any classification system can be used as a domain with the present invention to automatically catalogue and index documents. - The
classification hierarchy 203 is a specific weighted domain ontology used to identify documents based on keywords found within each document. For example, according to one embodiment of the present invention,classification hierarchy 203 includes a hierarchical list of classifications. Each classification within the hierarchical list includes one or more keywords representative of that class. For example, one classification system includes a top-level classification labeled “Aviation Technology.” Within this classification, there are three sub-classifications: “Aerodynamics”; “Aircraft”; and “Flight Control and Instrumentation.” Each classification includes keywords representative of that class. For example, “Aerodynamics” includes “dynamics of testing,” “wind tunnel,” etc. These keywords are used to determine the most likely classification of a document. Thus, theclassification hierarchy 203 functions as a thesaurus, assisting in the correct identification and classification of a document based on the keyword index generated in 202. - The present invention automatically catalogues documents in source
digital repository 201 as shown in 204. The mapping of documents to one or more specific classifications can be performed in many ways. In one embodiment of the present invention, documents are catalogued by mapping keywords from 202 against a specific weighted domain ontology, such asclassification hierarchy 203. In another embodiment of the present invention, a neural network is used to recognize which categories withinclassification hierarchy 203 are most likely relevant to the referenced document. One of ordinary skill in the art would recognize other methods to categorize documents in accordance with the present invention. - The automated cataloguing system is effective; however, it is not 100% accurate. To assist in increasing the overall accuracy of the collected metadata, one embodiment of the present invention includes a review process whereby the automatically suggested classifications are reviewed by a user to ensure they are accurate. In an additional embodiment, a user performs the cataloguing process; however, the automated cataloguing system is used to suggest an appropriate classification to the user, thereby aiding the human operator, increasing the operational speed and accuracy of the cataloguing process.
- Once a document has been catalogued, metadata information is created and stored as shown in205. In one embodiment of the present invention, metadata is encoded and stored in XML/RDF format. Other embodiments store metadata or update a key descriptor field in a database system, a flat file, or any other mechanism that provides a way to store and retrieve data. For example, for previously built indexes based on full word searching, the existing indexes can be updated with the cataloguing tool. This information can be used by
portal server 103 to facilitate effective searching and retrieval of data stored in sourcedigital repository 201. - FIG. 3 shows a data structure for containing metadata according to one embodiment of the present invention. The data structure includes the following attributes: (1) a URL; (2) a title; (3) an author; (4) an abstract; (5) a collection; (6) a keyword; (7) one or more matched words; (8) a path; (9) a classmark; (10) a classification date; and (11) a last modified date. Each of these attributes will be discussed in turn below.
- The attributes in FIG. 3 are shown according to one embodiment of the present invention. One of ordinary skill in the art would understand that many variations of this data structure could be made without departing from the scope and spirit of the present invention. Additionally, this data structure is designed to record metadata for information stored on the web. The present invention could be used to record metadata about data stored in other formats. For example, the metadata could be used to facilitate searching of an Oracle™ database or any other relational or object-oriented database. In such an application, the metadata structure could be modified to better accommodate the stored data.
- The URL attribute stores a uniform resource locator (URL), a property uniquely identifying the data. The most common URL is a web address. For example, “http://www.saic.research.com/RDF/source/agriculturel.txt” uniquely identifies the location of a web page. First, “http:” defines that protocol that is used to access the information. “HTTP” represents the standard protocol used on the web, the hypertext transfer protocol. Next, “www.saic.research.com” defines the server where the information is stored. On the Internet, computers communicate using the Internet Protocol (IP). When using this protocol, computers must convert host names to IP addresses using a distributed hierarchical database known as the Domain Name Service (DNS). This host name can be used to look up the IP Address in DNS. Finally, “/source/agriculturel.txt” identifies the path to the information. In combination, the entire URL defines the protocol to be used, the address of the server providing the information, and the path to the provided information.
- The “Title” attribute gives the title of the resource. For most webpages, the title is displayed on the title bar on the top of a web browser. This data is intended to convey the general purpose and content of the document to a user.
- The “Author” attribute identifies the person or persons who wrote the document. In one embodiment of the present invention, the “Author” attribute identifies the owner of the document within the server file system. Modern computer operating systems are designed to support multiple users. Each user logs on to the system using a user identifier. When a file is created on a computer, the user creating the file is recorded as the owner of that file or document. In one embodiment, this information is used to populate the “Author” attribute.
- The “Abstract” attribute stores the document's or resource's abstract. The abstract gives a brief overview of the document designed to facilitate searching and allowing a user to quickly determine if a document is relevant.
- The “Collection” attribute identifies the type of a resource. For example, a document may be a “Technical Report,” a “Proposal,” a “Refereed Journal,” a “Thesis”, and so on. This attribute is used to identify the general type of a document to assist in searching and retrieval of information.
- The “Keyword” attribute is usually stored as a RDF Bag container. An RDF Bag container stores multiple values. Thus, the “Keyword” attribute can store one or more keywords. Each keyword is a word identified in a document that assists in identifying the subject matter of that document.
- The “MatchedWords” attribute is one or more words from a document that match the classification. This attribute can include one matched word, or can contain an RDF Bag holding a plurality of matched words. For example in FIG. 3, the “MatchedWords” attribute includes “field” and “general.”
- The “Path” attribute identifies that path component of the URL as discussed above. For example in FIG. 3, the path is “source/agriculture1.txt.” This identifies the location of the referenced document within the information repository system.
- The “Classmark” attribute identifies a classification for a document. The classification can include one or more predetermined classification systems. For example, FIG. 3 shows two classifications; “Ordnance.Aerial Bombs” and “Ordnance.Underwater Ordnance”. These classifications are within the Defense Department's DTIC classification system. In other embodiments of the present invention other classification systems are used. For example, the Association of Computing Machinery (ACM), an association for computing professionals, publishes a classification hierarchy for areas within the field of computing. Similarly, the U.S. patent and Trademark Office publishes a classification hierarchy for all areas within the technological arts for classifying issued patents. In one embodiment of the present invention, the classmark attribute is assigned through an automated process.
- The “Classification_date” attribute stores the date that a classmark was assigned to the referenced document. This identifies when the document was classified.
- Finally, the “Last_modified” attribute stores the date the referenced document was last modified. In one embodiment of the present invention, this attribute is obtained from the operating system of the information repository. In modern computer operating systems, the date that a file is created and the date the a file was last modified are stored with each file on the system. Using this information, the date that a document was last modified can be obtained from the operating system and used to populate this field.
- FIG. 4 shows a sample RDF schema according to one embodiment of the present invention. In this embodiment, the shown RDF schema defines a vocabulary for representing metadata. The RDF shown implements the data structure shown in FIG. 3 in an XML/RDF format. By defining an RDF vocabulary, XML/RDF-aware browsers can use the metadata information to search and retrieve information from the data store.
- FIG. 5 shows a sample document encoded in XML/RDF using the vocabulary defined in FIG. 4.
- Once documents have been categorized and metadata information has been stored, more effective searches can be performed using the system shown in FIG. 1. For example, a user can further restrict a search to a particular classification. If one possible classification is “Ordnance.Aerial Bombs,” the user can restrict the search to only those documents with this classification in their classmark attribute. Additionally, a user's keyword search will be more effective by utilizing the “MatchedWords” and “Keyword” metadata attributes. By using an automated cataloguing process, the present invention provides more effective searching and information retrieval capabilities than the widely used keyword indexing systems.
- Embodiments of the present invention have now been fully described. It will be appreciated that these examples are merely illustrative of the present invention. Many variations and modifications will be apparent to those of ordinary skill in the art.
Claims (32)
1. A system for providing a portal to information stored in one or more information repositories, comprising:
a computer network;
one or more information repositories, each information repository including a communications component connecting the information repository to the computer network;
a metadata server, the metadata server including a communications component connecting the metadata server to the computer network, the metadata server storing metadata information about data stored in the one or more information repositories; and
a portal server, the portal server having a communications component connecting the portal server to the computer network, the portal server receiving requests and processing the requests using metadata information stored on the metadata server.
2. The system of claim 1 , wherein the computer network is the Internet.
3. The system of claim 1 , wherein the metadata server stores the metadata information encoded in the eXtensible Markup Language (XML).
4. The system of claim 1 , wherein the metadata server stores the metadata information encoded in the Resource Description Framework (RDF) format.
5. The system of claim 1 , wherein the requests that the portal server receives are one or more from the group consisting of: a search request and an information retrieval request.
6. The system of claim 1 , wherein the metadata server and the portal server are run on a single computing device.
7. The system of claim 6 , wherein the metadata server and the portal server use the same communications component to connect to the computer network.
8. The system of claim 1 , wherein the metadata information stored on the metadata server includes classmark information.
9. The system of claim 8 , wherein the classmark information is automatically determined from an index and a class definition.
10. A system for searching and retrieving information on a network, comprising:
a computer network; and
a user workstation, the user workstation including a communications component connecting the user workstation to the computer network, the user workstation receiving a request;
the user workstation sending the request to a portal server, the portal server using metadata information about data stored in one or more information repositories to process the request, returning resulting information to the user workstation.
11. The system of claim 10 , wherein the metadata information is encoded in the eXtensible Markup Language (XML).
12. The system of claim 10 , wherein the metadata information is encoded in the Resource Description Framework (RDF) format.
13. The system of claim 10 , wherein the request received by the user workstation is one or more from the group consisting of: a search request and an information retrieval request.
14. A system for cataloguing information stored in an information repository, the system comprising:
a keyword index of data stored in an information repository;
one or more domain class definitions; and
a computing device, the computing device cataloguing documents stored in the information repository using the keyword index and the one or more class definitions.
15. The system of claim 14 , wherein the keyword index is built by a spider.
16. The system of claim 14 , wherein each of the one or more domain class definitions includes one or more classes, each of the one or more classes including keywords representative of that class.
17. The system of claim 16 , wherein each of the keywords representative of a class are weighted.
18. A computer-readable medium comprising data stored in a data structure for describing data stored in an information repository, the data structure including:
a resource locator attribute identifying a document stored in an information repository; and
a classmark attribute identifying a classification of the document, the classification automatically determined using a keyword index and a classification definition.
19. The computer-readable medium of claim 18 , wherein the data structure further comprises:
an author attribute identifying the author of the document; and
a title attribute specifying a title for the document.
20. The computer-readable medium of claim 18 , wherein the data structure further comprises an abstract attribute specifying an abstract of the document.
21. The computer-readable medium of claim 18 , wherein the data structure further comprises a keyword attribute, the keyword attribute identifying zero or more keywords for the document.
22. A method for cataloguing data stored in an information repository, comprising:
receiving a keyword index of data stored in an information repository;
receiving a classification definition, the classification definition including a plurality of classes;
determining the classification of data stored in the information repository using the keyword index and the classification definition; and
storing the determined classification.
23. The method of claim 22 , wherein receiving a keyword index is built by a spider.
24. The method of claim 22 , wherein the determined classification is stored in the eXtensible Markup Language (XML) format.
25. The method of claim 22 , wherein the determined classification is stored in the Resource Description Framework (RDF) format.
26. A method for providing access to one or more information repositories, the method comprising:
receiving a request on a portal server; and
using metadata about data stored in one or more information repositories to process the request.
27. The method of claim 26 , wherein the request is one or more from the group consisting of: a search request and an information retrieval request.
28. The method of claim 26 , wherein the metadata is encoded in the eXtensible Markup Language (XML).
29. The method of claim 26 , wherein the metadata is encoded in the Resource Description Framework (RDF) format.
30. The method of claim 26 , wherein the metadata is stored on the portal server.
31. The method of claim 26 , wherein the metadata is stored on a metadata server.
32. The method of claim 26 , wherein the metadata includes a classmark attribute, the classmark attribute being automatically generated from a keyword index and a class definition.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/760,472 US20040153467A1 (en) | 2000-01-21 | 2004-01-21 | System and method for cataloguing digital information for searching and retrieval |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/489,735 US6701314B1 (en) | 2000-01-21 | 2000-01-21 | System and method for cataloguing digital information for searching and retrieval |
US10/760,472 US20040153467A1 (en) | 2000-01-21 | 2004-01-21 | System and method for cataloguing digital information for searching and retrieval |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/489,735 Continuation US6701314B1 (en) | 2000-01-21 | 2000-01-21 | System and method for cataloguing digital information for searching and retrieval |
Publications (1)
Publication Number | Publication Date |
---|---|
US20040153467A1 true US20040153467A1 (en) | 2004-08-05 |
Family
ID=31716076
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/489,735 Expired - Lifetime US6701314B1 (en) | 2000-01-21 | 2000-01-21 | System and method for cataloguing digital information for searching and retrieval |
US10/760,472 Abandoned US20040153467A1 (en) | 2000-01-21 | 2004-01-21 | System and method for cataloguing digital information for searching and retrieval |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/489,735 Expired - Lifetime US6701314B1 (en) | 2000-01-21 | 2000-01-21 | System and method for cataloguing digital information for searching and retrieval |
Country Status (1)
Country | Link |
---|---|
US (2) | US6701314B1 (en) |
Cited By (48)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050243346A1 (en) * | 2004-05-03 | 2005-11-03 | Microsoft Corporation | Planar mapping of graphical elements |
US20050243355A1 (en) * | 2004-05-03 | 2005-11-03 | Microsoft Corporation | Systems and methods for support of various processing capabilities |
US20050248790A1 (en) * | 2004-04-30 | 2005-11-10 | David Ornstein | Method and apparatus for interleaving parts of a document |
US20050251740A1 (en) * | 2004-04-30 | 2005-11-10 | Microsoft Corporation | Methods and systems for building packages that contain pre-paginated documents |
US20050262134A1 (en) * | 2004-05-03 | 2005-11-24 | Microsoft Corporation | Spooling strategies using structured job information |
US20050268221A1 (en) * | 2004-04-30 | 2005-12-01 | Microsoft Corporation | Modular document format |
US20050273701A1 (en) * | 2004-04-30 | 2005-12-08 | Emerson Daniel F | Document mark up methods and systems |
US20050278272A1 (en) * | 2004-04-30 | 2005-12-15 | Microsoft Corporation | Method and apparatus for maintaining relationships between parts in a package |
US20060069987A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Method, apparatus and computer-readable medium for managing specific types of content in an electronic document |
US20060136143A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Personalized genetic-based analysis of medical conditions |
US20060136467A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Domain-specific data entity mapping method and system |
US20060136417A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Method and system for search, analysis and display of structured data |
US20060136466A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Computer assisted domain specific entity mapping method and system |
US20060136259A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Multi-dimensional analysis of medical data |
US20060150085A1 (en) * | 2005-01-06 | 2006-07-06 | Microsoft Corporation | Data binding in a word-processing application |
US20060167868A1 (en) * | 2005-01-27 | 2006-07-27 | Weijia Zhang | Universal and extensible packaging process for computer system software integration and deployment |
US20060184489A1 (en) * | 2004-12-17 | 2006-08-17 | General Electric Company | Genetic knowledgebase creation for personalized analysis of medical conditions |
US20060190815A1 (en) * | 2004-12-20 | 2006-08-24 | Microsoft Corporation | Structuring data for word processing documents |
EP1696347A1 (en) * | 2005-02-25 | 2006-08-30 | Microsoft Corporation | Data store for software application documents |
US20060195783A1 (en) * | 2005-01-06 | 2006-08-31 | Microsoft Corporation | Programmability for binding data |
US20060195454A1 (en) * | 2005-01-06 | 2006-08-31 | Microsoft Corporation | XML schema for binding data |
US20060248094A1 (en) * | 2005-04-28 | 2006-11-02 | Microsoft Corporation | Analysis and comparison of portfolios by citation |
US20070061382A1 (en) * | 2005-09-09 | 2007-03-15 | Microsoft Corporation | Real-time synchronization of XML data between applications |
US20070073751A1 (en) * | 2005-09-29 | 2007-03-29 | Morris Robert P | User interfaces and related methods, systems, and computer program products for automatically associating data with a resource as metadata |
US20070073688A1 (en) * | 2005-09-29 | 2007-03-29 | Fry Jared S | Methods, systems, and computer program products for automatically associating data with a resource as metadata based on a characteristic of the resource |
US20070073770A1 (en) * | 2005-09-29 | 2007-03-29 | Morris Robert P | Methods, systems, and computer program products for resource-to-resource metadata association |
US20070078873A1 (en) * | 2005-09-30 | 2007-04-05 | Avinash Gopal B | Computer assisted domain specific entity mapping method and system |
US20070198542A1 (en) * | 2006-02-09 | 2007-08-23 | Morris Robert P | Methods, systems, and computer program products for associating a persistent information element with a resource-executable pair |
US20070198541A1 (en) * | 2006-02-06 | 2007-08-23 | International Business Machines Corporation | Method and system for efficiently storing semantic web statements in a relational database |
US20070198456A1 (en) * | 2006-02-06 | 2007-08-23 | International Business Machines Corporation | Method and system for controlling access to semantic web statements |
US20080154848A1 (en) * | 2006-12-20 | 2008-06-26 | Microsoft Corporation | Search, Analysis and Comparison of Content |
US20080172379A1 (en) * | 2007-01-17 | 2008-07-17 | Fujitsu Limited | Recording medium storing a design support program, design support method, and design support apparatus |
US20090177777A1 (en) * | 2008-01-09 | 2009-07-09 | International Business Machines Corporation | Machine-Processable Semantic Description For Resource Management |
US7673235B2 (en) | 2004-09-30 | 2010-03-02 | Microsoft Corporation | Method and apparatus for utilizing an object model to manage document parts for use in an electronic document |
US7752632B2 (en) | 2004-12-21 | 2010-07-06 | Microsoft Corporation | Method and system for exposing nested data in a computer-generated document in a transparent manner |
US7752224B2 (en) | 2005-02-25 | 2010-07-06 | Microsoft Corporation | Programmability for XML data store for documents |
US7770180B2 (en) | 2004-12-21 | 2010-08-03 | Microsoft Corporation | Exposing embedded data in a computer-generated document |
US20100223252A1 (en) * | 2009-03-02 | 2010-09-02 | Yahoo! Inc. | Method and system for web searching |
US20120203734A1 (en) * | 2009-04-15 | 2012-08-09 | Evri Inc. | Automatic mapping of a location identifier pattern of an object to a semantic type using object metadata |
US8243317B2 (en) | 2004-05-03 | 2012-08-14 | Microsoft Corporation | Hierarchical arrangement for spooling job data |
US8363232B2 (en) | 2004-05-03 | 2013-01-29 | Microsoft Corporation | Strategies for simultaneous peripheral operations on-line using hierarchically structured job information |
US8661332B2 (en) | 2004-04-30 | 2014-02-25 | Microsoft Corporation | Method and apparatus for document processing |
US20150074007A1 (en) * | 2013-09-09 | 2015-03-12 | UnitedLex Corp. | Interactive case management system |
US9607089B2 (en) | 2009-04-15 | 2017-03-28 | Vcvc Iii Llc | Search and search optimization using a pattern of a location identifier |
US10033799B2 (en) | 2002-11-20 | 2018-07-24 | Essential Products, Inc. | Semantically representing a target entity using a semantic object |
CN110275874A (en) * | 2019-02-25 | 2019-09-24 | 广州金越软件技术有限公司 | A kind of intelligent resource inventory method that big data resource is administered |
US10628847B2 (en) | 2009-04-15 | 2020-04-21 | Fiver Llc | Search-enhanced semantic advertising |
US11763260B2 (en) * | 2017-01-12 | 2023-09-19 | Halliburton Energy Services, Inc. | Bridging various standards for drilling projects |
Families Citing this family (201)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8423648B2 (en) * | 1999-06-01 | 2013-04-16 | Yodlee.Com, Inc. | Method and system for verifying state of a transaction between a client and a service over a data-packet-network |
US8165146B1 (en) * | 1999-10-28 | 2012-04-24 | Lightwaves Systems Inc. | System and method for storing/caching, searching for, and accessing data |
US7213024B2 (en) * | 2000-03-09 | 2007-05-01 | The Web Access, Inc. | Method and apparatus for accessing information within an electronic system |
CA2405399A1 (en) * | 2000-04-18 | 2001-10-25 | Web Wombat Pty Ltd. | Retrieving and processing stored information using a distributed network of remote computers |
US6704728B1 (en) * | 2000-05-02 | 2004-03-09 | Iphase.Com, Inc. | Accessing information from a collection of data |
JP4037999B2 (en) * | 2000-05-15 | 2008-01-23 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Website, robot type search engine response system, robot type search engine registration method, storage medium, and program transmission device |
AU2001255714A1 (en) * | 2000-06-13 | 2001-12-24 | Industria Solutions, Incorporated | Systems and methods for the collaborative design, construction, and maintenance of fluid processing plants |
AUPR033800A0 (en) * | 2000-09-25 | 2000-10-19 | Telstra R & D Management Pty Ltd | A document categorisation system |
US7660740B2 (en) * | 2000-10-16 | 2010-02-09 | Ebay Inc. | Method and system for listing items globally and regionally, and customized listing according to currency or shipping area |
US7200627B2 (en) * | 2001-03-21 | 2007-04-03 | Nokia Corporation | Method and apparatus for generating a directory structure |
WO2002078286A2 (en) * | 2001-03-27 | 2002-10-03 | Bea Systems, Inc. | System and method for managing objects and resources with access rights embedded in nodes within a hierarchical tree structure |
US6925457B2 (en) * | 2001-07-27 | 2005-08-02 | Metatomix, Inc. | Methods and apparatus for querying a relational data store using schema-less queries |
US7890517B2 (en) * | 2001-05-15 | 2011-02-15 | Metatomix, Inc. | Appliance for enterprise information integration and enterprise resource interoperability platform and methods |
US7058637B2 (en) * | 2001-05-15 | 2006-06-06 | Metatomix, Inc. | Methods and apparatus for enterprise application integration |
US6856992B2 (en) * | 2001-05-15 | 2005-02-15 | Metatomix, Inc. | Methods and apparatus for real-time business visibility using persistent schema-less data storage |
US20030208499A1 (en) * | 2002-05-03 | 2003-11-06 | David Bigwood | Methods and apparatus for visualizing relationships among triples of resource description framework (RDF) data sets |
US6954749B2 (en) * | 2002-10-07 | 2005-10-11 | Metatomix, Inc. | Methods and apparatus for identifying related nodes in a directed graph having named arcs |
US7302440B2 (en) * | 2001-07-27 | 2007-11-27 | Metatomix, Inc. | Methods and apparatus for statistical data analysis and reduction for an enterprise application |
US6886046B2 (en) * | 2001-06-26 | 2005-04-26 | Citrix Systems, Inc. | Methods and apparatus for extendible information aggregation and presentation |
WO2003007140A2 (en) * | 2001-07-13 | 2003-01-23 | Wind River Systems, Inc. | Directional focus manager |
US6917944B1 (en) * | 2001-08-30 | 2005-07-12 | Cisco Technology, Inc. | Method and apparatus for configuring access to a plurality of data repositories |
WO2003032188A1 (en) * | 2001-10-05 | 2003-04-17 | Vitria Technology, Inc. | System and method for vocabulary-based data transformation |
US7752266B2 (en) | 2001-10-11 | 2010-07-06 | Ebay Inc. | System and method to facilitate translation of communications between entities over a network |
US6944610B2 (en) * | 2001-10-31 | 2005-09-13 | Bellsouth Intellectual Property Corporation | System and method for searching heterogeneous electronic directories |
GB2387085A (en) * | 2002-03-25 | 2003-10-01 | Sony Uk Ltd | System |
US7277924B1 (en) * | 2002-05-07 | 2007-10-02 | Oracle International Corporation | Method and mechanism for a portal website architecture |
US7548957B1 (en) | 2002-05-07 | 2009-06-16 | Oracle International Corporation | Method and mechanism for a portal website architecture |
US8078505B2 (en) | 2002-06-10 | 2011-12-13 | Ebay Inc. | Method and system for automatically updating a seller application utilized in a network-based transaction facility |
US7152059B2 (en) * | 2002-08-30 | 2006-12-19 | Emergency24, Inc. | System and method for predicting additional search results of a computerized database search user based on an initial search query |
US8661498B2 (en) | 2002-09-18 | 2014-02-25 | Symantec Corporation | Secure and scalable detection of preselected data embedded in electronically transmitted messages |
US7472114B1 (en) * | 2002-09-18 | 2008-12-30 | Symantec Corporation | Method and apparatus to define the scope of a search for information from a tabular data source |
US8041719B2 (en) | 2003-05-06 | 2011-10-18 | Symantec Corporation | Personal computing device-based mechanism to detect preselected data |
US8225371B2 (en) * | 2002-09-18 | 2012-07-17 | Symantec Corporation | Method and apparatus for creating an information security policy based on a pre-configured template |
US7673344B1 (en) * | 2002-09-18 | 2010-03-02 | Symantec Corporation | Mechanism to search information content for preselected data |
US7283989B1 (en) * | 2002-09-27 | 2007-10-16 | At&T Bls Intellectual Property, Inc. | System and method for use of application metadata |
US7076497B2 (en) * | 2002-10-11 | 2006-07-11 | Emergency24, Inc. | Method for providing and exchanging search terms between internet site promoters |
US20030088553A1 (en) * | 2002-11-23 | 2003-05-08 | Emergency 24, Inc. | Method for providing relevant search results based on an initial online search query |
US20040133560A1 (en) * | 2003-01-07 | 2004-07-08 | Simske Steven J. | Methods and systems for organizing electronic documents |
JP4405736B2 (en) * | 2003-01-31 | 2010-01-27 | コニカミノルタホールディングス株式会社 | Database system |
US20050005237A1 (en) * | 2003-07-03 | 2005-01-06 | Rail Peter D. | Method for maintaining a centralized, multidimensional master index of documents from independent repositories |
EP1690210A2 (en) * | 2003-07-07 | 2006-08-16 | Metatomix, Inc. | Surveillance, monitoring and real-time events platform |
US8131739B2 (en) * | 2003-08-21 | 2012-03-06 | Microsoft Corporation | Systems and methods for interfacing application programs with an item-based storage platform |
US8238696B2 (en) * | 2003-08-21 | 2012-08-07 | Microsoft Corporation | Systems and methods for the implementation of a digital images schema for organizing units of information manageable by a hardware/software interface system |
US7401104B2 (en) * | 2003-08-21 | 2008-07-15 | Microsoft Corporation | Systems and methods for synchronizing computer systems through an intermediary file system share or device |
US20050055354A1 (en) * | 2003-08-21 | 2005-03-10 | Microsoft Corporation | Systems and methods for representing units of information manageable by a hardware/software interface system but independent of physical representation |
US7483915B2 (en) * | 2003-08-21 | 2009-01-27 | Microsoft Corporation | Systems and method for representing relationships between units of information manageable by a hardware/software interface system |
US7428546B2 (en) * | 2003-08-21 | 2008-09-23 | Microsoft Corporation | Systems and methods for data modeling in an item-based storage platform |
US7739316B2 (en) * | 2003-08-21 | 2010-06-15 | Microsoft Corporation | Systems and methods for the implementation of base schema for organizing units of information manageable by a hardware/software interface system |
US7555497B2 (en) * | 2003-08-21 | 2009-06-30 | Microsoft Corporation | Systems and methods for separating units of information manageable by a hardware/software interface system from their physical organization |
US7590643B2 (en) * | 2003-08-21 | 2009-09-15 | Microsoft Corporation | Systems and methods for extensions and inheritance for units of information manageable by a hardware/software interface system |
US8166101B2 (en) | 2003-08-21 | 2012-04-24 | Microsoft Corporation | Systems and methods for the implementation of a synchronization schemas for units of information manageable by a hardware/software interface system |
US7349913B2 (en) * | 2003-08-21 | 2008-03-25 | Microsoft Corporation | Storage platform for organizing, searching, and sharing data |
US7130819B2 (en) * | 2003-09-30 | 2006-10-31 | Yahoo! Inc. | Method and computer readable medium for search scoring |
US7844589B2 (en) * | 2003-11-18 | 2010-11-30 | Yahoo! Inc. | Method and apparatus for performing a search |
GR1004902B (en) * | 2004-04-19 | 2005-05-27 | Αθανασιος Σαββιδης | Method for searching information in computers and mechanism for storing text "per paragraph" |
US9189568B2 (en) | 2004-04-23 | 2015-11-17 | Ebay Inc. | Method and system to display and search in a language independent manner |
US7778962B2 (en) * | 2004-04-30 | 2010-08-17 | Microsoft Corporation | Client store synchronization through intermediary store change packets |
US7349901B2 (en) | 2004-05-21 | 2008-03-25 | Microsoft Corporation | Search engine spam detection using external data |
US7665063B1 (en) | 2004-05-26 | 2010-02-16 | Pegasystems, Inc. | Integration of declarative rule-based processing with procedural programming |
US7428530B2 (en) * | 2004-07-01 | 2008-09-23 | Microsoft Corporation | Dispersing search engine results by using page category information |
US7363296B1 (en) | 2004-07-01 | 2008-04-22 | Microsoft Corporation | Generating a subindex with relevant attributes to improve querying |
TWI254880B (en) * | 2004-10-18 | 2006-05-11 | Avectec Com Inc | Method for classifying electronic document analysis |
US8335704B2 (en) | 2005-01-28 | 2012-12-18 | Pegasystems Inc. | Methods and apparatus for work management and routing |
US20060184549A1 (en) * | 2005-02-14 | 2006-08-17 | Rowney Kevin T | Method and apparatus for modifying messages based on the presence of pre-selected data |
US8011003B2 (en) * | 2005-02-14 | 2011-08-30 | Symantec Corporation | Method and apparatus for handling messages containing pre-selected data |
US7706895B2 (en) * | 2005-02-25 | 2010-04-27 | Rockwell Automation Technologies, Inc. | Reliable messaging instruction |
US7805422B2 (en) | 2005-02-28 | 2010-09-28 | Microsoft Corporation | Change notification query multiplexing |
US7565351B1 (en) | 2005-03-14 | 2009-07-21 | Rockwell Automation Technologies, Inc. | Automation device data interface |
US7461044B2 (en) * | 2005-04-27 | 2008-12-02 | International Business Machines Corporation | It resource event situation classification and semantics |
US7233830B1 (en) * | 2005-05-31 | 2007-06-19 | Rockwell Automation Technologies, Inc. | Application and service management for industrial control devices |
US9020906B2 (en) * | 2005-08-15 | 2015-04-28 | National Instruments Corporation | Method for intelligent storing and retrieving in an enterprise data system |
US20070168325A1 (en) * | 2006-01-13 | 2007-07-19 | Julian Bourne | System and method for workflow processing using a portable knowledge format |
US20070088704A1 (en) * | 2005-10-17 | 2007-04-19 | Julian Bourne | System and method for creation, distribution, and utilization of portable knowledge format |
US20070100862A1 (en) * | 2005-10-23 | 2007-05-03 | Bindu Reddy | Adding attributes and labels to structured data |
US7933900B2 (en) * | 2005-10-23 | 2011-04-26 | Google Inc. | Search over structured data |
US20070112856A1 (en) * | 2005-11-17 | 2007-05-17 | Aaron Schram | System and method for providing analytics for a communities framework |
US7493329B2 (en) * | 2005-11-17 | 2009-02-17 | Bea Systems, Inc. | System and method for providing generic controls in a communities framework |
US8046696B2 (en) | 2005-11-17 | 2011-10-25 | Oracle International Corporation | System and method for providing active menus in a communities framework |
US7680927B2 (en) * | 2005-11-17 | 2010-03-16 | Bea Systems, Inc. | System and method for providing testing for a communities framework |
US20070113188A1 (en) * | 2005-11-17 | 2007-05-17 | Bales Christopher E | System and method for providing dynamic content in a communities framework |
US20070112781A1 (en) * | 2005-11-17 | 2007-05-17 | Mcmullen Cindy | System and method for providing search controls in a communities framework |
US8255818B2 (en) | 2005-11-17 | 2012-08-28 | Oracle International Corporation | System and method for providing drag and drop functionality in a communities framework |
US8185643B2 (en) * | 2005-11-17 | 2012-05-22 | Oracle International Corporation | System and method for providing security in a communities framework |
US7805459B2 (en) | 2005-11-17 | 2010-09-28 | Bea Systems, Inc. | Extensible controls for a content data repository |
US8078597B2 (en) * | 2005-11-17 | 2011-12-13 | Oracle International Corporation | System and method for providing extensible controls in a communities framework |
US7590687B2 (en) * | 2005-11-17 | 2009-09-15 | Bea Systems, Inc. | System and method for providing notifications in a communities framework |
US20070112799A1 (en) * | 2005-11-17 | 2007-05-17 | Bales Christopher E | System and method for providing resource interlinking for a communities framework |
US20070112798A1 (en) * | 2005-11-17 | 2007-05-17 | Bea Systems, Inc. | System and method for providing unique key stores for a communities framework |
US20070174296A1 (en) * | 2006-01-17 | 2007-07-26 | Andrew Gibbs | Method and system for distributing a database and computer program within a network |
US7657546B2 (en) * | 2006-01-26 | 2010-02-02 | International Business Machines Corporation | Knowledge management system, program product and method |
US7552151B2 (en) * | 2006-02-06 | 2009-06-23 | International Business Machines Corporation | System, method and program product for adding, updating and removing RDF statements stored on a server |
US8924335B1 (en) | 2006-03-30 | 2014-12-30 | Pegasystems Inc. | Rule-based user interface conformance methods |
US20090132232A1 (en) * | 2006-03-30 | 2009-05-21 | Pegasystems Inc. | Methods and apparatus for implementing multilingual software applications |
KR100691400B1 (en) * | 2006-03-31 | 2007-03-12 | 엔에이치엔(주) | Method for analyzing morpheme using additional information and morpheme analyzer for executing the method |
US7890499B1 (en) * | 2006-07-28 | 2011-02-15 | Google Inc. | Presentation of search results with common subject matters |
US8661057B1 (en) * | 2006-07-31 | 2014-02-25 | Elsevier Inc. | Methods and apparatus for post-search automated full-article retrieval |
US8639782B2 (en) | 2006-08-23 | 2014-01-28 | Ebay, Inc. | Method and system for sharing metadata between interfaces |
US8799218B2 (en) | 2006-12-01 | 2014-08-05 | Ebay Inc. | Business channel synchronization |
US7987185B1 (en) | 2006-12-29 | 2011-07-26 | Google Inc. | Ranking custom search results |
US8250525B2 (en) | 2007-03-02 | 2012-08-21 | Pegasystems Inc. | Proactive performance management for multi-user enterprise software systems |
US20080270462A1 (en) * | 2007-04-24 | 2008-10-30 | Interse A/S | System and Method of Uniformly Classifying Information Objects with Metadata Across Heterogeneous Data Stores |
US7823761B2 (en) * | 2007-05-16 | 2010-11-02 | The Invention Science Fund I, Llc | Maneuverable surgical stapler |
US8010567B2 (en) * | 2007-06-08 | 2011-08-30 | GM Global Technology Operations LLC | Federated ontology index to enterprise knowledge |
US8843471B2 (en) | 2007-08-14 | 2014-09-23 | At&T Intellectual Property I, L.P. | Method and apparatus for providing traffic-based content acquisition and indexing |
US9268849B2 (en) * | 2007-09-07 | 2016-02-23 | Alexander Siedlecki | Apparatus and methods for web marketing tools for digital archives—web portal advertising arts |
US10733223B2 (en) * | 2008-01-08 | 2020-08-04 | International Business Machines Corporation | Term-driven records file plan and thesaurus design |
US8112424B2 (en) * | 2008-03-11 | 2012-02-07 | International Business Machines Corporation | Flexible and resilient information collaboration management infrastructure |
US7996374B1 (en) | 2008-03-28 | 2011-08-09 | Symantec Corporation | Method and apparatus for automatically correlating related incidents of policy violations |
US7996373B1 (en) | 2008-03-28 | 2011-08-09 | Symantec Corporation | Method and apparatus for detecting policy violations in a data repository having an arbitrary data schema |
US8065739B1 (en) | 2008-03-28 | 2011-11-22 | Symantec Corporation | Detecting policy violations in information content containing data in a character-based language |
US8140531B2 (en) * | 2008-05-02 | 2012-03-20 | International Business Machines Corporation | Process and method for classifying structured data |
US8990896B2 (en) | 2008-06-24 | 2015-03-24 | Microsoft Technology Licensing, Llc | Extensible mechanism for securing objects using claims |
US8001154B2 (en) * | 2008-06-26 | 2011-08-16 | Microsoft Corporation | Library description of the user interface for federated search results |
US8037525B2 (en) * | 2008-07-16 | 2011-10-11 | International Business Machines Corporation | Access control and entitlement determination for hierarchically organized content |
US8826443B1 (en) | 2008-09-18 | 2014-09-02 | Symantec Corporation | Selective removal of protected content from web requests sent to an interactive website |
US10481878B2 (en) * | 2008-10-09 | 2019-11-19 | Objectstore, Inc. | User interface apparatus and methods |
EP2353112A2 (en) * | 2008-10-24 | 2011-08-10 | Indigo Biosystems, Inc. | Storage of complex data |
US8843435B1 (en) | 2009-03-12 | 2014-09-23 | Pegasystems Inc. | Techniques for dynamic data processing |
US8935752B1 (en) | 2009-03-23 | 2015-01-13 | Symantec Corporation | System and method for identity consolidation |
US8468492B1 (en) | 2009-03-30 | 2013-06-18 | Pegasystems, Inc. | System and method for creation and modification of software applications |
US20110153680A1 (en) * | 2009-12-23 | 2011-06-23 | Brinks Hofer Gilson & Lione | Automated document classification and routing |
US8689004B2 (en) | 2010-11-05 | 2014-04-01 | Microsoft Corporation | Pluggable claim providers |
US8880487B1 (en) | 2011-02-18 | 2014-11-04 | Pegasystems Inc. | Systems and methods for distributed rules processing |
JP5389130B2 (en) * | 2011-09-15 | 2014-01-15 | 株式会社東芝 | Document classification apparatus, method and program |
US9195936B1 (en) | 2011-12-30 | 2015-11-24 | Pegasystems Inc. | System and method for updating or modifying an application without manual coding |
US9235603B2 (en) * | 2012-03-27 | 2016-01-12 | Verizon Patent And Licensing Inc. | Activity based search |
EP2836920A4 (en) | 2012-04-09 | 2015-12-02 | Vivek Ventures Llc | Clustered information processing and searching with structured-unstructured database bridge |
US10469396B2 (en) | 2014-10-10 | 2019-11-05 | Pegasystems, Inc. | Event processing with enhanced throughput |
US20220164840A1 (en) | 2016-04-01 | 2022-05-26 | OneTrust, LLC | Data processing systems and methods for integrating privacy information management systems with data loss prevention tools or other tools for privacy design |
US10698599B2 (en) | 2016-06-03 | 2020-06-30 | Pegasystems, Inc. | Connecting graphical shapes using gestures |
US11625502B2 (en) | 2016-06-10 | 2023-04-11 | OneTrust, LLC | Data processing systems for identifying and modifying processes that are subject to data subject access requests |
US11475136B2 (en) | 2016-06-10 | 2022-10-18 | OneTrust, LLC | Data processing systems for data transfer risk identification and related methods |
US11222139B2 (en) | 2016-06-10 | 2022-01-11 | OneTrust, LLC | Data processing systems and methods for automatic discovery and assessment of mobile software development kits |
US11134086B2 (en) | 2016-06-10 | 2021-09-28 | OneTrust, LLC | Consent conversion optimization systems and related methods |
US11366786B2 (en) | 2016-06-10 | 2022-06-21 | OneTrust, LLC | Data processing systems for processing data subject access requests |
US11675929B2 (en) | 2016-06-10 | 2023-06-13 | OneTrust, LLC | Data processing consent sharing systems and related methods |
US10909488B2 (en) | 2016-06-10 | 2021-02-02 | OneTrust, LLC | Data processing systems for assessing readiness for responding to privacy-related incidents |
US10284604B2 (en) | 2016-06-10 | 2019-05-07 | OneTrust, LLC | Data processing and scanning systems for generating and populating a data inventory |
US10740487B2 (en) | 2016-06-10 | 2020-08-11 | OneTrust, LLC | Data processing systems and methods for populating and maintaining a centralized database of personal data |
US11354435B2 (en) | 2016-06-10 | 2022-06-07 | OneTrust, LLC | Data processing systems for data testing to confirm data deletion and related methods |
US11188862B2 (en) | 2016-06-10 | 2021-11-30 | OneTrust, LLC | Privacy management systems and methods |
US11461500B2 (en) | 2016-06-10 | 2022-10-04 | OneTrust, LLC | Data processing systems for cookie compliance testing with website scanning and related methods |
US11636171B2 (en) | 2016-06-10 | 2023-04-25 | OneTrust, LLC | Data processing user interface monitoring systems and related methods |
US11188615B2 (en) | 2016-06-10 | 2021-11-30 | OneTrust, LLC | Data processing consent capture systems and related methods |
US11481710B2 (en) | 2016-06-10 | 2022-10-25 | OneTrust, LLC | Privacy management systems and methods |
US11438386B2 (en) | 2016-06-10 | 2022-09-06 | OneTrust, LLC | Data processing systems for data-transfer risk identification, cross-border visualization generation, and related methods |
US11651104B2 (en) | 2016-06-10 | 2023-05-16 | OneTrust, LLC | Consent receipt management systems and related methods |
US12052289B2 (en) | 2016-06-10 | 2024-07-30 | OneTrust, LLC | Data processing systems for data-transfer risk identification, cross-border visualization generation, and related methods |
US11562097B2 (en) | 2016-06-10 | 2023-01-24 | OneTrust, LLC | Data processing systems for central consent repository and related methods |
US11520928B2 (en) | 2016-06-10 | 2022-12-06 | OneTrust, LLC | Data processing systems for generating personal data receipts and related methods |
US11403377B2 (en) | 2016-06-10 | 2022-08-02 | OneTrust, LLC | Privacy management systems and methods |
US11416798B2 (en) | 2016-06-10 | 2022-08-16 | OneTrust, LLC | Data processing systems and methods for providing training in a vendor procurement process |
US11341447B2 (en) | 2016-06-10 | 2022-05-24 | OneTrust, LLC | Privacy management systems and methods |
US11416589B2 (en) | 2016-06-10 | 2022-08-16 | OneTrust, LLC | Data processing and scanning systems for assessing vendor risk |
US11416109B2 (en) | 2016-06-10 | 2022-08-16 | OneTrust, LLC | Automated data processing systems and methods for automatically processing data subject access requests using a chatbot |
US12136055B2 (en) | 2016-06-10 | 2024-11-05 | OneTrust, LLC | Data processing systems for identifying, assessing, and remediating data processing risks using data modeling techniques |
US10909265B2 (en) | 2016-06-10 | 2021-02-02 | OneTrust, LLC | Application privacy scanning systems and related methods |
US11222142B2 (en) | 2016-06-10 | 2022-01-11 | OneTrust, LLC | Data processing systems for validating authorization for personal data collection, storage, and processing |
US10997318B2 (en) | 2016-06-10 | 2021-05-04 | OneTrust, LLC | Data processing systems for generating and populating a data inventory for processing data access requests |
US12118121B2 (en) | 2016-06-10 | 2024-10-15 | OneTrust, LLC | Data subject access request processing systems and related methods |
US11227247B2 (en) | 2016-06-10 | 2022-01-18 | OneTrust, LLC | Data processing systems and methods for bundled privacy policies |
US11586700B2 (en) | 2016-06-10 | 2023-02-21 | OneTrust, LLC | Data processing systems and methods for automatically blocking the use of tracking tools |
US11336697B2 (en) | 2016-06-10 | 2022-05-17 | OneTrust, LLC | Data processing systems for data-transfer risk identification, cross-border visualization generation, and related methods |
US11418492B2 (en) | 2016-06-10 | 2022-08-16 | OneTrust, LLC | Data processing systems and methods for using a data model to select a target data asset in a data migration |
US11392720B2 (en) | 2016-06-10 | 2022-07-19 | OneTrust, LLC | Data processing systems for verification of consent and notice processing and related methods |
US10318761B2 (en) | 2016-06-10 | 2019-06-11 | OneTrust, LLC | Data processing systems and methods for auditing data request compliance |
US12045266B2 (en) | 2016-06-10 | 2024-07-23 | OneTrust, LLC | Data processing systems for generating and populating a data inventory |
US11366909B2 (en) | 2016-06-10 | 2022-06-21 | OneTrust, LLC | Data processing and scanning systems for assessing vendor risk |
US11294939B2 (en) | 2016-06-10 | 2022-04-05 | OneTrust, LLC | Data processing systems and methods for automatically detecting and documenting privacy-related aspects of computer software |
US11651106B2 (en) | 2016-06-10 | 2023-05-16 | OneTrust, LLC | Data processing systems for fulfilling data subject access requests and related methods |
US10949565B2 (en) | 2016-06-10 | 2021-03-16 | OneTrust, LLC | Data processing systems for generating and populating a data inventory |
US11727141B2 (en) | 2016-06-10 | 2023-08-15 | OneTrust, LLC | Data processing systems and methods for synching privacy-related user consent across multiple computing devices |
US11416590B2 (en) | 2016-06-10 | 2022-08-16 | OneTrust, LLC | Data processing and scanning systems for assessing vendor risk |
US10846433B2 (en) | 2016-06-10 | 2020-11-24 | OneTrust, LLC | Data processing consent management systems and related methods |
US11354434B2 (en) | 2016-06-10 | 2022-06-07 | OneTrust, LLC | Data processing systems for verification of consent and notice processing and related methods |
US10678945B2 (en) | 2016-06-10 | 2020-06-09 | OneTrust, LLC | Consent receipt management systems and related methods |
US11544667B2 (en) | 2016-06-10 | 2023-01-03 | OneTrust, LLC | Data processing systems for generating and populating a data inventory |
US11410106B2 (en) | 2016-06-10 | 2022-08-09 | OneTrust, LLC | Privacy management systems and methods |
US11343284B2 (en) | 2016-06-10 | 2022-05-24 | OneTrust, LLC | Data processing systems and methods for performing privacy assessments and monitoring of new versions of computer code for privacy compliance |
US10698647B2 (en) | 2016-07-11 | 2020-06-30 | Pegasystems Inc. | Selective sharing for collaborative application usage |
US10013577B1 (en) | 2017-06-16 | 2018-07-03 | OneTrust, LLC | Data processing systems for identifying whether cookies contain personally identifying information |
US11048488B2 (en) | 2018-08-14 | 2021-06-29 | Pegasystems, Inc. | Software code optimizer and method |
US11544409B2 (en) | 2018-09-07 | 2023-01-03 | OneTrust, LLC | Data processing systems and methods for automatically protecting sensitive data within privacy management systems |
US10803202B2 (en) | 2018-09-07 | 2020-10-13 | OneTrust, LLC | Data processing systems for orphaned data identification and deletion and related methods |
EP4179435B1 (en) | 2020-07-08 | 2024-09-04 | OneTrust LLC | Systems and methods for targeted data discovery |
WO2022026564A1 (en) | 2020-07-28 | 2022-02-03 | OneTrust, LLC | Systems and methods for automatically blocking the use of tracking tools |
US11475165B2 (en) | 2020-08-06 | 2022-10-18 | OneTrust, LLC | Data processing systems and methods for automatically redacting unstructured data from a data subject access request |
US11567945B1 (en) | 2020-08-27 | 2023-01-31 | Pegasystems Inc. | Customized digital content generation systems and methods |
WO2022060860A1 (en) | 2020-09-15 | 2022-03-24 | OneTrust, LLC | Data processing systems and methods for detecting tools for the automatic blocking of consent requests |
US11526624B2 (en) | 2020-09-21 | 2022-12-13 | OneTrust, LLC | Data processing systems and methods for automatically detecting target data transfers and target data processing |
US11397819B2 (en) | 2020-11-06 | 2022-07-26 | OneTrust, LLC | Systems and methods for identifying data processing activities based on data discovery results |
US12038979B2 (en) * | 2020-11-25 | 2024-07-16 | International Business Machines Corporation | Metadata indexing for information management using both data records and associated metadata records |
WO2022159901A1 (en) * | 2021-01-25 | 2022-07-28 | OneTrust, LLC | Systems and methods for discovery, classification, and indexing of data in a native computing system |
WO2022170047A1 (en) | 2021-02-04 | 2022-08-11 | OneTrust, LLC | Managing custom attributes for domain objects defined within microservices |
US11494515B2 (en) | 2021-02-08 | 2022-11-08 | OneTrust, LLC | Data processing systems and methods for anonymizing data samples in classification analysis |
WO2022173912A1 (en) | 2021-02-10 | 2022-08-18 | OneTrust, LLC | Systems and methods for mitigating risks of third-party computing system functionality integration into a first-party computing system |
US11775348B2 (en) | 2021-02-17 | 2023-10-03 | OneTrust, LLC | Managing custom workflows for domain objects defined within microservices |
US11546661B2 (en) | 2021-02-18 | 2023-01-03 | OneTrust, LLC | Selective redaction of media content |
EP4305539A1 (en) | 2021-03-08 | 2024-01-17 | OneTrust, LLC | Data transfer discovery and analysis systems and related methods |
US11562078B2 (en) | 2021-04-16 | 2023-01-24 | OneTrust, LLC | Assessing and managing computational risk involved with integrating third party computing functionality within a computing system |
US11620142B1 (en) | 2022-06-03 | 2023-04-04 | OneTrust, LLC | Generating and customizing user interfaces for demonstrating functions of interactive user environments |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5708825A (en) * | 1995-05-26 | 1998-01-13 | Iconovex Corporation | Automatic summary page creation and hyperlink generation |
US6151624A (en) * | 1998-02-03 | 2000-11-21 | Realnames Corporation | Navigating network resources based on metadata |
US6182066B1 (en) * | 1997-11-26 | 2001-01-30 | International Business Machines Corp. | Category processing of query topics and electronic document content topics |
US6223575B1 (en) * | 1998-08-24 | 2001-05-01 | Kusakabe Electric & Machinery Co., Ltd. | Tube forming machine using three point bending |
US6236991B1 (en) * | 1997-11-26 | 2001-05-22 | International Business Machines Corp. | Method and system for providing access for categorized information from online internet and intranet sources |
US6253239B1 (en) * | 1997-09-23 | 2001-06-26 | Information Architects Corporation | System for indexing and display requested data having heterogeneous content and representation |
US6301579B1 (en) * | 1998-10-20 | 2001-10-09 | Silicon Graphics, Inc. | Method, system, and computer program product for visualizing a data structure |
US6366575B1 (en) * | 1996-11-01 | 2002-04-02 | Teloquent Communications Corporation | Extended access for automatic call distributing system |
US6397209B1 (en) * | 1996-08-30 | 2002-05-28 | Telexis Corporation | Real time structured summary search engine |
US6480835B1 (en) * | 1998-12-31 | 2002-11-12 | Intel Corporation | Method and system for searching on integrated metadata |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6233575B1 (en) * | 1997-06-24 | 2001-05-15 | International Business Machines Corporation | Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values |
-
2000
- 2000-01-21 US US09/489,735 patent/US6701314B1/en not_active Expired - Lifetime
-
2004
- 2004-01-21 US US10/760,472 patent/US20040153467A1/en not_active Abandoned
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5708825A (en) * | 1995-05-26 | 1998-01-13 | Iconovex Corporation | Automatic summary page creation and hyperlink generation |
US6397209B1 (en) * | 1996-08-30 | 2002-05-28 | Telexis Corporation | Real time structured summary search engine |
US6366575B1 (en) * | 1996-11-01 | 2002-04-02 | Teloquent Communications Corporation | Extended access for automatic call distributing system |
US6253239B1 (en) * | 1997-09-23 | 2001-06-26 | Information Architects Corporation | System for indexing and display requested data having heterogeneous content and representation |
US6182066B1 (en) * | 1997-11-26 | 2001-01-30 | International Business Machines Corp. | Category processing of query topics and electronic document content topics |
US6236991B1 (en) * | 1997-11-26 | 2001-05-22 | International Business Machines Corp. | Method and system for providing access for categorized information from online internet and intranet sources |
US6151624A (en) * | 1998-02-03 | 2000-11-21 | Realnames Corporation | Navigating network resources based on metadata |
US6223575B1 (en) * | 1998-08-24 | 2001-05-01 | Kusakabe Electric & Machinery Co., Ltd. | Tube forming machine using three point bending |
US6301579B1 (en) * | 1998-10-20 | 2001-10-09 | Silicon Graphics, Inc. | Method, system, and computer program product for visualizing a data structure |
US6480835B1 (en) * | 1998-12-31 | 2002-11-12 | Intel Corporation | Method and system for searching on integrated metadata |
Cited By (80)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10033799B2 (en) | 2002-11-20 | 2018-07-24 | Essential Products, Inc. | Semantically representing a target entity using a semantic object |
US7383500B2 (en) * | 2004-04-30 | 2008-06-03 | Microsoft Corporation | Methods and systems for building packages that contain pre-paginated documents |
US20050248790A1 (en) * | 2004-04-30 | 2005-11-10 | David Ornstein | Method and apparatus for interleaving parts of a document |
US20050251740A1 (en) * | 2004-04-30 | 2005-11-10 | Microsoft Corporation | Methods and systems for building packages that contain pre-paginated documents |
US7752235B2 (en) | 2004-04-30 | 2010-07-06 | Microsoft Corporation | Method and apparatus for maintaining relationships between parts in a package |
US20050268221A1 (en) * | 2004-04-30 | 2005-12-01 | Microsoft Corporation | Modular document format |
US20050273701A1 (en) * | 2004-04-30 | 2005-12-08 | Emerson Daniel F | Document mark up methods and systems |
US20050278272A1 (en) * | 2004-04-30 | 2005-12-15 | Microsoft Corporation | Method and apparatus for maintaining relationships between parts in a package |
US7836094B2 (en) | 2004-04-30 | 2010-11-16 | Microsoft Corporation | Method and apparatus for maintaining relationships between parts in a package |
US8122350B2 (en) | 2004-04-30 | 2012-02-21 | Microsoft Corporation | Packages that contain pre-paginated documents |
US8661332B2 (en) | 2004-04-30 | 2014-02-25 | Microsoft Corporation | Method and apparatus for document processing |
US8243317B2 (en) | 2004-05-03 | 2012-08-14 | Microsoft Corporation | Hierarchical arrangement for spooling job data |
US20050243346A1 (en) * | 2004-05-03 | 2005-11-03 | Microsoft Corporation | Planar mapping of graphical elements |
US20050243355A1 (en) * | 2004-05-03 | 2005-11-03 | Microsoft Corporation | Systems and methods for support of various processing capabilities |
US20050262134A1 (en) * | 2004-05-03 | 2005-11-24 | Microsoft Corporation | Spooling strategies using structured job information |
US7755786B2 (en) | 2004-05-03 | 2010-07-13 | Microsoft Corporation | Systems and methods for support of various processing capabilities |
US8024648B2 (en) | 2004-05-03 | 2011-09-20 | Microsoft Corporation | Planar mapping of graphical elements |
US8363232B2 (en) | 2004-05-03 | 2013-01-29 | Microsoft Corporation | Strategies for simultaneous peripheral operations on-line using hierarchically structured job information |
US8639723B2 (en) | 2004-05-03 | 2014-01-28 | Microsoft Corporation | Spooling strategies using structured job information |
US9110877B2 (en) | 2004-09-30 | 2015-08-18 | Microsoft Technology Licensing, Llc | Method and apparatus for utilizing an extensible markup language schema for managing specific types of content in an electronic document |
US20060080590A1 (en) * | 2004-09-30 | 2006-04-13 | Microsoft Corporation | Method and apparatus for utilizing an extensible markup language schema for managing specific types of content in an electronic document |
US7707498B2 (en) | 2004-09-30 | 2010-04-27 | Microsoft Corporation | Specific type content manager in an electronic document |
US7712016B2 (en) | 2004-09-30 | 2010-05-04 | Microsoft Corporation | Method and apparatus for utilizing an object model for managing content regions in an electronic document |
US20060069987A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Method, apparatus and computer-readable medium for managing specific types of content in an electronic document |
US7673235B2 (en) | 2004-09-30 | 2010-03-02 | Microsoft Corporation | Method and apparatus for utilizing an object model to manage document parts for use in an electronic document |
US20060069989A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Method and apparatus for utilizing an object model for managing content regions in an electronic document |
US20060136143A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Personalized genetic-based analysis of medical conditions |
US20060136467A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Domain-specific data entity mapping method and system |
US20060136259A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Multi-dimensional analysis of medical data |
US20060136466A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Computer assisted domain specific entity mapping method and system |
US20060184489A1 (en) * | 2004-12-17 | 2006-08-17 | General Electric Company | Genetic knowledgebase creation for personalized analysis of medical conditions |
US20060136417A1 (en) * | 2004-12-17 | 2006-06-22 | General Electric Company | Method and system for search, analysis and display of structured data |
US20060190815A1 (en) * | 2004-12-20 | 2006-08-24 | Microsoft Corporation | Structuring data for word processing documents |
US7770180B2 (en) | 2004-12-21 | 2010-08-03 | Microsoft Corporation | Exposing embedded data in a computer-generated document |
US7752632B2 (en) | 2004-12-21 | 2010-07-06 | Microsoft Corporation | Method and system for exposing nested data in a computer-generated document in a transparent manner |
US20060195783A1 (en) * | 2005-01-06 | 2006-08-31 | Microsoft Corporation | Programmability for binding data |
US7945590B2 (en) | 2005-01-06 | 2011-05-17 | Microsoft Corporation | Programmability for binding data |
US7617234B2 (en) | 2005-01-06 | 2009-11-10 | Microsoft Corporation | XML schema for binding data |
US20060195454A1 (en) * | 2005-01-06 | 2006-08-31 | Microsoft Corporation | XML schema for binding data |
US20060150085A1 (en) * | 2005-01-06 | 2006-07-06 | Microsoft Corporation | Data binding in a word-processing application |
US7730394B2 (en) | 2005-01-06 | 2010-06-01 | Microsoft Corporation | Data binding in a word-processing application |
US20060167868A1 (en) * | 2005-01-27 | 2006-07-27 | Weijia Zhang | Universal and extensible packaging process for computer system software integration and deployment |
AU2006200047B2 (en) * | 2005-02-25 | 2011-02-03 | Microsoft Technology Licensing, Llc | Data store for software application documents |
US20060195777A1 (en) * | 2005-02-25 | 2006-08-31 | Microsoft Corporation | Data store for software application documents |
US7752224B2 (en) | 2005-02-25 | 2010-07-06 | Microsoft Corporation | Programmability for XML data store for documents |
US7668873B2 (en) | 2005-02-25 | 2010-02-23 | Microsoft Corporation | Data store for software application documents |
EP1696347A1 (en) * | 2005-02-25 | 2006-08-30 | Microsoft Corporation | Data store for software application documents |
US20060248094A1 (en) * | 2005-04-28 | 2006-11-02 | Microsoft Corporation | Analysis and comparison of portfolios by citation |
US7953696B2 (en) | 2005-09-09 | 2011-05-31 | Microsoft Corporation | Real-time synchronization of XML data between applications |
US20070061382A1 (en) * | 2005-09-09 | 2007-03-15 | Microsoft Corporation | Real-time synchronization of XML data between applications |
US20070073770A1 (en) * | 2005-09-29 | 2007-03-29 | Morris Robert P | Methods, systems, and computer program products for resource-to-resource metadata association |
US20070073751A1 (en) * | 2005-09-29 | 2007-03-29 | Morris Robert P | User interfaces and related methods, systems, and computer program products for automatically associating data with a resource as metadata |
US20100332559A1 (en) * | 2005-09-29 | 2010-12-30 | Fry Jared S | Methods, Systems, And Computer Program Products For Automatically Associating Data With A Resource As Metadata Based On A Characteristic Of The Resource |
US20070073688A1 (en) * | 2005-09-29 | 2007-03-29 | Fry Jared S | Methods, systems, and computer program products for automatically associating data with a resource as metadata based on a characteristic of the resource |
US7797337B2 (en) | 2005-09-29 | 2010-09-14 | Scenera Technologies, Llc | Methods, systems, and computer program products for automatically associating data with a resource as metadata based on a characteristic of the resource |
US9280544B2 (en) | 2005-09-29 | 2016-03-08 | Scenera Technologies, Llc | Methods, systems, and computer program products for automatically associating data with a resource as metadata based on a characteristic of the resource |
US20070078873A1 (en) * | 2005-09-30 | 2007-04-05 | Avinash Gopal B | Computer assisted domain specific entity mapping method and system |
US7840542B2 (en) | 2006-02-06 | 2010-11-23 | International Business Machines Corporation | Method and system for controlling access to semantic web statements |
US20070198456A1 (en) * | 2006-02-06 | 2007-08-23 | International Business Machines Corporation | Method and system for controlling access to semantic web statements |
US20070198541A1 (en) * | 2006-02-06 | 2007-08-23 | International Business Machines Corporation | Method and system for efficiently storing semantic web statements in a relational database |
US20070198542A1 (en) * | 2006-02-09 | 2007-08-23 | Morris Robert P | Methods, systems, and computer program products for associating a persistent information element with a resource-executable pair |
US8065307B2 (en) | 2006-12-20 | 2011-11-22 | Microsoft Corporation | Parsing, analysis and scoring of document content |
US20080154848A1 (en) * | 2006-12-20 | 2008-06-26 | Microsoft Corporation | Search, Analysis and Comparison of Content |
US8019761B2 (en) * | 2007-01-17 | 2011-09-13 | Fujitsu Limited | Recording medium storing a design support program, design support method, and design support apparatus |
US20080172379A1 (en) * | 2007-01-17 | 2008-07-17 | Fujitsu Limited | Recording medium storing a design support program, design support method, and design support apparatus |
US8140680B2 (en) * | 2008-01-09 | 2012-03-20 | International Business Machines Corporation | Machine-processable semantic description for resource management |
US20090177777A1 (en) * | 2008-01-09 | 2009-07-09 | International Business Machines Corporation | Machine-Processable Semantic Description For Resource Management |
US20100223252A1 (en) * | 2009-03-02 | 2010-09-02 | Yahoo! Inc. | Method and system for web searching |
US9477763B2 (en) * | 2009-03-02 | 2016-10-25 | Excalibur IP, LC | Personalized search results utilizing previously navigated web sites |
US9934315B2 (en) | 2009-03-02 | 2018-04-03 | Excalibur Ip, Llc | Method and system for web searching |
US10628847B2 (en) | 2009-04-15 | 2020-04-21 | Fiver Llc | Search-enhanced semantic advertising |
US20120203734A1 (en) * | 2009-04-15 | 2012-08-09 | Evri Inc. | Automatic mapping of a location identifier pattern of an object to a semantic type using object metadata |
US9607089B2 (en) | 2009-04-15 | 2017-03-28 | Vcvc Iii Llc | Search and search optimization using a pattern of a location identifier |
US9613149B2 (en) * | 2009-04-15 | 2017-04-04 | Vcvc Iii Llc | Automatic mapping of a location identifier pattern of an object to a semantic type using object metadata |
US20150074007A1 (en) * | 2013-09-09 | 2015-03-12 | UnitedLex Corp. | Interactive case management system |
US10453071B2 (en) * | 2013-09-09 | 2019-10-22 | UnitedLex Corp. | Interactive case management system |
US11803860B2 (en) | 2013-09-09 | 2023-10-31 | UnitedLex Corp. | Email mappings |
US11978057B2 (en) | 2013-09-09 | 2024-05-07 | UnitedLex Corp. | Single instance storage of metadata and extracted text |
US11763260B2 (en) * | 2017-01-12 | 2023-09-19 | Halliburton Energy Services, Inc. | Bridging various standards for drilling projects |
CN110275874A (en) * | 2019-02-25 | 2019-09-24 | 广州金越软件技术有限公司 | A kind of intelligent resource inventory method that big data resource is administered |
Also Published As
Publication number | Publication date |
---|---|
US6701314B1 (en) | 2004-03-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6701314B1 (en) | System and method for cataloguing digital information for searching and retrieval | |
Benjelloun et al. | Google dataset search by the numbers | |
US8510339B1 (en) | Searching content using a dimensional database | |
US7124358B2 (en) | Method for dynamically generating reference identifiers in structured information | |
US20020065857A1 (en) | System and method for analysis and clustering of documents for search engine | |
Jenkins et al. | Automatic RDF metadata generation for resource discovery | |
US20070185860A1 (en) | System for searching | |
Guptill | Metadata and data catalogues | |
AU6509800A (en) | Indexing a network with agents | |
US20070271228A1 (en) | Documentary search procedure in a distributed system | |
López et al. | An efficient and scalable search engine for models | |
Jepsen et al. | Characteristics of scientific Web publications: Preliminary data gathering and analysis | |
Desai et al. | Resource discovery: modelling, cataloguing and searching | |
Roszkowski et al. | A distributed architecture for resource discovery using metadata | |
Wariyapola et al. | Ontology and metadata creation for the poseidon distributed coastal zone management system | |
Wang et al. | An application specific knowledge engine for researches in intelligent transportation systems | |
Francisco‐Revilla et al. | Encoded archival description: Data quality and analysis | |
Lam | The Overview of Web Search Engines | |
Foulonneau et al. | Strategies for reprocessing aggregated metadata | |
Tolosana-Calasanz et al. | CatServer: a server of GATOS | |
Heery et al. | Metadata | |
Hughes et al. | Intelligent resource discovery using ontology-based resource profiles | |
Chen et al. | SE4SC: A specific search engine for software components | |
Pouchard et al. | Data Grid discovery and Semantic Web technologies for the earth sciences | |
Zapilko et al. | A LOD backend infrastructure for scientific search portals |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: SCIENCE APPLICATIONS INTERNATIONAL CORPORATION, CA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CONOVER, JOAN EVELYN;ANTHONY, DOUGLAS MCCOY;REEL/FRAME:014911/0211 Effective date: 20011101 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |