[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1376616.1376750acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

SchemaScope: a system for inferring and cleaning XML schemas

Published: 09 June 2008 Publication History

Abstract

We present SchemaScope, a system to derive Document Type Definitions and XML schema from a corpus of sample XML documents. Tools are provided to visualize, clean and refine existing or inferred schemas. A number of use cases illustrate the versatility of the system, as well as various types of applications.

References

[1]
Stylus Studio. http://www.stylusstudio.com/.
[2]
oXygen/ XML editor and XSLT debugger. http://www.oxygenxml.com/.
[3]
D. Barbosa, A. O. Mendelzon, J. Keenleyside, and K. A. Lyons. ToXgene: an extensible template-based data generator for XML. In WebDB 2002, pages 49--54, 2002.
[4]
D. Barbosa, L. Mignet, and P. Veltri. Studying the XML Web: gathering statistics from an XML sample. World Wide Web, 8(4):413--438, 2005.
[5]
G. J. Bex, F. Neven, T. Schwentick, and K. Tuyls. Inference of concise DTDs from XML data. In VLDB 2006, pages 115--126, 2006.
[6]
G. J. Bex, F. Neven, and J. Van den Bussche. DTDs versus XML Schema: a practical study. In WebDB 2004, pages 79--84, 2004.
[7]
G. J. Bex, F. Neven, and S. Vansummeren. Inferring XML Schema Definitions from XML data. In VLDB 2007, pages 998--1009, 2007.
[8]
B. Choi. What are real DTDs like? In WebDB 2002, pages 43--48, 2002.
[9]
J. Clark. Trang: Multi-format schema converter based on RELAX NG. http://www.thaiopensource.com/relaxng/trang.html, June 2003.
[10]
M. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim. XTRACT: learning document type descriptors from XML document collections. Data mining and knowledge discovery, 7:23--56, 2003.
[11]
E. Gold. Language identification in the limit. Information and Control, 10(5):447--474, May 1967.
[12]
W. Martens, F. Neven, T. Schwentick, and G. J. Bex. Expressiveness and Complexity of XML Schema. ACM TODS, 31(3):770--813, 2006.
[13]
A. McDowell, C. Schmidt, and K. bun Yue. Analysis and metrics of XML Schema. In Software Engineering Research and Practice, pages 538--544, 2004.
[14]
A. Sahuguet. Everything you ever wanted to know about DTDs, but were afraid to ask. In WebDB 2000, pages 69--74, 2000.
[15]
H. Thompson, D. Beech, M. Maloney, and N. Mendelsohn. XML Schema part 1: structures. W3C, May 2001.

Cited By

View all
  • (2022)Designing XML Schema Inference Algorithm for Intra-enterprise UsePerspectives in Business Informatics Research10.1007/978-3-031-16947-2_3(35-49)Online publication date: 16-Sep-2022
  • (2014)Discovering XSD Keys from XML DataACM Transactions on Database Systems10.1145/263854739:4(1-49)Online publication date: 30-Dec-2014
  • (2013)Discovering XSD keys from XML dataProceedings of the 2013 ACM SIGMOD International Conference on Management of Data10.1145/2463676.2463705(61-72)Online publication date: 22-Jun-2013
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data
June 2008
1396 pages
ISBN:9781605581026
DOI:10.1145/1376616
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. regular expressions
  2. schema inference
  3. xml

Qualifiers

  • Demonstration

Conference

SIGMOD/PODS '08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Designing XML Schema Inference Algorithm for Intra-enterprise UsePerspectives in Business Informatics Research10.1007/978-3-031-16947-2_3(35-49)Online publication date: 16-Sep-2022
  • (2014)Discovering XSD Keys from XML DataACM Transactions on Database Systems10.1145/263854739:4(1-49)Online publication date: 30-Dec-2014
  • (2013)Discovering XSD keys from XML dataProceedings of the 2013 ACM SIGMOD International Conference on Management of Data10.1145/2463676.2463705(61-72)Online publication date: 22-Jun-2013
  • (2013)Inference of XML Integrity ConstraintsAdvances in Databases and Information Systems10.1007/978-3-642-32741-4_26(285-296)Online publication date: 2013
  • (2012)Foundations of XML based on logic and automataProceedings of the 7th international conference on Foundations of Information and Knowledge Systems10.1007/978-3-642-28472-4_2(23-33)Online publication date: 5-Mar-2012
  • (2011)XEvolveProceedings of the 2011 ACM Symposium on Applied Computing10.1145/1982185.1982530(1645-1650)Online publication date: 21-Mar-2011
  • (2010)Ambiguous content and disambiguation of XML schemataProceedings of the Fourteenth International Database Engineering & Applications Symposium10.1145/1866480.1866492(75-81)Online publication date: 16-Aug-2010
  • (2010)A context-free markup language for semi-structured textACM SIGPLAN Notices10.1145/1809028.180662245:6(221-232)Online publication date: 5-Jun-2010
  • (2010)A context-free markup language for semi-structured textProceedings of the 31st ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/1806596.1806622(221-232)Online publication date: 5-Jun-2010
  • (2009)Simplifying XML schemaProceedings of the 2009 ACM SIGMOD International Conference on Management of data10.1145/1559845.1559922(731-744)Online publication date: 29-Jun-2009

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media