[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Semantic Text Summarization Based on Syntactic Patterns

Published: 01 October 2013 Publication History

Abstract

Text summarization is machine based generation of a shortened version of a text. The summary should be a non-redundant extract from the original text. Most researches of text summarization use sentence extraction instead of abstraction to produce a summary. Extraction is depending mainly on sentences that already contained in the original input, which makes it more accurate and more concise. When all input articles are surrounding a particular event, extracting similar sentences would result in producing a highly repetitive summary. In this paper, a novel model for text summarization is proposed based on removing the non-effective sentences in producing an extract from the text. The model utilizes semantic analysis by evaluating sentences similarity. This similarity is provided by evaluating individual words similarity as well as syntactic relationships between neighboring words. These relationships addressed throughout the model as syntactic patterns. Word senses and the correlating part of speech for the word within context are provided in the semantic processing of matched patterns. The introduction of syntactic patterns knowledge supports text reduction by mapping the matched patterns into summarized ones. In addition, syntactic patterns make use of sentence relatedness evaluation in defining which sentences to keep and which to drop. Experiments proved that the model presented throughout the paper is well performing in results evaluation of compression rate, accuracy, recall and other human criteria like correctness, novelty, fluency and usefulness.

References

[1]
Allen, J. 1987. Natural language understanding. The Benjamin/Cummings Publishing Company.
[2]
Barzilay, R., Elhadad, N., & McKeown, K. 2001. Sentence ordering in multidocument summarization. In Proceedings of the First International Conference on Human Language Technology Research HLT '01. pp. 1-7.
[3]
Curto, S., Mendes, A., & Coheur, L. 2012. Question generation based on lexico-syntactic patterns learned from the web. Dialogue & Discourse, 32, 147-175.
[4]
Fellbaum, C. 1998. WordNet an electronic lexical database. Cambridge, MA: MIT Press.
[5]
Feng, S., Banerjee, R., & Choi, Y. 2012, July 12-14. Characterizing stylistic elements in syntactic structure. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea pp. 1522-1533. Association for Computational Linguistics.
[6]
Finkelstein-Landau, M., & Morin, E. 1999. Extracting semantic relationships between terms supervised vs. unsupervised methods. In Proceedings of the International Workshop on Ontological Engineering on the Global Information Infrastructure, Dagstuhl Castle, Germany.
[7]
Giovannetti, E., Marchi, S., & Montemagni, S. 2008. Combining statistical techniques and lexico-syntactic patterns for semantic relations extraction from text. In Proceedings of the SWAP vol. 426 of CEUR Workshop Proceedings. CEUR-WS.org.
[8]
Hennig, L. 2009. Topic-based multi-document summarization with probabilistic latent semantic analysis. In International Conference RANLP'09, Borovets, Bulgaria pp. 144-149.
[9]
Hoang, C., & Kan, M. 2010. Towards automated related work summarization. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters COLING '10 pp. 427-435.
[10]
Hogenboom, F., IJntema, W., & Frasincar, F. 2012. Text-based information extraction using lexico-semantic patterns. In Proceedings of the Twenty-Fourth Benelux Conference on Artificial Intelligence BNAIC 2012 pp. 293-294. Océ Business Services.
[11]
Klaussner, C., & Zhekova, D. 2011. Lexico-syntactic patterns for automatic ontology building. In Proceedings of the Student Research Workshop associated with RANLP, Hissar, Bulgaria pp. 109-114.
[12]
Kowsalya, R., Priya, R., & Nithiya, P. 2011. Multi document extractive summarization based on word sequences. IJCSI International Journal of Computer Science Issues, 82.
[13]
Lin, C. 2005. ROUGE: A package for automatic evaluation of summaries. In Proceedings of the ACL-04 Workshop pp. 74-81
[14]
Lloret, E., Ferrández, Ó., Muñoz, R., & Palomar, M. 2008. A text summarization approach under the influence of textual entailment. In Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science NLPCS 2008, In conjunction with ICEIS 2008, Barcelona, Spain.
[15]
Luger, G., & Stubbleefield, W. 1990. Artificial Intelligence and the design of expert systems. Redwood City, CA: Benjamin/Cummings Publishing Co. Inc.
[16]
Maynard, D., Funk, A., & Peters, W. 2009. Using lexico-syntactic ontology design patterns for ontology creation and population. In Proceedings of WOP Vol. 516 of CEUR Workshop Proceedings. CEUR-WS.org.
[17]
Montiel-Ponsoda, E., & Aguado de Cea, G. 2010. Using natural language patterns for the development of ontologies. Researching specialized languages. John Benjamins Publishing.
[18]
Nastase, V. 2008. Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing pp. 763-772.
[19]
Panchenko, A., Morozova, O., & Naets, H. 2012. A semantic similarity measure based on lexico-syntactic patterns. In Proceedings of KONVENS, Main Track: Poster Presentations, Vienna.
[20]
Pedersen, T., & Kolhatkar, V. 2009. WordNet:SenseRelate:AllWords: A broad coverage word sense tagger that maximizes semantic relatedness. In Proceedings of the NAACL-Demonstrations '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics Companion Volume: Demonstration Session, pp. 17-20.
[21]
Radev, D., Allison, T., Blair-Goldensohn, S., Blitzer, J., Celebi, A., & Dimitrov, S. ' Zhang, Z. 2004. MEAD - a platform for multidocument multilingual text summarization. In Proceedings of the LREC 2004, Lisbon, Portugal.
[22]
Saggion, H., & Lapalme, G. 2002. Generating indicative-informative summaries with SumUM. Association for Computational Linguistics, 284, 497-526. ISSN 0891-2017.
[23]
Sarkar, K. 2009. Using domain knowledge for text summarization in medical domain. International Journal of Recent Trends in Engineering, 11.
[24]
Sathapornrungkij, P., & Pluempitiwiriyawej, C. 2005. Construction of Thai WordNet lexical database from machine readable dictionaries. In Proceedings of MT-Summit X, Phuket, Thailand pp. 87-92.
[25]
Shamsfard, M. 2010. Lexico-syntactic and semantic patterns for extracting knowledge from Persian texts. {IJCSE}. International Journal on Computer Science and Engineering, 26, 2190-2196.
[26]
Snow, R., Jurafsky, D., & Ng, A. 2005. Learning syntactic patterns for automatic hypernym discovery. In Proceedings of NIPS 17. Advances in Neural Information Processing Systems. MIT Press.
[27]
Wang, D., Li, T., Zhu, S., & Ding, C. 2008, July 20-24. Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization. In Proceedings of the SIGIR'08, Singapore.
[28]
Widdows, D., & Dorow, B. 2005. Automatic extraction of idioms using graph analysis and asymmetric lexicosyntactic patterns. In Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition DeepLA'05 pp. 48-56. Association for Computational Linguistics Stroudsburg.
[29]
Xu, F., Kurz, D., Piskorski, J., & Schmeier, S. 2002, May 29-31. An domain adaptive approach to automatic acquisition of domain relevant terms and their relations with bootstrapping. In Proceedings of the 3rd International Conference on Language Resources an Evaluation LREC'02, Las Palmas, Canary Islands, Spain.
[30]
Yih, W., Goodman, J., Vanderwende, L., & Suzuki, H. 2007. Multi-document summarization by maximizing informative content words. In Proceedings of the IJCAI-07.
[31]
Zhang, J., Sun, Y., Wang, H., & He, Y. 2011. Calculating statistical similarity between sentences. Journal of Convergence Information Technology, 62.

Cited By

View all
  • (2018)A New LSA and Entropy-Based Approach for Automatic Text Document SummarizationInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201810010114:4(1-32)Online publication date: 1-Oct-2018
  • (2018)Automatic Text Document Summarization Using Graph Based Centrality Measures on Lexical NetworkInternational Journal of Information Retrieval Research10.4018/IJIRR.20180701028:3(14-32)Online publication date: 1-Jul-2018
  • (2016)Text Summarization Using FrameNet-Based Semantic Graph ModelScientific Programming10.1155/2016/51306032016(5)Online publication date: 1-Nov-2016

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image International Journal of Information Retrieval Research
International Journal of Information Retrieval Research  Volume 3, Issue 4
October 2013
140 pages
ISSN:2155-6377
EISSN:2155-6385
Issue’s Table of Contents

Publisher

IGI Global

United States

Publication History

Published: 01 October 2013

Author Tags

  1. Semantic Analysis
  2. Syntactic Analysis
  3. Syntactic Patterns
  4. Text Summarization
  5. Word Sense Disambiguation

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2018)A New LSA and Entropy-Based Approach for Automatic Text Document SummarizationInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201810010114:4(1-32)Online publication date: 1-Oct-2018
  • (2018)Automatic Text Document Summarization Using Graph Based Centrality Measures on Lexical NetworkInternational Journal of Information Retrieval Research10.4018/IJIRR.20180701028:3(14-32)Online publication date: 1-Jul-2018
  • (2016)Text Summarization Using FrameNet-Based Semantic Graph ModelScientific Programming10.1155/2016/51306032016(5)Online publication date: 1-Nov-2016

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media