[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Combining Information Extraction with Genetic Algorithms for Text Mining

Published: 01 May 2004 Publication History

Abstract

Text mining discovers unseen patterns in textual databases. But these discoveries are useless unless they contribute valuable knowledge for users who make strategic decisions. Confronting this issue can lead to a complicated activity called knowledge discovery from texts, which deals with both discovering unseen knowledge and evaluating this potentially valuable knowledge. KDT can benefit from techniques that have been useful in data mining or knowledge discovery from databases. However, we can't immediately apply data mining techniques to text data for text mining because they assume a structure in the source data that isn't in free text. We must therefore use new representations for text data. An evolutionary approach that combines information extraction technology and genetic algorithms can produce a new, integrated hypothesis for text mining.

References

[1]
J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann, 2001.
[2]
D. Swanson, "On the Fragmentation of Knowledge, the Connection Explosion, and Assembling Other People's Ideas," Bull. of the Am. Soc. for Information Science and Technology, vol. 27, no. 3, Feb./Mar. 2001, pp. 12-14; www.asis.org/Bulletin/Mar-01/ swanson.html.
[3]
M. Hearst, "Automated Discovery of WordNet Relations," WordNet: An Electronic Lexical Database, MIT Press, 1998, pp. 131-151.
[4]
C. Jacquemin, "Syntagmatic and Paradigmatic Representation of Terms Variation," Proc. 37th Ann. Meeting Assoc. for Computational Linguistics, Assoc. for Computational Linguistics, 1999, pp. 341-348.
[5]
S. Basu, et al., "Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text," Proc. NAACL 2001 Workshop WordNet and Other Lexical Resources: Applications, Extensions and Customizations, Assoc. for Computational Linguistics, 2001, pp. 144- 149.
[6]
K. Deb, Multiobjective Optimization Using Evolutionary Algorithms, John Wiley & Sons, 2001.
[7]
S. Teufel and M. Moens, "Discourse-Level Argumentation in Scientific Articles: Human and Automatic Annotation," Proc. ACL 1999 Workshop towards Standards and Tools for Discourse Tagging, Assoc. for Computational Linguistics, 1999.
[8]
T. Landauer P. Foltz and D. Laham, "An Introduction to Latent Semantic Analysis," Discourse Processes, vol. 10, no. 25, 1998, pp. 259-284.
[9]
W. Kintsch, "Predication," Cognitive Science, vol. 25, no. 2, 2001, pp. 173-202.
[10]
P. Foltz W. Kintsch and T. Landauer, "The Measurement of Textual Coherence with Latent Semantic Analysis," Discourse Processes, vol. 25, no. 2, 1998, pp. 259-284.
[11]
S. Basu, et al., "Evaluating the Novelty of Text-Mined Rules Using Lexical Knowledge," Proc. 7th Int'l Conf. Knowledge Discovery and Data Mining, ACM Press, Aug. 2001, pp. 233-238.
[12]
E. Zitzler and L. Thiele, An Evolutionary Algorithm for Multiobjective Optimisation: The Strength Pareto Approach, tech. report 43, Swiss Federal Inst. of Technology (ETH), 1998.
[13]
M. Mitchell, An Introduction to Genetic Algorithms, MIT Press, 1996.

Cited By

View all
  • (2023)Edge detection algorithm in complex image text information extractionJournal of Computational Methods in Sciences and Engineering10.3233/JCM-22672223:3(1381-1393)Online publication date: 30-May-2023
  • (2023)Ontology-Based Similarity Computation of Two Sentences Using Word-Net DatabaseNew Generation Computing10.1007/s00354-023-00228-z41:3(723-737)Online publication date: 18-Aug-2023
  • (2022)A Complete Process of Text Classification System Using State-of-the-Art NLP ModelsComputational Intelligence and Neuroscience10.1155/2022/18836982022Online publication date: 1-Jan-2022
  • Show More Cited By
  1. Combining Information Extraction with Genetic Algorithms for Text Mining

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image IEEE Intelligent Systems
    IEEE Intelligent Systems  Volume 19, Issue 3
    May 2004
    93 pages

    Publisher

    IEEE Educational Activities Department

    United States

    Publication History

    Published: 01 May 2004

    Author Tags

    1. genetic algorithms
    2. knowledge discovery from texts
    3. multiobjective optimization
    4. semantic analysis
    5. text mining

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Edge detection algorithm in complex image text information extractionJournal of Computational Methods in Sciences and Engineering10.3233/JCM-22672223:3(1381-1393)Online publication date: 30-May-2023
    • (2023)Ontology-Based Similarity Computation of Two Sentences Using Word-Net DatabaseNew Generation Computing10.1007/s00354-023-00228-z41:3(723-737)Online publication date: 18-Aug-2023
    • (2022)A Complete Process of Text Classification System Using State-of-the-Art NLP ModelsComputational Intelligence and Neuroscience10.1155/2022/18836982022Online publication date: 1-Jan-2022
    • (2021)Research on anti-conflict extraction method of multimedia video information based on machine learningMultimedia Tools and Applications10.1007/s11042-019-07755-280:15(22701-22718)Online publication date: 1-Jun-2021
    • (2020)An Approach to Mine SBVR Vocabularies and Rules from Business DocumentsProceedings of the 13th Innovations in Software Engineering Conference (formerly known as India Software Engineering Conference)10.1145/3385032.3385046(1-11)Online publication date: 27-Feb-2020
    • (2020)RETRACTED ARTICLE: Medical image analysis of phosphorylated protein interaction extraction algorithm based on text mining technologyMultimedia Tools and Applications10.1007/s11042-019-07853-179:15-16(10551-10579)Online publication date: 1-Apr-2020
    • (2020)A Context-Aware Computing Method of Sentence Similarity Based on Frame SemanticsAdvanced Data Mining and Applications10.1007/978-3-030-65390-3_9(114-126)Online publication date: 12-Nov-2020
    • (2018)Relation Identification in Business Rules for Domain-specific DocumentsProceedings of the 11th Innovations in Software Engineering Conference10.1145/3172871.3172884(1-5)Online publication date: 9-Feb-2018
    • (2018)Generating Hard to Comprehend Fake Documents for Defensive Cyber DeceptionIEEE Intelligent Systems10.1109/MIS.2018.287727733:5(16-25)Online publication date: 1-Sep-2018
    • (2017)An Approach to Mine Business Rule Intents from Domain-specific DocumentsProceedings of the 10th Innovations in Software Engineering Conference10.1145/3021460.3021470(96-106)Online publication date: 5-Feb-2017
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media