Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. Mining Text Data introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including the key research content on the topic, and the future directions of research in the field. There is a special focus on Text Embedded with Heterogeneous and Multimedia Data which makes the mining process much more challenging. A number of methods have been designed such as transfer learning and cross-lingual mining for such cases. Mining Text Data simplifies the content, so that advanced-level students, practitioners and researchers in computer science can benefit from this book. Academic and corporate libraries, as well as ACM, IEEE, and Management Science focused on information security, electronic commerce, databases, data mining, machine learning, and statistics are the primary buyers for this reference book.
Cited By
- Boutalbi K, Boutalbi R, Verjus H, Salamatian K, Telisson D and Le Van O IEcons: A New Consensus Approach Using Multi-Text Representations for Clustering Task Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, (3632-3636)
- Setzu M, Corbara S, Monreale A, Moreo A and Sebastiani F (2024). Explainable Authorship Identification in Cultural Heritage Applications, Journal on Computing and Cultural Heritage , 17:3, (1-23), Online publication date: 30-Sep-2024.
- Do Carmo F, Menezes P, Barata B, Jacob A and Lobato F CRM Market Overview: A Case Study of Job Vacancies Proceedings of the 20th Brazilian Symposium on Information Systems, (1-10)
- Md Suhaimin M, Ahmad Hijazi M, Moung E, Nohuddin P, Chua S and Coenen F (2024). Social media sentiment analysis and opinion mining in public security, Journal of King Saud University - Computer and Information Sciences, 35:9, Online publication date: 1-Oct-2023.
- Barbella M and Tortora G (2023). A semi-automatic data integration process of heterogeneous databases, Pattern Recognition Letters, 166:C, (134-142), Online publication date: 1-Feb-2023.
- Ignaczak L, Goldschmidt G, Costa C and Righi R (2021). Text Mining in Cybersecurity, ACM Computing Surveys, 54:7, (1-36), Online publication date: 30-Sep-2022.
- Boutalbi R, Ait-Saada M, Iurshina A, Staab S and Nadif M Tensor-based Graph Modularity for Text Data Clustering Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, (2227-2231)
- Krasniqi R and Do H Automatically Capturing Quality-Related Concerns in Bug Report Descriptions for Efficient Bug Triaging Proceedings of the 26th International Conference on Evaluation and Assessment in Software Engineering, (10-19)
- Obie H, Ilekura I, Du H, Shahin M, Grundy J, Li L, Whittle J and Turhan B On the violation of honesty in mobile apps Proceedings of the 19th International Conference on Mining Software Repositories, (321-332)
- Malgaonkar S, Licorish S and Savarimuthu B (2022). Prioritizing user concerns in app reviews – A study of requests for new features, enhancements and bug fixes, Information and Software Technology, 144:C, Online publication date: 1-Apr-2022.
- Ferreira Mello R, Fiorentino G, Oliveira H, Miranda P, Rakovic M and Gasevic D Towards automated content analysis of rhetorical structure of written essays using sequential content-independent features in Portuguese LAK22: 12th International Learning Analytics and Knowledge Conference, (404-414)
- Subhashini L, Li Y, Zhang J and Atukorale A (2022). Assessing the effectiveness of a three-way decision-making framework with multiple features in simulating human judgement of opinion classification, Information Processing and Management: an International Journal, 59:2, Online publication date: 1-Mar-2022.
- Meena G, Mohbey K and Indian A (2022). Categorizing Sentiment Polarities in Social Networks Data Using Convolutional Neural Network, SN Computer Science, 3:2, Online publication date: 1-Mar-2022.
- Nota G, Postiglione A and Carvello R (2022). Text mining techniques for the management of predictive maintenance, Procedia Computer Science, 200:C, (778-792), Online publication date: 1-Jan-2022.
- Najar F and Bouguila N (2022). Emotion recognition, Engineering Applications of Artificial Intelligence, 107:C, Online publication date: 1-Jan-2022.
- Hosseinzadeh Aghdam M and Daryaie Zanjani M (2022). A novel regularized asymmetric non-negative matrix factorization for text clustering, Information Processing and Management: an International Journal, 58:6, Online publication date: 1-Nov-2021.
- Naseem U, Razzak I, Khan S and Prasad M (2021). A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models, ACM Transactions on Asian and Low-Resource Language Information Processing, 20:5, (1-35), Online publication date: 30-Sep-2021.
- Sun W, Zhang S, Balog K, Ren Z, Ren P, Chen Z and de Rijke M Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, (2499-2506)
- Yang T, Hu L, Shi C, Ji H, Li X and Nie L (2021). HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification, ACM Transactions on Information Systems, 39:3, (1-29), Online publication date: 4-Jul-2021.
- Hirsch L, Nuovo A and Haddela P Document Clustering with Evolved Single Word Search Queries 2021 IEEE Congress on Evolutionary Computation (CEC), (280-287)
- Sjögårde P, Ahlgren P and Waltman L (2021). Algorithmic labeling in hierarchical classifications of publications, Journal of the Association for Information Science and Technology, 72:7, (853-869), Online publication date: 9-Jun-2021.
- Arslan Y, Allix K, Veiber L, Lothritz C, Bissyandé T, Klein J and Goujon A A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain Companion Proceedings of the Web Conference 2021, (260-268)
- Yang C, Liu J and Shi C Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework Proceedings of the Web Conference 2021, (1227-1237)
- Zhang Y, Shen Z, Dong Y, Wang K and Han J MATCH: Metadata-Aware Text Classification in A Large Hierarchy Proceedings of the Web Conference 2021, (3246-3257)
- Ragesh R, Sellamanickam S, Iyer A, Bairi R and Lingam V HeteGCN Proceedings of the 14th ACM International Conference on Web Search and Data Mining, (860-868)
- Felicia Ilona K and Budi I Classification of Inundation Level using Tweets in Indonesian Language Proceedings of the 2021 10th International Conference on Software and Computer Applications, (137-143)
- Mohotti W and Nayak R (2020). Efficient Outlier Detection in Text Corpus Using Rare Frequency and Ranking, ACM Transactions on Knowledge Discovery from Data, 14:6, (1-30), Online publication date: 31-Dec-2021.
- Nery L, de Freitas Neto F and Moreira D LiB Proceedings of the 10th Euro-American Conference on Telematics and Information Systems, (1-8)
- Zhao N, Chen J, Wang Z, Peng X, Wang G, Wu Y, Zhou F, Feng Z, Nie X, Zhang W, Sui K and Pei D Real-time incident prediction for online service systems Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, (315-326)
- Chen X, Du C, He X and Wang J JIT2R: A Joint Framework for Item Tagging and Tag-based Recommendation Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, (1681-1684)
- Qian C, Feng F, Wen L, Lin L and Chua T Enhancing Text Classification via Discovering Additional Semantic Clues from Logograms Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, (1201-1210)
- Cai W and Chen L Predicting User Intents and Satisfaction with Dialogue-based Conversational Recommendations Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization, (33-42)
- Suh J, Ghorashi S, Ramos G, Chen N, Drucker S, Verwey J and Simard P (2019). AnchorViz, ACM Transactions on Interactive Intelligent Systems, 10:1, (1-38), Online publication date: 31-Mar-2020.
- Boteanu A, Dutile E, Kiezun A and Artzi S Subjective Search Intent Predictions using Customer Reviews Proceedings of the 2020 Conference on Human Information Interaction and Retrieval, (303-307)
- Silva T, Viana A, Benevenuto F, Villas L, Salles J, Loureiro A and Quercia D (2019). Urban Computing Leveraging Location-Based Social Network Data, ACM Computing Surveys, 52:1, (1-39), Online publication date: 31-Jan-2020.
- Neshatpour K, Homayoun H and Sasan A (2019). ICNN, ACM Transactions on Embedded Computing Systems, 18:6, (1-27), Online publication date: 22-Jan-2020.
- Boudjellal N, Zhang H, Khan A, Ahmad A and Ali S (2020). Biomedical Relation Extraction Using Distant Supervision, Scientific Programming, 2020, Online publication date: 1-Jan-2020.
- Sun Y, Platoš J and Lee C (2020). High-Dimensional Text Clustering by Dimensionality Reduction and Improved Density Peak, Wireless Communications & Mobile Computing, 2020, Online publication date: 1-Jan-2020.
- Shen K, Hao P, Li R and Zhang N (2020). A Compressive Sensing Model for Speeding Up Text Classification, Computational Intelligence and Neuroscience, 2020, Online publication date: 1-Jan-2020.
- Elnagar A, Al-Debsi R and Einea O (2022). Arabic text classification using deep learning models, Information Processing and Management: an International Journal, 57:1, Online publication date: 1-Jan-2020.
- Chen J, Liu H, Yang Y and He J (2019). Effective Selection of a Compact and High-Quality Review Set with Information Preservation, ACM Transactions on Management Information Systems, 10:4, (1-22), Online publication date: 31-Dec-2020.
- Wang S, Aggarwal C and Liu H Beyond word2vec Proceedings of the 28th ACM International Conference on Information and Knowledge Management, (1041-1050)
- Silva V, Bittencourt I and Maldonado J (2019). Automatic Question Classifiers: A Systematic Review, IEEE Transactions on Learning Technologies, 12:4, (485-502), Online publication date: 1-Oct-2019.
- Olsson T, Ericsson M and Wingkvist A Semi-automatic mapping of source code using naive Bayes Proceedings of the 13th European Conference on Software Architecture - Volume 2, (209-216)
- Schouten K, Frasincar F, Dekker R and Riezebos M (2022). Heracles, Expert Systems with Applications: An International Journal, 127:C, (68-84), Online publication date: 1-Aug-2019.
- Bastani K, Namavari H and Shaffer J (2022). Latent Dirichlet allocation (LDA) for topic modeling of the CFPB consumer complaints, Expert Systems with Applications: An International Journal, 127:C, (256-271), Online publication date: 1-Aug-2019.
- Khan J, Alam A, Hussain J and Lee Y (2019). EnSWF, Applied Intelligence, 49:8, (3123-3145), Online publication date: 1-Aug-2019.
- Siow E, Tiropanis T and Hall W (2018). Analytics for the Internet of Things, ACM Computing Surveys, 51:4, (1-36), Online publication date: 31-Jul-2019.
- Kejriwal M, Shao R and Szekely P Expert-Guided Entity Extraction using Expressive Rules Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, (1353-1356)
- Soize C, Ghanem R, Safta C, Huan X, Vane Z, Oefelein J, Lacaze G, Najm H, Tang Q and Chen X (2022). Entropy-based closure for probabilistic learning on manifolds, Journal of Computational Physics, 388:C, (518-533), Online publication date: 1-Jul-2019.
- Tessore J, Esnaola L, Russo C and Baldassarri S Comparative analysis of preprocessing tasks over social media texts in Spanish Proceedings of the XX International Conference on Human Computer Interaction, (1-8)
- Zhao C and He Y Auto-EM: End-to-end Fuzzy Entity-Matching using Pre-trained Deep Models and Transfer Learning The World Wide Web Conference, (2413-2424)
- Song M, Park H and Shin K (2022). Attention-based long short-term memory network using sentiment lexicon embedding for aspect-level sentiment analysis in Korean, Information Processing and Management: an International Journal, 56:3, (637-653), Online publication date: 1-May-2019.
- Solovyev V, Solnyshkina M, Gafiyatova E, McNamara D and Ivanov V Sentiment in Academic Texts Proceedings of the 24th Conference of Open Innovations Association FRUCT, (408-414)
- Liu J (2019). Using big data database to construct new GFuzzy text mining and decision algorithm for targeting and classifying customers, Computers and Industrial Engineering, 128:C, (1088-1095), Online publication date: 1-Feb-2019.
- El-Assady M, Sperrle F, Deussen O, Keim D and Collins C (2018). Visual Analytics for Topic Model Optimization based on User-Steerable Speculative Execution, IEEE Transactions on Visualization and Computer Graphics, 25:1, (374-384), Online publication date: 1-Jan-2019.
- Elnaggar A, Waltl B, Glaser I, Landthaler J, Scepankova E and Matthes F Stop Illegal Comments Proceedings of the 2018 Artificial Intelligence and Cloud Computing Conference, (41-47)
- Karmaker Santu S, Geigle C, Ferguson D, Cope W, Kalantzis M, Searsmith D and Zhai C (2018). SOFSAT, ACM SIGKDD Explorations Newsletter, 20:2, (21-30), Online publication date: 11-Dec-2018.
- Angiani G, Fornacciari P, Lombardo G, Mordonini M, Pietroni U and Tomaiuolo M Automatic processing and classification of citizens' reports Proceedings of the 4th EAI International Conference on Smart Objects and Technologies for Social Good, (310-311)
- Ma T, Li R, Ou G and Yue M (2018). Topic based research competitiveness evaluation, Scientometrics, 117:2, (789-803), Online publication date: 1-Nov-2018.
- Wood A, Rodeghero P, Armaly A and McMillan C Detecting speech act types in developer question/answer conversations during bug repair Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, (491-502)
- Nik Bakht M, El-Diraby T and Hosseini M (2022). Game-based crowdsourcing to support collaborative customization of the definition of sustainability, Advanced Engineering Informatics, 38:C, (501-513), Online publication date: 1-Oct-2018.
- Gollapalli S and Li X Using PageRank for Characterizing Topic Quality in LDA Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, (115-122)
- Georgakopoulos S, Tasoulis S, Vrahatis A and Plagianakos V Convolutional Neural Networks for Toxic Comment Classification Proceedings of the 10th Hellenic Conference on Artificial Intelligence, (1-6)
- Sohrabi B, Vanani I and Abedin E (2018). Human Resources Management and Information Systems Trend Analysis Using Text Clustering, International Journal of Human Capital and Information Technology Professionals, 9:3, (1-24), Online publication date: 1-Jul-2018.
- Santos I, Araújo J, Lima C, Prudêncio R and Barros F AVS Proceedings of the XIV Brazilian Symposium on Information Systems, (1-7)
- Ashish N and Patawari A (2018). Machine Reading of Biomedical Data Dictionaries, Journal of Data and Information Quality, 9:4, (1-20), Online publication date: 22-May-2018.
- Pandove D, Goel S and Rani R (2018). Systematic Review of Clustering High-Dimensional and Large Datasets, ACM Transactions on Knowledge Discovery from Data, 12:2, (1-68), Online publication date: 30-Apr-2018.
- Moumtzidou A, Andreadis S, Gialampoukidis I, Karakostas A, Vrochidis S and Kompatsiaris I Flood Relevance Estimation from Visual and Textual Content in Social Media Streams Companion Proceedings of the The Web Conference 2018, (1621-1627)
- Di Castro D, Gamzu I, Grabovitch-Zuyev I, Lewin-Eytan L, Pundir A, Sahoo N and Viderman M Automated Extractions for Machine Generated Mail Companion Proceedings of the The Web Conference 2018, (655-662)
- Fruchter N and Liccardi I Consumer Attitudes Towards Privacy and Security in Home Assistants Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems, (1-6)
- Kejriwal M and Szekely P Technology-assisted Investigative Search Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems, (1-9)
- Cao Y, Li H, Luo P and Yao J Towards Automatic Numerical Cross-Checking Proceedings of the 2018 World Wide Web Conference, (1795-1804)
- Karisani P and Agichtein E Did You Really Just Have a Heart Attack? Proceedings of the 2018 World Wide Web Conference, (137-146)
- Peng H, Li J, He Y, Liu Y, Bao M, Wang L, Song Y and Yang Q Large-Scale Hierarchical Text Classification with Recursively Regularized Deep Graph-CNN Proceedings of the 2018 World Wide Web Conference, (1063-1072)
- Nourashrafeddin S, Sherkat E, Minghim R and Milios E (2018). A Visual Approach for Interactive Keyterm-Based Clustering, ACM Transactions on Interactive Intelligent Systems, 8:1, (1-35), Online publication date: 13-Mar-2018.
- Uylaş Satı N, Ordin B and Lanza-Gutiérrez J (2018). Application of the Polyhedral Conic Functions Method in the Text Classification and Comparative Analysis, Scientific Programming, 2018, Online publication date: 1-Jan-2018.
- Park Y, Kim H, Kim D, Lee H, Kim S and Kang P (2017). A deep learning-based sports player evaluation model based on game statistics and news articles, Knowledge-Based Systems, 138:C, (15-26), Online publication date: 15-Dec-2017.
- Tang S, Zhang X, Cryan J, Metzger M, Zheng H and Zhao B (2017). Gender Bias in the Job Market, Proceedings of the ACM on Human-Computer Interaction, 1:CSCW, (1-19), Online publication date: 6-Dec-2017.
- Afful-Dadzie E and Afful-Dadzie A (2017). Liberation of public data, International Journal of Information Management: The Journal for Information Professionals, 37:6, (664-672), Online publication date: 1-Dec-2017.
- Kumar B and Ravi V LDA Based Feature Selection for Document Clustering Proceedings of the 10th Annual ACM India Compute Conference, (125-130)
- mieja M, Struski u and Tabor J (2017). Semi-supervised model-based clustering with controlled clusters leakage, Expert Systems with Applications: An International Journal, 85:C, (146-157), Online publication date: 1-Nov-2017.
- Bhatia M and Sood S (2017). Game theoretic decision making in IoT-assisted activity monitoring of defence personnel, Multimedia Tools and Applications, 76:21, (21911-21935), Online publication date: 1-Nov-2017.
- Fan S, Jiang M, Shen Z, Koenig B, Kankanhalli M and Zhao Q The Role of Visual Attention in Sentiment Prediction Proceedings of the 25th ACM international conference on Multimedia, (217-225)
- Cirqueira D, Pinheiro M, Braga T, Jacob A, Reinhold O, Alt R and Santana Á Improving relationship management in universities with sentiment analysis and topic modeling of social media channels Proceedings of the International Conference on Web Intelligence, (998-1005)
- Neto J, Yokoyama K and Becker K Studying toxic behavior influence and player chat in an online video game Proceedings of the International Conference on Web Intelligence, (26-33)
- Alharbi A, Li Y and Xu Y Topical term weighting based on extended random sets for relevance feature selection Proceedings of the International Conference on Web Intelligence, (654-661)
- Käfer V Summarizing software engineering communication artifacts from different sources Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, (1038-1041)
- Banchev B Text Mining Based Adaptive Case Management Automation in the Field of Forensic Medicine Proceedings of the 18th International Conference on Computer Systems and Technologies, (111-118)
- Yang E, Grossman D, Frieder O and Yurchak R Effectiveness results for popular e-discovery algorithms Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law, (261-264)
- Hirsch L and Di Nuovo A Document clustering with evolved search queries 2017 IEEE Congress on Evolutionary Computation (CEC), (1239-1246)
- Cocos A, Qian T, Callison-Burch C and Masino A (2017). Crowd control, Journal of Biomedical Informatics, 69:C, (86-92), Online publication date: 1-May-2017.
- Albishre K, Li Y and Xu Y Effective pseudo-relevance for Microblog retrieval Proceedings of the Australasian Computer Science Week Multiconference, (1-6)
- Law D, Gruss R and Abrahams A (2017). Automated defect discovery for dishwasher appliances from online consumer reviews, Expert Systems with Applications: An International Journal, 67:C, (84-94), Online publication date: 1-Jan-2017.
- D'Addio R and Manzato M Exploiting Item Representations for Soft Clustering Recommendation Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web, (271-278)
- Kanan T and Fox E (2016). Automated arabic text classification with P-Stemmer, machine learning, and a tailored news article taxonomy, Journal of the Association for Information Science and Technology, 67:11, (2667-2683), Online publication date: 1-Nov-2016.
- Schulz C, Nocaj A, El-Assady M, Frey S, Hlawatsch M, Hund M, Karch G, Netzel R, Schätzle C, Butt M, Keim D, Ertl T, Brandes U and Weiskopf D Generative Data Models for Validation and Evaluation of Visualization Techniques Proceedings of the Sixth Workshop on Beyond Time and Errors on Novel Evaluation Methods for Visualization, (112-124)
- Zamani H, Dadashkarimi J, Shakery A and Croft W Pseudo-Relevance Feedback Based on Matrix Factorization Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, (1483-1492)
- Lv X and El-Gohary N (2016). Enhanced context-based document relevance assessment and ranking for improved information retrieval to support environmental decision making, Advanced Engineering Informatics, 30:4, (737-750), Online publication date: 1-Oct-2016.
- Yin J and Wang J A Text Clustering Algorithm Using an Online Clustering Scheme for Initialization Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (1995-2004)
- Zimmerman C, Madsen R, Eliassen H and Vatrapu R Space vs. Place Proceedings of the 7th 2016 International Conference on Social Media & Society, (1-10)
- Schubotz M, Grigorev A, Leich M, Cohl H, Meuschke N, Gipp B, Youssef A and Markl V Semantification of Identifiers in Mathematics for Better Math Information Retrieval Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, (135-144)
- Zhang C, Fan W, Du N and Yu P Mining User Intentions from Medical Queries Proceedings of the 25th International Conference on World Wide Web, (1373-1384)
- Lossio-Ventura J, Hacid H, Roche M and Poncelet P Communication overload management through social interactions clustering Proceedings of the 31st Annual ACM Symposium on Applied Computing, (1166-1169)
- Bhanuse S, Kamble S and Kakde S (2016). Text Mining Using Metadata for Generation of Side Information, Procedia Computer Science, 78:C, (807-814), Online publication date: 1-Mar-2016.
- Mahmoud A and Bradshaw G (2015). Estimating Semantic Relatedness in Source Code, ACM Transactions on Software Engineering and Methodology, 25:1, (1-35), Online publication date: 2-Dec-2015.
- Babu T, Chatterjee A, Khandeparker S, Subhash A and Gupta S Geographical address classification without using geolocation coordinates Proceedings of the 9th Workshop on Geographic Information Retrieval, (1-10)
- Vranjković V, Struharik R and Novak L (2015). Hardware acceleration of homogeneous and heterogeneous ensemble classifiers, Microprocessors & Microsystems, 39:8, (782-795), Online publication date: 1-Nov-2015.
- Roychoudhury S, Kulkarni V and Bellarykar N Analyzing Document Intensive Business Processes using Ontology Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, (1899-1902)
- Yesha R and Gangopadhyay A A method for analyzing health behavior in online forums Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics, (615-621)
- King M, Abrahams A and Ragsdale C (2015). Ensemble learning methods for pay-per-click campaign management, Expert Systems with Applications: An International Journal, 42:10, (4818-4829), Online publication date: 15-Jun-2015.
- Roychoudhury S, Kulkarni V and Bellarykar N Mining enterprise models for knowledgeable decision making Proceedings of the Fourth International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, (1-6)
- Hauff C and Gousios G Matching GitHub developer profiles to job advertisements Proceedings of the 12th Working Conference on Mining Software Repositories, (362-366)
- Kim H, Hong K and Chang J Semantically enriching text representation model for document clustering Proceedings of the 30th Annual ACM Symposium on Applied Computing, (922-925)
- D'Addio R and Manzato M A sentiment-based item description approach for kNN collaborative filtering Proceedings of the 30th Annual ACM Symposium on Applied Computing, (1060-1065)
- Long J, Wang L, Li Z and Zhang Z Service retrieval based on hybrid SLVM of WSDL Proceedings of the 6th Asia-Pacific Symposium on Internetware, (120-126)
- Wang F, Wang Z, Li Z and Wen J Concept-based Short Text Classification and Ranking Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, (1069-1078)
- Rekha V, Divya N and Bagavathi P A Hybrid Auto-tagging System for StackOverflow Forum Questions Proceedings of the 2014 International Conference on Interdisciplinary Advances in Applied Computing, (1-5)
- Yin J and Wang J A dirichlet multinomial mixture model-based approach for short text clustering Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, (233-242)
- Rezaei M and Fränti P Matching Similarity for Keyword-Based Clustering Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition - Volume 8621, (193-202)
- Sinoara R, Sundermann C, Marcacini R, Domingues M and Rezende S Named entities as privileged information for hierarchical text clustering Proceedings of the 18th International Database Engineering & Applications Symposium, (57-66)
- Mitankin P, Gerdjikov S and Mihov S An approach to unsupervised historical text normalisation Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage, (29-34)
- Lämmel R, Schmorleiz T and Varanovich A The 101haskell Chrestomathy Proceedings of the 25th symposium on Implementation and Application of Functional Languages, (25-36)
- Aggarwal C (2013). On the equivalence of PLSI and projected clustering, ACM SIGMOD Record, 41:4, (45-50), Online publication date: 17-Jan-2013.
- Bast H, Bäurle F, Buchhold B and Haussmann E A case for semantic full-text search Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search, (1-3)
- Munaf M, Afzal H, Mahmood K and Iltaf N Low Resource Summarization using Pre-trained Language Models, ACM Transactions on Asian and Low-Resource Language Information Processing, 0:0
- Ahmed K, Tazi N and Hossny A Sentiment Analysis over Social Networks: An Overview 2015 IEEE International Conference on Systems, Man, and Cybernetics, (2174-2179)
- Boyko A, Kaidina A, Kim Y, Lupatov A, Panov A, Suvorov R and Shvets A A framework for automated meta-analysis: Dendritic cell therapy case study 2016 IEEE 8th International Conference on Intelligent Systems (IS), (160-166)