Abstract
Searching for a specific and meaningful piece of information in the humongous textual data volumes found on the internet and knowledge repositories is a very challenging task. This problem is usually constrained to answering simple, factoid questions by resorting to a question answering (QA) system built on top of complex approaches such as heuristics, information retrieval, and machine learning. More precisely, deep learning methods became into sharp focus of this research field because such purposes can realize the benefits of the vast amounts of data to boost the practical results of QA systems. In this paper, we present a systematic survey on deep learning-based QA systems concerning factoid questions, with particular focus on how each existing system addresses their critical features in terms of learning end-to-end models. We also detail the evaluation process carried out on these systems and discuss how each approach differs from the others in terms of the challenges tackled and the strategies employed. Finally, we present the most prominent research problems still open in the field.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abualigah LM, Khader AT, Hanandeh ES (2018) A combination of objective functions and hybrid krill herd algorithm for text document clustering analysis. Eng Appl Artif Intell 73:111–125
Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin
Abualigah LMQ, Hanandeh ES (2015) Applying genetic algorithms to information retrieval using vector space model. Int J Comput Sci Eng Appl 5(1):19
ACM (2013) Acm digital library. http://dl.acm.org
Aghaebrahimian A, Jurčíček F (2016) Open-domain factoid question answering via knowledge graph search. In: Proceedings of the workshop on human-computer question answering, pp 22–28
Bast H, Haussmann E (2015) More accurate question answering on freebase. In: Proceedings of the 24th ACM international on conference on information and knowledge management, ACM, pp 1431–1440
Berant J, Chou A, Frostig R, Liang P (2013) Semantic parsing on freebase from question-answer pairs. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp 1533–1544
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, AcM, pp 1247–1250
Boné R, Crucianu M, de Beauville JPA (2002) Learning long-term dependencies by the selective addition of time-delayed connections to recurrent neural networks. Neurocomputing 48(1–4):251–266
Bordes A, Chopra S, Weston J (2014) Question answering with subgraph embeddings. arXiv preprint arXiv:14063676
Bordes A, Usunier N, Chopra S, Weston J (2015) Large-scale simple question answering with memory networks. arXiv preprint arXiv:150602075
Buchholz S, Daelemans W (2001) Complex answers: a case study using a www question answering system. Natl Lang Eng 7(4):301–323
Buzaaba H, Amagasa T (2019) A modular approach for efficient simple question answering over knowledge base. In: International conference on database and expert systems applications, Springer, pp 237–246
Camacho-Collados J, Pilehvar MT (2018) From word to sense embeddings: a survey on vector representations of meaning. J Artif Intell Res 63:743–788. https://doi.org/10.1613/jair.1.11259
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:14061078
Cimiano P, Minock M (2009) Natural language interfaces: what is the problem?–a data-driven quantitative analysis. In: International conference on application of natural language to information systems, Springer, pp 192–206
Cimiano P, Lopez V, Unger C, Cabrio E, Ngomo ACN, Walter S (2013) Multilingual question answering over linked data (qald-3): Lab overview. In: International conference of the cross-language evaluation forum for european languages, Springer, pp 321–332
Dai Z, Li L, Xu W (2016) Cfo: Conditional focused neural question answering with large-scale knowledge bases. arXiv preprint arXiv:160601994
Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:181004805
Diefenbach D, Lopez V, Singh K, Maret P (2018) Core techniques of question answering systems over knowledge bases: a survey. Knowl Inf Syst 55(3):529–569
Direct S (2013) Science direct. http://www.sciencedirect.com/
Goldberg Y (2017) Neural network methods for natural language processing. Synth Lect Human Lang Technol 10(1):1–309
Golub D, He X (2016) Character-level question answering with attention. arXiv preprint arXiv:160400727
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge, http://www.deeplearningbook.org
Graves A, Mohamed Ar, Hinton G (2013) Speech recognition with deep recurrent neural networks. In: 2013 IEEE international conference on acoustics, speech and signal processing, IEEE, pp 6645–6649
Hakimov S, Unger C, Walter S, Cimiano P (2015) Applying semantic parsing to question answering over linked data: Addressing the lexical gap. In: International conference on applications of natural language to information systems, Springer, pp 103–109
Hannun A, Case C, Casper J, Catanzaro B, Diamos G, Elsen E, Prenger R, Satheesh S, Sengupta S, Coates A, et al. (2014) Deep speech: scaling up end-to-end speech recognition. arXiv preprint arXiv:14125567
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, pp 1026–1034
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Hochreiter S, Bengio Y, Frasconi P, Schmidhuber J, et al. (2001) Gradient flow in recurrent nets: the difficulty of learning long-term dependencies
Höffner K, Walter S, Marx E, Usbeck R, Lehmann J, Ngonga Ngomo AC (2017) Survey on challenges of question answering in the semantic web. Semantic Web 8(6):895–920
Hu B, Lu Z, Li H, Chen Q (2014) Convolutional neural network architectures for matching natural language sentences. In: Advances in neural information processing systems, pp 2042–2050
IEEXplore (2013) Ieeexplore digital library. http://ieeexplore.ieee.org/Xplore/home.jsp
Jain S (2016) Question answering over knowledge base using factual memory networks. In: Proceedings of the NAACL student research workshop, pp 109–115
Joulin A, Grave E, Bojanowski P, Nickel M, Mikolov T (2017) Fast linear model for knowledge graph embeddings. arXiv preprint arXiv:171010881
Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. arXiv preprint arXiv:14042188
Kumar A, Irsoy O, Ondruska P, Iyyer M, Bradbury J, Gulrajani I, Zhong V, Paulus R, Socher R (2016) Ask me anything: Dynamic memory networks for natural language processing. In: International conference on machine learning, pp 1378–1387
Lafferty J, McCallum A, Pereira FC (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, Morgan Kaufmann, pp 282–289
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436
Lopez V, Unger C, Cimiano P, Motta E (2013) Evaluating question answering over linked data. Web Semant Sci Serv Agents World Wide Web 21:3–13
Lukovnikov D, Fischer A, Lehmann J, Auer S (2017) Neural network-based question answering over knowledge graphs on word and character level. In: Proceedings of the 26th international conference on World Wide Web, international world wide web conferences steering committee, pp 1211–1220
Lukovnikov D, Fischer A, Lehmann J (2019) Pretrained transformers for simple question answering over knowledge graphs. In: International semantic web conference, Springer, pp 470–486
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:13013781
Mohammed S, Shi P, Lin J (2017) Strong baselines for simple question answering over knowledge graphs with and without neural networks. arXiv preprint arXiv:171201969
Pai M, McCulloch M, Gorman JD, Pai N, Enanoria W, Kennedy G, Tharyan P, Colford JJ (2004) Systematic reviews and meta-analyses: an illustrated, step-by-step guide. Natl Med J India 17(2):86–95
Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Empirical methods in natural language processing (EMNLP), pp 1532–1543. http://www.aclweb.org/anthology/D14-1162
Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. arXiv preprint arXiv:180205365
Petrochuk M, Zettlemoyer L (2018) Simplequestions nearly solved: a new upperbound and baseline approach. arXiv preprint arXiv:180408798
Qin P, Xu W, Guo J (2016) An empirical convolutional neural network approach for semantic relation classification. Neurocomputing 190:1–9
Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) Squad: 100,000+ questions for machine comprehension of text. arXiv preprint arXiv:160605250
Rao Y, Lu J, Zhou J (2017) Attention-aware deep reinforcement learning for video face recognition. In: Proceedings of the IEEE international conference on computer vision, pp 3931–3940
Rush AM, Chopra S, Weston J (2015) A neural attention model for abstractive sentence summarization. arXiv preprint arXiv:150900685
Sak H, Senior A, Rao K, Beaufays F (2015) Fast and accurate recurrent neural network acoustic models for speech recognition. arXiv preprint arXiv:150706947
Schaefer AM, Udluft S, Zimmermann HG (2008) Learning long-term dependencies with recurrent neural networks. Neurocomputing 71(13–15):2481–2488
Scopus (2013) Scopus. http://www.scopus.com/
Severyn A, Moschitti A (2015) Learning to rank short text pairs with convolutional deep neural networks. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, ACM, New York, NY, USA, SIGIR ’15, pp 373–382, 10.1145/2766462.2767738,
Sharma Y, Gupta S (2018) Deep learning approaches for question answering system. Procedia Computer Science 132:785–794
Suen CY (1979) N-gram statistics for natural language understanding and text processing. IEEE Trans Pattern Anal Mach Intell PAMI 1(2):164–172
Sukhbaatar S, Weston J, Fergus R, et al. (2015) End-to-end memory networks. In: Advances in neural information processing systems, pp 2440–2448
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Advances in neural information processing systems, pp 3104–3112
Ture F, Jojic O (2017) No need to pay attention: Simple recurrent neural networks work!(for answering“ simple” questions). arXiv preprint arXiv:160605029
Unger C, Forascu C, Lopez V, Ngomo ACN, Cabrio E, Cimiano P, Walter S (2014) Question answering over linked data (qald-4). In: Working Notes for CLEF 2014 Conference
Wang P, Xu B, Xu J, Tian G, Liu CL, Hao H (2016) Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing 174:806–814
Wani MA, Bhat FA, Afzal S, Khan AI (2020) Advances in deep learning. Springer, Berlin
Weston J, Chopra S, Bordes A (2014) Memory networks. arXiv preprint arXiv:14103916
Yao X, Van Durme B (2014) Information extraction over structured data: Question answering with freebase. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, vol 1, pp 956–966
Yin W, Yu M, Xiang B, Zhou B, Schütze H (2016) Simple question answering by attentive convolutional neural network. arXiv preprint arXiv:160603391
Zhou J, Xu W (2015) End-to-end learning of semantic role labeling using recurrent neural networks. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, vol 1, Long Papers, Association for computational linguistics, Beijing, China, pp 1127–1137, https://doi.org/10.3115/v1/P15-1109. https://www.aclweb.org/anthology/P15-1109
Zhu S, Cheng X, Su S, Lang S (2017) Knowledge-based question answering by jointly generating, copying and paraphrasing. In: Proceedings of the 2017 ACM on conference on information and knowledge management, ACM, pp 2439–2442
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
da Silva, J.W.F., Venceslau, A.D.P., Sales, J.E. et al. A short survey on end-to-end simple question answering systems. Artif Intell Rev 53, 5429–5453 (2020). https://doi.org/10.1007/s10462-020-09826-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-020-09826-5