[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2664551.2664579acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
research-article

Sentiment Analysis for Streams of Web Data: A Case Study of Brazilian Financial Markets

Published: 18 November 2014 Publication History

Abstract

With the rise of Web 2.0 applications, most people started consuming information and sharing opinions and ideas about most aspects of their lives on a variety of social media platforms, creating massive and continuous streams of valuable data. While this opened the door for information extraction and mining techniques that can help us understand different aspects of society, extracting useful information from such streams of Web data is far from trivial. In this setting, sentiment analysis techniques can be convenient as they are capable of summarizing general feeling about entities people care about, such as products and companies. Therefore, they can be quite applicable in scenarios like the stock market, which also has tremendous impact on society. This paper describes and evaluates two different techniques for sentiment analysis applied to the Brazilian stock market data: lexicon-based and machine learning based, considering a wide range of text pre-processing and feature selection approaches.

References

[1]
Seu futuro bem investido - relatorio anual bm&fbovespa 2013. http://ri.bmfbovespa.com.br/ptb/2002/RELATORIOWEBPORTUGUESFINAL.pdf. Accessed: 2014-07-20.
[2]
Ten largest stock exchanges in the world by market capitalization in 2011. http://www.world-stock-exchanges.net/top10.html. Accessed: 2014-07-20.
[3]
World's largest stock exchanges. http://www.relbanks.com/stock-exchanges/largest-stock-exchanges. Accessed: 2014-07-20.
[4]
S. Bird, E. Klein, and E. Loper. Natural Language Processing with Python. O'Reilly Media, Inc., 1st edition, 2009.
[5]
J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1):1--8, 2011.
[6]
W. F. M. D. Bondt and R. Thaler. Does the stock market overreact? The Journal of Finance, 40(3):793--805, 1986.
[7]
N.-F. Chen, R. Roll, and S. A. Ross. Economic forces and the stock market. The Journal of Business, 59(3):83--403, 1986.
[8]
E. F. Fama. The behavior of stock-market prices. The Journal of Business, 38(1):34--105, 1965.
[9]
R. Feldman. Techniques and applications for sentiment analysis. Commun. ACM, 56(4):82--89, Apr. 2013.
[10]
E. Fonseca and J. Rosa. A two-step convolutional neural network approach for semantic role labeling. In Neural Networks (IJCNN), The 2013 International Joint Conference on, pages 1--7, Aug 2013.
[11]
S. Goel, J. M. Hofman, S. Lahaie, D. M. Pennock, and D. J. Watts. Predicting consumer behavior with web search. Proceedings of the National Academy of Sciences, 2010.
[12]
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: An update. SIGKDD Explor. Newsl., 11(1):10--18, Nov. 2009.
[13]
T. J. P. Lopes, G. K. L. Hiratani, F. J. Barth, O. Rodrigues, Jr., and J. M. Pinto. Mineracao de opinioes aplicada a analise de investimentos. In Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web, WebMedia'08, pages 117--120, Vila Velha, Espírito Santo, Brazil, 2008.
[14]
C. D. Manning and H. Schütze. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA, USA, 1999.
[15]
P. P. Balage Filho, T. A. S. Pardo, and S. M. Alusio. An evaluation of the brazilian portuguese liwc dictionary for sentiment analysis. In In 9th Brazilian Symposium in Information and Human Language Technology, Fortaleza, Ceara, 2013.
[16]
B. Pang and L. Lee. Opinion mining and sentiment analysis. Found. Trends Inf. Retr., 2(1-2):1--135, Jan. 2008.
[17]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing - Volume 10, EMNLP'02, pages 79--86, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics.
[18]
M. J. Silva, P. Carvalho, and L. Sarmento. Building a sentiment lexicon for social judgement mining. In Proceedings of the 10th International Conference on Computational Processing of the Portuguese Language, PROPOR'12, pages 218--228, Berlin, Heidelberg, 2012. Springer-Verlag.
[19]
M. J. Silva and R. TEAM. Notas sobre a realizacao e qualidade do twitometro. Technical report, University of Lisbon, Faculty of Sciences, LASIGE, Lisbon, Portugal, May 2011.
[20]
M. Souza and R. Vieira. Sentiment analysis on twitter data for portuguese language. In H. Caseli, A. Villavicencio, A. Teixeira, and F. Perdigao, editors, Computational Processing of the Portuguese Language, volume 7243 of Lecture Notes in Computer Science, pages 241--247. Springer Berlin Heidelberg, 2012.
[21]
M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede. Lexicon-based methods for sentiment analysis. Comput. Linguist., 37(2):267--307, June 2011.
[22]
J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Proceedings of the Third IEEE International Conference on Data Mining, ICDM'03, pages 427--, Washington, DC, USA, 2003. IEEE Computer Society.

Cited By

View all
  • (2023)Feasibility analysis of machine learning for performance-related attributional statementsInternational Journal of Accounting Information Systems10.1016/j.accinf.2022.10059748(100597)Online publication date: Mar-2023
  • (2023)Sentiment analysis in Portuguese tweets: an evaluation of diverse word representation modelsLanguage Resources and Evaluation10.1007/s10579-023-09661-458:1(223-272)Online publication date: 28-Jun-2023
  • (2022)Real-Time Text Classification of User-Generated Content on Social Media: Systematic ReviewIEEE Transactions on Computational Social Systems10.1109/TCSS.2021.31201389:4(1154-1166)Online publication date: Aug-2022
  • Show More Cited By

Index Terms

  1. Sentiment Analysis for Streams of Web Data: A Case Study of Brazilian Financial Markets

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      WebMedia '14: Proceedings of the 20th Brazilian Symposium on Multimedia and the Web
      November 2014
      256 pages
      ISBN:9781450332309
      DOI:10.1145/2664551
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      • SBC: Brazilian Computer Society

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 18 November 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. brazilian stock market
      2. lexicon
      3. machine learning
      4. sentiment analysis
      5. social data
      6. web and social networks

      Qualifiers

      • Research-article

      Conference

      WebMedia'14
      Sponsor:
      • SBC
      WebMedia'14: 20th Brazilian Symposium on Multimedia and the Web
      November 18 - 21, 2014
      João Pessoa, Brazil

      Acceptance Rates

      WebMedia '14 Paper Acceptance Rate 25 of 86 submissions, 29%;
      Overall Acceptance Rate 270 of 873 submissions, 31%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)4
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 25 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Feasibility analysis of machine learning for performance-related attributional statementsInternational Journal of Accounting Information Systems10.1016/j.accinf.2022.10059748(100597)Online publication date: Mar-2023
      • (2023)Sentiment analysis in Portuguese tweets: an evaluation of diverse word representation modelsLanguage Resources and Evaluation10.1007/s10579-023-09661-458:1(223-272)Online publication date: 28-Jun-2023
      • (2022)Real-Time Text Classification of User-Generated Content on Social Media: Systematic ReviewIEEE Transactions on Computational Social Systems10.1109/TCSS.2021.31201389:4(1154-1166)Online publication date: Aug-2022
      • (2022)Soybean Price Trend Forecast Using Deep Learning Techniques Based on Prices and Text SentimentsInformation and Communication Technologies for Agriculture—Theme II: Data10.1007/978-3-030-84148-5_10(235-266)Online publication date: 18-Mar-2022
      • (2019)The influence of tweets and news on the brazilian stock market through sentiment analysisProceedings of the 25th Brazillian Symposium on Multimedia and the Web10.1145/3323503.3349564(385-392)Online publication date: 29-Oct-2019
      • (2018)Creating a social media-based personal emotional lexiconProceedings of the 24th Brazilian Symposium on Multimedia and the Web10.1145/3243082.3264668(261-264)Online publication date: 16-Oct-2018
      • (2018)Spatio-Temporal Trend Analysis of the Brazilian Elections Based on Twitter Data2018 IEEE International Conference on Data Mining Workshops (ICDMW)10.1109/ICDMW.2018.00192(1355-1360)Online publication date: Nov-2018

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media