[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2389707.2389713acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Towards large-scale twitter mining for drug-related adverse events

Published: 29 October 2012 Publication History

Abstract

Drug-related adverse events pose substantial risks to patients who consume post-market or Drug-related adverse events pose substantial risks to patients who consume post-market or investigational drugs. Early detection of adverse events benefits not only the drug regulators, but also the manufacturers for pharmacovigilance. Existing methods rely on patients' "spontaneous" self-reports that attest problems. The increasing popularity of social media platforms like the Twitter presents us a new information source for finding potential adverse events. Given the high frequency of user updates, mining Twitter messages can lead us to real-time pharmacovigilance. In this paper, we describe an approach to find drug users and potential adverse events by analyzing the content of twitter messages utilizing Natural Language Processing (NLP) and to build Support Vector Machine (SVM) classifiers. Due to the size nature of the dataset (i.e., 2 billion Tweets), the experiments were conducted on a High Performance Computing (HPC) platform using MapReduce, which exhibits the trend of big data analytics. The results suggest that daily-life social networking data could help early detection of important patient safety issues.

References

[1]
E. Aramaki, S. Maskawa, and M. Morita. Twitter catches the flu: detecting influenza epidemics using twitter. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11, pages 1568--1576, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics.
[2]
A. R. Aronson. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp, pages 17--21, 2001.
[3]
B. E. Boser, I. M. Guyon, and V. N. Vapnik. A training algorithm for optimal margin classifiers. In Proceedings of the fifth annual workshop on Computational learning theory, COLT '92, pages 144--152, New York, NY, USA, 1992. ACM.
[4]
C.-C. Chang and C.-J. Lin. Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol., 2:27:1--27:27, May 2011.
[5]
B. W. Chee, R. Berlin, and B. Schatz. Predicting adverse drug events from personal health messages. AMIA Annu Symp Proc, 2011:217--226, 2011.
[6]
Y. Chen and C. Lin. Combining svms with various feature selection strategies. Feature Extraction, pages 315--324, 2006.
[7]
Y.-W. Chen and C.-J. Lin. Combining SVMs with various feature selection strategies. In I. Guyon, S. Gunn, M. Nikravesh, and L. Zadeh, editors, Feature extraction, foundations and applications. Springer, 2006.
[8]
N. K. e. a. Cobb. Online social networks and smoking cessation: a scientific research agenda. J. Med. Internet Res., 13:e119, 2011.
[9]
C. Cortes and V. Vapnik. Support-vector networks. Mach. Learn., 20:273--297, September 1995.
[10]
S. C. A. S. Course. Statistical Concepts: A Second Course. Richard G. Lomax and Debbie L. Hahs-Vaughn, 3 edition, 2007.
[11]
A. Culotta. Towards detecting influenza epidemics by analyzing twitter messages. In Proceedings of the First Workshop on Social Media Analytics, SOMA '10, pages 115--122, New York, NY, USA, 2010. ACM.
[12]
European Medicines Agency. Eudravigilance -- pharmacovigilance in eea. http://eudravigilance.ema.europa.eu/human/index.asp, March 2012.
[13]
C. Friedman. Discovering novel adverse drug events using natural language processing and mining of the electronic health record. In Proceedings of the 12th Conference on Artificial Intelligence in Medicine: Artificial Intelligence in Medicine, AIME '09, pages 1--5, Berlin, Heidelberg, 2009. Springer-Verlag.
[14]
S. A. Golder and M. W. Macy. Diurnal and seasonal mood vary with work, sleep, and daylength across diverse cultures. Science, 333(6051):1878--1881, Sep 2011.
[15]
C. W. Hsu, C. C. Chang, and C. J. Lin. A practical guide to support vector classification. Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, 2003.
[16]
A. Iyengar, T. Finin, and A. Joshi. Content-based prediction of temporal boundaries for events in twitter. In SocialCom/PASSAT, pages 186--191, 2011.
[17]
J. P. Juergens, S. L. Szeinbach, M. J. Janssen, T. R. Brown, and D. D. Garner. An evaluation of interventions designed to stimulate physician reporting of adverse drug events. Top Hosp Pharm Manage, 12(2):12--18, Jul 1992.
[18]
M. Kuhn, M. Campillos, I. Letunic, L. J. Jensen, and P. Bork. A side effect resource to capture phenotypic effects of drugs. Mol. Syst. Biol., 6:343, 2010.
[19]
M. McCandless. chromium-compact-language-detector. http://code.google.com/p/chromium-compact-language-detector/, March 2012.
[20]
A. T. McCray, A. Burgun, and O. Bodenreider. Aggregating UMLS semantic types for reducing conceptual complexity. Stud Health Technol Inform, 84:216--220, 2001.
[21]
E. A. Millman, P. J. Pronovost, M. A. Makary, and A. W. Wu. Patient-assisted incident reporting: including the patient in patient safety. J Patient Saf, 7(2):106--108, Jun 2011.
[22]
B. O'Connor, R. Balasubramanyan, B. R. Routledge, and N. A. Smith. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the Fourth International Conference on Weblogs and Social Media, 2010.
[23]
M. J. Paul and M. Dredze. You are what you tweet : Analyzing twitter for public health. Artificial Intelligence, 38:265--272, 2011.
[24]
K. W. Prier, M. S. Smith, C. Giraud-Carrier, and C. L. Hanson. Identifying health-related topics on twitter: an exploration of tobacco-related tweets as a test topic. In Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction, SBP'11, pages 18--25, Berlin, Heidelberg, 2011. Springer-Verlag.
[25]
O. Scharf and A. D. Colevas. Adverse event reporting in publications compared with sponsor database for cancer clinical trials. J. Clin. Oncol., 24(24):3933--3938, Aug 2006.
[26]
P. L. Schuyler, W. T. Hole, M. S. Tuttle, and D. D. Sherertz. The UMLS Metathesaurus: representing different views of biomedical concepts. Bull Med Libr Assoc, 81:217--222, Apr 1993.
[27]
The Apache Software Foundation. Apache lucene core, March 2012.
[28]
A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe. Predicting elections with twitter: What 140 characters reveal about political sentiment. In W. W. Cohen and S. Gosling, editors, ICWSM. The AAAI Press, 2010.
[29]
Twitter. Rest api resources. https://dev.twitter.com/docs/api, March 2012.
[30]
U.S. Food and Drug Administration. Adverse event reporting system (aers). http://www.fda.gov/drugs/, March 2012.
[31]
U.S. National Institutes of Health. Clinicaltrails.gov. http://clinicaltrials.gov/, March 2012.
[32]
U.S. National Library of Medicine. Current semantic types. http://www.nlm.nih.gov/research/umls/META3_current_semantic_types.html, March 2012.
[33]
W. Wang, K. Haerian, H. Salmasian, R. Harpaz, H. Chase, and C. Friedman. A drug-adverse event extraction algorithm to support pharmacovigilance knowledge mining from citations. AMIA Annu Symp Proc, 2011:1464--1470, 2011.

Cited By

View all
  • (2024)A Hybrid Neural Network Model with Entity-related Knowledge for Adverse Drug Reaction Detection2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM62325.2024.10822078(7117-7120)Online publication date: 3-Dec-2024
  • (2024)Using Social Media as a Source of Real-World Data for Pharmaceutical Drug Development and Regulatory Decision MakingDrug Safety10.1007/s40264-024-01409-547:5(495-511)Online publication date: 6-Mar-2024
  • (2024)Mining Patient-Generated Content for Medication Relations and Transition Network to Predict the Rankings and Volumes of Different MedicationsInformation Systems Frontiers10.1007/s10796-024-10530-wOnline publication date: 7-Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SHB '12: Proceedings of the 2012 international workshop on Smart health and wellbeing
October 2012
72 pages
ISBN:9781450317122
DOI:10.1145/2389707
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. big-data analytic
  2. drug-related adverse events
  3. high performance computing
  4. mapreduce
  5. natural language processing
  6. public health
  7. twitter mining

Qualifiers

  • Research-article

Conference

CIKM'12
Sponsor:

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)40
  • Downloads (Last 6 weeks)2
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Hybrid Neural Network Model with Entity-related Knowledge for Adverse Drug Reaction Detection2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM62325.2024.10822078(7117-7120)Online publication date: 3-Dec-2024
  • (2024)Using Social Media as a Source of Real-World Data for Pharmaceutical Drug Development and Regulatory Decision MakingDrug Safety10.1007/s40264-024-01409-547:5(495-511)Online publication date: 6-Mar-2024
  • (2024)Mining Patient-Generated Content for Medication Relations and Transition Network to Predict the Rankings and Volumes of Different MedicationsInformation Systems Frontiers10.1007/s10796-024-10530-wOnline publication date: 7-Sep-2024
  • (2023)Sentiment Analysis of COVID-19 Tweets Using Deep Learning and Lexicon-Based ApproachesSustainability10.3390/su1503257315:3(2573)Online publication date: 31-Jan-2023
  • (2023)Public Perception of ChatGPT and Transfer Learning for Tweets Sentiment Analysis Using Wolfram MathematicaData10.3390/data81201808:12(180)Online publication date: 28-Nov-2023
  • (2023)Identifying adverse drug reactions from patient reviews on social media using natural language processingHealth Informatics Journal10.1177/1460458222113671229:1(146045822211367)Online publication date: 1-Mar-2023
  • (2023)Adverse event signal extraction from cancer patients’ narratives focusing on impact on their daily-life activitiesScientific Reports10.1038/s41598-023-42496-113:1Online publication date: 19-Sep-2023
  • (2023)Efficient parameter tuning of neural foundation models for drug perspective prediction from unstructured socio-medical dataEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106214123:PAOnline publication date: 1-Aug-2023
  • (2023)Computationale Methoden in den Sozial- und HumanwissenschaftenForschungsmethoden und Evaluation in den Sozial- und Humanwissenschaften10.1007/978-3-662-64762-2_19(1011-1062)Online publication date: 23-Feb-2023
  • (2023)KESDT: Knowledge Enhanced Shallow and Deep Transformer for Detecting Adverse Drug ReactionsNatural Language Processing and Chinese Computing10.1007/978-3-031-44696-2_47(601-613)Online publication date: 8-Oct-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media