[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3152178.3152192acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article
Open access

Predictive Analytics Using Text Classification for Restaurant Inspections

Published: 07 November 2017 Publication History

Abstract

According to the Center for Disease Control (CDC), there are almost 48 million people affected by foodborne diseases in the U.S. every year, including 3,000 deaths. The most effective way of avoiding food poisoning would be its prevention. However, complete prevention is not possible, therefore Public Health departments perform routine restaurant inspections, combined with the practice of inspecting specific restaurants once a disease outbreak is identified. Following other health applications (e.g., prediction of a flu outbreak using Twitter), we use social media and a predictive analytics approach to identify the need for targeted visits by city inspectors.

References

[1]
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., et al. TensorFlow: Large-scale Machine Learning on Heterogeneous Distributed Systems. CoRR abs/1603.04467 (2016).
[2]
Bardach, N. S., Asteria-Peñaloza, R., Boscardin, W. J., and Dudley, R. A. The Relationship between Commercial Website Ratings and Traditional Hospital Performance Measures in the USA. BMJ Quality & Safety (2012), 194--202.
[3]
Breiman, L. Random Forests. Machine Learning 45, 1 (2001), 5--32.
[4]
CDC. Burden of Foodborne Illness: Overview, July 2016.
[5]
Choucair, B. The Future of Public Health: Preventing Food Poisonings from Occurring, February 2014.
[6]
Colas, F., and Brazdil, P. Comparison of SVM and Some Older Classification Algorithms in Text Classification Tasks. Artificial Intelligence in Theory and Practice (2006), 169--178.
[7]
Ghiassi, M., Skinner, J., and Zimbra, D. Twitter Brand Sentiment Analysis: A Hybrid System Using n-gram Analysis and Dynamic Artificial Neural Network. Expert Systems with Applications 40, 16 (2013), 6266--6282.
[8]
Goutte, C., and Gaussier, E. A Probabilistic Interpretation of Precision, Recall and F-score, with Implication for Evaluation. In European Conference on IR Research (ECIR) (2005), vol. 5, Springer, pp. 345--359.
[9]
Harrison, C., Jorder, M., Stern, H., Stavinsky, F., Reddy, V., Hanson, H., Waechter, H., Lowe, L., Gravano, L., Balter, S., et al. Using Online Reviews by Restaurant Patrons to Identify Unreported Cases of Foodborne Illness-New York City, 2012--2013. 441--445.
[10]
Kilaru, A. S., Meisel, Z. F., Paciotti, B., Ha, Y. P., Smith, R.J., Ranard, B. L., and Merchant, R. M. What Do Patients Say About Emergency Departments in Online Reviews? A Qualitative Study. BMJ Quality & Safety 25, 1 (2016), 14--24.
[11]
Kiritchenko, S., Zhu, X., Cherry, C., and Mohammad, S. NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews. In SemEval@COLING (2014), pp. 437--442.
[12]
Luca, M., and Zervas, G. Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud. Management Science 62, 12 (2016), 3412--3427.
[13]
Metsis, V., Androutsopoulos, I., and Paliouras, G. Spam Filtering with Naive Bayes - Which Naive Bayes? In CEAS (2006), vol. 17, pp. 28--69.
[14]
Pontiki, M., Galanis, D., Papageorgiou, H., Androutsopoulos, I., Manandhar, S., Al-Smadi, M., Al-Ayyoub, M., Zhao, Y., Qin, B., De Clercq, O., et al. SemEval-2016 Task 5: Aspect Based Sentiment Analysis. In Workshop on Semantic Evaluation (SemEval-2016) (2016), Association for Computational Linguistics, pp. 19--30.
[15]
Ranard, B. L., Werner, R. M., Antanavicius, T., Schwartz, H. A., Smith, R. J., Meisel, Z. F., Asch, D. A., Ungar, L. H., and Merchant, R. M. Yelp Reviews of Hospital Care Can Supplement and Inform Traditional Surveys of the Patient Experience of Care. Health Affairs 35, 4 (2016), 697--705.
[16]
Sadilek, A., Brennan, S., Kautz, H., and Silenzio, V. nEmesis: Which Restaurants Should You Avoid Today? In AAAI Conference on Human Computation and Crowdsourcing (2013).
[17]
Sadilek, A., Kautz, H. A., DiPrete, L., Labus, B., Portman, E., Teitel, J., and Silenzio, V. Deploying nEmesis: Preventing Foodborne Illness by Data Mining Social Media. In AAAI (2016), pp. 3982--3990.
[18]
Tausczik, Y. R., and Pennebaker, J. W. The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods. Journal of Language and Social Psychology 29, 1 (2010), 24--54.
[19]
Tran, N. N., and Lee, J. Online Reviews as Health Data: Examining the Association between Availability of Health Care Services and Patient Star Ratings Exemplified by the Yelp Academic Dataset. JMIR Public Health and Surveillance 3, 3 (2017).
[20]
Ullman, J. D. Mining of Massive Datasets. Cambridge University Press, 2011.
[21]
Webstaurantstore. Preparing for a Health Inspection, September 2017.

Cited By

View all
  • (2021)Exploring Text-based Emotions Recognition Machine Learning Techniques on Social Media ConversationProcedia Computer Science10.1016/j.procs.2021.01.099179(821-828)Online publication date: 2021
  • (2020)Analysis of the Impact of COVID-19 on Education Based on Geotagged TwitterProceedings of the 1st ACM SIGSPATIAL International Workshop on Modeling and Understanding the Spread of COVID-1910.1145/3423459.3430756(15-23)Online publication date: 3-Nov-2020
  • (2020)Conceptual Model of Professional Supervision Study Based on Data Mining: A Study in the Regional Council of Nutritionists of the 4th Brazilian Region (Rio de Janeiro and Espirito Santo States)Advances in Multidisciplinary Medical Technologies ─ Engineering, Modeling and Findings10.1007/978-3-030-57552-6_2(11-27)Online publication date: 8-Nov-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
UrbanGIS'17: Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics
November 2017
118 pages
ISBN:9781450354950
DOI:10.1145/3152178
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Text mining
  2. online reviews
  3. prediction models
  4. public health
  5. supervised learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

SIGSPATIAL'17
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)62
  • Downloads (Last 6 weeks)7
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Exploring Text-based Emotions Recognition Machine Learning Techniques on Social Media ConversationProcedia Computer Science10.1016/j.procs.2021.01.099179(821-828)Online publication date: 2021
  • (2020)Analysis of the Impact of COVID-19 on Education Based on Geotagged TwitterProceedings of the 1st ACM SIGSPATIAL International Workshop on Modeling and Understanding the Spread of COVID-1910.1145/3423459.3430756(15-23)Online publication date: 3-Nov-2020
  • (2020)Conceptual Model of Professional Supervision Study Based on Data Mining: A Study in the Regional Council of Nutritionists of the 4th Brazilian Region (Rio de Janeiro and Espirito Santo States)Advances in Multidisciplinary Medical Technologies ─ Engineering, Modeling and Findings10.1007/978-3-030-57552-6_2(11-27)Online publication date: 8-Nov-2020
  • (2017)Ontology-based Instance Matching for Geospatial Urban Data IntegrationProceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics10.1145/3152178.3152186(1-8)Online publication date: 7-Nov-2017

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media