8000 GitHub - j-ranasinghe/nlp_cw
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

j-ranasinghe/nlp_cw

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 

Repository files navigation

TripAdvisor Hotel Reviews Analysis

This repository contains code for analyzing TripAdvisor reviews of hotels in Sri Lanka. The goal is to perform sentiment analysis and clustering to extract insights from the reviews. The analysis covers various stages, including data cleaning, sentiment classification, feature extraction, and text clustering.

This was done as part of a coursework for the module CM 4603 - Language Processing and Information Retrieval

Contents

  • Data Collection: Reviews were extracted from TripAdvisor for 205 hotels in Sri Lanka.
  • Data Cleaning: The dataset was preprocessed for sentiment analysis and clustering tasks.
  • Establishing Ground Truth:
  • Feature Extraction:
  • Text classification(Sentiment Analysis): Reviews were classified based on sentiment.
  • Clustering & Topic Modeling: Reviews were clustered based on hotel aspects.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  
0