Skip to main content

Datasets

Standard Dataset

Hotel Reviews from around the world with Sentiment Values and Review Ratings in different Categories for Natural Language Processing

Average: 5 (132 votes)

Abstract

The dataset consists of reviews for various hotels throughout the world and data columns range from Location, Trip Type to various parameters of reviewing with individual review score. The data can be preprocessed and used for various purposes ranging from review categorization, topic extraction, sentiment analysis, location based quality calculation etc. Trustworthy real world data comes handy now-a-days and is tough to get a grasp on. So this dataset will be a good contribution for the researcher community as well as professionals. 

 

Instructions:

The dataset consists of 69308 instances containing all columns. A seperate file containing only sentiment values ranging from

[-1,1] have also been added. 

The various data headings are :

·         ReviewId       

·         UserLocation

·         ReviewedDate

·         HotelName

·         DateOfStay

·         ReviewText   

·         TripType

·         Value    

·         Cleanliness   

·         Service 

·         Location

·         Sleep Quality 

·         Rooms  

·         Check in / front desk    

·         Business service (e.g., internet access)         



Starting from Value all columns are ratings given by reviewers. Presence of -1 depicts missing values.

The sentiment data file consists of Id, Text and Sentiment Value .

The sentiment value was extracted from given dataset using some preprocessing and semi-supervised algorithm.

All the values are Tab separated Values.

 

For selecting Bibtex contents, double click on IEEE contents. Then use Ctrl+C to copy. It's a bug and we need to wait till its fixed. Till then this is how you can cite.
Avishek Garain Tue, 10/06/2020 - 07:47 Permalink
i am unable to download the dataset actually i want to do extend this but unable to download the dataset.
Muhammad Awais Thu, 06/17/2021 - 17:14 Permalink
After reading paper "An ensemble-based hotel recommender system using sentiment analysis and aspect categorization of hotel reviews" i like this paper and i want to reproduce the paper. So, please mail me this dataset my mail address is "[email protected]".
AHSAN UL HAQ Wed, 05/25/2022 - 06:26 Permalink
I am unable to download the dataset. Is there any any way I an download the dataset?
Tirath Savasaiya Wed, 11/15/2023 - 23:35 Permalink