0% found this document useful (0 votes)

13 views34 pages

Project Presentation

Restaurant recomendation ml project

Uploaded by

nirannjanss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views34 pages

Project Presentation

Restaurant recomendation ml project

Uploaded by

nirannjanss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

City & Cuisine-Based

Restaurant
z z
Recommender
Using Yelp Dataset
z
School of Computer Science And
Engineering
Data Mining and Analysis(18ECSC301)
Course Project
on

Yelp Data Challenge

Round 12

Team Leader: Adarsh Raj Team Members:

 Abhijeet Prakash
 Abhishek Sawant
 Adarsh Raj
 Apeksha Ninnekar
z
INTRODUCT
• Yelp's website, Yelp.com, is a crowd-sourced
local business
. review and social networking
site active in major metropolitan areas.
• Yelp users can submit a review of their products
or services using a one to five star rating system.

• Yelp has over 135 million restaurant and

business reviews worldwide.
Description
z
 Yelp has 135 millions restaurants worldwide.

 Whether you’re looking for a continental food, a

great coffee shop nearby, a new salon, or the best
handyman in town, Yelp is your city guide to
finding the perfect places to eat, shop, drink, relax,
visit and play.
Problem
statement
“To search for and recommend best restaurants
in a city for different kinds of cuisines based on
reviews given by customers .”
Project vision
z

 Yelp contains review data of various restaurants

in a city and helps users in choosing a
restaurant.
 In this project we have used review text for
recommending restaurants to the users for
different cuisines.
 We have investigated features of yelp data for
rating prediction and recommendation tasks
Dataset z

 The size of the Data is 6.84 Gb including the Attributes of Business

data
sub files
 Business Dataset(139 Mb)

 Check-In Dataset (50.3 Mb)

 Photo Dataset (34.9 Mb)

 Review Dataset (4.39 Gb)

 Tips Dataset (203 Mb)

 Users Dataset (2.03 Gb)

Data sets used :
• Business
• review
Exploratory analysis
z

• Graph to check no of food businesses in each

state .

Conclusion: Number of businesses for food categories

was highest in Ontario state i.e. 17907 businesses
Exploratory analysis
z

• Top 10 cities with highest review ratings in Ontario

state.
Exploratory analysis
z

Majority of the stars for food

business are 5-stars
Categories selected
z

Over all food categories:

 ‘Food’, ' Restaurants’, 'Pizza', 'Mexican', 'American (Traditional)',
'American (New)', 'Italian', ''Indian', ' Pakistani', 'Thai', ' Japanese',
'French’,’ Canadian (New ), ' Middle Eastern', 'German', 'Vietnamese',
'Chinese', 'Hungarian'

Cuisines:
 Indian
 Chinese
 Thai
 Italian
 Japanese
Exploratory analysis
z

 Majority of the food categories selected was

restaurant .
Data Reduction
z

After exploratory analysis , we trimmed our

dataset:
We selected instances with:
 Food related businesses

 State as ‘ Ontario’
Methodology
z
Pre processing
z

Dropped :
 28 columns from business file
 4 columns from review file
 Adding new columns: senti-polarity and text clear

Data integration:
 Combining the two dataset : business and the review the total
no of columns after integration is 36 columns and 482384 rows
Predictive tasks
z

There are two major tasks in our project:

 Predict rating from review text:

- Linear support vector machine

classifier
 Find the sentiment polarity and recommend the
top best restaurants for each cuisine type.
– Sentiment polarity
- Mean of star ratings
Linear support vector
machinez
’ Linear support vector machine classifier
 text pre-processing

 removed punctuations, stop words and tokenized the reviews

 converted each review into a vector using tf-idf

Training the model

 split the dataset into training and test set by 80:20 ratio

 build a multiclass svm classifier and fit it to our training set

Test and evaluating the model

 tested the model for 5 classes(1,2,3,4,5 rating)
Using five
z
 ’

classes(1,2,3,4,5)
Task2: Recommending
Restaurants To Users
z

 ’

 Calculate sentiment polarity for each review

text
 Find mean sentiment polarity for each
business_id
 Find mean stars for each business_id
 Considering the business with mean stars
greater than 3.5 and sentiment polarity
greater than 0 as good restaurants.
Plotting graphs of stars vs
sentiment polarity
z
From graph we see that all the stars greater than
3.5 are above 0 of senti-polarity.

 ’
Displaying restaurants on
map z
Mapping
z the restaurants on
world map
Finding the best
z

restaurants
Finding the top best restaurants on YELP:
• based on the stars and the highest senti polarity value
z

 ’
Indian cuisine
z

Finding top restaurants for Indian

cuisine.
z

 ’
WORDCLOUD
z
Chinese cuisine
z
 Finding the top best restaurants for Chinese cuisine.
GUI z
GUI z
z
z

Thank you
z
z
z

Q'anjob'al
No ratings yet
Q'anjob'al
30 pages
Full Download Understanding Nutrition 13th Edition Whitney Solutions Manual
100% (51)
Full Download Understanding Nutrition 13th Edition Whitney Solutions Manual
35 pages
Traditional Toum (Lebanese Garlic Sauce) Recipe
No ratings yet
Traditional Toum (Lebanese Garlic Sauce) Recipe
2 pages
Trip Advisor-Yelp Market Research Assignment Power Point
No ratings yet
Trip Advisor-Yelp Market Research Assignment Power Point
4 pages
Chapter One(New)
No ratings yet
Chapter One(New)
25 pages
IAJSE1014
No ratings yet
IAJSE1014
8 pages
DAT SCIENCE PROGRAMING ASSEMENT 5 (1)
No ratings yet
DAT SCIENCE PROGRAMING ASSEMENT 5 (1)
6 pages
Swiggy Growth
No ratings yet
Swiggy Growth
4 pages
Swiggy_project_ppt
No ratings yet
Swiggy_project_ppt
13 pages
Recommendation System
No ratings yet
Recommendation System
14 pages
Zomato Recommendation and Price Prediction System
No ratings yet
Zomato Recommendation and Price Prediction System
5 pages
Project Report
No ratings yet
Project Report
16 pages
Zomato Data Analysis Presentation
No ratings yet
Zomato Data Analysis Presentation
16 pages
289
No ratings yet
289
1 page
Data Visualization 2
No ratings yet
Data Visualization 2
12 pages
IBPS Clerk Prelims Mock Test 202402 - Hindi
No ratings yet
IBPS Clerk Prelims Mock Test 202402 - Hindi
10 pages
RIT-39
No ratings yet
RIT-39
19 pages
LIQUOR
No ratings yet
LIQUOR
2 pages
lit1F
No ratings yet
lit1F
7 pages
Food Recommendation System
No ratings yet
Food Recommendation System
13 pages
Report
No ratings yet
Report
18 pages
Analyzing The Impact of Components of Yelp - Com On Recommender System Performance Case of Austin
No ratings yet
Analyzing The Impact of Components of Yelp - Com On Recommender System Performance Case of Austin
11 pages
Thesis On Food Processing Industry in India
100% (2)
Thesis On Food Processing Industry in India
6 pages
02 ruchiJWoo35-49
No ratings yet
02 ruchiJWoo35-49
16 pages
Data Mining Capstone Project Report
No ratings yet
Data Mining Capstone Project Report
15 pages
Restaurants Rating Prediction Using Machine Learning Algorithms
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
1 page
NSOsheet1 CLASS1
No ratings yet
NSOsheet1 CLASS1
3 pages
DA Report PDF
No ratings yet
DA Report PDF
4 pages
BBM Prelim Labs and Assessments
No ratings yet
BBM Prelim Labs and Assessments
10 pages
f12
No ratings yet
f12
3 pages
Restaurant Recommendation System Using Machine Learning
No ratings yet
Restaurant Recommendation System Using Machine Learning
5 pages
ExL1 - 030414-2 Jeffrey Is A Pet Camel
No ratings yet
ExL1 - 030414-2 Jeffrey Is A Pet Camel
2 pages
A Recommendation System For Food Tourism
No ratings yet
A Recommendation System For Food Tourism
10 pages
10 1109@icasert 2019 8934655
No ratings yet
10 1109@icasert 2019 8934655
6 pages
Popularity-Based and Collaborative Filtering Based Restaurant Recommender System
No ratings yet
Popularity-Based and Collaborative Filtering Based Restaurant Recommender System
19 pages
RuiJian MastersThesis
No ratings yet
RuiJian MastersThesis
71 pages
Sentiment Analysis and Classification of Restaurant Reviews Using Machine Learning
No ratings yet
Sentiment Analysis and Classification of Restaurant Reviews Using Machine Learning
6 pages
TOPIC 4 Premises and Facilities Hygiene in Islamic Perspective Latest
No ratings yet
TOPIC 4 Premises and Facilities Hygiene in Islamic Perspective Latest
38 pages
Modern NLP in Python
No ratings yet
Modern NLP in Python
46 pages
Sentimental Analysis of Resturant Reviews
No ratings yet
Sentimental Analysis of Resturant Reviews
30 pages
Focus - 4. Godina - Orijentacioni Plan Za Drustveni Smer
No ratings yet
Focus - 4. Godina - Orijentacioni Plan Za Drustveni Smer
3 pages
Introduction To Text Mining
No ratings yet
Introduction To Text Mining
54 pages
Yelp Business Rating Prediction
No ratings yet
Yelp Business Rating Prediction
8 pages
Restaurant Review Predictionusing Machine Learning and Neural Network
No ratings yet
Restaurant Review Predictionusing Machine Learning and Neural Network
5 pages
Report-Converted Sip
No ratings yet
Report-Converted Sip
14 pages
f14
No ratings yet
f14
3 pages
Final Project Report DA
No ratings yet
Final Project Report DA
3 pages
Macros 101
No ratings yet
Macros 101
2 pages
Edunet
No ratings yet
Edunet
14 pages
TeamMess M#
No ratings yet
TeamMess M#
15 pages
Grammar 3
No ratings yet
Grammar 3
18 pages
The Quiet Power of Introverts
No ratings yet
The Quiet Power of Introverts
2 pages
Ashish Gandhe, Restaurant Recommendation System
No ratings yet
Ashish Gandhe, Restaurant Recommendation System
5 pages
Legend Situ Bagendit
No ratings yet
Legend Situ Bagendit
6 pages
Evaluation of Customer Ratings On Restaurant by Clustering Techniques Using R
No ratings yet
Evaluation of Customer Ratings On Restaurant by Clustering Techniques Using R
8 pages
BCD3002 Business Intelligence and Analytics
No ratings yet
BCD3002 Business Intelligence and Analytics
5 pages
South Indian Veg Meal Plan - Female
No ratings yet
South Indian Veg Meal Plan - Female
1 page
Zomato Data Analysis (1) (1)
No ratings yet
Zomato Data Analysis (1) (1)
11 pages
Dataset Description
No ratings yet
Dataset Description
2 pages
RESTAURANT RECOMMANDATION SYSTEM(1)
No ratings yet
RESTAURANT RECOMMANDATION SYSTEM(1)
15 pages
Ashish Gandhe, Restaurant Recommendation System PDF
No ratings yet
Ashish Gandhe, Restaurant Recommendation System PDF
5 pages
Livestock Systems and Forage Resources of Small Ruminant Farms in Some Selected Districts in Sierra Leone
No ratings yet
Livestock Systems and Forage Resources of Small Ruminant Farms in Some Selected Districts in Sierra Leone
8 pages
Yelp Explorers Report
No ratings yet
Yelp Explorers Report
10 pages
Data report
No ratings yet
Data report
7 pages
Iasec Upsc Syllabus
No ratings yet
Iasec Upsc Syllabus
1 page
Child Welfare and Overindulgence
No ratings yet
Child Welfare and Overindulgence
10 pages
Zomato Customer Satisfaction Code and Results
No ratings yet
Zomato Customer Satisfaction Code and Results
11 pages
DA - Project 1
No ratings yet
DA - Project 1
12 pages
Ashish Gandhe, Restaurant Recommendation System
No ratings yet
Ashish Gandhe, Restaurant Recommendation System
6 pages
Modena Modular Pizza Oven Kit Datasheet
No ratings yet
Modena Modular Pizza Oven Kit Datasheet
10 pages
Final Examination: Course: Reading - Writing B2 (Reading)
No ratings yet
Final Examination: Course: Reading - Writing B2 (Reading)
6 pages
Test 11: Choose The Best Option To Complete The Following Sentences
No ratings yet
Test 11: Choose The Best Option To Complete The Following Sentences
6 pages
Restaurant Recommendation1
No ratings yet
Restaurant Recommendation1
5 pages
Work Immersion
No ratings yet
Work Immersion
4 pages
Roguish Archetype: Poisoner
No ratings yet
Roguish Archetype: Poisoner
3 pages
Project Detailed Review
No ratings yet
Project Detailed Review
9 pages
Grade 7 - First Term Revision - 2021-2022
No ratings yet
Grade 7 - First Term Revision - 2021-2022
6 pages
Restaurant Review Classification and Recommender System
No ratings yet
Restaurant Review Classification and Recommender System
5 pages
Presentation1 COOKIES
No ratings yet
Presentation1 COOKIES
14 pages
Zomato Data Aanalysis Using Machine Learning Algorithms
No ratings yet
Zomato Data Aanalysis Using Machine Learning Algorithms
7 pages
Understanding Educational Statistics Using Microsoft Excel and SPSS
From Everand
Understanding Educational Statistics Using Microsoft Excel and SPSS
Martin Lee Abbott
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
4 pages
Restaurants Rating Prediction Using Machine Learning Algorithms
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
4 pages
Unit 5. Food and Drink
No ratings yet
Unit 5. Food and Drink
8 pages
Yelp Vs Zomato Analysis
No ratings yet
Yelp Vs Zomato Analysis
8 pages
Companies Database
100% (1)
Companies Database
653 pages
Zomato Data Analysis
No ratings yet
Zomato Data Analysis
8 pages
Customer Satisfaction Towards Coca Cola Shivam Patel Rbmi
100% (3)
Customer Satisfaction Towards Coca Cola Shivam Patel Rbmi
94 pages
Data Mining of Restaurant Review Using W PDF
No ratings yet
Data Mining of Restaurant Review Using W PDF
4 pages
Syllabus Compilation BSHM & BSTM 11111
100% (1)
Syllabus Compilation BSHM & BSTM 11111
41 pages

Project Presentation

Uploaded by

Project Presentation

Uploaded by

City & Cuisine-Based

Yelp Data Challenge

Team Leader: Adarsh Raj Team Members:

• Yelp has over 135 million restaurant and

 Whether you’re looking for a continental food, a

 Yelp contains review data of various restaurants

 The size of the Data is 6.84 Gb including the Attributes of Business

 Check-In Dataset (50.3 Mb)

 Photo Dataset (34.9 Mb)

 Review Dataset (4.39 Gb)

 Tips Dataset (203 Mb)

 Users Dataset (2.03 Gb)

• Graph to check no of food businesses in each

Conclusion: Number of businesses for food categories

• Top 10 cities with highest review ratings in Ontario

Majority of the stars for food

Over all food categories:

 Majority of the food categories selected was

After exploratory analysis , we trimmed our

There are two major tasks in our project:

- Linear support vector machine

 removed punctuations, stop words and tokenized the reviews

 converted each review into a vector using tf-idf

Training the model

 build a multiclass svm classifier and fit it to our training set

Test and evaluating the model

 Calculate sentiment polarity for each review

Finding top restaurants for Indian

You might also like