0% found this document useful (0 votes)

48 views5 pages

Book Database

The document describes a dataset from Kaggle containing information on Amazon's Top 50 bestselling books from 2009 to 2019, with details on 550 books including title, author, user rating, reviews, price, year, and genre. It outlines the structure of the dataset and introduces a Book class to manage the book data, along with Java files for reading the dataset and performing various tasks such as counting books by an author and listing books by rating. The document also specifies tasks that can be performed on the dataset, such as retrieving books by a specific author or rating.

Uploaded by

Ujjwal Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views5 pages

Book Database

Uploaded by

Ujjwal Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

For this project, we have taken a dataset from Kaggle.

This
dataset is on Amazon’s Top 50 bestselling books from 2009 to
2019. It keeps the record of 550 books in a .csv file.

Amazon’s top 50 bestselling books

This is a Kaggle dataset in .csv format. It includes the information

on name, author, user rating, reviews, price, year, and genre of
550 different books. So, data is arranged using the seven
columns below.

Dataset entries

Name Author User Reviews Price Year Genre

Rating

10-Day Green JJ Smith 4.7 17350 8 2016 Non fiction

Smoothie
Cleanse

12 Rules for Jordan B. 4.7 18979 15 2018 Non fiction

Life: An Peterson
Antidote to
Chaos

1984 (Signet George 4.7 21424 6 2017 Fiction

Classics) Orwell

5,000 National 4.8 7665 12 2019 Non fiction

Awesome Facts Geographic
(About Kids
Everything!)
(National
Geographic
Kids)
A Dance with George R. 4.4 12643 11 2011 Fiction
Dragons (A R. Martin
Song of Ice
and Fire)

... ... ... ... ... ... ...

The above table represents a book with various attributes

detailing its characteristics and performance on Amazon. Let’s

discuss these columns as follows:

● Name: This column contains the title of the book

● Author: This column lists the author’s name.
● User Rating: It shows the average Amazon user rating, which
ranges from 3.3 to 4.9.
● Reviews: It indicates the number of reviews written by users
on Amazon, with a minimum of 37 and a maximum of
87,800 reviews.
● Price: It provides the cost of the book, spanning from $0 to
$105.
● Year: It specifies the year or years the book appeared on the
bestseller list, covering the period from 2009 to 2019.
● Genre: Lastly, it classifies the book as either fiction or
nonfiction.

Reading the Dataset

To begin working with the dataset, we need to read the data from
a CSV file named data.csv. This file contains information about
various books, structured in a tabular format. Each row
represents a book and includes details such as the title, author,
user rating, number of reviews, price, publication year, and
genre.

Define Book class

In this section, we will define a Book class that models the

attributes of a book based on the dataset provided. The Book
class will contain all the necessary details about each book, such
as its title, author, user rating, number of reviews, price,
publication year, and genre.

This class is designed to provide a structured way to manage and

manipulate book data within our application.

Attributes:

○ title: The title of the book.

○ author: The author of the book.
○ userRating: The average user rating of the book.
○ reviews: The number of user reviews.
○ price: The price of the book.
○ year: The year the book appeared on the bestseller list.
○ genre: The genre of the book (either fiction or
non-fiction).
● Constructor: Initializes a Book object with the provided
values for each attribute.
● Getters and setters: These methods provide access to and
modification of the book's attributes.

In the code above, we can have three java files used to read the
dataset. Lets explore the objective of each file as follows:
● The Book.java file defines the Book class, This class
represents a Book object with attributes for the title, author,
user rating, reviews, price, year, and genre. It includes
getters for each attribute and a printDetails method to print
the details of the book in a formatted manner.
● The DatasetReader.java file is responsible for reading a CSV
file and creating a list of Book objects. It handles the parsing
of each line in the CSV, ensuring that each book has the
required data fields, and skips malformed lines.
● The driver.java file contains the main method, which serves
as the entry point of the program. It uses DatasetReader to
read the dataset from the CSV file, and then iterates over
the list of Book objects to print their details using the
printDetails method of the Book class.

Tasks
1. Total number of books by an author
○ It takes the name of an author and dataset as input
and returns the total number of books written by the
author
2. All the authors in the dataset
○ Print name of all authors in the dataset
3. Names of all the books by an author
○ It takes the author as an input and returns all the
books written by the author. Just for reference, Author
is the second column, and Name (name of the book) is
the first column in the dataset.
4. Classify with a user rating
○ It takes the rating as an input and returns all books
with the user rating equal to rating.
5. Price of all the books by an author
○ It takes the name of the author as an input and returns
the names and prices of all the books written by the
author.

Updated Pr1-Assignment
No ratings yet
Updated Pr1-Assignment
14 pages
Assignment: OOP: PR1 - Fall 2021
No ratings yet
Assignment: OOP: PR1 - Fall 2021
13 pages
C++ Object-Oriented Programming Tasks
No ratings yet
C++ Object-Oriented Programming Tasks
30 pages
Beattie Scda Finalproj-1
No ratings yet
Beattie Scda Finalproj-1
24 pages
Team Renegades MMLA Report
No ratings yet
Team Renegades MMLA Report
27 pages
Python-Based Library Management System
No ratings yet
Python-Based Library Management System
19 pages
Introduction To Computer Science - Chapter 4 Lab: Writing Multiple Classes - 25 Points
No ratings yet
Introduction To Computer Science - Chapter 4 Lab: Writing Multiple Classes - 25 Points
5 pages
Lab2 Pandas1
No ratings yet
Lab2 Pandas1
8 pages
Exercises: Spring Data Advanced Quering: 1. Books Titles by Age Restriction
No ratings yet
Exercises: Spring Data Advanced Quering: 1. Books Titles by Age Restriction
5 pages
Capstone Project: I. Definition
No ratings yet
Capstone Project: I. Definition
17 pages
PySpark Practice
No ratings yet
PySpark Practice
2 pages
Bookrecommendations 230615063942 3b1016c9
No ratings yet
Bookrecommendations 230615063942 3b1016c9
22 pages
BOOK Recommendation That Help To Analsis The
No ratings yet
BOOK Recommendation That Help To Analsis The
22 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
On Classes - 1
No ratings yet
On Classes - 1
3 pages
Project
No ratings yet
Project
4 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Google Play Store Apps
100% (1)
Google Play Store Apps
63 pages
PlayStore Analysis
No ratings yet
PlayStore Analysis
28 pages
Final
No ratings yet
Final
36 pages
AssignmentRequirements Eng
No ratings yet
AssignmentRequirements Eng
13 pages
App Rating Prediction Project by CHANCHAL SINGH
No ratings yet
App Rating Prediction Project by CHANCHAL SINGH
15 pages
Movie Data Insights & Predictions
No ratings yet
Movie Data Insights & Predictions
22 pages
App Rating Prediction Model
No ratings yet
App Rating Prediction Model
51 pages
Business Intelligence Project Report
No ratings yet
Business Intelligence Project Report
14 pages
Analyzing IMDB Scores of Netflix Films
No ratings yet
Analyzing IMDB Scores of Netflix Films
14 pages
Assignment On " Exploring Public Datasets" Subject: Big Data Technologies
No ratings yet
Assignment On " Exploring Public Datasets" Subject: Big Data Technologies
8 pages
Mini Project
No ratings yet
Mini Project
17 pages
App Rating Prediction Project
100% (5)
App Rating Prediction Project
14 pages
Movie Ticket Booking
No ratings yet
Movie Ticket Booking
30 pages
Practical File
No ratings yet
Practical File
27 pages
Book Class Activity With Instructions
No ratings yet
Book Class Activity With Instructions
5 pages
C++ Assignment 1
No ratings yet
C++ Assignment 1
8 pages
R Raj
No ratings yet
R Raj
9 pages
Dat7302 Bda Assessment Brief
No ratings yet
Dat7302 Bda Assessment Brief
9 pages
Python Data Handling and SQL Practices
No ratings yet
Python Data Handling and SQL Practices
16 pages
Java Bookstore Management System
No ratings yet
Java Bookstore Management System
13 pages
CSV File Program PDF
No ratings yet
CSV File Program PDF
4 pages
Internal Assessment Practical 2
No ratings yet
Internal Assessment Practical 2
2 pages
Practicals - CS Class 12
No ratings yet
Practicals - CS Class 12
8 pages
05.advanced Querying Exercises
No ratings yet
05.advanced Querying Exercises
7 pages
06.advanced Querying Exercises
No ratings yet
06.advanced Querying Exercises
7 pages
Arpit
No ratings yet
Arpit
30 pages
Java File Handling and Stream Classes
No ratings yet
Java File Handling and Stream Classes
7 pages
Vihari
No ratings yet
Vihari
27 pages
Book Recommendation System Analysis
No ratings yet
Book Recommendation System Analysis
31 pages
Data Mining Journal 1 Kashan
No ratings yet
Data Mining Journal 1 Kashan
13 pages
C++ Programming Assignment 2024
No ratings yet
C++ Programming Assignment 2024
20 pages
Class 12 IP Practical File 2025-26
No ratings yet
Class 12 IP Practical File 2025-26
28 pages
Class 12 CS Practical List 2023-24
No ratings yet
Class 12 CS Practical List 2023-24
31 pages
Unit 1-Part3-Compressed
No ratings yet
Unit 1-Part3-Compressed
28 pages
Movie Recommendation System Using Graph Database
No ratings yet
Movie Recommendation System Using Graph Database
31 pages
OOP Assign 3
No ratings yet
OOP Assign 3
21 pages
Lab Questions IDSE 2024
No ratings yet
Lab Questions IDSE 2024
7 pages
Amazon Review Data Analysis
No ratings yet
Amazon Review Data Analysis
23 pages
Assignment 2 22261
No ratings yet
Assignment 2 22261
4 pages
Book Recommendation Project
No ratings yet
Book Recommendation Project
15 pages
Lecture 7 - CS50x
No ratings yet
Lecture 7 - CS50x
9 pages
Alfred North Whitehead, On Beauty
No ratings yet
Alfred North Whitehead, On Beauty
11 pages
A Study On Consumer Satisfaction Analysis Towards The Product, Price, Service of Lifestyle International Pvt. LTD Bhubaneswar
No ratings yet
A Study On Consumer Satisfaction Analysis Towards The Product, Price, Service of Lifestyle International Pvt. LTD Bhubaneswar
11 pages
Database Design Lecture Notes
No ratings yet
Database Design Lecture Notes
9 pages
Wsma Unit-1
No ratings yet
Wsma Unit-1
15 pages
MySQL Certification Guide.1Z0 873 Sample
No ratings yet
MySQL Certification Guide.1Z0 873 Sample
4 pages
Creating and Managing SQL Views and Indexes
No ratings yet
Creating and Managing SQL Views and Indexes
4 pages
DMW - Unit - 1 MCQS
No ratings yet
DMW - Unit - 1 MCQS
6 pages
Data Engineering Glossary for Beginners
No ratings yet
Data Engineering Glossary for Beginners
2 pages
Demystifying The Big Data Ecosystem... - Param Natarajan
100% (1)
Demystifying The Big Data Ecosystem... - Param Natarajan
8 pages
Module 6 Measures - TOT
No ratings yet
Module 6 Measures - TOT
69 pages
G8 SLM1a Q2 Final
No ratings yet
G8 SLM1a Q2 Final
23 pages
Stats Thesis Help for Students
100% (4)
Stats Thesis Help for Students
7 pages
Classic Star Schema As Data Model of Data Warehouse
No ratings yet
Classic Star Schema As Data Model of Data Warehouse
7 pages
The Fujitsu 3.5-Inch 15K RPM Enterprise Hard Disk Drives
No ratings yet
The Fujitsu 3.5-Inch 15K RPM Enterprise Hard Disk Drives
2 pages
MATLAB Data Analysis PDF Download
No ratings yet
MATLAB Data Analysis PDF Download
2 pages
VLOOKUP Guide for Beginners
No ratings yet
VLOOKUP Guide for Beginners
2 pages
Question Bank For DBT
No ratings yet
Question Bank For DBT
27 pages
Database Normalization
100% (1)
Database Normalization
44 pages
RAM vs ROM: Key Differences Explained
No ratings yet
RAM vs ROM: Key Differences Explained
3 pages
The Osi Reference Model
No ratings yet
The Osi Reference Model
15 pages
Dblink Creation
No ratings yet
Dblink Creation
4 pages
SANGHMITRA
No ratings yet
SANGHMITRA
57 pages
Practical PDF
No ratings yet
Practical PDF
46 pages
Training and Development in Organizations: Start at The Beginning
No ratings yet
Training and Development in Organizations: Start at The Beginning
12 pages
Database Systems Design Implementation and Management 13th Edition Coronel Solutions Manual
100% (54)
Database Systems Design Implementation and Management 13th Edition Coronel Solutions Manual
20 pages
CBSE Class 11 Computer Science Sample Paper 2018
No ratings yet
CBSE Class 11 Computer Science Sample Paper 2018
6 pages
Kafka Commands
No ratings yet
Kafka Commands
5 pages
BCA DBMS Exam Paper
No ratings yet
BCA DBMS Exam Paper
7 pages
Unit 1 - Scsa3008 - Distributed Database and Information
No ratings yet
Unit 1 - Scsa3008 - Distributed Database and Information
23 pages
Chapter 3 Sample
No ratings yet
Chapter 3 Sample
12 pages

Book Database

Uploaded by

Book Database

Uploaded by

For this project, we have taken a dataset from Kaggle.

Amazon’s top 50 bestselling books

This is a Kaggle dataset in .csv format. It includes the information

Name Author User Reviews Price Year Genre

10-Day Green JJ Smith 4.7 17350 8 2016 Non fiction

12 Rules for Jordan B. 4.7 18979 15 2018 Non fiction

1984 (Signet George 4.7 21424 6 2017 Fiction

5,000 National 4.8 7665 12 2019 Non fiction

... ... ... ... ... ... ...

The above table represents a book with various attributes

detailing its characteristics and performance on Amazon. Let’s

discuss these columns as follows:

● Name: This column contains the title of the book

Reading the Dataset

Define Book class

In this section, we will define a Book class that models the

This class is designed to provide a structured way to manage and

○ title: The title of the book.

You might also like