0% found this document useful (0 votes)

68 views5 pages

Experiment 10

The document outlines the implementation of a Recurrent Neural Network (RNN) for classifying IMDB movie reviews as positive or negative. It details the objectives, program code, and step-by-step explanation of loading data, preprocessing, building, compiling, training, and evaluating the model. The RNN achieves a test accuracy of approximately 85-87%, demonstrating its effectiveness in sentiment analysis.

Uploaded by

gnanesh847

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views5 pages

Experiment 10

Uploaded by

gnanesh847

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

# Experiment 10: Implement an RNN for IMDB Movie Review Classification

## Title

Recurrent Neural Network (RNN) for IMDB Movie Review Classification

## Aim

To implement a Recurrent Neural Network (RNN) for classifying IMDB movie reviews as
either positive or negative.

## Objectives

- Understand the use of RNN for text classification.

- Preprocess text data and convert it into sequences using word embeddings.

- Train an RNN model using TensorFlow/Keras for sentiment analysis.

- Evaluate the model's performance using accuracy metrics.

---

## Program with Line-by-Line Explanation

Below is the complete Python code to implement an RNN for sentiment classification
on the IMDB dataset:

```python

# Import required libraries

import tensorflow as tf

from tensorflow import keras

from [Link] import sequence

from [Link] import Sequential

from [Link] import Embedding, SimpleRNN, Dense

from [Link] import imdb

# Step 1: Load the IMDB dataset

max_features = 10000 # Vocabulary size (top 10,000 words)

maxlen = 500 # Max length of a review (truncate/pad to this size)

batch_size = 32

# Load dataset with only top `max_features` words

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

# Step 2: Preprocess the data (pad sequences to ensure equal length)

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

# Step 3: Build the RNN model

model = Sequential([

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

])

# Step 4: Compile the model

[Link](loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

# Step 5: Train the model

[Link](x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

y_test))

# Step 6: Evaluate the model

test_loss, test_acc = [Link](x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

```

### Explanation of Code (Line by Line)

#### Step 1: Load the IMDB Dataset

```python

max_features = 10000 # Vocabulary size (top 10,000 words)

maxlen = 500 # Max length of a review (truncate/pad to this size)

batch_size = 32

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

```

- The IMDB dataset contains 50,000 movie reviews (25,000 for training and 25,000 for
testing).

- Each review is a sequence of integers representing word indices.

- `num_words=max_features` limits the vocabulary to the 10,000 most frequent

words.

- Reviews are labeled as positive (1) or negative (0).

#### Step 2: Preprocess the Data

```python

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

```

- Reviews vary in length, so they are padded or truncated to a fixed length of 500
words.

- This ensures all input sequences have the same shape, which is required for the
RNN.

#### Step 3: Build the RNN Model

```python

model = Sequential([

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

])

```

- **Embedding Layer**: Converts word indices into dense vectors of size 32, learning
word representations during training.
- **SimpleRNN Layer**: A basic RNN with 32 units that processes the sequence and
captures temporal dependencies between words.

- Dense Layer: A single neuron with a sigmoid activation function outputs a

probability (0 to 1) for binary classification.

#### Step 4: Compile the Model

```python

[Link](loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

```

- Loss Function: `binary_crossentropy` is suitable for binary classification tasks.

- Optimizer: `adam` adapts the learning rate for efficient training.

- Metrics: `accuracy` measures the model's performance.

#### Step 5: Train the Model

```python

[Link](x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

y_test))

```

- Trains the model for 5 epochs with a batch size of 32.

- Uses training data (`x_train`, `y_train`) and validates on test data (`x_test`, `y_test`)
after each epoch.

#### Step 6: Evaluate the Model

```python

test_loss, test_acc = [Link](x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

```

- Evaluates the model on the test dataset and prints the test accuracy, showing
performance on unseen data.

---

## Expected Output

After training for 5 epochs, the output might look like this:
```

Epoch 1/5

782/782 [==============================] - 35s 45ms/step - loss:

0.6500 - accuracy: 0.6000 - val_loss: 0.5500 - val_accuracy: 0.7000

Epoch 2/5

782/782 [==============================] - 32s 41ms/step - loss:

0.4500 - accuracy: 0.8000 - val_loss: 0.4000 - val_accuracy: 0.8200

Epoch 3/5

782/782 [==============================] - 32s 41ms/step - loss:

0.3000 - accuracy: 0.8800 - val_loss: 0.3500 - val_accuracy: 0.8500

Epoch 4/5

782/782 [==============================] - 32s 41ms/step - loss:

0.2000 - accuracy: 0.9200 - val_loss: 0.3200 - val_accuracy: 0.8600

Epoch 5/5

782/782 [==============================] - 32s 41ms/step - loss:

0.1200 - accuracy: 0.9500 - val_loss: 0.3100 - val_accuracy: 0.8700

Test Accuracy: 0.8700

```

The model typically achieves a test accuracy of around 85–87%, meaning it correctly
classifies reviews as positive or negative about 85% of the time.

---

## Conclusion

- Successfully implemented an RNN for IMDB movie review classification.

- Used word embeddings to numerically represent text data, enabling sequence

processing.

- The model effectively learns sentiment patterns, achieving good accuracy on the
test set.

This experiment demonstrates the power of RNNs in handling sequential data like text
for sentiment analysis tasks.

Experiment 2
No ratings yet
Experiment 2
5 pages
IMDB Sentiment Analysis with RNN
No ratings yet
IMDB Sentiment Analysis with RNN
8 pages
Neuralnetworks Research Assignment
No ratings yet
Neuralnetworks Research Assignment
7 pages
Untitled34: Tensorflow TF
No ratings yet
Untitled34: Tensorflow TF
2 pages
Satish Deep Learning Lab MAnual
No ratings yet
Satish Deep Learning Lab MAnual
85 pages
Experiment No 6
No ratings yet
Experiment No 6
3 pages
DL Lab1
No ratings yet
DL Lab1
15 pages
FDL 6
No ratings yet
FDL 6
3 pages
DL 3
No ratings yet
DL 3
6 pages
Wa0000.
No ratings yet
Wa0000.
40 pages
Ex NO 9 DL LAB
No ratings yet
Ex NO 9 DL LAB
3 pages
Assignment No 2
No ratings yet
Assignment No 2
8 pages
Exp 6,7,8
No ratings yet
Exp 6,7,8
17 pages
MNIST MLP Digit Classifier Guide
No ratings yet
MNIST MLP Digit Classifier Guide
43 pages
CCS355
No ratings yet
CCS355
29 pages
DL Lab Manual
No ratings yet
DL Lab Manual
18 pages
Sequence Classification With LSTM Recurrent Neural Networks
No ratings yet
Sequence Classification With LSTM Recurrent Neural Networks
6 pages
LSTM and Neural Network Models in TensorFlow
No ratings yet
LSTM and Neural Network Models in TensorFlow
6 pages
Plotting Loss History in TensorFlow
No ratings yet
Plotting Loss History in TensorFlow
65 pages
RNNs for Sequential Data Modeling
No ratings yet
RNNs for Sequential Data Modeling
33 pages
Deep Learning Lab Assignments - 6-9
No ratings yet
Deep Learning Lab Assignments - 6-9
14 pages
DL - LSTM - 3.ipynb - Colab
No ratings yet
DL - LSTM - 3.ipynb - Colab
3 pages
Keras NLP Encoding and Sentiment Analysis
No ratings yet
Keras NLP Encoding and Sentiment Analysis
8 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
DL Lab Answers Batch 2
No ratings yet
DL Lab Answers Batch 2
27 pages
DL Exp-10,11,12
No ratings yet
DL Exp-10,11,12
6 pages
Hand Writing Using - CNN
No ratings yet
Hand Writing Using - CNN
5 pages
Keras RNN Guide for Beginners
No ratings yet
Keras RNN Guide for Beginners
13 pages
Practical 1
No ratings yet
Practical 1
6 pages
Deep Learning LAB
No ratings yet
Deep Learning LAB
47 pages
Case Study - Sentiment Analysis With RNNs
No ratings yet
Case Study - Sentiment Analysis With RNNs
8 pages
Experiment 3.3
No ratings yet
Experiment 3.3
3 pages
DL Programs
No ratings yet
DL Programs
12 pages
Shaurya DL File
No ratings yet
Shaurya DL File
75 pages
Exercise 12
No ratings yet
Exercise 12
5 pages
Advanced Deep Learning Practical File
No ratings yet
Advanced Deep Learning Practical File
29 pages
Lab 6 ML
No ratings yet
Lab 6 ML
7 pages
Lab 1 Assignment - W2022
No ratings yet
Lab 1 Assignment - W2022
7 pages
ML PPT G3
No ratings yet
ML PPT G3
15 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
26 pages
Deep DL Manual Nainish
No ratings yet
Deep DL Manual Nainish
8 pages
Neural Networks
No ratings yet
Neural Networks
8 pages
RLDL
No ratings yet
RLDL
27 pages
Experiment 3 (A, B, C) (RNN) (Recuurent) (IMDB) )
No ratings yet
Experiment 3 (A, B, C) (RNN) (Recuurent) (IMDB) )
11 pages
IMDB Sentiment Analysis with LSTM
No ratings yet
IMDB Sentiment Analysis with LSTM
5 pages
Unit 4
No ratings yet
Unit 4
23 pages
Predicting Chlorophyll with Neural Networks
No ratings yet
Predicting Chlorophyll with Neural Networks
17 pages
NLP Lab Assignment - 05
No ratings yet
NLP Lab Assignment - 05
6 pages
ADL Exp File
No ratings yet
ADL Exp File
56 pages
DL 8
No ratings yet
DL 8
4 pages
DL NN Pra 9-10
No ratings yet
DL NN Pra 9-10
6 pages
DL 2
No ratings yet
DL 2
4 pages
Design A Neural Network For Classifying Movie Reviews
No ratings yet
Design A Neural Network For Classifying Movie Reviews
5 pages
DL Record Merged
No ratings yet
DL Record Merged
113 pages
Neural Network Implementation Guide
No ratings yet
Neural Network Implementation Guide
12 pages
Exp 2
No ratings yet
Exp 2
4 pages
DLTF Lab Manual.1
No ratings yet
DLTF Lab Manual.1
29 pages
Deep Learning Lab With Tensorflow
No ratings yet
Deep Learning Lab With Tensorflow
84 pages
Computer Vision Lab Guide
No ratings yet
Computer Vision Lab Guide
120 pages
C&NS Unit - 2
No ratings yet
C&NS Unit - 2
46 pages
C&ns. Unit 1
No ratings yet
C&ns. Unit 1
28 pages
Experiment 5
No ratings yet
Experiment 5
7 pages
Experiment 1
No ratings yet
Experiment 1
2 pages
Abb E4.2h 4000 4000a 3p Acb Breaker
No ratings yet
Abb E4.2h 4000 4000a 3p Acb Breaker
3 pages
Bartók's Harmonic Innovations
100% (1)
Bartók's Harmonic Innovations
49 pages
Engineering Students' Project Report
No ratings yet
Engineering Students' Project Report
32 pages
Psychometric Chart Calculations
No ratings yet
Psychometric Chart Calculations
7 pages
Xi B Maths Magazine
No ratings yet
Xi B Maths Magazine
2 pages
IGNOU - B.Sc. - MTE01: Calculus
80% (5)
IGNOU - B.Sc. - MTE01: Calculus
370 pages
Lesson Plans for Grade 1-3 Math Concepts
No ratings yet
Lesson Plans for Grade 1-3 Math Concepts
9 pages
MM 1000 (2016) Introduction To Micromine (2016-08)
100% (1)
MM 1000 (2016) Introduction To Micromine (2016-08)
320 pages
Untitled
100% (1)
Untitled
106 pages
GCSE Mathematics Higher Tier Paper 3
No ratings yet
GCSE Mathematics Higher Tier Paper 3
26 pages
Unit 10 - Mathematical Functions and Notations
No ratings yet
Unit 10 - Mathematical Functions and Notations
23 pages
Work Application Plan
No ratings yet
Work Application Plan
2 pages
Hill Cipher
No ratings yet
Hill Cipher
4 pages
Dynamic Analysis of Offshore Jacket Sub Structure Using FEM
No ratings yet
Dynamic Analysis of Offshore Jacket Sub Structure Using FEM
5 pages
Olympiade
No ratings yet
Olympiade
5 pages
Introduction To Python Programming (BPLCK105B/205B) Lab Manual
No ratings yet
Introduction To Python Programming (BPLCK105B/205B) Lab Manual
20 pages
MATHEMATICS Class 7 CBSE Integers and Fractions
No ratings yet
MATHEMATICS Class 7 CBSE Integers and Fractions
2 pages
Laplace Transform in Cryptography
No ratings yet
Laplace Transform in Cryptography
19 pages
COT 1 Illustrating Linear Inequalities v2
No ratings yet
COT 1 Illustrating Linear Inequalities v2
3 pages
RC Chapter03 2016
No ratings yet
RC Chapter03 2016
36 pages
R Prograaming Journal
No ratings yet
R Prograaming Journal
16 pages
AI - SMPS-Unit 6 - Week 2.3
No ratings yet
AI - SMPS-Unit 6 - Week 2.3
5 pages
Practice Problems For Curve Fitting - Solution
No ratings yet
Practice Problems For Curve Fitting - Solution
8 pages
Weight Estimation of Vessels Apps
60% (5)
Weight Estimation of Vessels Apps
28 pages
Quiz Solutions
95% (20)
Quiz Solutions
11 pages
p3 Demand and Low Demand
No ratings yet
p3 Demand and Low Demand
12 pages
UNIT 5: Solids Liquids and Gases: Igcse May 2020 (Physics)
No ratings yet
UNIT 5: Solids Liquids and Gases: Igcse May 2020 (Physics)
16 pages
Year 1 Math Workbook Overview
50% (2)
Year 1 Math Workbook Overview
106 pages
Fusing Hyperspectral and Multispectral Images Via Coupled Sparse Tensor Factorization-Dian Renwei-TIP2018
No ratings yet
Fusing Hyperspectral and Multispectral Images Via Coupled Sparse Tensor Factorization-Dian Renwei-TIP2018
12 pages
Vector Assignment
No ratings yet
Vector Assignment
2 pages

Experiment 10

Uploaded by

Experiment 10

Uploaded by

# Experiment 10: Implement an RNN for IMDB Movie Review Classification

Recurrent Neural Network (RNN) for IMDB Movie Review Classification

- Understand the use of RNN for text classification.

- Train an RNN model using TensorFlow/Keras for sentiment analysis.

- Evaluate the model's performance using accuracy metrics.

## Program with Line-by-Line Explanation

# Import required libraries

from tensorflow import keras

from [Link] import sequence

from [Link] import Sequential

from [Link] import Embedding, SimpleRNN, Dense

from [Link] import imdb

# Step 1: Load the IMDB dataset

max_features = 10000 # Vocabulary size (top 10,000 words)

# Load dataset with only top `max_features` words

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

# Step 2: Preprocess the data (pad sequences to ensure equal length)

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

# Step 3: Build the RNN model

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

# Step 4: Compile the model

[Link](loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

# Step 5: Train the model

[Link](x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

# Step 6: Evaluate the model

test_loss, test_acc = [Link](x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

### Explanation of Code (Line by Line)

#### Step 1: Load the IMDB Dataset

max_features = 10000 # Vocabulary size (top 10,000 words)

maxlen = 500 # Max length of a review (truncate/pad to this size)

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

- Each review is a sequence of integers representing word indices.

- `num_words=max_features` limits the vocabulary to the 10,000 most frequent

- Reviews are labeled as positive (1) or negative (0).

#### Step 2: Preprocess the Data

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

#### Step 3: Build the RNN Model

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

- **Dense Layer**: A single neuron with a sigmoid activation function outputs a

#### Step 4: Compile the Model

[Link](loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

- **Loss Function**: `binary_crossentropy` is suitable for binary classification tasks.

- **Optimizer**: `adam` adapts the learning rate for efficient training.

- **Metrics**: `accuracy` measures the model's performance.

#### Step 5: Train the Model

[Link](x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

- Trains the model for 5 epochs with a batch size of 32.

#### Step 6: Evaluate the Model

test_loss, test_acc = [Link](x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

782/782 [==============================] - 35s 45ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

Test Accuracy: 0.8700

- Successfully implemented an RNN for IMDB movie review classification.

- Used word embeddings to numerically represent text data, enabling sequence

You might also like

- Dense Layer: A single neuron with a sigmoid activation function outputs a

- Loss Function: `binary_crossentropy` is suitable for binary classification tasks.

- Optimizer: `adam` adapts the learning rate for efficient training.

- Metrics: `accuracy` measures the model's performance.