firstReport
firstReport
Engineering
Progress Report-I
Synopsis
On
Convolutional Neural Networks (CNNs) are pivotal in modern computer vision, offering
unparalleled performance in image classification tasks. This research paper aim to review,
implement, and evaluate a CNN for classifying images from the CIFAR-10 dataset. The
research emphasizes CNN architecture, training methodologies, advantages, and
challenges, while exploring its potential for improvement through advanced techniques
such as deeper networks and data augmentation. This paper provides a comprehensive
overview of CNN architecture, layers, and training processes, followed by an end-to-end
application of CNN for image classification. We highlight its efficiency, advantages, and
limitations, and conclude with a discussion of future research directions.
Introduction
Introduced in the late 1980s by Yann LeCun, CNNs have evolved with advancements in
hardware and deep learning. CNN models like LeNet, AlexNet, VGG, and ResNet have
set new benchmarks in tasks such as object detection, segmentation, and recognition.
Convolutional Neural Networks (CNNs) are a type of deep learning model designed to
analyze structured data, such as images. Inspired by how the human visual system works,
CNNs are especially good at recognizing patterns and extracting important features from
data. They consist of several layers, each serving a different purpose:
1. Convolutional Layers: These layers apply filters to the input data to find specific
features like edges or textures. The filters slide over the data, creating a feature map
that highlights the presence of these features.
2. Pooling Layers: These layers reduce the size of the feature maps, making the model
faster and less likely to overfit. Pooling operations, like max pooling or average
pooling, help summarize the features in a region.
3. Fully Connected Layers: In these layers, every neuron is connected to every neuron
in the previous layer. This allows the model to combine the features extracted by
earlier layers to make predictions.
CNNs are powerful because they can automatically learn and adapt to the features of the
data, making them especially useful for tasks like image classification, object detection,
and facial recognition. They are widely used in many areas due to their strength in
handling complex data with little need for manual preprocessing.
Objectives/Aim of the project
Software Requirements:
• Programming Language: Python
• Libraries: TensorFlow, Keras, NumPy, Matplotlib, and Pandas.
• Tools: Jupyter Notebook or an Integrated Development Environment (IDE) like
PyCharm.
• Dataset: CIFAR-10 from TensorFlow's datasets module.
Role of team members
1. Team Lead & Performance Analyst (Charu Bansal): Oversees the project workflow, ensures
deadlines are met, and manages resources & evaluates the model's accuracy and suggests
optimization techniques.
2. Research Analyst (Girraj Jha): Investigates CNN architecture, methodologies, and best practices.
3. Data Engineer & Documentation Specialist (Jatin Garg): Prepares and preprocesses the CIFAR-
10 dataset & prepares the project report, collating findings and outcomes.
4. Model Architect (Divykrati Niranjan): Designs and implements the CNN structure using Python
and TensorFlow.
References
1. LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-Based Learning Applied to
Document Recognition. Proceedings of the IEEE, 86(11), 2278-2324.
2. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet Classification with Deep
Convolutional Neural Networks. Advances in Neural Information Processing Systems.
3. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
4. R. Chauhan, K. K. Ghanshala and R. C. Joshi, "Convolutional Neural Network (CNN)
for Image Detection and Recognition," 2018 First International Conference on Secure
Cyber Computing and Communication (ICSCCC), Jalandhar, India, 2018
5. The CIFAR-10 dataset, [online] Available: https://www.cs.toronto.edu/∼kriz/cifar.html.