Sip Final Review
Sip Final Review
By
Koushik Ghosh
Reg No:- 23BMMCA031
CMR UNIVERSITY
SCHOOL OF SCIENCE & COMPUTER STUDIES (SSCS)
Lakeside Campus, Bagalur, Bengaluru, Karnataka, 562149 February
2025
1|Page
INTERNSHIP CERTIFICATE
2|Page
CMR UNIVERSITY
SCHOOL OF SCIENCE AND COMPUTER STUDIES,
BANGALORE, KARNATAKA, 562149
CERTIFICATE
This is to certify that the “Summer Internship Program Project Report” entitled “Bitcoin
Prediction” submitted by Koushik Ghosh (23BMMCA031) is the work done by him at
PHOENIX SOLUTIONS in partial fulfilment of the requirements for the award of the Degree of
MASTER OF COMPUTER APPLICATIONS (MCA) during the academic year 2024-2025.
3|Page
DECLARATION
This Internship project report entitled as “Summer Internship Program Project Report”
entitled “Bitcoin Prediction” is submitted to the SSCS, CMR UNIVERSITY, BANGALORE
in partial fulfilment of the requirements for the Degree of Master of Computer Applications
(MCA). I hereby declare that this internship report is based on the work done by me during the
internship period 10/12/2024 – 10/02/2025. This internship report, neither in whole, nor in part,
has been previously submitted for any degree.
Signature: …………………….
4|Page
ACKNOWLEDGEMENTS
First, I would like to thank Garvit Garg founder and CEO of PHOENIX SOLUTIONS for
giving me the opportunity to do an internship within the organization.
I would also like to thank all the people who worked along with me at PHOENIX SOLUTIONS
with their patience and openness. They created an enjoyable working environment. It is indeed
with a great sense of pleasure and immense sense of gratitude that I acknowledge the help of these
individuals.
I’m greatly indebted to Dr. T. A. Ashok Kumar, Director and Dr. Pughazendi N,HOD, School
of Science and Computer Studies (Lakeside Campus), CMR University,Bangalore, for their
encouragement and suggestion at every step of my Internship.
I would like to thank Prof Syeeda Mujeebunnisa, Department Internship Coordinator and
Prof. Sudharsan S Department Internship Mentor for their support and advice to get and
complete internship in the above said organization.
5|Page
TABLE OF CONTENTS
SR. No CONTENTS PAGE NO
Chapter 1 Theoretical Background and
7
Methodology 1.1: Significance of
Internship Study
7
1.2 : Objective of the study
7
1.3 : Scope of the Study
12
Chapter 4 Consolidated Reports
14-19
Chapter 6 Projects
20
Conclusion
References 21
LIST OF TABLES
SL. NO TABLE DESCRIPTION PAGE NO
LIST OF FIGURES
SL. NO FIGURE DESCRIPTION PAGE NO
1 Company Logo 9
6|Page
Theoretical Background and Methodology
1.1 Significance of Internship: Study Internships play a crucial role in bridging the gap between
academic learning and real-world applications. This internship in Machine Learning using Python
provided hands-on experience with fundamental ML concepts, algorithms, and practical
implementations. It was my first exposure to an industrial setting, which helped me understand the
significance of data-driven decision-making. By working on real-world datasets and projects, I gained
a deeper appreciation for how machine learning can be applied in various domains such as finance,
healthcare, and e-commerce. The knowledge and skills acquired during this internship will serve as a
strong foundation for my future endeavours in the field of artificial intelligence and data science.
7|Page
1.4 Methodology of the Study
The study methodology involved a combination of theoretical learning and hands-on practical
exercises. The key steps included:
• Literature Review: Understanding the core concepts of machine learning through provided study
materials, research papers, and online resources.
• Hands-on Practice: Implementing ML models using Python-based environments such as Jupyter
Notebook and Google Colab.
• Data Analysis: Working with real-world datasets to apply preprocessing and feature engineering
techniques.
• Model Building: Developing and training machine learning models using various algorithms.
• Evaluation and Optimization: Assessing model performance and fine-tuning hyperparameters
for better accuracy.
• Project Implementation: Working on a mini-project (e.g., Bitcoin prediction) to apply the
acquired knowledge in a practical setting.
While the internship was highly beneficial, certain limitations were encountered:
• Limited Timeframe: Being a two-month internship, there was limited time to explore advanced
topics such as deep learning and reinforcement learning in depth.
• Basic-Level Exposure: The internship focused on foundational ML concepts, leaving advanced
techniques like neural networks and natural language processing for future study.
• Resource Constraints: Some real-world datasets required extensive computational resources that
were beyond the scope of the internship environment.
• Industry-Specific Applications: The internship provided generic ML knowledge, but industry-
specific applications and domain expertise were not covered in depth.
8|Page
Company Overview
EduPhoenix Solutions is a dynamic and forward-thinking company that specializes in providing
innovative educational technology solutions to institutions worldwide. Established with a vision to
revolutionize the way education is delivered and experienced, EduPhoenix Solutions has quickly emerged
as a leader in the field, offering a wide range of cutting-edge products and services tailored to meet the
diverse needs of educators and learners alike.
At EduPhoenix Solutions, we believe that technology has the power to transform education by making
learning more accessible, engaging, and effective. With this belief at the core of our mission, we strive to
develop solutions that empower educators to deliver high-quality instruction and enable learners to reach
their full potential.
Our comprehensive suite of products and services encompasses everything from learning management
systems and virtual classrooms to interactive content authoring tools and professional development
programs. Whether it's providing educators with the tools they need to create immersive learning
experiences or offering learners access to dynamic and interactive learning environments, EduPhoenix
Solutions is committed to driving positive change in education through technology.
One of the key strengths of EduPhoenix Solutions lies in our team of experts, who bring a wealth of
experience and expertise from diverse fields such as education, technology, instructional design, and user
experience. By combining our knowledge and skills, we are able to develop innovative solutions that
address the evolving needs of the education sector and deliver measurable results.
In addition to our product offerings, EduPhoenix Solutions also provides consultancy services to
educational institutions seeking to leverage technology effectively. From needs assessment and solution
design to implementation and ongoing support, our team works closely with clients to develop customized
strategies and solutions that align with their unique goals and objectives.
As a company, we are deeply committed to excellence, integrity, and customer satisfaction. We take pride
in our ability to deliver high-quality products and services that meet the highest standards of quality and
reliability. With a customer-centric approach and a focus on continuous improvement, we strive to exceed
our clients' expectations and build long-lasting partnerships based on trust and mutual respect.
9|Page
Company Departments
Human Resources (HR) Department: This involves identifying staffing needs, sourcing candidates
through job postings and recruitment agencies, conducting interviews, and selecting the best-fit
candidates.
Information Technology (IT) and Technical Support Department: This includes managing servers,
networks, hardware, and software systems to ensure seamless operation, security, and scalability.
Research and Development (R&D) Department: R&D conducts market research to identify industry
trends, customer preferences, and competitive landscapes to inform product development strategies.
Sales and Marketing Department: Sales and marketing professionals identify and segment target
markets based on demographics, behaviors, and preferences to tailor marketing messages and campaigns.
Customer Success Department: This involves guiding new customers through the implementation
process, providing training, support, and resources to ensure successful adoption and utilization of
EduPhoenix Solutions.
Finance and Accounting Department: This department is responsible for budgeting, forecasting,
financial modeling, and providing insights to support strategic decision-making.
Legal and Compliance Department: Legal professionals draft, review, and negotiate contracts with
clients, partners, and vendors to protect EduPhoenix Solutions' interests and mitigate legal risks.
Customer Experience Department: This team focuses on designing intuitive, user-friendly interfaces
and interactions for digital products and platforms to enhance customer satisfaction and engagement.
Quality Assurance and Testing Department: Quality Assurance (QA) Testing is a critical function that
ensures products and services meet specified quality standards before they are released to customers. This
involves systematically testing various aspects of software, hardware, or other deliverables to identify
defects or areas for improvement.
Grievance and Conflict Resolution Department: The Grievance and Conflict Resolution department is
responsible for managing disputes, grievances, and conflicts within the organization. This department
serves as a neutral mediator resolve issues in a fair and equitable manner.
10 | P a g e
Role and Responsibilities
During my two-month internship, I was responsible for various tasks related to machine learning and data
analysis, including:
• Understanding Machine Learning Concepts: Engaging in training sessions and self-learning to
grasp the basics of supervised and unsupervised learning.
• Data Preprocessing and Cleaning: Handling missing data, performing feature engineering, and
normalizing datasets for machine learning models.
• Implementing Machine Learning Algorithms: Developing models using Python libraries such
as scikit-learn and applying regression, classification, and clustering techniques.
• Working with Real-World Datasets: Conducting exploratory data analysis (EDA) to extract
insights and prepare data for model training.
• Model Evaluation and Optimization: Using metrics like mean squared error (MSE), accuracy,
and precision-recall to assess model performance and improve results.
• Collaborating on a Mini-Project: Contributing to a project (Bitcoin prediction) where I applied
machine learning techniques to analyse financial trends.
• Documentation and Reporting: Preparing reports on findings, model performance, and learning
outcomes to document the progress of the internship.
• Feature Engineering: Extracting and selecting important features to improve model performance
and predictive power.
• Hyperparameter Tuning: Experimenting with different parameter settings to enhance the
accuracy and efficiency of machine learning models.
• Visualization and Interpretation: Creating visual representations of data trends and model
performance using Matplotlib and Seaborn.
• Code Optimization: Writing efficient Python code to ensure smooth execution of machine
learning tasks and reduce computation time.
• Collaboration and Teamwork: Engaging with peers and mentors to discuss project
improvements and troubleshoot technical challenges.
11 | P a g e
Consolidated Report
12 | P a g e
Skills and Experiences
13 | P a g e
Bitcoin Prediction Project
Source Code:
Bitcoin.py
import numpy as np
import pandas as pd
14 | P a g e
import matplotlib.pyplot as plt
import seaborn as sb
import warnings
warnings.filterwarnings('ignore')
df = pd.read_csv('BTC-2021min.csv')
print(df.head())
print(df.shape)
print(df.describe())
print(df.info())
plt.figure(figsize=(15, 5))
plt.plot(df['close'])
plt.title('Bitcoin close price.', fontsize=15)
plt.ylabel('Price in dollars.')
plt.show()
plt.subplots(figsize=(20,10))
for i, col in enumerate(features):
plt.subplot(2,2,i+1)
sb.distplot(df[col])
plt.show()
plt.subplots(figsize=(20,10))
for i, col in enumerate(features):
range_min = 0.0
range_max = 10000.0
filtered_data = [x for x in df[col] if range_min <= x <= range_max]
plt.subplot(2,2,i+1)
sb.distplot(filtered_data)
plt.show()
plt.subplots(figsize=(20,10))
for i, col in enumerate(features):
plt.subplot(2,2,i+1)
15 | P a g e
ax = sb.boxplot(df[col])
q1, median, q3 = df[col].quantile([0.25, 0.5, 0.75])
label_text=f"25% : {q1:.2f} median : {median:.2f} 75% : {q3:.2f}"
plt.text(20000, -0.35, label_text, fontsize=12)
IQR = q3 - q1
k = 1.5 # Adjust this value if needed
lower_fence = q1 - k * IQR
upper_fence = q3 + k * IQR
label_fence=f"Fence line : {upper_fence:.2f}"
plt.text(20000, -0.25, label_fence, fontsize=12)
plt.show()
df['year'] = splitted[0].astype('int')
df['month'] = splitted[1].astype('int')
df['day'] = splitted[2].astype('int')
print(df.head())
data_grouped = df.groupby('year').mean()
plt.subplots(figsize=(20,10))
for i, col in enumerate(['open', 'high', 'low', 'close']):
plt.subplot(2,2,i+1)
data_grouped[col].plot.bar()
dataLabel = f"{col}"
plt.text(3, 39000, dataLabel, fontsize=12)
plt.show()
df['is_quarter_end'] = np.where(df['month']%3==0,1,0)
print(df.head())
df['open-close'] = df['open'] - df['close']
df['low-high'] = df['low'] - df['high']
df['target'] = np.where(df['close'].shift(-1) > df['close'], 0, 1)
plt.pie(df['target'].value_counts().values,
labels=["Goes down", "Goes up"], autopct='%1.1f%%')
plt.show()
plt.figure(figsize=(10, 10))
scaler = StandardScaler()
features = scaler.fit_transform(features)
16 | P a g e
print(X_train.shape, X_valid.shape)
for i in range(3):
models[i].fit(X_train, Y_train)
print(f'{models[i]} : ')
print('Training Accuracy : ', metrics.roc_auc_score(Y_train, models[i].predict_proba(X_train)[:,1]))
print('Validation Accuracy : ', metrics.roc_auc_score(Y_valid, models[i].predict_proba(X_valid)[:,1]))
print()
print('\n\n0 : Goes up')
print('1 : Goes down')
metrics.plot_confusion_matrix(models[0], X_valid, Y_valid)
plt.show()
BTC-2021.CSV
17 | P a g e
OUTPUT
Fig.- 2
Fig.- 3
Fig.- 4
18 | P a g e
Fig.- 5
Fig.- 6
19 | P a g e
CONCLUSION
The machine learning internship at EduPhoenix Solutions provided me with a comprehensive and
enriching learning experience. Through hands-on projects, mentorship, and exposure to real-world
applications, I gained a deep understanding of the various aspects of machine learning and its potential to
drive innovation and business transformation.
During the internship, I had the opportunity to work on a diverse range of machine learning tasks,
including supervised learning, unsupervised learning, and deep learning. I applied these techniques to
solve complex problems in domains such as predictive analytics, computer vision, and natural language
processing. The experience of working with cross-functional teams, iterating on solutions, and presenting
my findings to stakeholders was invaluable.
Moreover, the internship allowed me to explore the different career paths within the machine learning
field, from machine learning engineering and data science to AI research and product management. This
exposure broadened my understanding of the industry and helped me identify the areas that align best
with my interests and strengths.
Beyond the technical skills, the internship also emphasized the importance of critical thinking, problem-
solving, and effective communication – all essential for success in the rapidly evolving field of machine
learning. The mentorship and guidance provided by the experienced professionals at EduPhoenix
Solutions were instrumental in shaping my professional development and preparing me for the challenges
that lie ahead.
As I reflect on my time at EduPhoenix Solutions, I am grateful for the opportunity to have been a part of
this dynamic and innovative environment. The knowledge, skills, and practical experience gained during
the internship have not only strengthened my technical expertise but also inspired me to continue
exploring the boundless possibilities of machine learning. I am excited to apply these learnings and
embark on the next chapter of my career, driven by the passion to make a meaningful impact through the
power of machine learning.
20 | P a g e
REFERENCES
Alpaydin, E. (2020). Introduction to Machine Learning (4th ed.). The MIT Press.
Cutler, A., Cutler, D. R., & Stevens, J. R. (2012). Random Forests. In C. Zhang & Y. Ma (Eds.), Ensemble Machine Learning (pp. 157–
175). Springer.
Domingos, P. (2015). The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World. Basic Books.
Géron, A. (2019). Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow (2nd ed.). O'Reilly Media.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. The MIT Press.
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd
ed.). Springer.
Hyndman, R. J., & Athanasopoulos, G. (2018). Forecasting: Principles and Practice (3rd ed.). OTexts.
Kingma, D. P., & Ba, J. (2015). Adam: A Method for Stochastic Optimization. In ICLR.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep Learning. Nature, 521(7553), 436–444.
21 | P a g e