Final Proj
Final Proj
MACHINE LEARNING
A Mini Project thesis submitted to the JAWAHARLAL NEHRU
TECHNOLOGICAL UNIVERSITY HYDERABAD in Partial Fulfillment of the
requirement for the award of the degree of
BACHELOR OF TECHNOLOGY
In
COMPUTER SCIENCE AND ENGINEERING(AI&ML)
Submitted By
SHAHANAZ BEGUM : (22D31A6644)
MD.ABDUL RAHMAN : (22D31A6633)
J.SAI LAXMAN : (22D31A6623)
K.SAKETH : (22D31A6628)
i
Date: / /2025
CERTIFICATE
This is to certify that the thesis “CAREER PREDICTION WEBSITE
USING MACHINE LEARNING” being
submitted by
SHAHANAZ BEGUM : (22D31A6644)
MD.ABDUL RAHMAN : (22D31A6633)
J.SAI LAXMAN : (22D31A6623)
K.SAKETH : (22D31A6628)
In partial fulfilment for the award of “BACHELOR OF TECHNOLOGY” in
the Department of “COMPUTER SCIENCE & ENGINEERING”(B.Tech) to the
“JAWAHARLAL NEHRU TECHNOLOGICAL UNIVERSITY HYDERABAD” is
a record of bonafide Mini Project Work carried out by them under our guidance and
supervision.
The results embodied in this thesis have not been submitted to any other
University or Institute for the award of any degree or diploma.
EXTERNAL EXAMINER
ii
ACKNOWLEDGEMENT
We would like to say thanks to all of my friends for their timely help
and encouragement.
SHAHANAZ BEGUM : (22D31A6644)
MD.ABDUL RAHMAN : (22D31A6633)
J.SAI LAXMAN : (22D31A6623)
K.SAKETH : (22D31A6628)
iii
DECLARATION
We hereby declare that the project entitled “CAREER PREDICTION
BY:
K.SAKETH : (22D31A6628)
iv
ABSTRACT
v
CONTENTS
1 INTRODUCTION 1
vi
4 IMPLEMENTATION 42
4.1 Modules 42
4.1.1 User 42
4.1.2 Admin 42
4.1.3 Machine Learning 42
4.2 Input and Output Design 43
4.2.1 Input Design 43
4.2.2 Output Design 44
5 RESULTS 46
5.1 Home Page 46
5.2 User Register Form 46
5.3 Admin Login Page 47
5.4 Admin Home Page 47
5.5 User View Page 48
5.6 User Login Page 48
5.7 User Home Page: 49
5.8 Dataset View: 49
5.12 ML Score 51
6 SYSTEM TESTING 53
6.1 Unit Testing 53
6.2 Integration Testing 53
6.3 Functional Testing 54
6.4 System Testing 54
6.5 White Box Testing 55
6.6 Black Box Testing 55
vii
6.7 Unit Testing 55
6.8 Integration Testing 56
6.9 Acceptance Testing 56
7 CONCLUSION 57
7.1 Conclusion 58
7.2 Future Enhancement 58
8 BIBLIOGRAPHY 59
9 APPENDICES 62
viii
LIST OF FIGURES
FIGURE NO. FIGURE NAME PAGE NO.
Fig 2.1 Django’s Architecture 23
Fig 2.2 Model View Template Architecture 24
Fig 3.1 Use case Diagram 36
Fig 3.2 Class Diagram 37
Fig 3.3 Sequence Diagram 38
Fig 3.4 Activity Diagram 39
Fig 5.1 Home page 44
Fig 5.2 Registration form 44
Fig 5.3 Admin Login page 45
Fig 5.4 Admin home page 45
Fig 5.5 User View page 46
Fig 5.6 User login page 46
Fig 5.7 User home page 47
Fig 5.8 Dataset view page 47
Fig 5.9 K-Nearest Neighbours 48
Fig 5.10 Confusion Matrix 48
Fig 5.11 Entropy Confusion Matrix 49
Fig 5.12 ML Score 49
Fig 5.13 Prediction page 50
ix
CHAPTER 1
INTRODUCTION
While choosing a career path, it is important not only to choose a course, but also
what you admire to become after graduation and your interests. Career advice plays a role
by providing assistance in acknowledging yourself and your skills and abilities. Machine
learning is used in various fields and industries such as clinical analysis, image
processing, classification, and regression. It has the ability to develop and explore
automation without being explicit. Machine learning can be applied in three ways:
unsupervised machine learning, supervised machine learning, and reinforced machine
learning algorithms. Simply put, machine learning is the science of learning and acting
like humans. Analyzing the student's abilities is very important and should guide the
student on the right path. This concerns career choice and related training, then a job and
then whether to stay or change another job, additional formal and informal training
courses, etc. Many people encounter difficulties when making such decisions, often
hindering them or forcing them to choose suboptimal options. The IT revolution[1] is
influencing individuals' career choices in two ways. First, there is increasing demand for
employees like engineers, mathematicians, scientists and technical experts [1], but several
jobs could be lost to robotization [1].. Second, communication and information
technology has facilitated access to a variety of judgment and analysis, both during
individual career counseling and while using various self-help websites. Therefore, career
advisors must constantly expand their knowledge and skills to obtain new sources of
information about available career assessments and environments that can help them find
the best assessments.
1
1.1 LITERATURE SURVEY
2
the course, self-exploration, generational financial status and parental and
constraints of peer group are major governing factors in the choice of students.
According to the research by[18], individuals were most affected by prior course
knowledge, connections to professionals in the field, and work-related course
experiences. The results of the Senior School Certificate Examination (SSCE), the
Joint Admission Matriculation Board's (JAMB's) benchmarks for Unified Tertiary
Matriculation Examination (UTME), and Post-UTME results, which are typically
conducted by selected Nigerian universities, is one of the most recent factors in the
country of Nigeria [19]. [20] noted that many candidates were dumped from the
courses that they don't have interest in by the latter due to their incompetence in
meeting the cut-off level of the selected Universities. [21] believed that hiring a
teacher who is knowledgeable about the subject matter is necessary to draw students
to a path of study. Apart from this, the pros and cons of the course also define and
further alter their perception[22], [23]. This suggests that the morals of the profession
can affect their notions of a profession[24]. In the time of network and media
everyone [25] believes that media visibility can highly influence secondary school
pupils' profession choices. The studied literature indicated determinants of career
options among senior school students but did not understand the usage of computer
advancements in collecting information of the field for the students. Insights from
[26], [27], [28] indicate that expert systems advise high school students by utilizing
the characteristics of a student to recommend related courses. These studies did not
pay particular attention to less popular courses, like the construction profession, by
giving data about the profession, university admission requirements, and faculty in
such professions for further career planning in the construction profession and
outlook of information regarding career opportunities in construction
3
CHAPTER 2
SYSTEM ANALYSIS
2.1 EXISTING SYSTEM
2.1.1 Threshold-based monitoring
• Utilizes pre-defined thresholds on device attributes to detect anomalies.
Advantage: Simple to implement.
Disadvantage: Requires extensive domain expertise, low generalizability, and high
false alarms.
2.1.2 Traditional statistical models
• Applies methods like linear regression on device metrics to predict failure.
Advantage: Interpretable, low complexity.
Disadvantage: Often oversimplified, incapable of capturing complex patterns.
2.2PROPOSED SYSTEM
The proposed career prediction system integrates machine learning models,
including Random Forest, SVM, and Neural Networks.
2.2.1 ADVANTAGES AND DISADVANTAGES
Advantages:
• Adaptive, adept at handling complex relationships in data.
• Automatically detects novel failure patterns.
• Scales effectively for large datasets.
• Offers insights into feature importance.
Disadvantages:
• Requires substantial training data.
• Lacks interpretability.
• Tedious hyperparameter tuning.
• Risk of overfitting.
4
2.3 FEASIBILITY STUDY
The feasibility of the project is analyzed in this phase and business proposal
is put forth with a very general plan for the project and some cost estimates. During
system analysis the feasibility study of the proposed system is to be carried out. This
is to ensure that the proposed system is not a burden to the company. For feasibility
analysis, some understanding of the major requirements for the system is essential.
Three key considerations involved in the feasibility analysis are,
• ECONOMICAL FEASIBILITY
• TECHNICAL FEASIBILITY
• SOCIAL FEASIBILITY
2.3.1 ECONOMICAL FEASIBILITY
This study is carried out to check the economic impact that the system will have
on the organization. The amount of fund that the company can pour into the research
and development of the system is limited. The expenditures must be justified. Thus
the developed system as well within the budget and this was achieved because most
of the technologies used are freely available. Only the customized products had to be
purchased.
5
confidence must be raised so that he is also able to make some constructive criticism,
which is welcomed, as he is the final user of the system.
6
2.6 SYSTEM SPECIFICATION:
2.7.1 PYTHON
Python is a general-purpose interpreted, interactive, object-oriented, and
high-level programming language. An interpreted language, Python has a design
philosophy that emphasizes code readability (notably using whitespace indentation to
delimit code blocks rather than curly brackets or keywords), and a syntax that allows
programmers to express concepts in fewer lines of code than might be used in languages
such as C++or Java. It provides constructs that enable clear programming on both small
and large scales. Python interpreters are available for many operating systems. CPython,
the reference implementation of Python, is open source software and has a community-
based development model, as do nearly all of its variant implementations. CPython is
managed by the non-profit Python Software Foundation. Python features a dynamic type
system and automatic memory management. It supports multiple programming
paradigms, including object-oriented, imperative, functional and procedural, and has a
large and comprehensive standard library.
7
Interactive Mode Programming
Invoking the interpreter without passing a script file as a parameter brings up the
following prompt −
$ python
>>>
Type the following text at the Python prompt and press the Enter −
If you are running new version of Python, then you would need to use print statement
with parenthesis as in print ("Hello, Python!");. However in Python version 2.4.3, this
produces the following result −
Hello, Python!
Invoking the interpreter with a script parameter begins execution of the script
and continues until the script is finished. When the script is finished, the interpreter is no
longer active.
Let us write a simple Python program in a script. Python files have extension .py. Type
the following source code in a test.py file −
Live Demo
We assume that you have Python interpreter set in PATH variable. Now, try to run this
program as follows −
8
$ python test.py
Hello, Python!
Let us try another way to execute a Python script. Here is the modified test.py file −
Live Demo
#!/usr/bin/python
We assume that you have Python interpreter available in /usr/bin directory. Now, try to
run this program as follows −
$./test.py
Hello, Python!
Python Identifiers
Python does not allow punctuation characters such as @, $, and % within identifiers.
Python is a case sensitive programming language. Thus, Manpower and manpower are
two different identifiers in Python.
Class names start with an uppercase letter. All other identifiers start with a lowercase
letter.
9
Starting an identifier with a single leading underscore indicates that the identifier is
private.
Starting an identifier with two leading underscores indicates a strongly private identifier.
If the identifier also ends with two trailing underscores, the identifier is a language-
defined special name.
Reserved Words
The following list shows the Python keywords. These are reserved words and you cannot
use them as constant or variable or any other identifier names. All the Python keywords
contain lowercase letters only.
assert finally or
def if return
elif in while
else is with
Python provides no braces to indicate blocks of code for class and function
definitions or flow control. Blocks of code are denoted by line indentation, which is
rigidly enforced.
10
The number of spaces in the indentation is variable, but all statements within the block
must be indented the same amount. For example −
if True:
print "True"
else:
print "False"
if True:
print "Answer"
print "True"
else:
print "Answer"
print "False"
Thus, in Python all the continuous lines indented with same number of spaces would form
a block. The following example has various statement blocks −
Note − Do not try to understand the logic at this point of time. Just make sure you
understood various blocks even if they are without braces.
#!/usr/bin/python
import sys
try:
except IOError:
11
sys.exit()
if file_text == file_finish:
file.close
break
file.write(file_text)
file.write("\n")
file.close()
if len(file_name) == 0:
sys.exit()
try:
except IOError:
sys.exit()
file_text = file.read()
file.close()
12
print file_text
Multi-Line Statements
Statements in Python typically end with a new line. Python does, however, allow
the use of the line continuation character (\) to denote that the line should continue. For
example −
total = item_one + \
item_two + \
item_three
Statements contained within the [], {}, or () brackets do not need to use the line
continuation character. For example −
'Thursday', 'Friday']
Quotation in Python
Python accepts single ('), double (") and triple (''' or """) quotes to denote string literals,
as long as the same type of quote starts and ends the string.
The triple quotes are used to span the string across multiple lines. For example, all the
following are legal −
word = 'word'
Comments in Python
13
A hash sign (#) that is not inside a string literal begins a comment. All characters
after the # and up to the end of the physical line are part of the comment and the Python
interpreter ignores them.
Live Demo
#!/usr/bin/python
# First comment
Hello, Python!
You can type a comment on the same line after a statement or expression −
# This is a comment.
Following triple-quoted string is also ignored by Python interpreter and can be used as a
multiline comments:
'''
This is a multiline
comment.
'''
A line containing only whitespace, possibly with a comment, is known as a blank line and
Python totally ignores it.
14
In an interactive interpreter session, you must enter an empty physical line to terminate a
multiline statement.
The following line of the program displays the prompt, the statement saying “Press the
enter key to exit”, and waits for the user to take action −
#!/usr/bin/python
Here, "\n\n" is used to create two new lines before displaying the actual line. Once the
user presses the key, the program ends. This is a nice trick to keep a console window open
until the user is done with an application.
The semicolon ( ; ) allows multiple statements on the single line given that neither
statement starts a new code block. Here is a sample snip using the semicolon.
A group of individual statements, which make a single code block are called suites
in Python. Compound or complex statements, such as if, while, def, and class require a
header line and a suite.
Header lines begin the statement (with the keyword) and terminate with a colon ( : ) and
are followed by one or more lines which make up the suite. For example −
if expression :
suite
elif expression :
suite
else :
15
suite
Many programs can be run to provide you with some basic information about how
they should be run. Python enables you to do this with -h −
$ python -h
usage: python [option] ... [-c cmd | -m mod | file | -] [arg] ...
You can also program your script in such a way that it should accept various options.
Command Line Arguments is an advanced topic and should be studied a bit later once
you have gone through rest of the Python concepts.
Python Lists
The list is a most versatile datatype available in Python which can be written as
a list of comma-separated values (items) between square brackets. Important thing about
a list is that items in a list need not be of the same type.
list2 = [1, 2, 3, 4, 5 ];
Similar to string indices, list indices start at 0, and lists can be sliced, concatenated and
so on.
16
A tuple is a sequence of immutable Python objects. Tuples are sequences, just like lists.
The differences between tuples and lists are, the tuples cannot be changed unlike lists and
tuples use parentheses, whereas lists use square brackets.
tup2 = (1, 2, 3, 4, 5 );
tup1 = ();
To write a tuple containing a single value you have to include a comma, even though there
is only one value −
tup1 = (50,);
Like string indices, tuple indices start at 0, and they can be sliced, concatenated, and so
on.
To access values in tuple, use the square brackets for slicing along with the index or
indices to obtain value available at that index. For example −
Live Demo
#!/usr/bin/python
tup2 = (1, 2, 3, 4, 5, 6, 7 );
17
tup1[0]: physics
tup2[1:5]: [2, 3, 4, 5]
Updating Tuples
To access dictionary elements, you can use the familiar square brackets along with the
key to obtain its value. Following is a simple example −
Live Demo
#!/usr/bin/python
dict['Name']: Zara
dict['Age']: 7
If we attempt to access a data item with a key, which is not part of the dictionary, we get
an error as follows −
Live Demo
#!/usr/bin/python
dict['Alice']:
18
File "test.py", line 4, in <module>
KeyError: 'Alice'
Updating Dictionary
You can update a dictionary by adding a new entry or a key-value pair, modifying an
existing entry, or deleting an existing entry as shown below in the simple example −
Live Demo
#!/usr/bin/python
dict['Age']: 8
You can either remove individual dictionary elements or clear the entire contents of a
dictionary. You can also delete entire dictionary in a single operation.
To explicitly remove an entire dictionary, just use the del statement. Following is a simple
example −
Live Demo
#!/usr/bin/python
19
del dict['Name']; # remove entry with key 'Name'
This produces the following result. Note that an exception is raised because after del dict
dictionary does not exist any more −
dict['Age']:
(a) More than one entry per key not allowed. Which means no duplicate key is allowed.
When duplicate keys encountered during assignment, the last assignment wins. For
example −
Live Demo
#!/usr/bin/python
20
When the above code is executed, it produces the following result −
dict['Name']: Manni
(b) Keys must be immutable. Which means you can use strings, numbers or tuples as
dictionary keys but something like ['key'] is not allowed. Following is a simple example
−
Live Demo
#!/usr/bin/python
Tuples are immutable which means you cannot update or change the values of tuple
elements. You are able to take portions of existing tuples to create new tuples as the
following example demonstrates −
Live Demo
#!/usr/bin/python
# tup1[0] = 100;
21
print tup3;
Removing individual tuple elements is not possible. There is, of course, nothing wrong
with putting together another tuple with the undesired elements discarded.
To explicitly remove an entire tuple, just use the del statement. For example −
Live Demo
#!/usr/bin/python
print tup;
del tup;
print tup;
This produces the following result. Note an exception raised, this is because after del tup
tuple does not exist any more −
print tup;
22
2.7.2 DJANGO
Django is a high-level Python Web framework that encourages rapid
development and clean, pragmatic design. Built by experienced developers, it takes care
of much of the hassle of Web development, so you can focus on writing your app without
needing to reinvent the wheel. It’s free and open source.
Django also provides an optional administrative create, read, update and delete interface
that is generated dynamically through introspection and configured via admin models
23
Fig 2.2 Model View Template Architecture
Create a Project
Whether you are on Windows or Linux, just get a terminal or a cmd prompt and
navigate to the place you want your project to be created, then use this code −
myproject/
manage.py
myproject/
_init_.py
settings.py
urls.py
wsgi.py
The “myproject” folder is just your project container, it actually contains two elements −
24
manage.py − This file is kind of your project local django-admin for interacting with your
project via command line (start the development server, sync db...). To get a full list of
command accessible via manage.py you can use the code −
The “myproject” subfolder − This folder is the actual python package of your project. It
contains four files −
urls.py − All links of your project and the function to call. A kind of ToC of your project.
DEBUG = True
This option lets you set if your project is in debug mode or not. Debug mode lets you get
more information about your project's error. Never set it to ‘True’ for a live project.
However, this has to be set to ‘True’ if you want the Django light server to serve static
files. Do it only in the development mode.
DATABASES = {
'default': {
'ENGINE': 'django.db.backends.sqlite3',
'NAME': 'database.sql',
'USER': '',
'PASSWORD': '',
'HOST': '',
25
'PORT': '',
Database is set in the ‘Database’ dictionary. The example above is for SQLite engine. As
stated earlier, Django also supports −
MySQL (django.db.backends.mysql)
PostGreSQL (django.db.backends.postgresql_psycopg2)
MongoDB (django_mongodb_engine)
Before setting any new engine, make sure you have the correct db driver installed.
Now that your project is created and configured make sure it's working −
You will get something like the following on running the above code −
Validating models...
0 errors found
A project is a sum of many applications. Every application has an objective and can be
reused into another project, like the contact form on a website can be an application, and
can be reused for others. See it as a module of your project.
26
Create an Application
We assume you are in your project folder. In our main “myproject” folder, the
same folder then manage.py −
You just created myapp application and like project, Django create a “myapp” folder with
the application structure −
myapp/
_init_.py
admin.py
models.py
tests.py
views.py
admin.py − This file helps you make the app modifiable in the admin interface.
At this stage we have our "myapp" application, now we need to register it with our Django
project "myproject". To do so, update INSTALLED_APPS tuple in the settings.py file of
your project (add your app name) −
INSTALLED_APPS = (
'django.contrib.admin',
27
'django.contrib.auth',
'django.contrib.contenttypes',
'django.contrib.sessions',
'django.contrib.messages',
'django.contrib.staticfiles',
'myapp',
Creating forms in Django, is really similar to creating a model. Here again, we just need
to inherit from Django class and the class attributes will be the form fields. Let's add a
forms.py file in myapp folder to contain our app forms. We will create a login form.
myapp/forms.py
class LoginForm(forms.Form):
As seen above, the field type can take "widget" argument for html rendering; in our case,
we want the password to be hidden, not displayed. Many others widget are present in
Django: DateInput for dates, CheckboxInput for checkboxes, etc.
There are two kinds of HTTP requests, GET and POST. In Django, the request object
passed as parameter to your view has an attribute called "method" where the type of the
request is set, and all data passed via POST can be accessed via the request.POST
dictionary.
28
#-- coding: utf-8 --
def login(request):
if request.method == "POST":
MyLoginForm = LoginForm(request.POST)
if MyLoginForm.is_valid():
username = MyLoginForm.cleaned_data['username']
else:
MyLoginForm = Loginform()
The view will display the result of the login form posted through the loggedin.html. To
test it, we will first need the login form template. Let's call it login.html.
<html>
<body>
<center>
</center>
</div>
29
<br>
<center>
</center>
</div>
<br>
<center>
<strong>Login</strong>
</button>
</center>
</div>
</form>
</body>
</html>
The template will display a login form and post the result to our login view above. You
have probably noticed the tag in the template, which is just to prevent Cross-site Request
Forgery (CSRF) attack on your site.
30
{% csrf_token %}
Once we have the login template, we need the loggedin.html template that will be
rendered after form treatment.
<html>
<body>
</body>
</html>
urlpatterns = patterns('myapp.views',
url(r'^connection/',TemplateView.as_view(template_name = 'login.html')),
Setting Up Sessions
In Django, enabling session is done in your project settings.py, by adding some lines to
the MIDDLEWARE_CLASSES and the INSTALLED_APPS options. This should be
done while creating the project, but it's always good to know, so
MIDDLEWARE_CLASSES should have −
'django.contrib.sessions.middleware.SessionMiddleware'
31
'django.contrib.sessions'
When session is enabled, every request (first argument of any view in Django) has a
session (dict) attribute.
Let's create a simple sample to see how to create and save sessions. We have built a simple
login system before (see Django form processing chapter and Django Cookies Handling
chapter). Let us save the username in a cookie so, if not signed out, when accessing our
login page you won’t see the login form. Basically, let's make our login system we used
in Django Cookies handling more secure, by saving cookies server side.
For this, first lets change our login view to save our username cookie server side −
def login(request):
if request.method == 'POST':
MyLoginForm = LoginForm(request.POST)
if MyLoginForm.is_valid():
username = MyLoginForm.cleaned_data['username']
request.session['username'] = username
else:
MyLoginForm = LoginForm()
Then let us create formView view for the login form, where we won’t display the form if
cookie is set −
def formView(request):
if request.session.has_key('username'):
32
username = request.session['username']
else:
Now let us change the url.py file to change the url so it pairs with our new view −
urlpatterns = patterns('myapp.views',
When accessing /myapp/connection, you will get to see the following page
33
CHAPTER 3
SYSTEM DESIGN
2. The data flow diagram (DFD) is one of the most important modeling tools. It is
used to model the system components. These components are the system process, the data
used by the process, an external entity that interacts with the system and the information
flows in the system.
3. DFD shows how the information moves through the system and how it is modified
by a series of transformations. It is a graphical technique that depicts information flow
and the transformations that are applied as data moves from input to output.
4. DFD is also known as bubble chart. A DFD may be used to represent a system at
any level of abstraction. DFD may be partitioned into levels that represent increasing
information flow and functional detail.
The goal is for UML to become a common language for creating models of object oriented
computer software. In its current form UML is comprised of two major components: a
Meta-model and a notation. In the future, some form of method or process may also be
added to; or associated with, UML.
34
The Unified Modeling Language is a standard language for specifying,
Visualization, Constructing and documenting the artifacts of software system, as well as
for business modeling and other non-software systems.
The UML represents a collection of best engineering practices that have proven successful
in the modeling of large and complex systems.
The UML is a very important part of developing objects oriented software and the
software development process. The UML uses mostly graphical notations to express the
design of software projects.
GOALS:
35
3.3.1 USE CASE DIAGRAM:
36
3.3.2 CLASS DIAGRAM:
In software engineering, a class diagram in the Unified Modeling Language
(UML) is a type of static structure diagram that describes the structure of a system by
showing the system's classes, their attributes, operations (or methods), and the
relationships among the classes. It explains which class contains information.
37
3.3.3 SEQUENCE DIAGRAM:
A sequence diagram in Unified Modeling Language (UML) is a kind of
interaction diagram that shows how processes operate with one another and in what order.
It is a construct of a Message Sequence Chart. Sequence diagrams are sometimes called
event diagrams, event scenarios, and timing diagrams.
38
3.3.4 ACTIVITY DIAGRAM:
Activity diagrams are graphical representations of workflows of stepwise
activities and actions with support for choice, iteration and concurrency. In the Unified
Modeling Language, activity diagrams can be used to describe the business and
operational step-by-step workflows of components in a system. An activity diagram
shows the overall flow of control.
39
CHAPTER 4
IMPLEMENTATION
4.1 MODULES:
• User
• Admin
• Machine Learning
MODULES DESCRIPTION:
4.1.1 User:
The User can register the first. While registering he required a valid user email and
mobile for further communications. Once the user register then admin can activate the
user. Once admin activated the user then user can login into our system. User can upload
the dataset based on our dataset column matched. For algorithm execution data must be
in float format. Here we took Employment Scam Aegean Dataset (EMSCAD) containing
18000 sample dataset. User can also add the new data for existing dataset based on our
Django application. User can click the Classification in the web page so that the data
calculated Accuracy and macro avg, weighted avg based on the algorithms. User can
display the ml results. user can also display the prediction results.
4.1.2 Admin:
Admin can login with his login details. Admin can activate the registered users.
Once he activate then only the user can login into our system. Admin can view the overall
data in the browser. Admin can click the Results in the web page so calculated Accuracy
and macro avg, weighted avg based on the algorithms is displayed. All algorithms
40
execution complete then admin can see the overall accuracy in web page. And also display
the classification results.
OBJECTIVES
1.Input Design is the process of converting a user-oriented description of the input into a
computer-based system. This design is important to avoid errors in the data input process
41
and show the correct direction to the management for getting correct information from
the computerized system.
2. It is achieved by creating user-friendly screens for the data entry to handle large
volume of data. The goal of designing input is to make data entry easier and to be free
from errors. The data entry screen is designed in such a way that all the data manipulates
can be performed. It also provides record viewing facilities.
3.When the data is entered it will check for its validity. Data can be entered with the help
of screens. Appropriate messages are provided as when needed so that the user will not
be in maize of instant. Thus the objective of input design is to create an input layout that
is easy to follow
A quality output is one, which meets the requirements of the end user and
presents the information clearly. In any system results of processing are communicated to
the users and to other system through outputs. In output design it is determined how the
information is to be displaced for immediate need and also the hard copy output. It is the
most important and direct source information to the user. Efficient and intelligent output
design improves the system’s relationship to help user decision-making.
1. Designing computer output should proceed in an organized, well thought out manner;
the right output must be developed while ensuring that each output element is designed
so that people will find the system can use easily and effectively. When analysis design
computer output, they should Identify the specific output that is needed to meet the
requirements.
3.Create document, report, or other formats that contain information produced by the
system.
The output form of an information system should accomplish one or more of the
following objectives.
42
• Future.
• Trigger an action.
• Confirm an action
43
CHAPTER 5
RESULTS
44
5.3Admin Login page
45
5.5 User View page
46
5.7 User home page
47
5.9 K-Nearest Neighbours Confusion Matrix
48
5.10 Confusion Matrix (Decision Tree)
49
5.12 ML Score
50
CHAPTER 6
SYSTEM TESTING
The purpose of testing is to discover errors. Testing is the process of
trying to discover every conceivable fault or weakness in a work product. It provides a
way to check the functionality of components, sub assemblies, assemblies and/or a
finished product It is the process of exercising software with the intent of ensuring that
the Software system meets its requirements and user expectations and does not fail in an
unacceptable manner. There are various types of test. Each test type addresses a specific
testing requirement.
TYPES OF TESTS
51
6.3 Functional test
Functional tests provide systematic demonstrations that functions tested are
available as specified by the business and technical requirements, system documentation,
and user manuals.
be exercised.
52
purpose. It is purpose. It is used to test areas that cannot be reached from a black box
level.
Test objectives
Features to be tested
53
6.8 Integration Testing
Software integration testing is the incremental integration testing of two or
more integrated software components on a single platform to produce failures caused by
interface defects.
The task of the integration test is to check that components or software applications, e.g.
components in a software system or – one step up – software applications at the company
level – interact without error.
Test Results: All the test cases mentioned above passed successfully. No defects
encountered.
Test Results: All the test cases mentioned above passed successfully. No defects
encountered.
54
CHAPTER 7
CONCLUSION
7.1 CONCLUSION
The motive of this research was to design and develop a website for career
prediction which predicts fitting options for a candidate in choosing a suitable field. The
Options predicted in the proposed system are more valid & precise than the present career
guidance systems in the field. We have used the KNN algorithm to classify the skill sets
of the candidate and predict a suitable discipline with the help of the answers of MCQs
which the candidate filled as feedback, and the K-Means Clustering algorithm is used to
form the clusters by dividing the student responses for a particular skill set and predicting
the rate of success for the respective fields in each cluster. For field specific prediction
purposes, the success rate in each of the clusters are determined; higher success rates and
lower failure rates were expected. In this project, various career prediction systems were
studied thoroughly for building a web-based application with expected results. More
research is needed to better understand the framework's accuracy rate and introduction of
additional features and the removal of outliers in the framework.
The motive of this research was to design and develop a website for
career prediction which predicts fitting options for a candidate in choosing a suitable field.
The Options predicted in the proposed system are more valid & precise than the present
career guidance systems in the field. We have used the KNN algorithm to classify the skill
sets of the candidate and predict a suitable discipline with the help of the answers of
MCQs which the candidate filled as feedback, and the K-Means Clustering algorithm is
used to form the clusters by dividing the student responses for a particular skill set and
predicting the rate of success for the respective fields in each cluster. For field specific
prediction purposes, the success rate in each of the clusters are determined; higher success
rates and lower failure rates were expected. In this project, various career prediction
systems were studied thoroughly for building a web-based application with expected
55
results. More research is needed to better understand the framework's accuracy rate and
introduction of additional features and the removal of outliers in the framework.
56
CHAPTER 8
BIBLIOGRAPHY
[1] Viktória Kulcsár, Anca Dobrean, Itamar Gati. "Challenges and difficulties in career
decision making: Their causes, and their effects on the process and the decision", Journal
of Vocational Behavior, 2020.
[5] C. Clutter, “The Effects of Parental Influence on Their Children‘s Career Choices,”
Published Master of Science Thesis Submitted to the School of Family Studies and
Human Service, College of Human Ecology, Kansas State University, Manhattan, Kansas,
2010.
[6] K. A. Jungen, “Parental Influence and Career Choice: How Parents Affect the Career
Aspirations of Their Children,” Published M.Sc. Guidance and Counselling Project
Submitted to the Graduate School, University of Wisconsin-Stout, Menomonie, WI,
2008.
[8] I. H. Alika, “Parental and Peer Group Influence as Correlates of Career Choice in
Humanities among Secondary School Students in Edo State, Nigeria,” Journal of
Research in Edu. and Society, 2010, vol. 1 (1), pp. 178 - 185.
57
[9] B. O. Ehigbor, and T. N. Akinlosotu, “Parents’ Occupation as Correlate of Students’
Career Aspiration in Public Secondary Schools in Ekpoma Metropolis,” Int. Journal of
Arts and Humanities, 2016, vol. 5 (3), pp. 197 – 212.
[12] I. A. Durosaro, and M. A. Nuhu, “An evaluation of the relevance of career choice to
school Subject selection among school going adolescents in Ondo state,” Asian Journal
of Mgt Sci. and Edu., 2012, vol. 1 (2), pp. 140 – 145.
[15] W. Stanley, “The relationship between fear of Success, Self-Concept and Career
making,” Report-Research 9143: Speeches/meeting papers(150), U.S. Kentucky, 1996.
58
CHAPTER 9
APPENDICES
SAMPLE CODE
Admin views
from django.shortcuts import render
def AdminLoginCheck(request):
if request.method == 'POST':
usrid = request.POST.get('loginid')
pswd = request.POST.get('pswd')
else:
def AdminHome(request):
def RegisterUsersView(request):
59
data = UserRegistrationModel.objects.all()
return render(request,'admins/viewregisterusers.html',{'data':data})
def ActivaUsers(request):
if request.method == 'GET':
id = request.GET.get('uid')
status = 'activated'
UserRegistrationModel.objects.filter(id=id).update(status=status)
data = UserRegistrationModel.objects.all()
return render(request,'admins/viewregisterusers.html',{'data':data})
Base.html :
{% load static %}
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Career Prediction</title>
<link
href="https://fonts.googleapis.com/css?family=Open+Sans:300,300i,400,400i,600,600i,
60
700,700i|Jost:300,300i,400,400i,500,500i,600,600i,700,700i|Poppins:300,300i,400,400i
,500,500i,600,600i,700,700i" rel="stylesheet">
</head>
<body>
<style>
span{
color:white;
header{
background-color:GREEN;
61
}
body{
</style>
<ul>
</ul>
</div>
62
</header><!-- End Header -->
{% block contents %}
{% endblock %}
<footer id="footer">
<div id="preloader"></div>
</body>
</html>
63
User Views :
import Career_prediction_Using_Machine_Learning
import pandas as pd
import numpy as np
import datetime as dt
64
from sklearn.preprocessing import LabelEncoder
def UserRegisterActions(request):
if request.method == 'POST':
form = UserRegistrationForm(request.POST)
if form.is_valid():
print('Data is Valid')
form.save()
form = UserRegistrationForm()
else:
print("Invalid form")
else:
form = UserRegistrationForm()
def UserLoginCheck(request):
if request.method == "POST":
loginid = request.POST.get('loginid')
pswd = request.POST.get('pswd')
65
try:
check = UserRegistrationModel.objects.get(
loginid=loginid, password=pswd)
status = check.status
if status == "activated":
request.session['id'] = check.id
request.session['loggeduser'] = check.name
request.session['loginid'] = loginid
request.session['email'] = check.email
else:
except Exception as e:
pass
def UserHome(request):
def DatasetView(request):
66
path = settings.MEDIA_ROOT + "//" + 'roo_data.csv'
df = pd.read_csv(path, nrows=100)
df = df.to_html
def ml(request):
import pandas as pd
import numpy as np
dataset =
pd.read_csv(r"C:\\Users\giris\OneDrive\Desktop\F.career\F.career\career\Career_predict
ion_Using_Machine_Learning\media\roo_data.csv")
dataset
labelencoder = LabelEncoder()
dataset['memorycapabilityscore'] = labelencoder.fit_transform(
dataset['memorycapabilityscore'])
dataset['Typeofcompanywanttosettlein'] = labelencoder.fit_transform(
dataset['Typeofcompanywanttosettlein'])
dataset['ManagementorTechnical'] = labelencoder.fit_transform(
dataset['ManagementorTechnical'])
67
dataset['workedinteamsever'] = labelencoder.fit_transform(
dataset['workedinteamsever'])
dataset['SuggestedJobRole'] = labelencoder.fit_transform(
dataset['SuggestedJobRole'])
data1=dataset
normalized_data = Normalizer().fit_transform(data1)
print(normalized_data.shape)
normalized_data
data2=dataset
df1 = np.append(normalized_data,data2,axis=1)
dataset.columns
col_to_drop = ['SuggestedJobRole',
'PercentageinComputerNetworks',
'PercentageinSoftwareEngineering',
'PercentageinProgrammingConcepts',
'percentageinAlgorithms']
X = dataset.drop(col_to_drop, axis=1)
y = dataset['SuggestedJobRole']
68
import matplotlib.pyplot as plt
clf = tree.DecisionTreeClassifier()
clf.fit(X_train, y_train)
y_pred = clf.predict(X_test)
knn_model = KNeighborsClassifier()
knn_model.fit(X_train, y_train)
knn_y_pred = knn_model.predict(X_test)
print(knn_cm)
job_counts = {
69
3: 'Network Engineer',
4: 'Project Manager',
5: 'Database Administrator',
6: 'Portal Administrator',
8: 'Software Engineer',
9: 'UX Designer',
70
26: 'Technical Services/Help Desk/Tech Support',
print(msg_knn)
print(classification_rep_knn)
plt.figure(figsize=(8, 6))
plt.xlabel("Predicted")
plt.ylabel("Actual")
71
plt.title("K-Nearest Neighbors Confusion Matrix")
plt.show()
print("Accuracy:", accuracy)
cm = confusion_matrix(y_test, y_pred)
print("Confusion Matrix:")
print(cm)
print("Classification Report:")
print(classification_rep)
plt.figure(figsize=(8, 6))
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.title("Confusion Matrix")
plt.show()
72
clf_entropy.fit(X_train, y_train)
entropy_y_pred = clf_entropy.predict(X_test)
print(cm_entropy)
print(classification_rep_entropy)
plt.figure(figsize=(8, 6))
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.show()
73
'accuracy': accuracy * 2500,
'msg_knn': msg_knn
})
def predictTrustWorthy(request):
if request.method == 'POST':
hackathons = request.POST.get("hackathons")
codingskillsrating = request.POST.get("codingskillsrating")
certifications = request.POST.get("certifications")
talentteststaken = request.POST.get("talentteststaken")
olympiads = request.POST.get("olympiads")
memorycapabilityscore = request.POST.get("memorycapabilityscore")
Interestedsubjects = request.POST.get("Interestedsubjects")
Typeofcompanywanttosettlein=
request.POST.get("Typeofcompanywanttosettlein")
ManagementorTechnical = request.POST.get("ManagementorTechnical")
workedinteamsever = request.POST.get("workedinteamsever")
df = pd.read_csv(path)
data = df.dropna()
74
data['SuggestedJobRole'] = data['SuggestedJobRole']
features=['hackathons','codingskillng’,
certifications','talentteststaken','olympiads','memorycapabilityscore',
'Interestedsubjects','Typeofcompanywanttosettlein',
'ManagementorTechnical','workedinteamsever']
X = pd.get_dummies(data[features])
test_set = {
'hackathons':hackathons,
'codingskillsrating': codingskillsrating,
'certifications': certifications,
'talentteststaken': talentteststaken,
'olympiads':Olympiads ,
'memorycapabilityscore': memorycapabilityscore,
'Interestedsubjects': Interestedsubjects,
'Typeofcompanywanttosettlein': Typeofcompanywanttosettlein,
'ManagementorTechnical':ManagementorTechnical,
'workedinteamsever': workedinteamsever}
test_df = pd.DataFrame([test_set])
test_df = pd.get_dummies(test_df)
75
# Matching the columns between the training and test sets
test_df[col] = 0
test_df = test_df[X.columns]
y = data['SuggestedJobRole']
test_size=0.2, random_state=101)
OBJ.fit(X_train, Y_train)
# Making predictions
print(test_df.values)
y_pred = OBJ.predict(test_df.values)
print(y_pred)
print(y_pred[0])
msg = y_pred[0]
else:
76