AISSCE-2024-25
INFORMATICS PRACTICES
PROJECT FILE
Participants : Novina & Deepika
EMPLOYEE DATA ANALYSIS
SYSTEM
SUBMITTED TO:- SUBMITTED BY:-
Mr.K.K Kumawat Deepika Balotiya
(PGT I.P) Class : 12th B
Roll No:
Page | 1
ACKNOWLEDGEMENT
I would like to express my special thanks of
gratitude to my I.P teacher Mr. K.K Kumawat
as well as our Principal Sir Dr.R.K.Vyas who
gave me the golden opportunity to do this
wonderful project on “EMPLOYEE DATA
ANALYSIS SYSTEM”, which also
helped me in doing practical work and I gained
extra knowledge.
And I would also like to thank my parents and
friends who helped me a lot in finishing this
project within the time limit.
DEEPIKA BALOTIYA
XII - Commerce
Page | 2
Page | 3
INDEX
Sr.No. Topic Page.No.
1. Theoretical 5
Background
2. Minimum 11
Requirements
3. CSV File 12
4. Source Code 13
5. Output 19
6. Bibliography 25
Page | 4
THEORITICAL
BACKGROUND
WHAT IS PYTHON?
Python is a High- level, general-purpose programming
language known for its readability and versatility. It supports
multiple programming paradigms and is widely used in various
domains, including web development ,data science, artificial
intelligence, and more. It was created by Guido van Rossum
and first released in 1991.Python’s simple, easy to learn syntax
emphasizes readability and therefore reduces the cost of
programming maintenance. It is free and open source and
more expressive. Python supports modules and packages,
which encourages program modularity and code reuse. Some
of the characteristics:
1. Readability: Python emphasizes code readability with a clear and
expressive syntax. It uses indentation to define code blocks, making it visually
straightforward.
2. Interpretation: It is an interpreted language, meaning that code executed is
line by line by an interpreter without need for compilation.
3. Dynamic Typing: Pythons uses Dynamic Typing, allowing developers to
create flexible and adaptable code.
Page | 5
What is Pandas?
Pandas is Python’s library for data analysis. Pandas has derived
its name from “Panel Data System” and it has become very
popular choice for data analysis.
Data Analysis is the technique of collecting, transforming, and
organizing data to make future predictions and informed data-
driven decisions. It also helps to find possible solutions for a
business problem.
While working with Pandas, it is required to import it.
Data Structure is collection of data values and operations
which can be perform over data.
In Pandas there are two type of Data
Structure-
1. Series
2. Dataframe
Page | 6
What is Data Visualization?
Data Visualization is the technique to present the data in a
pictorial or graphical format. It enables stakeholders and
decision makers to analyze data visually. The data in a
graphical format allows them to identify new trends and
pattern easily.
The main benefits of Data Visualization are as follows:
It simplifies the complex quantitative information.
It helps analyze and explore big data easily.
It identifies the areas that need attention or improvement.
It identifies the relationship between data points and variables.
It explores new patterns and reveals hidden patterns in the data.
Page | 7
Matplotlib Library and pyplot Interface
The Matplotlib is a python library that provides many
interfaces functionally for 2D graphics.
In short, we can call matplotlib as a high –quality plotting library of
python.
The matplotlib library offers many different named collections of
methods,pyplot is one such intereface.
Pyplot is a collection of methods within matplotlib which
allows user to construct 2D plots easily and interactively.
Matplotlib can be installed using pip command in command
prompt: pip install
matplotlib
Pyplot can be import
using : import
matplotlib.pyplot
Types of plot using
matplotlib:
1. Line Graph
Page | 8
2. Vertical Bar Graph
3. Horizontal Bar graph
4. Histogram
Page | 9
CSV Files
CSV stands for comma- Separated values.
This type of file can be opened with excel file or notepad.
CSV file is nothing more than a simple text file. However, it is
the most common, simple and easiest method to store tabular data.
This format arranges tables by a specific structure divided into rows
and columns.
To import data from a CSV file into a DataFrame, you can use the
read_CSV() function.
This function takes the path to the CSV file as its input, and returns
a DataFrame object.
To export a DataFrame to a CSV file, you can use the to_CSV() function.
This function takes the path to the CSV file as its Output,and
writes the DataFrame to the file.
Page | 10
MINIMUM
REQUIREMENTS
OPERATING SYSTEM:-
WINDOWS 7 OR ABOVE
PROCESSOR: -
PENTIUM (ANY) OR AMD ATHALON (DUAL CORE)
RAM:-
4 GB (RECOMMENDED)
HARD DISK:-
6 GB (MINIMUM)
SOFTWARES:-
PYTHON 3.X
Page | 11
CSV FILE
EMP.CSV
Page | 12
SOURCE
CODE
Page | 13
import pandas as pd
import matplotlib.pyplot as plt
ch='Y'
while ch=='Y':
print('1.Read csv file')
print('2.Show all records')
print('3.Show the name of
female employees')
print('4.Search Record')
print('5.Add new record')
print('6.Delete record')
print('7.Modify record')
print('8.Show salary chart using line
graph')
print('9.Show salary chart using bar
graph')
print('10.Save data into csv file')
choice=int(input('enter your choice :'))
Page | 14
if choice==1:
df=pd.read_csv('emp.csv') #read the csv
file
print('file opened')
elif choice==2:
print(df)
elif choice==3:
print(df[df['gender']=='F']['name'])
elif choice==4:
e=int(input('enter emp no to search'))
inx=df[df.empno==e].index.values #to get
index value
if len(inx)==0:
print("record not found")
else:
print(df[df.empno==e])
elif choice==5:
e=int(input('Enter emp no\t'))
Page | 15
n=input('Enter name\t')
d=input('Enter dept\t')
s=int(input("Enter salary\t"))
g=input("Enter gender\t")
df=df.append({'empno':e,'name':n,
'dept':d,'gender':g,'salary':s},ignore_index=True)
print('record added')
elif choice==6:
e=int(input('enter emp no to
delete'))
inx=df[df.empno==e].index.values
if len(inx)==0:
print("record not found")
else:
print(df[df.empno==e])
df=df[df['empno']!=e]
print('record deleted')
df.index=range(len(df)) #rearange index no
Page | 16
elif choice==7:
e==int(input('enter emp no to modify'))
inx=df[df.empno==e].index.values #to get
index value
if len(inx)==0:
print('record not found')
else:
print(df[df.empno==e])
n=input('enter new name')
d=input('enter new dept')
s=int(input('enter new salary'))
g=input('enter new gender')
df.loc[inx,"name"]=n
df.loc[inx,"dept"]=d
df.loc[inx,"salary"]=s
df.loc[inx,"gender"]=g
print("record updated")
elif choice==8:
Page | 17
plt.ylabel('Salary')
plt.xlabel('Empno')
plt.plot(df['empno'],df['salary'])
plt.title('Salary Chart')
plt.show( )
elif choice==9:
plt.bar(df['name'],df['salary'])
plt.title('Salary Graph')
plt.xlabel('Names')
plt.ylabel('Salary')
plt.show( )
elif choice==10:
df.to_csv('emp.csv',index=False)
print('file saved')
ch=input('Do u want to
continue').upper( )
Page | 18
OUtpUt
SCREEnS
Page | 19
CHOICE 1
CHOICE 2
Page | 20
CHOICE 3
CHOICE 4
Page | 21
CHOICE 5
CHOICE 6
Page | 22
CHOICE 7
CHOICE 8
Page | 23
CHOICE 9
CHOICE 10
Page | 24
Google.co.in
Wikipedia.com
Text Book:
Informatics Practices Class 11th Sumita
Arora
Informatics Practices Class 12th Sumita
Arora
NCERT IP Textbook
Page | 25
Page | 26