0% found this document useful (0 votes)
20 views5 pages

Volume - Control - Report (1) - Removed (1) N

The document discusses a project that aims to control a PC's volume using hand gestures. It utilizes computer vision and Mediapipe to recognize hand movements and translate them into volume adjustments. The objectives are to develop real-time hand recognition, enable volume control through gestures, and ensure a reliable user experience. It requires OpenCV, Mediapipe, Numpy, Pycaw and describes the implementation of computer vision algorithms, gesture to volume mapping, and a user interface.

Uploaded by

Tanisha Sood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views5 pages

Volume - Control - Report (1) - Removed (1) N

The document discusses a project that aims to control a PC's volume using hand gestures. It utilizes computer vision and Mediapipe to recognize hand movements and translate them into volume adjustments. The objectives are to develop real-time hand recognition, enable volume control through gestures, and ensure a reliable user experience. It requires OpenCV, Mediapipe, Numpy, Pycaw and describes the implementation of computer vision algorithms, gesture to volume mapping, and a user interface.

Uploaded by

Tanisha Sood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

MACHINE

LEARNING

HAND GESTURE VOLUME CONTROL

Created by:Tanisha Sood

Project Summary:

The Hand Gesture Volume Controller project aims to provide a method for
controlling a PC’s volume using hand gestures. This project utilizes
computer vision technology and Mediapipe (an open source by Google) to
recognize and interpret specific hand movements and translate them into
volume adjustments for audio playback.

Project Objectives:

• Develop a real-time hand recognition and detection system.


• Enable users to control volume levels through different hand gestures.
• Ensure robustness and accuracy in the project for a reliable user
experience.

Requirements:
• opencv-python
• mediapipe
• comtypes

• numpy
• pycaw

• openCV: openCV stands for Open Source Computer Vision Library. It


is a cross-platform library that includes hundreds of computer vision
algorithms. It is used in image processing, video processing, object
detection,face recognition, motion tracking, augmented reality etc.

• Mediapipe: MediaPipe is an open-source framework for building


pipelines to perform computer vision inference over arbitrary sensory data
such as video or audio.

These libraries and resources provide the core functionality for


eachMediaPipe Solution:

• MediaPipe Tasks: Cross-platform APIs and libraries for


deployingsolutions.
• MediaPipe Models: Pre-trained, ready-to-run models for use with
eachsolution.
In this project, the Hand Landmarks Detection Model of Mediapipe is used.
• Comtypes: It is a lightweight Python COM package, based on the
ctypes FFI library. Comtypes allows us to define, call, and implement
custom and dispatch-based COM interfaces in pure Python. This package
works on Windows only.

• Numpy: Numpy contains a multi-dimensional array and matrix data


structures. It can be utilized to perform many mathematical operations on
arrays.

• Pycaw: Python Core Audio Windows Library, working for both


Python2and Python3.It was created by Andre Miras.
The Git repository link for pycaw is given
below:
https://github.com/AndreMiras/pycaw.git

Implementation:
• Computer vision algorithm: Computer vision algorithms analyze
certain criteria in images and videos, and then apply interpretations to
predictive or decision-making tasks.
• Gesture to volume mapping: Establishes a mapping between
recognized gestures and corresponding volume adjustments.
• User interface: Displays adjusted volume level and personalized
messages. Personalized gestures for program termination.

You might also like