0% found this document useful (0 votes)

52 views18 pages

Final

The document outlines a project on a Speech-to-SQL Query Generator that enables non-technical users to interact with databases using natural language speech, eliminating the need for SQL knowledge. It details the system architecture, algorithm, performance analysis, and future enhancements for improved usability and accessibility. The project demonstrates significant advancements in Natural Language Processing and aims to bridge the gap between human communication and machine understanding.

Uploaded by

rutuyadav258

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views18 pages

Final

Uploaded by

rutuyadav258

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

SHALAKA FOUNDATION’S

KEYSTONE SCHOOL OF
ENGINEERING

SPEECH-TO-TEXT USING
NLP
By:
OM SHINDE B401180185
UNDER THE GUIDANCE OF:
GAYATRI SHINDE B401180184 GUIDE:- PROF. TUSHAR SURWADE
RITIKA YADAV B401180199 CO-GUIDE:- SONAL CHANDERI
NANDINI YAMALE B401180200
25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 1
INDEX
• Introduction • Hardware & Software
• Objectives Requirements
• Literature survey • Performance analysis
• Problem definition • Result
• System architecture • Conclusion
• Algorithm • Future scope
• Flowcharts • References
• Work breakdown
structure
25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 2
INTRODUCTION

NON-TECHNICAL
USERS

Speech to
To bridge this Gap, we have
SQL Query
Generator

Traditional database
querying requires
knowledge of SQL
(Structured Query
Language)

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 3

OBJECTIVES

Simplify Database Interaction Enhance Accessibility

Speech To Query Conversion Improve Usability

Real Time Query Execution

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 4

LITERATURE SURVEY
SN Paper Title Author(s) Year Methodology Findings of paper

Proposed a deep learning-based

Seq2SQL: Generating [Link] djahantighi approach to generate SQL queries from
1 Structured Queries from 2022 natural language using sequence-to- Implemented using with only Text-To-SQL
Natural Language sequence models, paving the way for
automated query generation from text.

Provided an overview of modern speech

recognition systems, highlighting Google
Speech-to-Text Systems: An Implemented using with only Speech-To-
2 Hinton et al. 2019 Speech Recognition's role in real-time
Overview Text
transcription and its applications in
various fields.

SPEECH-TO-SQL: Towards
Proposed a speech driven interface for Speech signals are not available from text
speech-driven SQL Query
3 Yu and Deng 2018 relational database using cascaded and detailed inner structure of speech
Generation From Natural
methods,SQLNet signals.
Language Question

Proposed a system that accepts the Due to incorrect Grammer or

Speech to SQL Generator- A
4 Zhong et al 2017 spoken query as input and gives SQL Mispronunciation, system fails to give
voice Based Approach
query output

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 5

Highlighted the evolution of deep learning in
Speech-to-Text: Automatic
speech-to-text systems, which serve as a basis
5 Speech Recognition Using Deep Xu et al. 2017 Implemented using with only Speech-To-Text
for modern speech recognition APIs like Google
Learning
Speech Recognition.

Introduced a system that learns to map natural

DBPal: Weak Supervision for
Manjunath, language queries to SQL queries using weak
6 Learning a Natural Language 2016 Implemented using with only Speech-To-Text
Shravankumar supervision, providing insights into handling
Interface to Databases
incomplete or noisy user inputs.

Using Natural Language

Weir et al. Proposed a system to solve the ambuity Implemented using with only Speech-To-Text
7 Processing in Order to Create 2012 between same words with multiple meanings And needs for updated model for the same
SQL Queries

SQLNet: Generating Structured Proposed a novel approach for generating SQL

Queries from Natural Language Yuanfeng, Raymond, queries from natural language input without
8 2008 Implemented using with only Speech-To-Text
Without Reinforcement Xuefang reinforcement learning, improving both
Learning efficiency and accuracy in query generation.

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 6

PROBLEM DEFINITION

Non-technical users often face challenges in interacting with

databases due to the complexity of SQL syntax. This limits their
ability to retrieve or manage data effectively. The Speech to SQL
Query Generator addresses this problem by providing a voice-
based interface that converts natural language speech into SQL
queries, allowing users to interact with databases without needing
to understand SQL.

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 7

SYSTEM ARCHITECTURE

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 8

ALGORITHM
•Start
•User Interaction
Capture voice input from the user via the User Interface (Web Browser).
•Send Voice Data
Send the captured voice input to the Flask Server.
•Voice Recognition
On the Flask Server:
-Send the voice data to the Google Speech Recognition API.
-Receive the transcribed text from the API.
•Text Processing
Send the transcribed text to the NLP & SQL Query Generation Module.
Generate the SQL query based on the processed text.
•Execute SQL Query
Send the generated SQL query to the Sample Database.
Execute the SQL query against the database.
Retrieve the query result.
•Display Results
Send the query result back to the Flask Server.
Display the results to the user via the User Interface.
•End
25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 9
FLOWCHARTS

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 10

WORK BREAKDOWN
STRUCTURE

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 11

HARDWARE AND SOFTWARE
REQUIREMENTS
SOFTWARE REQUIREMENTS
• Operating System : Windows XP/7/Vista on wards
• Coding Language : Python
• IDE : VS Code
• Web Browser : Google Chrome

HARDWARE REQUIREMENTS
• System : Pentium IV 2.4 GHz.
• Hard Disk : 256 GB(Min).
• Monitor : 15 VGA Colour.
• IO Devices : Keyboard and Mouse.
• Ram : 4 GB(Min).

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 12

PERFORMANCE ANALYSIS
Accuracy Performance
93
Speech Recognition Accuracy:
92 Achieved 92% accuracy in converting
91
speech to text using Google Speech
Recognition API.
90

89
SQL Query Conversion Accuracy:
88 Natural language converted to correct
87
SQL syntax with 88% accuracy on
average.
86
Speech Recognition SQL Generation Intent Recognition

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 13

System Performance Overview

Average Time from

Speech to SQL
Pass Rate (%) Output:
1.5 to 2 seconds
depending on input
length.

Avg. Response Time (sec)

0 10 20 30 40 50 60 70 80 90 100

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 14

RESULT

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 15

CONCLUSION
The Speech to SQL Query Generator project represents a significant
advancement in the field of Natural Language Processing and Database
Management, bridging the gap between human communication and machine
understanding. By utilizing voice recognition technology, this application
empowers users to interact with databases through intuitive voice commands,
eliminating the need for manual SQL query writing.

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 16

FUTURE SCOPE

Support Multiple Languages Contextual Understanding

Specific Applications and Use

Enhanced NLP
Cases

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 17

REFERENCES
[1]Jurafsky, D., & Martin, J. H. (2021) Speech and Language Processing: An Introduction to Natural Language
Processing, Computational Linguistics, and Speech Recognition. Prentice Hall.
[2]Graves, A., & Jaitly, N. (2014) "Towards End-to-End Speech Recognition with Recurrent Neural Networks."
International Conference on Machine Learning (ICML).
[3]Baker, J. K. (1975) "Stochastic Modeling for Automatic Speech Understanding." IEEE Transactions on
Audio, Speech, and Signal Processing.
[4]Khan, A., & Hossain, M. S. (2020) "A Survey of Natural Language Processing Techniques for Data
Querying." Journal of Computer and Communications.
[5]Chowdhury, S., & Shaikh, M. (2019) "Voice-Based Database Querying System." International Journal of
Computer Applications.

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 18

Thesis
No ratings yet
Thesis
8 pages
An Algorithm To Transform Natural Languages To SQL Queries For Relational Databases
No ratings yet
An Algorithm To Transform Natural Languages To SQL Queries For Relational Databases
7 pages
Project Report - 7 - Merged
No ratings yet
Project Report - 7 - Merged
46 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
18 pages
Text-to-Speech Project Report
No ratings yet
Text-to-Speech Project Report
26 pages
UNIT 5 Application AI
No ratings yet
UNIT 5 Application AI
16 pages
Project Final 1
No ratings yet
Project Final 1
55 pages
DesktopAssistant Reoprt
No ratings yet
DesktopAssistant Reoprt
42 pages
Dbms Lab El Report
No ratings yet
Dbms Lab El Report
20 pages
Python Virtual Assistant Project Report
No ratings yet
Python Virtual Assistant Project Report
18 pages
Speech & Text Recognition Report
No ratings yet
Speech & Text Recognition Report
74 pages
Project Report - 1
No ratings yet
Project Report - 1
4 pages
Minor Project Sem 2
No ratings yet
Minor Project Sem 2
35 pages
Shanu Merged
No ratings yet
Shanu Merged
46 pages
Speech Recognition Report
No ratings yet
Speech Recognition Report
46 pages
Speech Recognition Bot Project Proposal
No ratings yet
Speech Recognition Bot Project Proposal
13 pages
AI Desktop
No ratings yet
AI Desktop
14 pages
Wa0002.
No ratings yet
Wa0002.
10 pages
Thesis 1
No ratings yet
Thesis 1
46 pages
Caption Generator
No ratings yet
Caption Generator
18 pages
Project Report
No ratings yet
Project Report
17 pages
Enhancing Speech Recognition Security
No ratings yet
Enhancing Speech Recognition Security
5 pages
Expert System Voice Assistant Project
No ratings yet
Expert System Voice Assistant Project
52 pages
DL Proj Rep
No ratings yet
DL Proj Rep
11 pages
B.tech It Batchno 136
No ratings yet
B.tech It Batchno 136
25 pages
Data Science: Text & Speech Analysis Course
No ratings yet
Data Science: Text & Speech Analysis Course
2 pages
Anurag Synop
No ratings yet
Anurag Synop
9 pages
Similarity 0505064848
No ratings yet
Similarity 0505064848
56 pages
Project Report - 3
No ratings yet
Project Report - 3
3 pages
Current Challenges and Application of Speech Recog
No ratings yet
Current Challenges and Application of Speech Recog
4 pages
Mid Sem Report
No ratings yet
Mid Sem Report
11 pages
Speech Recognition Seminar Report
No ratings yet
Speech Recognition Seminar Report
24 pages
Jasmeet Seminar Report
No ratings yet
Jasmeet Seminar Report
24 pages
Speech Recognition System Project 2023
No ratings yet
Speech Recognition System Project 2023
13 pages
Mini Project Report 3.00000000
No ratings yet
Mini Project Report 3.00000000
21 pages
Natural Language Processing With Some Abbreviation To SQL
No ratings yet
Natural Language Processing With Some Abbreviation To SQL
5 pages
Speech Recognition System Overview
No ratings yet
Speech Recognition System Overview
35 pages
PROJECT-22group (Final) 1
No ratings yet
PROJECT-22group (Final) 1
50 pages
Voice Assistant Project Report
No ratings yet
Voice Assistant Project Report
58 pages
Text-to-Speech for Accessibility
No ratings yet
Text-to-Speech for Accessibility
2 pages
224s 22 Lec1
No ratings yet
224s 22 Lec1
31 pages
Ai CH 5
No ratings yet
Ai CH 5
22 pages
AI Assistant PBL Project
No ratings yet
AI Assistant PBL Project
13 pages
Speech Recognition Seminar Report
87% (97)
Speech Recognition Seminar Report
32 pages
Project Report Rtu
No ratings yet
Project Report Rtu
17 pages
NLP To SQL
No ratings yet
NLP To SQL
1 page
CASE STUDY - Speech Recognition
No ratings yet
CASE STUDY - Speech Recognition
25 pages
Voice Based System Assistant Using NLP and Deep Learning-1
No ratings yet
Voice Based System Assistant Using NLP and Deep Learning-1
82 pages
Python-Based Voice Assistant Project
No ratings yet
Python-Based Voice Assistant Project
11 pages
Impact of Computational Linguistics
No ratings yet
Impact of Computational Linguistics
2 pages
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
No ratings yet
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
4 pages
Speech Recognition Final Report (1) - Removed - Removed
No ratings yet
Speech Recognition Final Report (1) - Removed - Removed
62 pages
A Survey of Large Language Model-Based Generative AI For Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges
No ratings yet
A Survey of Large Language Model-Based Generative AI For Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges
7 pages
NLP 1.3.1 - Speed Recogmnition
No ratings yet
NLP 1.3.1 - Speed Recogmnition
20 pages
Final 1
No ratings yet
Final 1
39 pages
Large Language Model Enhanced Text-to-SQL Generation - A Survey
No ratings yet
Large Language Model Enhanced Text-to-SQL Generation - A Survey
18 pages
GRIET Assistant: College Info via NLP
No ratings yet
GRIET Assistant: College Info via NLP
2 pages
LLM Model Transform For Short Term Trading On Commodity
No ratings yet
LLM Model Transform For Short Term Trading On Commodity
7 pages
Sanghdipproject 2
No ratings yet
Sanghdipproject 2
22 pages
Dxdiag
No ratings yet
Dxdiag
42 pages
Microeconomics Group Assignment
No ratings yet
Microeconomics Group Assignment
4 pages
The Maltese Education System
No ratings yet
The Maltese Education System
4 pages
Inset Action Plan
No ratings yet
Inset Action Plan
3 pages
Physics Project File
No ratings yet
Physics Project File
8 pages
IMCI
No ratings yet
IMCI
41 pages
ATR 568F Propeller Maintenance Guidelines
100% (3)
ATR 568F Propeller Maintenance Guidelines
12 pages
Jee Main Study Material Syllabus
No ratings yet
Jee Main Study Material Syllabus
1 page
Manual - Shiftconnector API-v73
No ratings yet
Manual - Shiftconnector API-v73
95 pages
9 Drawing Layouts and Simplified Methods 2020 Manual of Engineering Drawin
No ratings yet
9 Drawing Layouts and Simplified Methods 2020 Manual of Engineering Drawin
18 pages
Lesson 4 - Shopping
No ratings yet
Lesson 4 - Shopping
11 pages
Understanding Mark-on, Markup, and Markdown
No ratings yet
Understanding Mark-on, Markup, and Markdown
6 pages
Distribution of Key Natural Resources
50% (2)
Distribution of Key Natural Resources
23 pages
Vocabulary Set 28 - TAX ON FAST FOOD
No ratings yet
Vocabulary Set 28 - TAX ON FAST FOOD
4 pages
CGL Exam Analysis
No ratings yet
CGL Exam Analysis
6 pages
AirTraffic (Version1 0)
No ratings yet
AirTraffic (Version1 0)
10 pages
2024.10.8 VOC CI For Tracklights (AC782024100802)
No ratings yet
2024.10.8 VOC CI For Tracklights (AC782024100802)
1 page
CSEC Economics Exam June 2022 Guide
No ratings yet
CSEC Economics Exam June 2022 Guide
18 pages
Adolescent Developmental Tasks Overview
No ratings yet
Adolescent Developmental Tasks Overview
2 pages
Aip 2025
No ratings yet
Aip 2025
14 pages
English III Lesson Plan on Verbs
No ratings yet
English III Lesson Plan on Verbs
5 pages
Icar-Indian Institute of Pulses Research Kalyanpur, Kanpur - 208 024 (An ISO 9001:2008 Certified Institute)
No ratings yet
Icar-Indian Institute of Pulses Research Kalyanpur, Kanpur - 208 024 (An ISO 9001:2008 Certified Institute)
4 pages
Science Prep.2 Unit One L 1
No ratings yet
Science Prep.2 Unit One L 1
8 pages
Agniveer Best 1000 One Liner Questions - 49585372 - 2025 - 05 - 24 - 19 - 08
No ratings yet
Agniveer Best 1000 One Liner Questions - 49585372 - 2025 - 05 - 24 - 19 - 08
46 pages
Hermite Curves, B-Splines & NURBS
No ratings yet
Hermite Curves, B-Splines & NURBS
10 pages
Bba 4 Sem
No ratings yet
Bba 4 Sem
1 page
Grade 11 Digestive System Lesson
No ratings yet
Grade 11 Digestive System Lesson
5 pages
Differences Between Prenatal Development and Neonatal Development
No ratings yet
Differences Between Prenatal Development and Neonatal Development
2 pages
Insert Name of Project (Here) : Project Initiation Document (PID)
No ratings yet
Insert Name of Project (Here) : Project Initiation Document (PID)
14 pages
Forbidden Park
No ratings yet
Forbidden Park
5 pages

Final

Uploaded by

Final

Uploaded by

SHALAKA FOUNDATION’S

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 3

Simplify Database Interaction Enhance Accessibility

Speech To Query Conversion Improve Usability

Real Time Query Execution

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 4

Proposed a deep learning-based

Provided an overview of modern speech

Proposed a system that accepts the Due to incorrect Grammer or

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 5

Introduced a system that learns to map natural

Using Natural Language

SQLNet: Generating Structured Proposed a novel approach for generating SQL

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 6

Non-technical users often face challenges in interacting with

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 7

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 8

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 10

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 11

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 12

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 13

Average Time from

Avg. Response Time (sec)

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 14

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 15

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 16

Support Multiple Languages Contextual Understanding

Specific Applications and Use

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 17

25/06/2025 DEPARTMENT OF COMPUTER ENGINEERING 18

You might also like