Audio Data Annotation
Audio Data Annotation
TABLE CONCONTENTSS
AUDIO ANNOTATION GUIDELINES 2
AUDIO Annotation 2
DEFINITIONS 2
getting started with lABEL STUDIO 3
process of audio annotation 4
Best Practices AUDIO ANNOTATION 6
1. Clarity and Consistency 6
2. Avoid Bias 6
3. Quality Control 7
4. Collaborative Reviews 7
Annotating filler words 7
1. Understand Filler words 7
2. establish GUIDELINES FOR INCLUSION OR EXCLUSION 8
3. STEPS FOR ANNOTATING FILLER WORDS 8
4. Review and edit 8
Annotating overlapping conversations 9
1. Understanding overlapping speech 9
2. use appropriate notation 9
3. segmenting overlaps 10
4.categorize overlaps 10
5. ANNOTATE WITH CONTEXT 10
6. REVIEW AND EDIT ANNOTATIONS 10
Annotating inaudible Segments 11
CODE SWITCHING 13
FILLER WORDS 13
EXAMPLES 14
1. FILLER WORDS 15
2. OVERLAPPING 16
3. INAUDIBLE 17
4. CODE SWITCHING 17
1
AUDIO ANNOTATION GUIDELINES
AUDIO ANNOTATION
This manual provides a step-by-step guide for annotating audio data using Label
Studio, with instructions tailored to transcription and translation.
DEFINITIONS
1. Transcription
2. Translation
3. Annotation
The process of labeling data in form of text, images or audio so that models can easily
comprehend a given data source and recognize certain formats, objects, information,
or patterns in the future
4. Audio annotation
2
A clean verbatim transcription is also known as edited transcription or intelligent
transcription. This type of transcription does not change the text’s meaning or
paraphrase it. Also, it does not capture unnecessary words in the speaker’s speech.
Non-verbal communication that does not add value to the content is left out,
including filler words and stammering. The ultimate goal of this mode of
transcription is to achieve a balance between readability and completeness.
3
PROCESS OF AUDIO ANNOTATION
3. Listening:
4
4. Labelling:
In the provided text fields:
o Transcribe the audio content accurately.
o Translate the transcription into the required target language.
5
6. Review Annotations:
-Double-check your work to ensure:
o The transcription and translation are accurate.
o The required fields are completed.
o Make corrections if necessary before submitting.
7. Submit the Task:
a. Click on Submit to mark the task as complete.
● Ensure your annotations are clear and follow consistent labeling practices.
● Adhere strictly to the provided annotation guidelines to maintain
uniformity across tasks.
2. AVOID BIAS
● Annotate based solely on the content of the audio and its characteristics.
● Do not make subjective assumptions or infer meanings beyond what
is explicitly stated in the audio.
3. QUALITY CONTROL
4. COLLABORATIVE REVIEWS
● Filler words include expressions like "um," "uh," "like," "you know," and "so."
,”Mmh”,”Eeh”,They are often used in speech to fill pauses but do not add
substantive meaning.
Example:
3. SEGMENTING OVERLAPS
When transcribing overlapping speech, segment the audio into manageable parts:
● Identify Overlap Points: Clearly mark where the overlap begins and
ends using timestamps.
● Sequential Representation: For clarity, you can represent
overlapping speech in a sequential manner, indicating which speaker is
contributing at each point.
4.CATEGORIZE OVERLAPS
Establish categories for different types of overlaps:
8
● Turn Stealing: When one speaker interrupts another.
● Anticipated Turn Taking: When a speaker prepares to take their
turn but overlaps momentarily.
● Include notes about the interaction dynamics (e.g., was the overlap
cooperative or interruptive?).
● Use tags or labels to indicate the nature of the overlap (e.g., "bck" for
back-channel, "tst" for turn stealing).
9
CODE SWITCHING
To annotate code switching we use [cs]: Prefix for words not in the primary language
and [cs] suffix to mark the end of the word not in primary language.
Example
No wĩ ta rĩu ona Nairobi ĩyo ũnjĩĩre, [cs]“Faith, nitumie shamba yako, nipigie simu
shamba yako ikiwa na mahindi ama na maharagwe unitumie,”[cs] nĩ ngũhota
gũtũma na wone ũrĩ o na
Adapted words
Create a catalogue of adapted words that are derived from English or Kiswahili
Transcribe the borrowed word following the phonological rules of the target
language
FILLER WORDS
Ehe- Encouraging
Eeh - [Think
10
EXAMPLES
1. FILLER WORDS
11
2. OVERLAPPING
12
3. INAUDIBLE
13
4. CODE SWITCHING
14
15