Epedimiology and Research Methodology
Epedimiology and Research Methodology
in
Dr JZJ Killewo
Dr SH Kapiga
Dr I Semali
Dr G Kwesigabo
Contents -----------------------------------------------------------------------Page
I. PRINCIPLES OF EPIDEMIOLOGY
Foreword
At last we have come up with a manual of lecture notes in Epidemiology and Research
Methodology after many attempts since 1985. The motivation to do this work was a result of a
chronic shortage of books and even when these became available they were very few compared to
the number of students who needed to use them. There was therefore, a scramble for books with
irresistible urge and temptation by many students to ‘beg, borrow or steal’ the few books in order
not to return them. These ‘lost’ books became a hot cake among the students and were being passed
from one batch of students to another until a fresh supply was ‘begged borrowed or stolen’ again.
As years passed by, the situation became worse, and at times we were not even able to provide
handouts for every lecture given due to the meager resources provided to the department in terms of
stationery and photocopying services. With the advent of cost sharing in which students were
required to contribute towards acquisition of lecture handouts the department provided one copy of
notes for further reproduction by the students themselves. This has been satisfactory to the
department but perhaps not equally so for the students since the notes have been loose and scattered
here and there making revision for examinations difficult.
This set of lecture notes is the answer to some of the problems mentioned above. However, it must
be properly utilized. It is not to be taken as a replacement for books and lectures. Relevant books
and other literature must be consulted as indicated at the end of most notes. It is recommended that
the notes are read before a scheduled lecture is delivered so that students can participate fully during
the lecture by asking questions about concepts which might not have been clear to them during the
time they first read the notes.
The department will endeavour to up-date the notes from time to time to keep students abreast with
new developments in the field of epidemiology and research methodology. We sincerely hope that
our students will enjoy the subject and that this set of notes will become a useful guide towards their
goal.
Dr. J. Z. J. Killewo
HEAD OF DEPARTMENT
SEPTEMBER 1995
4
1
PRINCIPLES OF EPIDEMIOLOGY
5
Introduction to epidemiology
Epidemiology may be defined as the study of the distribution and determinants of disease
frequency. These three closely interrelated components - distribution, determinants and frequency -
encompass all epidemiologic principles and methods.
History of Epidemiology
The concepts of epidemiology have their origin in the works of Hippocrates "On Airs, waters and
places" which stress the importance of considering the variety of environmental influences on
diseases in humans, e.g. the hot, the cold, the winds, the water quality, and whether the inhabitants
are fond of drinking and eating to excess and given to indolence or are fond of exercise and labour
and not given to excess in eating and drinking. This concept provides for appropriate definition,
description and analysis of a variety of situations in efforts to investigate causal relationships in
disease occurrence with the ultimate aim of preventing disease.
In an attempt to investigate and control epidemic diseases in historic times two theories evolved in
the course of time: The contagion vivum and the miasma theories.
According to this theory a living contagion was thought to be involved. This theory necessarily
depended on two other concepts:
(i) The specificity of both disease and its cause
(ii) The existence of microscopic organisms
Although this theory is relatively modern (mid 1800's, and about 25 years before the time of Louis
Pasteur) some practices commonly associated with it are quite old. For example, the custom of
isolating people with contagious diseases is recorded in the bible and was continued by the Roman
Catholic Church during the Middle Ages. Girolamo Fracastoro (1478-1553) was the first to state
explicitly that specific diseases directly result from specific contagions, but he did not have any
notion of micro-organisms. One of his contemporaries, Gilamo Cardano, stated in 1557 that the
seeds of disease were minute animals capable of reproducing their kind. You might describe this
stage of development as the beginning of the science of microbiology.
This theory, dating back to the early 1700's, offered an alternative explanation for the origin of
epidemics. The idea was based on the notion that when the air was of a bad odour or quality the
persons breathing that air would become ill. For example, "malaria" means "bad air". This theory
led to the total abandonment of the contagion vivum theory, but after the 1720 Marseilles plague
epidemic which could only be scientifically explained by the contagion vivum theory and the
discovery of the fungal origin of muscardine, a silkworm disease, by Agostino Bassi in 1816 the
contagion vivum doctrine was adopted by many European scientists. and the miasma theory
subsequently abandoned. Indeed, the contagion vivum theory was the scientific basis for the
6
initiation of the filtration of part of the London water supply in 1829 to remove possible disease
contagions.
In essence, the current era of epidemiology began in the mid 1800's when the contagion vivum
theory and the evolving epidemiologic study merged.
The essence of the epidemiologic study is the comparison of groups of people with regard to a
characteristic of interest in order to determine relationships in disease occurrence. The earliest
account of such a comparison is found in the Old Testament in the first chapter of the Book of
Daniel. Today an epidemiologist reading this account will have much to say regarding the
composition, selection and allocation of the comparison groups into the different food allocation
categories (in modern epidemiology - the treatment groups). Bias resulting from the way the
outcome of treatment was assessed in this biblical account forms yet another subject of
epidemiology we shall study. In epidemiology, comparisons of groups are so common that one can
not avoid the use of the tool even in their simplest situations. For this reason investigators should
be aware of the misleading use of numerators in making epidemiological comparisons.
John Graunt, inspired by Bacons writings collected the bills of mortality which had been initiated in
1603 by the parish clerks in London. After organising the published bills he derived from them
inferences about mortality and fertility in the human population. He noted the following inferences:
From the collected data Graunt constructed the first known life table, summarising the mortality
experience in terms of the number, percent or probability of living or dying over a lifetime.
James Lind used the idea of comparative groups in studies to elucidate the aetiology and treatment
of scurvy. From the results of his study, Lind inferred that citrus fruits cured the scurvy and that this
would also provide a means of prevention.
William Farr was the first compiler of Abstracts of the Registrar General's Office and organised the
first modern vital statistics system.
John Snow conducted a series of classical studies on cholera (1848 - 1854) and inferred the
existence of a "cholera poison" transmitted by polluted water. This led to a legislation mandating
that all of the water companies in London filter their water by 1857. However, It was not until 1883
that Robert Koch identified the cholera vibrio.
Sir Benjamin Ward Richardson, a pioneer of the sanitary movement, was among the first to develop
a disease reporting system and to propose the teaching of preventive medicine in medical schools in
the 1860's.
Much of our epidemiological knowledge on measles is credited to P.L. Panum who studied the
measles epidemic on the Faroe Islands in 1846.
Sir A. Bradford Hill pioneered the application of epidemiologic methods to non-infectious diseases.
7
C. Descriptive epidemiology : studying the distribution of disease in relation to time, place, and
person
The epidemiologic patterns of infectious diseases depend upon factors that influence the
probability of contact between an infectious agent and a susceptible person, known as the
host. This probability of contact is determined by an intricate interplay between factors related
to the triad of agent, host and environment. It is important to approach disease causation in this
manner as it gives us an insight necessary for developing rational preventive measures.
Examples of factors and associated diseases are listed below:
A. Nutritive elements
Factor Disease
Excess or deficiency of cholesterol atherosclerosis
calories Kwashiorkor
protein Marasmus
iodine goitre
iron, folic acid anaemia
B. Chemical Agents
Presence of poisons or allergens
pesticides poisoning
drugs, alcohol intoxication/dependency
allergens eczema
pollen hay-fever
C. Physical Agents
energy, speed accidents
solar radiation sun burns
radioactivity neoplasms
D. Infectious Agents
C. Socio-economic
- occupation occupational hazards
- urbanisation crowding, stress
- development education, poverty, availability of health services
- disruption civil and other wars - Rwanda 1994/5, natural
disasters- earthquake in Kobe Japan, 1995.
If the characteristics of the agent, host and environment are studied in isolation, we will never
reach a full understanding of why diseases are distributed the way they are. The interaction
between agent, host and environment is the object of the study of ecology. The study of this
interaction will give clues to answers of questions like such as:
Ecology is the study of the relationship of organisms to each other as well as to all
other aspects of the environment
Climax state: the final constant condition of flora and fauna. The equilibrium or balance
reached varies, depending on climate and soil. The climax state is not stable,
but a dynamic condition.
The main factor responsible for precluding stability is human activity. Human activity can
change the balance much more rapidly, profoundly and unpredictably than changes in weather,
climate or natural events. Examples are: irrigation projects, behavioural changes (e.g. sexual
behaviour), wars etc.
Food chain: from the smallest plants or animals up to large predators, plants and animals serve
as food for other species higher up in the food-chain. Predators (lion, cheetah and hyena) and
birds of prey (hawks, eagles) are at the end of the food chain. They can be exposed to
dangerously high levels of toxins and chemicals, originating from the environment, and
accumulated through the food-chain.
Habitat: This is the typical environment in which a certain species usually lives. The habitat
of different species is very much interdependent. Some species require very strict conditions,
and are restricted to limited geographic areas. Humans have adapted to an unusually wide
habitat, partially as a result of technology. The habitat of domestic animals naturally coincides
much with the human habitat. Psittacosis and toxoplasmosis are typical zoonoses originating
from pets.
Population density: fluctuations in animal and plant populations affect other species higher in
the food-chain. This may become relevant to human diseases, especially zoonoses (plague,
rabies). The density of human populations is an important variable too, as it affects food
availability, exploration of new areas (possible habitats for vectors: onchocerciasis,
trypanosomiasis, jungle yellow fever), increased numbers of domestic animals, in turn leading
to overgrazing and erosion/desertification. High human population densities (urbanization)
together with human patterns of socialisation provide good opportunities to diseases of close
contact (droplet transmission, sexual transmission).
It has been found, that the probability to contract certain diseases (e.g. measles) does not
depend on the proportion of susceptibles in a community, but rather on the absolute number of
susceptibles. In the USA measles transmission at present is confined to larger cities (over 0.5
million inhabitants.), where adequate numbers of susceptibles are available.
Socialisation: refers to the ways in which humans socialise or come in contact with each other.
Within each community/culture a great number of patterns exist: through work, school,
religion, recreation etc.
Adaptation: A heritable component of the phenotype which confers an advantage in survival and
reproductive success. In another perspective, it is the process by which organisms adapt to
environmental conditions. Two organisms co-existing in a host-parasite relationship, usually show the
phenomenon of adaptation: the parasite becomes less virulent, and the host develops resistance. Due to
selection pressure, more sensitive hosts will be weeded out, or relatively resistant mutants will develop.
This is in the interest of the parasite as well : the worst that might happen to the parasite, is extinction of
its obligate host. If man is only an occasional host, there is no pressure towards a milder course of the
disease in question, as long as an abundant reservoir of infection remains available elsewhere in nature
(e.g. rabies).
11
Development of relative resistance has been described for measles on Fiji after 1875, when
20% of the population died, and this has been postulated as an explanation for the decreasing
mortality from tuberculosis in Europe early this century. The emergence of cholera in 1870 as a
hitherto unknown disease, has been explained by the incorporation of plasmids (small
corpuscles of genetic material outside the nucleus) coding for an enterotoxin, originating from
E. Coli, by innocent vibrios in the estuaries of Bengal. Also Vibrio cholerae seems to adapt to
less virulent strains.
Endemic syphilis, previously highly endemic in many arid regions like Botswana and Saudi-
Arabia, has been disappearing. As the treponemes are very fragile, and require very close
contact between infected and uninfected to get transmitted, increasing personal hygiene and
wearing of clothing might have forced treponemes in a specific 'niche' (special limited habitat
of a particular organism): its only refuge was intimate sexual contact.
Poliomyelitis has showed a remarkable change in its spectrum: only during this century it has
changed towards much feared epidemics of infant paralysis. Presumably due to improved
hygiene and social-economic conditions and opportunities to spread in early infancy (when
polio only causes mild diarrhoea, rarely followed by paralysis). From an endemic disease of
early childhood, it became epidemic amongst school-age children, with frequent sequelae of
paralysis.
At which percentage of immunes herd immunity occurs depends on the disease concerned, and
on socialisation patterns. Measles transmission still occurs in places like Tanzania and other
tropical developing countries. In these places the measles virus may be present in the
community, The number of cases depends upon the number of susceptible persons (children)
exposed to the virus. If there is a cyclic pattern with peaks occurring at intervals of every 2-3
year, this suggests that every 2-3 years there is a sufficient number of new susceptibles, hence
low herd immunity.
Childhood diseases: diseases do occur at that particular age when they are highly infectious.
Such diseases usually provide lifelong immunity and socialisation patterns provide ample
opportunity for transmission at a certain age (e.g. school entrance). As they typically occur in
epidemics, which cease when the number of susceptibles is almost exhausted. They are
anthroponoses and require a large human population, subdivided in fairly large communities. A
disease like measles cannot persist on smaller islands; it only (re-)occurs when imported.
12
These are referred to as factors that must be present (a sine qua non) for the disease to occur.
These are the Agents of disease such as bacteria and viruses in infectious diseases, and fire,
nutrition, radiation or various poisons in non-infectious diseases. Many, but not all of the
known agents of disease are located in the biological and physical environment. A possible
agent in the social environment is maternal deprivation. Many studies have demonstrated that
the quality of parental care in the early years of life is intimately related to normal physical,
emotional, and mental development. From an ecological point of view, an agent is considered
to be a necessary but not sufficient cause of disease because suitable conditions of the host
and environment must be present for the disease to develop.
2. Contributory factors:
These are host and environmental factors which are associated with increased likelihood of
disease occurrence. e.g.: Host factors: immunity, sex, age. Environmental factors: social
(poverty, ignorance), physical (rainfall, temperature) or biological (presence of vectors, animal
reservoir). The interplay of these factors - also called ecology - can be presented in the form of
a model.
(Ecology: study of the inter-relationship between living things (agent and host) and the
environment. Human ecology : the study of human groups as influenced by environmental
factors often including social and behavioural factors). (Model: simplified framework which
can be used to represent more complex systems)
In this model, also called the epidemiological triad, three main components are important in the
chain of disease transmission.
AGENT
ENVIRONMENT HOST
Under stable ecological conditions, the epidemiological triad is in a balanced state, and the
disease can be said to be absent or endemic. If this balance is disturbed and becomes
unfavourable to the agent, the incidence of the disease will decrease. If the situation remains
unfavourable the disease may become sporadic or even disappear. If, on the other hand, the
balance alters to favour the agent, then an epidemic may occur. Although a lot may be known
about agents of disease, the role of host and environmental factors is incompletely understood.
For example, we do not know why only some of the people exposed to large doses of X-rays
develop leukaemia or why not all heavy smokers do develop lung cancer.
Some diseases such as schizophrenia, coronary heart disease, rheumatoid arthritis and essential
hypertension have not been linked to any specific causative agent. In this respect, new models
have been developed which de- emphasise the role of the agent and rather stress the multiplicity
of interactions between host and environment. These model are:
1. the wheel model
2. the web of causation model
PHYSICAL BIOLOGICAL
ENVIRONMENT ENVIRONMENT
- temperature - reservoir
- rainfall HOST - vectors
- altitude - heredity - microbes
- radiation - sex (agents)
- chemicals - age
SOCIAL ENVIRONMENT
- socio-economic conditions
- occupation
- education
- culture
- customs
- beliefs
If we consider road traffic accidents as one of the diseases, there are a number of factors which
influence the occurrence of an accident :
1. Host factors:
- age : young drivers are more prone to accidents than older ones.
- driving experience: less experience, more accidents
- drinking habits : a driver who is under the influence of alcohol is more prone to causing
accidents than the one who is not.
4. Biological environment:
- animals on the road may cause accidents
- pets and insects (tsetse or bees) in the vehicle may cause accidents
Intervention for control: all of the above factors should be intervened in order to reduce the
incidence of accidents.
In coronary heart disease or essential hypertension, the most important role is played by host
and social environmental factors.
1. Host factors:
- hereditary: probable genetic disposition
- age: usually adults are more affected
- behaviour: consumption of large quantities of cholesterol containing foods.
In this model many factors are associated with the occurrence of disease. Effects do not depend
on a single isolated cause but rather develop as a result of chains of causation in which each
link is a result of a complex genealogy of antecedents. The large number of antecedents creates
a condition that may approximately be conceptualized as a web.
Examples:
15
1. Web of causation of cardiovascular diseases, smoking, hormones, stress, diet (fat, total
calories, salt), physical activity, heredity, hyperlipidemia, obesity interact to bring about
atherosclerosis, hypertension and coagulation / clot lysis which in turn lead to coronary heart
disease, cerebral vascular disease and hypertensive disease.
2. Treatment of syphilis and the development of jaundice (serum hepatitis): before the advent of
antibiotics, the treatment of syphilis was by intravenous injection of arsenical compounds.
Many of the syphilitic patients who were so treated later on developed jaundice. In 1967
viral antigens (necessary factor) became associated with hepatitis for the first time.
References:
5 Krieger N. Epidemiology and the web of causation: Has anyone seen the spider ? Soc
Sci Med 39 (7) 1994 pg 887-903.
16
1. Pre-pathogenesis:
• In this stage, disease has not developed but the groundwork has been laid by the
presence of factors that favour its occurrence.
• The stage of pre-pathogenesis involves the period of interaction of agent, host and
environment. Ultimately this interplay may create a disease stimulus, which in turn
might produce disease in man.
• Agent refers to that aetiological factor, which is necessary to bring about a particular
disease, disability or pathological state in a susceptible host.
• Host refers to the organism capable of being infected by the agent.
• Factors whose presence is associated with an increased probability that disease will
develop later are called risk factors.
• Risk factors may be immutable (e.g. age, sex, race) or susceptible to change (e.g.
smoking habit, alcoholism, high cholesterol levels in serum etc.).
2. Pathogenesis:
• The period of pathogenesis is the natural course of the disease in man from the first
interaction with the disease provoking stimulus, until equilibrium is reached, or recovery,
death, defect or disability ensues.
• The following phases can be distinguished:-
(a) Incubation period: this is the period from the moment of infection until the
disease becomes manifest (i.e. discernible early lesions appear).
(b) Pre-patent period: period before any lesions are discernible by any available
laboratory means.
(c) Induction period: the period of time from causal action until disease initiation
(d) Latent period: the time interval between disease occurrence and detection. The
latent period can be reduced by improved methods of disease detection.
• The stage of pathogenesis can result in three distinct outcomes namely; recovery, death, or
disability.
(3) Post-Pathogenesis:
This is the period of deformity or disability, resulting from the pathogenesis of disease.
The deformity or disability may be temporary or permanent.
17
Understanding the natural history of a disease, enables an orderly listing of factors which
predispose to and/or cause a given disease, and of factors which may lead to improvement
or deterioration in the disease process. This information is essential to prevent a disease
from occurring or, once it has occurred, to prevent the disease process from causing
deterioration, defect, disability or death. These factors are called risk factors.
2. Prediction of prognosis.
It enables us to predict the prognosis of a given patient, i.e. to estimate the probability that a
patient with a given disease will die, recover, or suffer various complications. We can use
this knowledge to develop a prognostic model.
Levels of prevention:
In a narrow sense, prevention simply means inhibiting the development of a disease before it
occurs. However, in current usage, the term has been extended to include measures that interrupt or
slow the progression of the disease. For this reasons several levels of prevention are said to exist.
1. Primary prevention
2. Secondary prevention
Refers to detection of people who already have a given disease at the earliest possible stage,
in order to stop the disease from developing further (i.e. early detection and prompt
treatment). examples:-
• Improved medical care e.g. new therapies, specialized care facilities, improved access
to care.
• Screening for early diagnosis and specific treatment. Screening can be carried out at
two stages:-
(a) Detection of subclinical disease using laboratory tests. e.g. screening of syphilis by
VDRL tests or screening of anaemia by Hb estimation.
(b) Detection of early clinical disease, using clinical examinations with laboratory tests
if necessary. e.g. screening for breast cancer by palpation and/or mammography or
screening on tuberculosis by chest X-ray
18
3. Tertiary Prevention
Prevention of advanced disability and death in a patient who has already developed the
disease, and cannot be cured. It includes rehabilitation.
Risk factors:
-No immunity -Immunization with polio vaccine
-Overcrowding -Improved housing
-Poor hygiene -Health education
2. Pathogenesis Secondary prevention
-Early symptoms (e.g. fever, headache, vomiting) -Health education during epidemics to recognize
early symptoms
-Late symptoms (e.g. paralysis) -Treatment of paralysed patients to prevent
permanent paralysis.
3. Post-pathogenesis Tertiary prevention
-Permanent paralysis -Physiotherapy to prevent contractures
-Vocational training
-Provision of walking aids
REFERENCES:
A detailed description of each of the above components is presented in the following sections:
1. Agent
The agent refers to that aetiological factor, which must be present for the disease, disability or
pathological state to occur in a susceptible host. An agent may be defined as the presence, absence,
excess or deficiency of a certain factor. (Refer to the section on ecology of disease and models of
disease causation)
Resistance : this refers to the ability of the agent to survive under adverse environmental conditions. Some
agents are remarkably resistant, such as Mycobacterium tuberculosis, Clostridium tetani etc. Others are
extremely fragile, such as gonococci and influenza viruses. The term resistance is also used with a different
meaning as a characteristic of the host.
Infectivity : this is the capacity of a micro-organism to enter a susceptible host. It can be expressed as the
proportion of the susceptible population, which is infected by a particular organism. Experimentally it can be
thought of as the minimum number of particles or agents necessary to cause infection in 50% of a group of
hosts of the same species. The so called Infective dose to 50% of the hosts (ID50). Susceptibility of an
individual can be measured by serological means (absence of protective antibodies). For certain diseases
susceptibility may be assumed, if a person has no previous history of overt disease.
Pathogenicity : this refers to the capacity of a micro-organism to cause overt disease in the infected host. It
can be expressed as the proportion of the infected population (to be established by serological or other
laboratory techniques), which develops overt disease.
Antigenicity : refers to the ability of the agent to induce antibody production in the host. These antibodies
may not necessarily be protective.
Toxigenicity : refers to the capacity of the agent to produce a toxin or poison that can cause pathogenic
effects on the host. The pathogenic effect of agents in diseases such as botulism and shellfish poisoning
depends on the toxin produced by the micro- organism rather than on the direct effect of the micro-organism
itself.
Virulence : refers to the severity of the disease. It is the degree of pathogenicity of an infectious agent.
One measure of virulence is the Case Fatality Rate (CFR) which can be expressed as follows:
20
The infectiousness of a host differs from disease to disease, but is usually consistent for any given
disease. Many diseases are transmissible even during periods when the disease is not (yet) clinically
manifest. For instance mumps during the incubation period, and gonorrhoea in asymptomatic cases.
Some diseases can be transmitted years after recovery, as in a chronic carrier state: e.g. typhoid and
hepatitis B, or by reactivation of a latent infection e.g. herpes.
Reservoir (of infectious agents) refers to any human beings, animals, arthropods, plants, soil or
inanimate matter or a combination of these in which an infectious agent normally lives and
multiplies, and on which it primarily depends for survival and reproduction in such a manner that it
can be transmitted to a susceptible host. A reservoir of infection may also be termed as the natural
habitat of the infectious agent.
It is important to make a distinction between those diseases in which humans are the only reservoirs
- anthroponoses (e.g. smallpox, measles, cholera etc.) - and those diseases which involve other
animal reservoirs - zoonoses (e.g. plague, rabies).
Portals of exit in man include the respiratory passages, the alimentary canal, the openings in the
genito-urinary system, and skin lesions. Additional portals of exit may be made available through
insect bites, the drawing of blood, surgical procedures, accidents and other injuries.
For the chain of transmission to be continued, the portal of exit must be appropriate to the
particular agent. To produce infection, the agent must exit from the source in sufficient quantity to
survive the environment and the defences surrounding the portals of entry in the new host.
A. Direct transmission
Direct transmission means that the agent is transmitted directly from the infected host (man or
animal) to the new host. e.g. influenza, gonorrhoea etc.. Direct transmission may be horizontal
(between people of the same generation) or vertical (between mother and child).
B. Indirect transmission
21
Indirect transmission requires a vehicle or vector to carry the agent of disease from one host to
another.
Vehicles may be substances such as water, air, food, blood used in transfusion, or formites
(inanimate objects used by an infectious host, such as clothing, handkerchiefs, doorknobs etc.).
Multiplication may or may not take place in or on the vehicle (e.g. in influenza, hepatitis and
streptococcal disease).
Vectors may be either mechanical or biological, but are always living. Mechanical vectors are such
animals as insects, which can carry agents from place to place on their feet, proboscis or other body
parts. (e.g. flies in Shigellosis). Biological vectors must have growth or multiplication of organisms
occurring within the vector (e.g. fleas in plague). Some of these vectors may be necessary to the
life cycle and therefore to the survival of agents of certain diseases (e.g. malaria).
C. Complex cycles
- helminths : ovum > miracidium > cercaria
A portal of entry must be available in the new host. This portal of entry must be appropriate to the
particular agent. Most natural portals of entry in the new host provide barriers to infectious agents,
such as the epithelia in the respiratory passages, acid gastric secretion, etc. The intact skin also
provides an excellent defence against entry of agents. However, as in the case of the portals of exit,
insect bites, other skin lesions, blood transfusions, surgical procedures and accidents may result in
additional portals of entry for disease agents.
6. Host
Host refers to the organism capable of being infected by the agent. One can distinguish between
different types of hosts : Obligate host: if man is necessary for the life cycle or for the continued
existence of an agent, he constitutes the obligate host for this agent. Man is the obligate host for
such agents as Salmonella typhi, measles and influenza virus. Incidental (occasional or accidental)
hosts are not usually involved in the natural cycle of transmission.
22
Infected: an infected person harbours an infectious agent and has either manifest disease or
inapparent infection. He may or may not be infectious.
Infectious: is a person, from whom the infectious agent can be naturally acquired. A person or his
articles or clothing may also be merely contaminated with an infectious agent, without being
infected.
Immune: is a person who possesses specific protective antibodies or cellular immunity as a result
of previous infection or immunisation. Alternatively, these may result from such previous
experiences as to respond adequately with production of antibodies sufficient to prevent clinical
illness following exposure to the specific infectious agent. Immunity is relative: an ordinarily
effective protection may be overwhelmed by an excessive dose of the agent. It may also be
impaired by immuno-suppressive drug therapy or concurrent disease such as AIDS.
Susceptible: a person is considered susceptible if he does not possess sufficient resistance (inherent
and/or acquired) against a particular pathogenic agent to prevent contracting a disease when
exposed to the agent. This definition includes the fact that the person or animal must belong to a
species that is biologically capable of being an efficient host to the agent in question. Susceptibility
of a host may be modified by characteristics, such as age, sex, race, genetic make-up, physiological
state, habits and customs, pathological state and his previous experience with the agent (immunity).
References:
As infectious diseases are due to interaction between agent, host and environment, methods of
control should attempt to modify these factors whenever possible. For this reason, a detailed
understanding of a disease and its interactions is required, in order to develop an effective approach
to its control.
In general, the following methods for prevention and control of infectious diseases are available:
- personal hygiene
- proper food-handling procedures
- protective clothing/repellents for insect bites
- avoid suspect water, foods, animals and insects
2. Chemical methods
- Chlorinate water supplies and sewage effluents
- disinfect (potentially) infectious material
Objectives of Immunisation
1. Produce a good humoral, cellular and local immune response, similar to natural infection
2. Produce protection against clinical disease and re-infection
3. Give protection over several years, preferably a lifetime.
4. Result in minimal immediate side reactions or mild disease and with no delayed effects such as
late reactivation, CNS involvement or cancer
5. Can be administered simply in a form acceptable to the public
6. Cost and benefits of administration should clearly outweigh the cost and risk of natural disease
24
Case - finding
For certain diseases, case-finding and adequate treatment is an important means of source reduction
(e.g. tuberculosis). Case-finding may be active or passive. In passive case-finding, only cases
reporting spontaneously to be facility are treated. In active case-finding, activities are undertaken to
screen a population for early signs of a certain disease.
Surveillance
The term surveillance. may be defined as a system of close observation of all aspects of the
occurrence and distribution of a given disease, through the systematic collection, tabulation,
analysis and dissemination of all relevant data pertaining to that disease. Although the methodology
of surveillance is basically descriptive, surveillance must be dynamic, current and purposeful. It is
fundamental tall to prompted effective control and prevention of disease.
Elimination
Elimination may be defined as the purposeful reduction in prevalence of a communicable disease, to
the point of continued absence of transmission, from a certain geographical area. For instance,
malaria has been eliminated from Europe. As elimination is confined to a limited area, continuous
vigilance against re-introduction of the disease remains important.
Eradication
Eradication means the complete and universal removal of ac certain disease from the world. So far
only eradication of smallpox has been successful. Smallpox virus presently only persists in a few
laboratories around the world. The advantage of eradication over control is the cost of a time-
limited capital investment as against the ever-recurring cost of control measures.
1. Attack phase: aimed at the elimination of sources of infection and prevention of possibly
infectious relapses.
2. Consolidation phase: aimed at the absence of infectious cases for a specified period of time.
The end point of this phase is epidemiological eradication.
3. Maintenance phase: full eradication will be achieved when all old infections have naturally died
out. This is particularly important for diseases where the agent tends to persist (tuberculosis,
syphilis, leprosy).
The phases have to be worked out according to the specific requirements of a given disease. It is
crucial that methods, operational standards and end-points are clearly defined.
1. Numbers
Number of new cases of active Tuberculosis in some regions of Tanzania 1965
Table I
Region:
Dodoma 900
Coastal 850
Singida 700
Tabora 870
Mwanza 450
Other region 8,280
TOTAL 11,750
The limitation of using numbers is, that they do not allow any comparison, as the populations they
refer to are unknown. The numbers in table I above (fictitious data) give the false impression that
Dodoma is the most affected. Nevertheless numbers are useful for national allocation of resources
for instance to determine the amount of tuberculostatics required per region.
2. Ratios:
A ratio is the result of the division of two numbers; it is the basic measure of the relative magnitude
of two numbers. For example
numbers of males
Sex ratio = -----------------------
number of females
Ratio is a fraction whose denominator is independent of the numerator. Two distinct types of ratios
are commonly used; one has dimension while the other is dimensionless. The number of hospital
beds per 1000 population is a ratio possessing dimension; which male to female ratio, is
dimensionless. Still birth per 1000 live births is another example of a ratio with a dimension.
3. Proportion
A proportion is the result of the division of a number by the population it refers to. Unlike a ration
it is a fraction whose numerator is included in the denominator. It is a dimensionless quantity whose
value must range from 0 to 1. A proportion can be expressed as a fraction
( 0.5) or as a percentage (50%).
.
For instance: the number of new TB cases in Tabora Region is 870, and if the population of Tabora
is 870,000 people, the proportion of new TB cases in 1965 is, 870 / 870,000 = 00010 or 0.010% or
0.1 per 100,00. This allows better comparison between the regions regarding the magnitude of the
TB problem in 1965:
26
1
By using this kind of measurement one can know which of the regions had the biggest problem of
new TB cases when the regional population is taken into account. To illustrate this point table II
below (tuberculosis data) shows that despite differing number of new cases, the magnitude of the
problem of new cases of TB was the same in the regions in 1965.
Table II
Table III
Region New Case Proportional distribution
%
PMR shows the relative importance of a disease or factor in causing mortality e.g. Deaths
due to AIDS expressed as a ration of total deaths.
1
*the specified group may be categories of disease, age or sex group etc..
27
6. Index
An index is a percentage of a standard value. In the following table IV the IMR index is calculated
using the 1970 IMR as a reference is observed to decrease with decreasing time which clearly
indicate improvement in the health of the children (infants)
Table IV
7. Rates
Is basically a proportion that includes specification of time period or a ratio in which there is a
distinct relationship between the numerator and the denominator with time being and intrinsic part
of the denominator. Rate refers to the instantaneous change (or potential for change) in one
quantity per unit change in another quantity, in a period of time. A rate indicates the frequency of
events (like births, deaths, illness), occurring in a population per unit of time.
The scale of this measure (percent, per 1000 or per 100,000) should always be stated, as well as the
unit of time (per year, per month). For example a rate can generally be expressed as follows.
The denominator must be clearly defined and the numerator completely counted.
2
Examples of rates are: Crude Birth Rate, Crude Death Rate and Incidence Rate
Many measurements are erroneously called rates, while they are actually proportions; e.g.: Case
Fatality Rate, Prevalence Rate, maternal mortality rate, etc. In many such rates the denominator is
not exactly the population at risk e.g.: Infant mortality rate, where the denominator is composed of
2
Note: Prevalence rate when unspecified will always refer to number of cases existing in a given population
at a given point in time, divided by the total population.
28
live births during the year, which is strictly not the population at risk of death under one year during
the same year.
To avoid dealing with fractions, prevalence rate is often expressed as per 100, per 1000, per 10,000,
or per 100,000 depending on the frequency of the disease. Occasionally reference is made to a term
called period prevalence. This refers to the proportion of individual who ever suffered from the
given condition (old cases + new cases) during a period of time.
Point prevalence rate is used to measure the magnitude of a disease in a community. The following
general formula is used.
Number of people in a group with a certain characteristic
Prevalence = --------------------------------------------------------------------------x 100
Total number of persons in the group
So a prevalence is a proportion. When we say "In 1985 the prevalence of malnutrition in under-5's
was 12%, We means that 12% of all children under the age of 5 years were malnourished in 1985.
Strictly speaking, it is incorrect to speak of 'prevalence rate'.
2. Incidence rate
The incidence rate indicates the number of new events in a certain population at risk of such events
during a specified period of time.
There are two distinct measures of disease incidence RISK and RATE which must be differentiated.
Risk (Cumulative incidence -CI) is defined as the probability of a disease-free individual
developing a given disease over a specific period of time, conditional to that the individuals’ are not
dying of any other disease during that time. As a conditional probability risk varies from 0 to 1, and
is dimensionless. For example a 5 year risk of developing coronary heart disease (CHD) for an
individual aged 55 could be obtained as:
• Population of healthy individual aged 55 years at risk at the beginning of the observation period
is 15,000. This population is followed up regularly for a period of five years. During the five
years all new cases of CHD were identified and counted, which were 500.
• Therefore the 5 year risk of CHD = 500/15000=0.033 or 3.3 percent.
29
A related frequency measure is the risk odds which is the conditional probability of developing the
disease divided by the probability of not developing the disease during a given period of time.
From the above example odds ratio = 0.033 = 0.034
(1-0.033)
In figure 1. the first subject contributed 2.5 person years of observation prior to the onset of illness.
Similarly the contribution made by each of the subject can be calculated and added to form the total
person time of observation as the denominator. The incidence density can therefore be calculated
thus:
ID = I/PT
Where I = the number of new cases
PT = the total person time of observation
N
PT = Σ d(ti)
1:1
R = I/No
Where I = the number of new cases
No = the number of disease free individuals at risk at the beginning of the study.
Numbers of persons in a group developing a disease during a given time period
Incidence Rate = ----------------------------------------------------------------------------------- x1000
Total number at risk in the group at the beginning or midterm population
3
: Midterm or mid year population is used when calculating a rate for large population such as a city or
a country.
30
As the incidence is expressed in years, the average duration of a common cold should be expressed in years
as well: 0.02/ year i.e. cumulative incidence and average duration of disease should be expressed in the same
time unity.
Incidence is used to
(1) study disease aetiology
(2) study efficacy of an intervention
(3) when disease is of short duration due to quick recovery or death incidence is a better measure
than prevalence.
Number of exposed people developing the disease of interest within the range of the incubation period
= -------------------------------------------------------------------------------------------------------------- x 1000
Total number of susceptibles exposed to the primary case.
Gives information on infectivity of the agent, efficacy of a prophylaxis measure, or whether a disease of
unknown aetiology is communicable.
31
Measurement of mortality
Infant mortality rate: number of live born infants under the age of 1 year that died during a year/number of
live births dying during a year, are actually born in the preceding year, the denominator is not exactly the
population at risk, except when the number of births remains constant.
Crude death rate: total deaths in defined population in a given time period divided by the total population.
Uniform application of the definition of birth and liverbirth are essential for obtaining reliable and
comparable data.
Maternal mortality rate:. All maternal deaths occurring during pregnancy or within 42 days often
termination of the pregnancy in a year divided by the total number of live births in that year (per 100,000.
Disability:
It is a measure of the presence of consequences of disease i.e. impairment, disability, and handicap. Several
levels are known and defined as follows.
• Impairment; any loss or abnormality of psychological, physiological, or anatomical structure or function.
• Disability; any restriction or lack of ability to perform in the manner or within the range considered
normal for a human being.
• Handicap; a disadvantage of a given individual resulting from an impairment or a disability, that limits or
prevents the fulfillment of a role that is normal for that individual.
5. Risk
32
Risk difference(attributable risk): Relative risk shows the likelihood of disease amongst exposed compared
to those not exposed. Attributable risk (AR) or a risk difference is a measure of association that provide
information about the absolute effect of exposure on disease occurrence amongst those exposed compared to
those not exposed.
Attributable fraction (AR %): Attributable risk fraction is the fraction or proportion of disease that could be
prevented by eliminating the exposure. It is calculated as:
FIGURE 1
____┌────────────────────────────────────────┐
____│_________________X______________________│
1___│_____________________________________ __│2.5
2___│_______________________X________________│3.5
3___│________________________________________│1.5
4___│____________________________X___________│2.5
5___│________________________________________│4.5
6___│________________________________________│.5
7___│________________________________________│.5
8___│________________________________________│2.5
9___│__________________0_____________________│2.5
10__│__________________X_____________________│2.5
11__│______________0_________________________│1.5
12__│______________X_________________________│1.5
____│________________________________ _______│
____└────────────────────────────────────────┘
0 1 2 3 4 5
Time (years)
Key
X first occurrence of disease X
O Death
- Censored
33
Registers :
Information can also be available from registers when available, the registers can be institutional or
community based. Community based registers include those for surveillance of diseases, industrial
safety etc.
Institutional based registers include cancer registers (in Tanzania it is currently being maintained at
Muhimbili, department of pathology). The tuberculosis and leprosy registers are maintained at each
district. Registers for birth malformations, accidents, toxicity, and other infectious diseases can be
established
1. MCH
Maternal and child health services are available in all health care delivering units. Special
cards are available for the mother and the child, the state of health, type of treatment or
procedure for each mother and child is recorded regularly.
4. Mental health
Maintains records of people who are mentally ill
6. Eyes
Eye disease control programmes keeps records of people having eye problems. This is currently
being done in areas with high incidence of infectious diseases of the eye i.e. Onchocerciasis in
Morogoro, Mtwara, and Iringa regions, and trachoma in Dodoma
34
7. Nutrition
The nutrition programmes maintains records of individuals having/developing malnutrition, this
includes special nutritional deficiencies like iodine.
8. Others sources
(a) Health requirements
Recruitment to new employment require someone to undergo medical examination
prior to being recruited. The same is needed for school entrance, insurance, military
conscription etc.
(c) Census
Census may collect information on people who are sick, disabled or quality of
living.
Death certificate contain the following information about the deceased:(see death certificate
specimen below)
Census
Census may aim at collecting information on mortality for the entire population or part of the
population.
One of the uses of routine mortality statistics will be to compare mortality between different areas.
Such comparison is only possible if the criterion for acertaining cause of death is uniform, such
uniformity is achieved by the use of the International Classification of Diseases and Injuries (ICD).
This is internationally used and it is revised every 10 years to accommodate developments in
disease diagnosis and occurrence of new diseases.
From the death certificates causes of death are identified and are selected for tabulation. When only
one cause of death is entered it is selected for tabulation of mortality statistics. When more than one
causes are recorded the following rules should guide the selection of cause of death.
General:
Select the condition entered alone in the lowest used line of part I.
Rule I
If there is a reported sequence terminating in the condition first entered on the certificate select the
underlying cause of this sequence. If more than one such sequence select for the first.
Rule II
If there is no reported sequence terminating in the condition first entered on the certificate, select
the first e.g.
• Pernicious anaemia and gangrene of foot
• Arteriosclerosis
• Select: pernicious anaemia
• Rule III
If the condition selected by the general rule or rule 1 or rule II can be considered a direct sequel or
another reported condition whether in part I or part II select this primary condition. If there are two
or more of these, select the first mentioned of these. e.g. cerebral hemorrhage, hypertension, chronic
pyelonephritis and prostatic obstruction.
Variation by Place
Where are the rates of disease highest or lowest? Characteristics of place could explain variation of
disease occurrence or mortality.
Variation by time
• When does the disease occur commonly or rarely?
• Is the disease frequency at present different from the corresponding frequency in the past?.
On answering such questions will lead to development of hypothesis. Rarely one will be able to
prove a hypothesis using routine data.
A. Artifactual
1. Errors in the numerator due to
(a) changes in the recognition of disease.
(b) changes in the rules and procedures for classification of causes of death
(c) changes in the classification code of causes of disease.
(d) changes in the accuracy of reporting age at death.
B. Real
1. Changes in age distribution in the population
2. Changes in survivorship due to treatment.
3. Changes in incidence due to
(a) Genetic factors
(b) Environmental factors
Other uses of routine morbidity/mortality statistics
• Administration & planning
• Disease monitoring
• Epidemiological
• Generation of hunches
Completeness
Some information is not filled, or is wrongly filled for reasons that the information was not
collected, or the informant could not provide it.
Delay
Sometimes there is delay in transmitting such information to centres, hence being out of date.
37
Accuracy
Information though filled may be inaccurate, i.e. recording of sex as male instead of female,
recording of age as adult instead of actual age.
Coding
In the process of coding different coding systems may be used or not coded.
Retrievability
It is always not easy to retrieve all routine data due to missing data for various reasons. One would
desire to get information more than what is available on the form, this is not possible.
Publication
Publishing information from routine data is not easily acceptable by some readers.
Confidentiality
Routine data contains information on individuals, retrieval and publication might be divulging
information of the diseased , you might require ethical legal clearance to overcome that. Errors
could occur at any of these stages, diagnosis, recording of particulars, certification, coding, data
processing, interpretation and reporting.
References:
The crude death rate in the table below is 14.2 per 1000 population, which the age-specific rate
range from 10 to 20 per 1000 population.
Crude rate is weighted average of the individual category (specific) rates, weight being the
proportion of the population in each category.
Crude rates assume that all individuals in the population giving rise to the event of interest are at
equal risk of the event.
i.e. CDR - all individual are at equal risk of death.
CBR - all individuals are at equal risk of giving birth.
One way to evaluate crude rates of two populations or more is to work out the population
proportion distribution and strata specific rate and find out how they differ between the two
populations. Another way is to compare crude death rates.
However if you are comparing crude death rates, differences in age composition virtually always
occur, thus confounding the difference observed between two populations being compared. If a
39
From the table above 50.1 percent of the population in district A are 44 years or younger while in
district B 52.1 % are 34 years or younger. Hence the population in district A is more aged than that
of B. This difference in the age distribution between the two districts could account for an
observed difference in rates between the two. Two ways to account for a confounding effect of a
characteristic/variable between two populations being compared are:
When comparing crude rates of two populations difference observed could be due to (other than true
difference) differences in the population age structure, and sex composition etc. or sampling error of
the study population. The best ways to account for differences in population characteristics are one,
comparing category specific rates which is more accurate than crude rates, however this requires
comparing many numbers and at times can be very cumbersome. This then calls for a summary rate
which will take into consideration the differences in the characteristics of populations being
compared. This is achieved by standardizing (adjusting) for those characteristics hence
standardized (adjusted) rate. When comparing rates adjusted for a age, any remaining observed
difference between the groups cannot be attributed to the confounding effect of age.
Standardized(adjusted) rates:
A standardized (adjusted) rate is a statistically constructed summary rate which accounts for the
difference between populations being compared with respect to other variables.
Standardization is either by the direct or indirect, results in one summary measure, which is adjusted
for differences in population composition i.e. sex, age etc. and thus removes the confounding effect
of that variable adjusted for in the observed difference between the two populations being
compared.
Direct standardization
Direct standardization by age is used when:
- age specific rates in the index population are known
- the index populations and number of deaths are large enough to calculate stable
age specific rates.
40
The directly standardized death rate=-- expected deaths= 8,164 = 8.16 per
standard population 1,000,000 1000 population
Indirect method:
• Define the set of standard age specific rates
• Apply the standard age specific rates to the population in the corresponding age group of the index
population to get the expected number of cases or deaths in each group
• Add the expected cases or deaths over the age groups
• Divide the total observed cases or deaths by the total expected to get (SMR)
• The crude rate in the standard population multiplied by the SMR gives the standardized rate in the index
population.
41
Example:
Age group standard index - populations expect Nr. of deaths
rates per Distr.A Distr.B Distr.A Distr.B
1,000
Reading References
2. Hill, A. B (Sir)
Short textbook of medical statistics 11st ddn Holder and Stoughton London in association
with The Lancet 1984.
42
The basic purpose of screening for disease is to separate from a large group of apparently well
persons those who have a high probability of having the disease under study, so that they can have a
diagnostic work up and if diseased, brought to treatment. Screening is carried out with the
assumption that early detection of disease before development of symptoms will lead to a more
favourable prognosis because treatment began before the disease becomes clinically manifest will
be more effective than later treatment.
We would ideally want this test to identify correctly all those with the disease under investigation
and to exclude this disease amongst all non-diseased.
(b) For some conditions (e.g. angina pectoris) no gold standard is available
(c) Comparisons with an imperfect gold standard may lead to the erroneous conclusion that the
new test is worse when in fact it is better. The following are examples:
(i) If the new test detects diseased individuals more accurately than the gold
standard, these patients will be mistakenly labelled false positives.
(ii) If the new test is negative in more disease free individuals these patients will be
mistakenly labelled false negatives
The results of a screening test and disease status, can be examined conveniently by use of the four
fold contingency table.
DISEASE STATUS
SCREENING TEST Positive Negative
From a population of N people, D+ have the disease, while D- do not have the disease.
D+
---- Denotes the prevalence of disease in this population.
N
From this population, T+ persons are positive on the screening test, while T- persons are negative
on the screening test.
Diseased individuals with a positive screening test are the True Positives TP
Healthy individuals with a positive screening test are the False Positives FP
Diseased individuals with a negative screening test are the False Negatives FN
Healthy individuals with a negative screening test are the True Negatives TN
An ideal screening test has as few false positives and false negatives as possible. The capacity of a
screening test to identify correctly the diseased against the non-diseased is expressed by its
SENSITIVITY and SPECIFICITY which are a measure of the test's validity.
Sensitivity is the ability of a test to give a positive finding when the tested person truly has the
disease under the study. In other words it is the probability that the test will be reactive in a diseased
individual. A test with a high sensitivity will detect a high percentage of diseased individuals.
TP
Sensitivity = ---------X 100 %
D+
Specificity is the ability of a test to give a negative finding when the tested person is truly free of
the disease under study. In other words it is the probability that a test result will be non-reactive in
an individual who is not diseased.
44
TN
Specificity = ---------X 100 %
D-
In practice, sensitivity of a test is usually determined on a group of proven cases of a given disease,
while its specificity is determined on a group of healthy people, or people with diseases other than
the one under investigation.
Sensitivity and specificity are test characteristics and are not very much influenced by properties of
the population such as age, sex, etc. However, the pattern and prevalence of other diseases will
influence specificity of the test. The mix of subclinical and clinical cases in a population might
affect the sensitivity of a test.
Predictive Value
When interpreting a screening test result, or a sign or symptom, under practical circumstances, it is
important to measure its diagnostic accuracy in the population in which it will be used. This is the
predictive value which is used to measure the accuracy of a test in classifying individuals as
diseased or not diseased when the test is positive or negative respectively.
The predictive value of a positive test (PVP), denotes the proportion of diseased amongst all
positive tests. In other words, it signifies the probability that a given patient with a positive test,
indeed does have the disease. PVP is also known as the posterior probability or post-test probability
of disease.
The predictive value of a negative test (PVN), denotes the proportion of healthy individuals
amongst all negative tests. In other words, it signifies the probability that a given individual with a
negative test, indeed does not have the disease.
The predictive value is influenced by the sensitivity and specificity of the test as well as the
prevalence of the disease in the population. The influence of prevalence on predictive value, and the
independence of sensitivity and specificity on prevalence is illustrated by the following table:
DISEASE DISEASE
10 90 100 50 50 100
In both cases sensitivity and specificity are 90% but the predictive value of a positive test is
different and depends on the prevalence of the disease in the study sample.
LOW HIGH
PREVALENCE (10%) PREVALENCE (50%)
9 45
-- x 100 = 50 % -- x 100 = 90 %
18 50
In a quantitative test, however, the test result is a continuous variable, which can assume all values
within a certain range, with a typical distribution (normal, log-normal, Poisson, bimodal etc.).
Classifying individuals as diseased or disease free is on the basis of whether they fall above or
below a preselected cut-off value known as the positivity criterion. This cut-off value is also
referred to as the critical value or referent value. The same applies to semi-quantitative tests like
serological tests (e.g. VDRL) which use titre values.
Depending on the chosen cut-off point, the sensitivity and specificity of a test will vary as
illustrated in the following table :
1. Is it harmful or serious to miss a case ? If this is so then choose a value for the positivity
criterion that minimizes the false negatives and hence sensitivity should be high, usually at
the expense of specificity. (e.g. neonatal phenylketonuria screening).
2. Is treatment risky? (e.g. side-effects, or operative mortality), then the number of false
positives should be low, and specificity high usually at the expense of sensitivity. This also
applies when a false positive diagnosis may have deleterious effects on a patient's lifestyle,
self-image, or financial situation (e.g. AIDS, mental illness or learning disorders).
46
3. Is a highly specific and sensitive confirmatory test available? Then one would go for a very
high sensitivity in the preliminary (screening) test.
Sensitivity however is always inversely proportional to specificity: by increasing sensitivity one has
to accept a loss in specificity and vice-versa.
In serial testing we first apply one test to a certain population and all people identified as having a
positive test are then submitted to a second test (so called re-testing of reactive individuals with
another test). As an example we will take the VDRL test as a screening test for syphilis, and the
TPHA (Treponema Pallidum Haemoagglutination Assay) as a confirmatory test. We will assume a
population with a prevalence of syphilis of 20 %.
SYPHILIS
VDRL Positive Negative Total
In the serial testing the 260 positive on the VDRL (above table) will then be subjected to a
second series of testing with TPHA which is known to have the following test characteristics:
sensitivity 95 % and specificity 99 %. The result will be as shown in the table below.
SYPHILIS
TPHA Positive Negative Total
The overall sensitivity does decrease, while overall specificity increases when using tests in series.
Most importantly is that the predictive value always improves because the prevalence is increased
for the second test to 69.2% (180/260x100).
In general one can use the following general formulae to calculate overall sensitivity and
specificity:
As far as sensitivity and specificity are concerned, it does not matter in which order the tests are
carried out, but efficiency and total cost may differ considerably. In general, one benefits most from
serial testing if the most sensitive test is used for screening, and the most specific test as
confirmatory.
47
Summary:
1. Parallel tests result in:
a. Greater sensitivity
b. Increased PVN (decreased False Negative Rate)
c. Decreased specificity (increased False positive Rate)
Costs of a screening programme must be balanced against the number of cases detected and the
consequences of not screening, the cost too, should be economically balanced in relation to the total
expenditure on medical care.
In a screening programme differentiate between lead time bias and length bias:
Lead time is defined as the interval between the diagnosis of a disease at screening and when it
would have been detected due to development of symptoms. In other words it represents the time by
which the diagnosis has been advanced as a result of screening. It is important to take into account
lead time when evaluating a screening programme since survival may erroneously appear to be
increased among screen detected cases simply because the diagnosis was made earlier in the course
of the disease. See Fig 1
Age (years)
35 40 41 43 46
_______|_____________|_________________|_______________|_______________|______
Fig 1: Apparent increase in the duration of survival due to lead time in hypothetical screened and
symptom-diagnosed cases of breast cancer.
48
Length bias is the overrepresentation among screen detected cases of those with a long preclinical
phase and hence favourable prognosis. This occurs because diseases with long preclinical phase are
readily detected than rapidly progressing cases with a short preclinical phase. A programme may
seem successful when in fact observed differences in mortality were a result merely of the detection
of less rapidly fatal cases through screening while those more rapidly fatal are diagnosed after
development of symptoms.
References:
1. Griner, PF., Mayeuski, RJ., Mushlin, AI., Greenland, P. Selection and interpretation of
diagnostic tests and procedures Ann. Intern. Med. 1981: 94 557-93.
2. Wilson JMG and Juguer F. Principles and practice of screening for disease (Public Health
papers No 34) WHO, Geneva, 1968.
3. Galen and Gambino Beyond Normality. The predictive value and Efficiency of medical
Diagnoses. John Wiley and Sons (1975).
4. Gurick DS. Efficacy of screening of cervical cancer; a review Am. J. Public Health 68: 125,
1978.
5. Elveback LR. How high is high? A proposed alternative to the normal range. Mayo Clinic
Proceedings 47: 93. 1972.
6. Charles H Hennekens and Julie E Buring
Epidemiology in Medicine
Little Brown and Company
Boston/Toronto 1987 pg 327-347
7. Lewis H Roht, Beatrice J Selwyn, Alfonso H Holguin JR
Bobbe L Christensen
Principles of Epidemiology: A self teaching Guide
Academic press Inc, Harcourt Brace Jovanovich, Publishers
San Diego, New York, London 1982 pg 217-230.
8. Rebecca G. Knapp, M Clinton Miller III
Clinical Epidemiology and Biostatistics
NMS National Medical Series
Harwal Publishing Company, Malvern, Pennsylvania
Baltimore, Hong Kong, London Sydney 1992. pg 31-50
9. Sackett, DL. Holland, WW. Contraversy in the detection of disease. The Lancet August 23
1975 pg 357-359
10. Loefler IJP. Screening for cancer. Is it useful ? East Afr Med J 71 (5) May 1994 pg 323-7
49
Introduction
When disease occurs in a population there is always an attempt for investigators to determine the
cause for such occurrence. One of the first steps towards finding the cause is to collect information
that will enable the investigator to associate suspected or potential causal factors with the disease. In
epidemiology a variety of methods are available to determine associations between suspected causal
factors and disease. However, such associations even though they may be statistically significant do
not by necessity mean that they are causal. A causal association is defined as an association
between a factor and a disease in which an alteration of the factor is followed by a change in the
incidence of the diseases.
We shall illustrate the concept of cause by using the electric circuit with which most of you are
familiar. We all know that the flick of a light switch will make the lights go on, but the unseen
causes that also operate to produce the effect are usually unappreciated. The need for a good light
bulb in the socket, appropriate wiring from the switch to the bulb, and adequate voltage to produce
a current when the circuit is closed. To achieve the effect of turning on the light each of these is
equally as important as moving the switch because absence of any of these components of the
causal constellation will prevent the effect from occurring.
In the health field, it is important to understand the causes of disease in order to look for rational
means of preventing disease. Prevention here is taken to be broad and includes primary, secondary
and tertiary prevention. In secondary prevention for example, correct treatment can only be given
when the cause is known from early diagnosis.
In clinical sciences a cause can be defined as an event, condition, characteristic or their combination
which plays an important role in producing the disease in an individual. Logically therefore, a
cause must precede a disease in time.
In epidemiology, two models of disease causation have been described. The single factor model
and the multifactorial model.
Multifactorial model:
The disease may have neither a single sufficient cause nor a single necessary one. The sufficient
cause is not usually a single factor but often comprises several components. Each sufficient cause
has a necessary cause as a component. For example, there are different components in the causation
of tuberculosis but the tubercle bacillus is a necessary cause. The disease therefore, is a result of a
complex interaction of factors, some tending to promote the disease and others tending to oppose it.
This model is consistent with current views on chronic disease aetiology.
In general, it is not necessary to identify all the components of a sufficient cause before effective
prevention can take place since the removal of one component may interfere with the action of
others and thus prevent the disease. For example cigarette smoking is one component of the
sufficient cause of lung cancer. Smoking is not sufficient in itself to produce the disease; other
factors, mostly unknown, are required. However, the cessation of smoking reduces the number of
lung cancer cases in a population even if the other component causes are not altered.
The following rules were formulated by Henle and Koch in an attempt to determine whether a
specific living organism causes a particular disease. Anthrax was the first disease demonstrated to
meet these rules. Henle and Koch formulated these rules after they had been spurred by Pasteur's
work on micro-organisms.
Even though the above rules may be applicable to some diseases, they are inadequate for
determining causation for most diseases, both infectious and non-infectious. Many causes are
usually operating in the causation of a disease, while a single factor such as cigarette smoking may
be a cause of many diseases. In addition, the causative organism may disappear when a disease has
developed making it impossible to demonstrate the organism in the sick person. Rules such as
Koch's are of value only when the specific cause is an overpowering infectious agent, a situation
which is uncommon. Susceptibility due to other factors, and a sufficient dose of the agent are
usually required before clinical disease ensues.
Types of associations
2. Causal association which is said to be present between factor A and disease B, when a change
in A produces a change in B.
For example, in situation (a) above a positive statistical association has been demonstrated between
CHD mortality rates and coffee drinking habits. (See Table 1)
Table 1: CHD mortality rates for males aged 55-64 years by coffee drinking habits.
From the above table it can be seen that as coffee consumption increases the CHD mortality also
increases. However, it has been shown that people who drink coffee also tend to be cigarette
smokers, while cigarette smoking is known to be strongly associated with CHD mortality. (See
Table 2).
Table 2: CHD mortality rates for males aged 55-64 years by cigarette consumption.
Thus, to isolate the effect of coffee drinking, from that of cigarette smoking we cross-tabulate CHD
mortality according to both variables as shown in Table 3. For this process, additional analysis of
the raw data would be required.
Table 3. CHD mortality rates (per 1,000 per year) for males aged 55-64 years by coffee drinking
habits and cigarette consumption.
Table 3. shows that when cigarette smoking is held constant the effect of coffee drinking on
mortality disappears. Thus, the association between coffee drinking and CHD mortality is non-
causal (Figure 1). In general when an outcome is affected by multiple variables, in order to
examine the influence of a single one it is necessary to adjust for the effects of the others
52
Cigarette
smoking
Causal Non-causal
CHD coffee
mortality Non-causal association drinking
Causal inference is the term used for the process of determining whether observed epidemiological
associations are likely to be causal; the use of guidelines and the making of judgements are
involved. Before an association is assessed for the possibility that it is causal, other explanations,
such as chance, bias and confounding, have to be excluded. The steps in assessing the nature of the
relationship between a possible cause and an outcome are shown in Fig. 2.
OBSERVED ASSOCIATION
NO
Could it be due to
confounding?
NO
Could it be a result of
chance?
PROBABLY NOT
Could it be causal?
APPLY GUIDELINES
AND MAKE JUDGEMENT
A systematic approach to determining the nature of an association was used by the United States
Surgeon General to establish that cigarette smoking caused lung cancer. This approach was further
elaborated by Hill (1965) and on the basis of these concepts, a set of "guidelines for causation" has
been prepared. In Table 4 the concepts are listed in the sequence of testing that the epidemiologist
should follow to reach a conclusion about a cause of disease.
Temporal relationship
Does the cause precede the effect? (essential)
Plausibility Is the association consistent with other knowledge? (mechanism of
action; evidence from experimental animals)
Consistency Have similar results been shown in other studies?
Strength What is the strength of the association between the cause and the
effect? (relative risk)
Dose-response Is increased exposure to the possible cause associated with
relationship increased effect?
Reversibility Does the removal of a possible cause lead to reduction of disease
risk?
Study design Is the evidence based on a strong study design?
Judging the evidence How many lines of evidence lead to the conclusion?
To establish whether or not an association is causal may not be easy. If feasible or ethical this could
be investigated by an experiment, i.e. a randomized controlled trial. However, this may not be
feasible in many situations, and therefore an indirect approach is used by asking the following
questions:
Relative risks greater than 2 can be considered strong. For example, cigarette smokers have an
approximately two-fold increase in the risk of acute myocardial infarction compared with non-
smokers. The risk of lung cancer in smokers, compared with non-smokers, has been shown in
various studies to be increased between four-fold and twenty-fold. However, such very strong
associations are rare in epidemiology.
The fact that an association is weak does not preclude it from being causal; the strength of an
association depends on the relative prevalence of other possible causes. For example, weak
associations have been found between diet and risk of coronary heart disease in observational
studies; and although experimental studies on selected populations have been conducted, no fully
satisfactory trials have been completed. Despite this, diet is generally thought to be a major
causative factor in areas with high rates of coronary heart disease as in many industrialized
countries.
54
For example, the association between smoking and lung cancer was found in 29 case control studies
and 7 cohort studies in the USA. Broadly, the same answer has been reached in quite a wide variety
of situations and techniques. In this case it is very likely that the association between cigarette
smoking and lung cancer is causal. Consistency is particularly important when a variety of designs
are used in different settings, since the likelihood that all studies are making the same mistake is
thereby minimized. However, a lack of consistency does not exclude a causal association, because
different exposure levels and other conditions may reduce the impact of the causal factor in certain
studies. Furthermore, when the results of several studies are being interpreted the best-designed
ones should be given the greatest weight.
Techniques are available for pooling the results of a number of studies that have examined the same
issue, particularly randomized controlled trials. This technique is called meta-analysis, and it
combines the results of a number of well-designed trials, each of which may deal with a relatively
small sample, in order to obtain a better overall estimate of effect (Sacks et al., 1987).
Fig. 3 illustrates the results of 11 trials on the use of β-blockers for the prevention of death after
myocardial infarction. One important reason for the apparent inconsistency of the results is that
several of the early studies were on small samples. The estimated relative risk in each study is
marked by a cross; the horizontal lines indicate the 95% confidence intervals. For the aggregated
data from all the trials, covering a very large number of events, the 95% confidence interval is very
narrow. Overall, treatment with β-blockers after myocardial infarction is seen to reduce the death
rate on average by about 35%; the 95% confidence interval shows that the reduction in death rate is
at least 20% and could be as much as 50%.
x
x
x
Individual x
x
trials x
x
x
x
x
x
All trials x
<80 0 0 0
85 1 3 10
90 4 10 21
95 7 17 29
100 12 29 41
105 18 42 54
110 26 55 62
115 36 71 64
The study of the health consequences of low-level lead exposure is an example of the opposite
situation. Animal experiments indicate an effect of lead on the central nervous system. Similar
effects in an epidemiological study of children are therefore plausible but, because of potential
confounding factors and measurement difficulties, epidemiological studies have shown conflicting
results. However, assessment of all the available epidemiological data leads to the conclusion that
effects do occur in children at a low level of exposure to lead.
56
6. Reversibility
When the removal of a possible cause results in a reduced disease risk, the likelihood of the
association being causal is strengthened. For example, the cessation of cigarette smoking is
associated with a reduction in the risk of lung cancer relative to that in people who continue to
smoke. This finding strengthens the likelihood that cigarette smoking causes lung cancer. If the
cause leads to rapid irreversible changes that subsequently produce disease whether or not there is
continued exposure (as with HIV infection), then reversibility cannot be a condition for causality.
7. Study design
The ability of a study design to prove causation is the most important consideration (Table 6). The
best evidence comes from well-designed, competently conducted randomized controlled trials.
However, evidence is rarely available from this type of study, and usually only relates to the effects
of treatment and prevention campaigns. Other experimental studies, such as field and community
trials, are seldom used to study causation. Evidence comes most often from observational studies,
almost all the evidence on the health consequences of smoking comes from observational studies.
Cohort studies are the next best design because, when well conducted bias is minimised. Again,
they are not always available. Although case-control studies are subject to several forms of bias,
the results from large well-designed investigations of this kind provide good evidence for the causal
nature of an association; judgements often have to be made in the absence of data from other types
of study. Cross-sectional studies are less able to prove causation as they provide no direct evidence
on the time sequence of events.
Ecological studies provide the least satisfactory type of evidence on causality because of the danger
of incorrect extrapolation to individuals from data on regions or countries. However, for certain
exposures that cannot normally be measured individually (such as air pollution, pesticide residues in
food, fluoride in drinking water), evidence from ecological studies is very important. Yet only very
rarely has it been considered adequate for establishing causation. In 1968 the sale of
bronchodilators without prescription in England and Wales was stopped because the increase in
asthma deaths in the period 1959-66 had been shown to coincide with a rise in bronchodilator sales.
Despite the fact that only very limited evidence was available linking the use of bronchodilators
with death in asthmatics, the ecological evidence was deemed sufficient; two decades later this
relationship continues to be debated and has relevance to a recent increase in asthma deaths among
young people in New Zealand Crane et al., 1989).
9. Is there coherence?
Coherence implies that a cause and effect interpretation of an association does not conflict with
what is known of the natural history and biology of the disease. Any cause and effect interpretation
of, for example , cigarette smoking and lung cancer should be coherent with the histopathologic
effects of smoking on the bronchial epithelium. In this regard, the difference between coherence
and consistency is unnecessarily fine.
10. Analogy
The premise that, if one drug can cause birth defects, perhaps another one can also, could
conceivably enhance the credibility that an association is causal.
Regrettably, there are no completely reliable criteria for determining whether an association is
causal or not. Causal inference is usually tentative and judgements must be made on the basis of the
available evidence: uncertainty always remains. Evidence is often conflicting and due weight must
be given to the different types when decisions are being made. In judging the different aspects of
causation referred to above, the correct temporal relationship is essential; once that has been
established, the greatest weight may be given to plausibility, consistency and the dose-response
relationship. The likelihood of a causal association is heightened when many different types of
evidence lead to the same conclusion. Evidence from well-designed studies is particularly
important, especially if they are conducted in a variety of locations.
2
METHODS OF EPIDEMIOLOGY
59
Research:
Activity aimed at the advancement of knowledge (scientific or non-scientific)
Methodology:
The body of methods in a discipline
Purpose of Research
Collection of information that will contribute to the solution of a problem and therefore, provide a
basis for action whether immediately or in the long run. Research begins when an investigator
perceives a problem which in his/her view requires a solution. He/she then decides that a particular
study will contribute to this end, and embarks upon it through a research process..
Study- a project designed to yield evidence for the advancement of knowledge (in science).
Once the problem has been identified, it should be described clearly and appropriate hypotheses
formulated and methods for testing them designed. This is important particularly if funds or
facilities are applied for from funding agencies, in order to justify expenditure of public funds. In
order to do this well the following must be done:
• Review of previous work on the subject
• Describe the present state of knowledge
• Explain the significance/rationale of the proposed investigation
In identifying the problem preconceived ideas must be avoided at any cost lest they introduce bias
in the findings. If you think a service is bad you should not collect data that will help you to
condemn it. You should be honest and ensure objectivity in the collection and interpretation of
information
With a clear purpose in mind (why you want to conduct the study) you can now formulate the topic
of study in general terms.
If the reason for the investigation is that Infant Mortality is unduly high in a given population and
there is insufficient information on its causes for the planning of an action program the topic can
broadly be stated as:
At this stage the topic may be regarded as provisional and may frequently change as more
information becomes available. Study objectives are then listed and research methods identified
and described.
Quantitative methods in research are the back bone of epidemiology. They generate data that are to
be analysed quantitatively. They are also known as epidemiological methods because they are
designed to determine the extent and distribution of diseases and their determinants (causes) in
human populations with the aim of identifying effective management and preventive measures.
60
Due to the quantitative nature of epidemiologic studies, statistical techniques for measuring these
‘quantities’ have been developed and extensively used. Examples of measurements commonly used
in epidemiology are prevalence and incidence. These are used to measure the magnitude and
‘speed’ of spread of the disease respectively.
Prevalence represents the proportion of the population that is affected by the disease under study at
a given point in time, while incidence represents the rate of new cases of disease occurring in a
defined population within a specified period of time.
In quantitative research, measures of comparison are used to compare the risks of disease in
different population groups. These are either absolute or relative measurements. Of particular
interest to disease aetiology are the relative measurements which are used to determine whether or
not suspected causes are related to the disease (or health problem) being investigated. These
measurements are the relative risks (RR) or odds ratios (OR) which are used to measure the degree
of association such that the further away the calculated quantity is from unity (RR=1) the stronger is
the evidence for causation.
Depending on the type of health problem to be investigated a variety of study designs have been
developed and grouped into two broad categories. These are observational and experimental studies.
Observational studies involve only observational techniques and no manipulations like those
employed in experiments are used to study the population. Figure 1 depicts a common classification
of the methods.
Research methods discussed in this module are those which fall within the realm of epidemiology.
Other research methods e.g. socio-anthropological techniques will be discussed in the relevant
disciplines. Epidemiological studies are designed to determine the extent and distribution of
diseases and their determinants (causes) in human populations with the aim of identifying effective
management and preventive measures.
Cross sectional
Descriptive
Longitudinal
Surveys are investigations in which information is systematically collected but in which the
experimental method is not used. These are also called observational studies.
A) Descriptive studies
Descriptive surveys set out to describe disease situation in a particular area. e.g. the
distribution of a disease in a population in relation to age, sex, time, place and other
characteristics. Studies of this kind are not necessarily time limited. Registries of diseases in
a particular area are examples of descriptive studies that are not time-limited. They record
various characteristics of the affected individuals including age, sex, occupation, duration
of symptoms etc. Cross-sectional studies are examples of time-limited descriptive studies
but can sometimes be used as analytical studies. Descriptive studies are useful in studying
the natural history of diseases.
61
B) Analytical studies
Analytical surveys (or explanatory surveys) try to explain the situation. These are studies designed
specifically to explain the determinative processes of diseases. They answer the questions; why
does the disease occur in the persons having it and not in the persons not having it? Why do certain
persons fail to make use health services? Can the decreased incidence of the disease be attributed to
the introduction of preventive measures? To answer these questions hypotheses are formulated and
tested that may help to explain the situation. Examples of analytical studies are ecological, cross-
sectional, cohort and case-control. These are briefly described below:
Ecological studies
These studies are often referred to as correlational studies. They frequently initiate the
epidemiological process and allow for more detailed analysis of observed correlations. The units of
analysis are groups of people rather than individuals. For example, in one country, a relationship
was demonstrated between average sales of an anti-asthma drug and the occurrence of an unusually
high number of asthma deaths. Such relationships may be studies by comparing populations in
different countries at the same time or the same population in one country at different times. The
latter approach may avoid some of the socio-economic confounding.
Although simple to conduct and thus attractive, ecological studies are often difficult to interpret
since they usually rely on data collected for other purposes and hence essential exposure data may
not be available. In addition, since the unit of analysis is a population or group, the individual link
between exposure and effect cannot be made. One attraction of ecological studies is that data can be
used from populations with widely differing characteristics e.g. correlation between oesophageal
cancer rates in communities with different patterns of salt consumption.
Ecological fallacy or bias may result from ecological studies if one erroneously infers that
relationships established between two or more variables, measured at an aggregate level, will also
hold at the individual level.
Cross-sectional studies
Cross-sectional studies or prevalence studies are carried out at a certain point in time and in a given
population or geographical area. They depend on a single examination of a cross-section of the
population in which sick and healthy, or exposed and unexposed are not distinguished until results
are examined.
The prevalence of a risk factor or a disease is expressed as the proportion of the affected individuals
in the study population in a given geographical and area at a given point in time. An important
limitation of cross-sectional studies is their inability to sort out cause and effect relationships since
both are found in the study population at the same time.
Cohort studies
Cohort studies are synonymous with prospective studies, longitudinal studies, follow-up studies or
incidence studies. They can be carried out prospectively or historically ("retrospectively").
These studies are carried out on a sample of the population to determine the rate at which groups of
the population develop disease or die from it when differentially exposed. This is one way of testing
a hypothesis in disease causation.
Basic data from cohort studies are represented in Table 1
62
Table 1
Disease status
diseased healthy Total incidence
Exposure status
exposed a b a+b a/a+b)
not exposed c d c+d c/c+d)
When the incidence of disease among the exposed (a/a+b) is divided by the incidence of disease
among the unexposed (c/c+d), a measure of effect of exposure (RR) is obtained and an association
is said to exist between exposure and development of disease if the measure is significantly
different from unity. Standard statistical techniques are available to test this difference. Although
cohort studies may take long to accomplish, they have the advantage that they are more reliable in
providing evidence for causation than other analytical studies.
Case-control studies
Synonyms are case-referent studies. These studies involve the comparison of cases of the disease
under study with comparable controls for levels of exposure. The effect of exposure in such studies
is measured by the odds ratio (OR = ad/bc) as shown in Table 2 which is an approximation to the
relative risk (RR).
Case-control studies are by far the simplest in determining cause and effect relationships. They take
a short time to complete and have the advantage that they use cases of the disease under study and
are especially useful for studying rare diseases. Important disadvantages of case-control studies are
that they are more susceptible to selection bias and that information on exposure is less accurately
ascertained than in cohort studies.
Table 2
Disease status
Cases Controls Total
(diseased) (healthy)
Exposure status
exposed a b a+b
not exposed c d c+d
Cross sectional surveys provide information concerning the situation that exists at a given time. e.g.
weights of children at a given time.
Longitudinal surveys provide data concerning events during a period of time. e.g. Weights of
children as they grow older, each child being weighed more than once during the study period. In
determining the natural history of a disease e.g. tuberculosis or AIDS individuals are followed up
through acquisition of the disease and appearance of signs and symptoms to recovery chronicity,
development of various complications or death.
63
Group based:
In these surveys groups of populations are compared e.g. groups of countries. Group Information is
obtained from individual countries or from the same country at different time periods.
Example: National Heart disease rates and consumption of animal fat in the country.
AIDS case rates and importation of used clothes.
Inferences from such surveys are usually regarded as hints because they often
suffer from Ecological fallacy: e.g. Proportions affiliated to certain religions
and suicide rates in a country. Proportion of the population in mining
occupations and lung cancer rates in the country..
Individual based:
These are also surveys of groups but utilize information from individuals. Such surveys are
performed to test hypotheses that a specific factor is related to a specific disease.
1. Experimental studies
An experimental study is an investigation in which the researcher, wishing to study the
effects of exposure to or deprivation of a defined factor, decides which subjects (persons,
animals, communities etc.) will be exposed to, or deprived of, the factor. If the investigator
compares subjects exposed to the factor with subjects not exposed to it, the study is a
controlled experiment. Intervention studies and clinical trials are examples of experimental
studies.
2. Other studies
A Evaluative Studies:-These are carried out to appraise the value of health care. They set out
to measure how good care is.
(ii) Inbuilt evaluation - is that evaluation which is planned in advance and the required
information is collected in a systematic way as an integral part of the provision of
service.
(iii) Medical Audit: another type of evaluation in which the quality of a service is
evaluated by appraising the quality of care given to individuals. If a medical audit
discovers that some useful procedure is not being done in the course of patient care,
then it recommends that it should be done. e.g. testing of blood for Hepatitis B
virus before transfusion.
(B] Surveillance - Systematic collection analysis, and use of information for the control of a
specific disease. Generally it is the maintenance of an ongoing watch over the health status
of a community.
64
(C] Pilot Study: - A dress rehearsal of an investigation performed in order to identify defects in
the study design.
(D] K.A.P Studies:- Studies of knowledge, attitudes and practices towards health care. These
are important in health education methods
Example: We know vaccination against measles is effective and could eradicate measles, but
there are many constraints.
Cold chain maintenance problems
Faulty immunization techniques
Lack of proper supervision
Inadequate facilities and resources
Lack of staff motivation
Lack of community motivation
Operational research determines which one of these constrains is most important in the control of
measles
EPIDEMIOLOGICAL
STUDIES
OBSERVATIONAL EXPERIMENTAL
STUDIES STUDIES
Classical design
Natural Exper'mts
Randomized
controlled trials
DESCRIPTIVE ANALYTICAL
Clinical
STUDIES STUDIES
trials
Ecological
Person (Who) Prophylactic
Cross-sectional trials
Place (Where)
Case-control
Time (When) Interventions
Cohort
NB: The above figure is a very simplified one and is meant to assist the reader in understanding the
possible relationships in epidemiological methods. In practice there will be overlaps between certain
methods.
66
Determine
Cross-sectional study
prevalence
Determine
Cases
Past Case Control study
Controls
Exposure
Exposed
Determine Incidence
Descriptive studies
Descriptive studies describe pattern of disease occurrence in relation to, place, time and person.
There are three types of descriptive studies, namely ecological (correlational) and cross-sectional
studies (which can be considered partially analytical), case reports, case series, registries, etc.
Correlation coefficient(r)
Quantifies the extent to which there is linear relationship between exposure and disease (it ranges
from -1 to +1)
Limitations:
• Unable to link exposure with disease in particular individuals, i.e. increasing pap smear rate with
decreasing mortality from cancer of cervix. In this case it is not easy to prove that those who
underwent pap smear are the same who benefited.
• Lack of ability to control for the effects of potential confounding factors.
It has been shown that there is a very strong correlation between per capita number of colour
television sets and coronary heart disease in various countries. Owning a colour television is an
indicator of economic well being and related to other factors which are more likely to increase
the risk of CHD than ordering a colour television
• Lack of correlation does not mean lack of statistical association, some relationships can only be
detected by analytical studies.
• Correlational data represent average exposure levels rather than actual individual values.
e.g. CHD showed a strong negative correlation with alcohol consumption, countries with high
alcohol consumption showed a low risk of coronary heart disease, from analytical studies it
was shown that the relationship was J-shaped. Thus consuming moderate amounts of alcohol
was beneficial.
Case reports:
Describes the experience of a single patient or a group of patients with similar diagnosis.
Case reports: document unusual medical occurrences and can represent the first clue in the
identifications of new disease or adverse effects of exposure: e.g. drugs
Case series:
Collection of individual case reports which may occur within a fairly short period of time.
e.g. Pneumocystis carinii pneumonia was initially described in young homosexuals who were
initially healthy, otherwise it was previously described amongst elderly patients suffering from
debilitating disease like cancer. This then led to the idea that this was a probably new disease, and
so was Kaposi sarcoma occurring amongst young people.
Limitation:
. Cannot test for the presence of valid statistical association.
68
Uses:
Useful for hypothesis formulation.
Descriptive studies tell us about the distribution of disease and disease determinants in human
populations. In descriptive studies we commonly describe disease distribution by:
1. Person (who ?): The kind of persons afflicted by the disease; what are their characteristics?
e.g. age, sex, race, socio-economic status, genetic constitution, immunological status etc.
These are then related to those of the population to which such persons belong, i.e. census
data to determine if there is any predisposition to the disease.
69
2. Place (where ?): The kind of places where diseased individuals are found. Important
characteristics of place are: geographical placement, altitude, latitude, climate, vegetation
etc. A classical example of the use of place in descriptive studies is found in the studies of
Burkitt lymphoma in Africa. The influence of place in determining disease occurrence can
also be studied by using migrant populations. Such studies are called migrant studies. e.g.
Occurrence of cancer by site for Japanese of different migratory status. Relative risks of
mortality from cancer of the stomach, liver and colon among Japanese men in Japan,
Japanese immigrants to California and sons of Japanese immigrants compared to while men
in California (aged 45-664)
3. Time (when ?) : Does disease have any time trend? Many infectious disease occur during
certain periods of the year (seasonal distribution). At some instances there are clear-cut
explanations for the seasonality, especially when this can be related to the development of
breeding sites for disease vectors, and hence increased disease transmission. Other diseases
have a secular trend, i.e. changes in disease incidence occurring over periods of many years.
If disease trend can be predicted it may be possible to institute preventive measures. Other
diseases shows a cyclic trend, this is change in disease occurrence over a short period of
time usually two to three years. This is a characteristic of infectious diseases which confers
long lasting immunity, when it occurs all people who are susceptible develops the disease if
they get infected. The disease will die out after infecting all those who are susceptible and
will recur again when they are sufficient number of susceptibles. It is then possible to
predict disease outbreak, and also evaluate preventive measure like vaccination
4. Space-time clustering: disease incidence can very simultaneously with both place and
time. If disease can be described by both place and time, it will be possible to determine the
dynamics of spread in a population. Many infectious diseases occurring in one geographical
area tend to occur at around the same time. This phenomenon is called space-time
clustering. Such a phenomenon can occur by chance in non-infectious diseases. Cluster
analysis techniques are used to determine whether the degree of space-time clustering
observed for any disease is pertinent to infectious disease aetiology. Time-space clusters are
therefore a useful due to the aetiology of infectious diseases.
- changes or differences in the level of arsertainment of cases (are all cases found ?)
- changes or differences in diagnostic criteria between two points in time or place.
- changes or differences in population structure between two points in time or place.
Secular trends can be due to one or more of the following:
- Changes in diagnostic technique.
- Changes in the accuracy of enumerating the population at risk of developing the disease.
- Changes in the age distribution of the population between two time periods.
- Changes in survival
- Changes in actual incidence due to
70
Hypothesis formulation
(a) Method of difference; if the disease frequency is significantly different between two sets of
circumstances the disease might a causal association with a particular factor that differs
between the two
(b) Method of agreement; if a single factor is common in a number of circumstances in which the
disease occur causal association can be suspected.
(c) Method of concomitant variation; if a factor varies in proportion to the frequency of disease
causal association can also be suspected.
Case-control studies
(Also called case-referent, case-comparison or case history studies)
A case control-study is an epidemiologic investigation which involves comparing the characteristics
of diseased persons (the cases) with those of non-diseased persons (the controls). The purpose of
this comparison is to identify factors which occur more (or less) frequently in the cases as compared
to the controls and hence provide clues regarding the role of such factors in elevating (or reducing)
the risk of the disease under investigation.
Advantages
- Efficient in time and cost.
- Efficient for the study of rare diseases.
- Efficient for the study of chronic diseases.
- Tends to require a smaller sample size as opposed to other designs.
- Allows exploration of a large number of exposures for the disease under investigation.
- May be completed more rapidly than alternative designs.
- Ethical problems are minimal.
- Subjects need not be volunteers.
- Attrition problems are minimal.
- Used to test the hypothesis that 'disease in the cases is or is not associated with exposure to
some factor' hence can provide suggestive evidence of a causal relationship that warrants
public health intervention to reduce exposure to the risk factor
Disadvantages:
- Both the exposure and disease have already occurred at the time the participants are
recruited into the study, hence the design is susceptible to bias arising from differential
selection of cases or controls into the study on the basis of their exposure status as well as
from differential reporting or recording of exposure information among study groups based
on their disease status
- Inefficient in ascertainment of rare exposures
- Rates can not be computed directly unless a modified design is adopted
- Temporal relationships usually difficult to ascertain
Retrospective design:
All the cases have been diagnosed by the time the investigator initiates the study (prevalent cases
are used) Disadvantage: (1)Sampling bias (systematic differences between the study population
and the target population that pose a threat to external validity) Why ? Because patients who die or
recover rapidly, as well as those with mild symptoms whose illness goes undetected are not
represented in the study sample. As a result, only survivors are studied. These survivors are unlikely
to be representative of the total patient population. (2) Temporal relationship is difficult to establish.
Prospective design
The study is began and all new cases diagnosed within a specified period of time are included into
the study. This design tries to avoid studying survivors and hence addresses etiological factors as
close as possible to the commencement of the disease process.
72
CASES:
Cases should represent as far as possible a homogeneous disease entity as e.g. cervical cancer or
cancer of the body of the uterus and not uterine cancer (which includes both types), to ensure that
the cases selected represent a homogeneous disease entity :Establish a strict diagnostic criteria for
the disease. Depending on the certainty of the diagnosis and the amount of information available it
is often useful to perform the analysis and present the results separately for cases classified as
definite, probable or possible.
Selection of cases: Should be selected from a well defined population called: “source population”.
The following are possible sources of cases:
2. General Population:
All diseased individuals or a random sample from a defined population . Avoids bias from selection
factors that led the affected individual to utilise a certain health care facility.
CONTROLS:
Controls are necessary in order to allow the evaluation of whether the frequency of an exposure or a
specified characteristic observed in the case group is different from that which would have been
expected based on the experience of a series of comparable individuals who do not have the disease.
To select controls consider:
Sources of controls:
1. Hospital/Health care facility
These may be patients admitted at the same hospital as the cases for conditions other than the
disease being studied
Advantages
- easily identified
- readily available and minimizes assembling costs
- minimizes potential for recall bias in both cases and controls
- provides for similar hospital selection factors for cases and controls (if it is not a referral
hospital for that particular disease-patients may be drawn from a wider area in comparison
to the controls)
- likely to co-operate than healthy individuals hence minimizing non-response
Disadvantages
They are ill by definition and differ from the healthy in a number of ways that may be
associated with the exposure. Thus the experience of these patients may not accurately
represent the exposure distribution in the population from which the cases are derived.
73
2. General population
This is the source to be used when the cases have been selected to represent affected individuals in a
defined general population, and also when hospital based controls are not scientifically desirable or
feasible
Though controls from the general population may represent the non-ill individuals in the
community they are not short of problems
- usually costly and time consuming
- sampling frame may not be available
- difficult to get hold of busy people with a lot of scheduled activities
- may not recall exposures at the same level as the cases
- individuals who have not experienced an adverse health effect, are less motivated to
participate, hence non-response may be higher than in hospital based controls
3. Special groups:
Advantages:
They are healthy like other members of the population.
Likely to be co-operative due to the interest they have in the health of the case.
May offer some degree of control of confounders such as ethnic background, socio-economic status,
environmental factors and the like.
Disadvantages:
Due to the closeness of the controls to the cases, distribution of the factor under study, may be
similar in the cases as in the controls, and hence an underestimate of the true effect of the exposure
may occur
Results from a case control study can only provide an estimate of the relative risk, and this can be
done under the following assumptions;
- The controls are representative of the general population with respect to the frequency of
exposure.
- The assembled cases are representatives of all the cases of the disease
- Frequency of the disease in the population is small.
Under these assumptions the odds ratio can be used to estimate the relative risk and can be
calculated as the ratio of odds of exposure among the cases to that among the controls (for
unmatched designs).
Case Control
Exposed a b
Not exposed c d
Odds ratio=
a/c ad
----- = ------
b/d bc
In interpreting results from case control studies and indeed from any other epidemiological study,
the following should always be considered:
* Bias (minimized by adopting adequate design and implementation)
* Chance (evaluated by statistical methods)
* Confounding (may be minimized by adopting adequate design and adjusting for it in the
analysis).
When the above are taken care of, the ratio may be interpreted as follows: An odds ratio of 1
signifies lack association between exposure and disease. Odds ratios different from 1 indicate the
possibility of an association between exposure and disease. If the odds ratio is greater than 1
exposure to the factor will lead to an increased risk of disease and if the odds ratio is less than 1 it
shows a protective effect of the exposure under investigation.
Basic data from 1:1 matching may be represented in the following way:
CONTROLS
Exposed Not exposed
CASES
Exposed a b
Not exposed c d
Odds ratio estimate = number of pairs with unexposed control and exposed case
number of pairs with exposed control and unexposed case
b
Odds ratio= ------
c
i.e. only the discordant pairs contribute to the association between factor and disease.
The interpretation of the odds ratio is the same as already mentioned earlier.
References
1. Lewis H Roht, Beatrice J Selwyn, Alfonso H Holguin JR
Bobbe L Christensen
Principles of Epidemiology: A self teaching Guide
Academic press Inc, Harcourt Brace Jovanovich, Publishers
San Diego, New York, London 1982 pg 217-230.
2. Charles H Hennekens and Julie E Buring
Epidemiology in Medicine
Little Brown and Company
Boston/Toronto 1987 pg 327-347
3. Rebecca G. Knapp, M Clinton Miller III
Clinical Epidemiology and Biostatistics
NMS National Medical Series
Harwal Publishing Company, Malvern, Pennsylvania
Baltimore, Hong Kong, London Sydney 1992. pg 31-50
76
Cohort studies
Below is a list of hypothetical studies which illustrate the main characteristics of cohort studies:-
- From these studies we can identify three key characteristics of cohort studies:-
• Start with "healthy" subjects (i.e. before they develop outcome of interest) who are free of the
disease of interest.
• Classify them on the basis of exposure (or risk factor).
• Follow subjects and assess the occurrence of outcome of interest among exposure groups
• Compare incidence of outcome in both groups
The term "cohort" was originally a Roman military term: a cohort was one tenth of a legion. The word
"cohort" has come to be used in epidemiology to designate individuals sharing similar experiences
which we observe in order to learn about the occurrence of disease. These studies involve assembling
healthy cohorts which are followed forward in time for development of disease. A cohort is the group
of persons who share a common experience within a defined time period. For example, a birth cohort
consists of all persons born within a given period of time. A marriage cohort would consist of all
persons married within a certain period of time.
Group or groups of individuals are defined on the basis of presence or absence of exposure to a
suspected risk factor for a disease. At the time exposure status is defined, all potential subjects must be
free from the disease under investigation, and eligible participants are then followed over a period of
time to assess the occurrence of that outcome.
The group of individuals selected to form a cohort can come from a variety of sources. The choice of a
particular group will depend on the following scientific and feasibility considerations:
a) Frequency of the exposures under study.
- For relatively common exposures, (e.g. cigarette smoking or coffee drinking) a sufficiently large
number of exposed individuals could probably be identified from a number of possible populations.
77
- For rare exposures, however, such as those related to particular occupation in specific locations, it is
more efficient to choose a group specifically because they have undergone some unusual exposure or
experience, the effects of which are to be evaluated. This allows to obtain sufficient numbers of
exposed group within the shortest possible time.
- The key issue is to have variability of exposure with sufficient numbers in various exposure groups.
b) Accessibility of cohort members for measurements and follow-up because of the need to obtain
complete and accurate exposure and follow-up information on all study participants. Most cohort
studies are therefore conducted in populations where loss to follow-up is likely to be minimal.
Incomplete follow-up affects validity of the study.
d) An attempt should be made to make the results of the study generalizable. This requires assembling
of the cohort which is representative of the population where the study is being conducted.
Once the source of exposed subjects has been determined, the next consideration is the selection of an
appropriate comparison group of non-exposed individuals. The major principle underlying this decision
is that the groups being compared should be as similar as possible with respect to all other factors that
may be related to the disease except the exposure under investigation. This means we expect to observe
the same disease rates in the two exposure groups if there is no real association between exposure and
disease.
When several risk factors (or exposures) are being considered simultaneously, the non-exposed group
should be defined as those with none of the risk factors (or exposures) under evaluation. In many cohort
studies, it may be useful to have multiple comparison groups, especially when no single group appear
sufficiently similar to those who are exposed to provide assurance about the validity of the comparison.
Example: In evaluating potential adverse health outcomes associated with the use of oral
contraceptives (OC), choice of an appropriate comparison group is of critical importance.
Some studies have used for comparison a group of women not using OC, while others have
selected a group using some form of contraception other than OC. Women not using
contraception may differ substantially from users of any type of contraception in terms of
ability or desire to become pregnant or the nature of their sexual practices. On the other hand,
women using various forms of contraceptives are likely to differ from OC users with respect to
religion, socioeconomic status, and other life style factors. Thus, it may be that no one
comparison group is clearly superior, and information on the role of OC and disease can best be
elucidated by comparing the results from various comparison groups.
NOTE:
Consistent results from multiple comparison groups reinforces the believe that exposure under
observation is related to outcome or disease under investigation.
78
Selection of subjects:
- Options:-
a) Random samples of populations
- not useful when exposure is rare
b) Special exposure groups (e.g. atomic bomb survivors)
- groups with high exposure prevalence
- useful for occupational or environmental exposures
c) Within special information groups (e.g. doctors, students)
- groups with high anticipated quality of information
Data analysis:
Basic analysis of data from a cohort study involves the calculation of rates of the incidence of a
specified outcome among the cohorts under investigation. The basic layout of data at the end of the
study is in the form of a 2x2 table as shown below:-
Measure of effect:
- Effect of exposure is measured by the ratio Ie/Io, known as Relative Risk (RR).
Relative risk is the measure of strength of the association between exposure and disease. It measures
the risk of a disease in exposed group when compared to the unexposed group. A relative risk of 1
shows no increased risk in the exposed group (i.e. no association between exposure and disease); while
a RR greater than 1 indicates increased risk among exposed group. A RR of less than 1 indicates a
decreased risk in the exposed group.
NOTE:
There is another measure of association which is commonly used. It is called Risk difference (RD) or
absolute difference which is the difference of the incidence rate in the exposed and that in the unexposed
group.
Suppose;
Ie = Incidence of disease in exposed
Io = Incidence of disease in unexposed
RD = Ie - Io
Risk difference estimates the amount of the disease which is attributable to a certain exposure
(sometimes called "attributable risk"). Therefore, it specifies the amount of disease which can be
reduced from the study population if certain exposure is removed.
79
1. Expensive and time consuming and therefore takes a long time before completion. This is true for
most prospective cohort studies.
2. Liable for attrition or loss to follow-up among subjects in the study. This can affect validity of the
results obtained.
3. Inefficient for rare diseases.
4. Not suitable for diseases with long latency period as these require long period of follow-up.
References:
Experimental studies:
Introduction:
Initially descriptive studies offer a tentative explanation of relationship between a suspected cause
and an effect. This relationship is further analysed in cheap and rapid analytical studies i.e. case
control studies. Based on the strength of the relationship established, decision is made to embark on
more analytical studies such as cohort studies. Should the relationship still be strong and convincing
decision is made to embark on an experiment. An experiment on human beings should only be
based on removal or addition of a factor which will eventually improve health of an individual or a
community. In the course of an experiment an investigator should not cause suffering to any one in
order to prove cause and effect relationship.
Types of experiments
Natural experiments:
These are observations made by comparing two similar populations in which one of them has been
exposed to some disease factor as a result of some natural disaster such as famine, earthquakes and
wars. The circumstances of exposure and observation for effects have been likened to experiments.
Differences in physical environment i.e. geochemical characteristics leading to low or high levels of
iodine and development of goitre, thus providing a proof that low levels of iodine or no iodine will
lead to development of goitre. This condition will be rectified by improving the supply of iodine
thus proving the cause and effect relationship. NOTE: It is unethical to deprive people of iodine to
establish the relationship between iodine and goitre, hence nature alone has been able to provide the
clue.
outcome. As such subjects should be assigned to intervention or control group by a way which
mostly control for those factors known or unknown so as to produce comparable groups other than
for the intervention. The comparability between the two groups is achieved by
RANDOMIZATION. This procedure ensures that all individual have equal probability to be
assigned to either control or intervention group.
Randomization process:
Randomization produces two or more study groups comparable with respect to known and unknown
risk factors. It also removes investigators bias in allocating of subjects to treatment groups, and also
guarantees that statistical tests will have valid significance levels.
Randomization methods:
Simple Randomization.
The simplest method is to toss unbiased coin each time a subject is eligible to be randomized. You should
decide before hand which of the sides i.e. head or tail will represent which trial group, and hence whichever
side appears the subject will be assigned the respective treatment group. This procedure is not feasible for
large studies where you will need to randomize a large number of subjects, in such cases the use of random
number tables is more convenient. When using random number tables you should use even and odd numbers
to decide which subjects will be assigned to which treatment group. However you should decide before hand
whether even or odd numbers should represent which group. When you are starting the randomization
process, record the page of the random number tables, the column, and the starting number, for validity of the
assignment and objectivity. The advantage of this simple method is that it is easy to carry out. The
disadvantage is that there could be a substantial imbalance between the two groups, the solution to such
problem is the use of Blocked Randomization.
Blocked randomization:
Blocked randomization (permuted block randomization) ensures that at no time during randomization there
will be imbalance, and at certain points in time the numbers in each group are equal, if they are unequal the
difference does not exceed the block length. When assigning subjects on equal probability to two treatment
groups, given blocks of even sizes (4, 6 or 8 etc.), one half will be assigned to control and the second half to
the other. The order in which the interventions are assigned in each block are randomized. One of the
disadvantages of blocked randomization is that if staff happen to know the blocking factor (block length) they
can predict the next subject and thus bias the allocation, one way to get around that, is to vary the blocking
factor(block length) randomly.
Stratified randomization:
The major reason for doing randomization is to achieve comparability between groups as much as
possible. However at base line there may be factors which are crucial in determining the prognosis
(outcome) of the disease process. In such situation stratifying the subjects according to those
prognostic factors is done, thus subjects with certain prognostic factors are pulled together and
randomized. This ensures that there is a balance of prognostic factors between the intervention
groups. These prognostic factors need to be measured during the baseline.
Advantages of randomization
-Removes potential bias in allocation of subjects
-Produces comparable groups
-Guarantees validity of statistical tests.
82
Phase I
This phase involves clinical trials which are usually performed on human volunteers. The objective
of such trials is to determine how much of the drug can be given without causing serious side
effects. Secondly it is done to understand drug metabolism, and availability. Thirdly it is to
determine appropriate dose schedules. The sample size is usually small involving 20-30
individuals.
Phase II
It requires close monitoring of each patient. The objective is to determine the effectiveness and
safety of a drug. A second objective is to select drugs with genuine effect from those without
beneficial effect or are seriously toxic. The sample size ranges from 100 to 200 patients on each
drug.
Phase III
After establishing that a drug is safe and effective it is then compared with a standard treatment for
the same condition. This is usually a large trial involving a substantial number of patients. It is a
very vigorous and extensive type of clinical investigation of a new drug. Many times when people
take of clinical trials they are referring to trials in this phase..
Phase IV
These are also referred to as postmarketing surveillance. This phase is conducted after the trials in
the previous phases having established the suitability of the drug for marketing. They are usually
large studies and of long term on morbidity and mortality. The major objective is monitoring for
adverse effects of the drug. It is also an exercise to promote the drug by introducing it to a large
number of clinicians.
• Intervention trials: Intervention is done before a disease develops in those with disease risk
factors to reduce the risk for disease.
• Prophylactic trials: Determine the efficacy of prophylactic agents such as vaccines to prevent
occurrence of disease.
Comparability in assessment
One of the requirements of experimental studies is that all groups must be assessed comparatively.
Such requirement can be achieved at two levels:
(c) Autosuggestion, beliefs that one is receiving an effective treatment can influence the
outcome. Placebo can also demonstrate beneficial effect and is referred to as placebo effect
(d) Clinicians effect by way of heterosuggestion
To avoid all this and ensure comparability in assessment and disease progress employ Blinding.
5
Blinding
In any clinical trial, bias is one of the major concerns. Bias may be defined as a systematic error or
difference between the true value and that actually observed due to all causes other than sampling
variability. Its causes can be due to conscious factors, subconscious factors or both.
Bias can occur at a number of places in a clinical trial from the initial design, through data analysis
and interpretation. A general solution to the problem of bias is blinding. Blinding is to keep the
subject, observer, or data analyst or all unaware of the assigned intervention.
1. No blinding(unblinded) :
The investigator and the subject know to which intervention group one belongs. Some studies can
only be done that way, i.e. most surgical procedures, and changes in life styles.
Advantages
-It is simpler
-Easier to design
-Easier to carry out
Disadvantages:
There is much risk that allocation of subjects to control or intervention group or assessment will be
biased, secondly subjects may drop out if they know that they are receiving
placebo.
2. Single blinding:
Only the investigator knows to which group the subject is assigned thus the patient is blinded. This
is very common in surgical procedures i.e. chest skin incision as placebo versus a therapeutic
surgical procedure like coronary vascular surgery, or comparing two therapeutic procedures i.e.
4
Bilroth I anastomosis as compared to Bilroth II anastomosis etc. In such situation the surgeon
knows to which surgical procedure a subject has been assigned, but the subject is not aware of the
intervention.
Advantages
-simpler than double blind
-easier to design and carryout
-knowledge of the intervention may help investigator while caring for patients
-enables participation since some investigators are reluctant to participate in studies which they
don't know the study group assignment.
Disadvantage:
-It increases the likelihood that the investigator will be biased in assigning subjects to intervention
group, as well as when doing the assessment, and analysing the results.
-concomitant and compensatory therapies, could be imbalanced between the two groups
5
Bilroth I II are both surgical treatments for peptic ulcer
84
3. Double blind
Neither the subject nor the investigator responsible for follow up knows to which group the subject
is assigned.
Assigning subjects to intervention and control group is done by a person other than the investigator.
Advantages are;
The risk of bias is much reduced.
The effect of preconceived ideas by the investigator is much reduced, so is the subject.
Disadvantages;
- Mainly ethical, a placebo- control study is justified if there is no standard treatment superior to
placebo, and that subjects should be informed that a placebo is in use and what their probability of
being on placebo is.
-some investigators are not willing to participate in trials which they are blinded.
4 Triple blind:
In a triple blind study the committee monitoring response variables is not told which group is
allocated to intervention or control. It gives the advantage that the committee will evaluate the
response variable more objectively. On the event that the monitoring committee has an ethical
responsibility to monitor subjects safety, such design may be unfavourable. Many will be unlikely
to participate if they don't know to which intervention each study group is assigned, since many will
like to intervene in case the subject is not benefiting or is getting worse.
-it is rare that one accepts beneficial outcome unless statistical significance has been reached, but
rarely an investigator will continue a study in order to achieve a significant difference in an adverse
direction.
Strengths
• No subject is denied the new treatment
• same subjects are on new treatment, as a result size of study is cut by half.
• Historical data easily obtainable
• Rapid & inexpensive
• feasible when large benefits are expected i.e. for a disease which always ends fatally and you
expect survival.
Disadvantages:
• Many historical studies claim benefit but? consider such benefit could be attributed to:
(a) Changes in factors affecting survivor
(b) Changes in patient management
(c) Excluding seriously ill patients
(d) Migration of patients.
(e) Limitation of data collected historically.
Could explain the observed benefit.
The comparison groups run parallel and members of each group are given treatment at
the same time under similar conditions.
The subjects are initially recruited and then randomly allocated to control or intervention group. For both
groups initial evaluation or measurement is done. Intervention group then receives the intervention, while the
control group receives a placebo or another treatment. The strength of this design is that it ensures validity.
4. Sequential design:
Patients enter the study one pair after the other. Treatment A and B are allocated to either individual
of the pair and results of treatment are analysed as, and when they become available. When a pair
gives similar treatment outcome the pair is ignored in the analysis. Not suitable for chronic illness.
5. Cross-over design:
This design is started as in the parallel group design. Treatment is exchanged between the groups in the
course of study. Suitable for chronic diseases and is unsuitable if there is carryover effect.. There should also
be no order effect. When there is no carryover effect it has the advantage of requiring small sample size. If
you are not certainly sure that there is no carryover effect avoid the use of cross-over design.
6. Factorial design:
Several trials are combined into one, in order to study the effect of several treatments for the same
disease. Assuming that there is no interaction between the drugs tested. It has the advantage that
two or more trials can be combined in one with a small increase in sample size.
Treatment A Control C1
Treatment B a b
Control 2 c d
Those in cell (a) are receiving both treatments (A and B), cell (b) are receiving treatment B and
control, cell (c) are receiving treatment A and a control, while those in (d) are receiving control
only.
7. Withdrawal design;
A subject doing well on an intervention the treatment is withheld or the dosage reduced to find the
outcome. This is done to asses the response to discontinuation or dose reduction. The benefit will be
to evaluate the duration of an effect already known to be effective. This is only feasible for chronic
illness, and investigators should seriously consider ethics before such a study is done.
8. Hybrid design.
Instead of randomizing equally the number of subjects to be assigned to the control and intervention groups,
fewer are assigned to placebo or the alternative treatment, while more subjects are randomized to the new
intervention. This becomes feasible only when information on controls is fairly recent. It should be done by
the same individual or clinic using the same entry criteria and evaluation factors.
Advantage
- It requires smaller sample size
Disadvantage
Bias from the use of historical controls
Intervention Trials:
Individuals with disease risk factors are randomized into two groups and whilst an attempt is made
to remove or reduce the risk factor in one of the groups, the other group is not intervened.
Observations are then made on both groups, to determine the effect of such intervention. Such trials
are limited to circumstances where ethical issues are unlikely to arise. e.g. To confirm the causal
association between smoking and lung cancer, one could select a group of smokers and randomize
them in two groups. The first group continues to smoke and an attempt is made to discourage the
second from smoking. If lung cancer mortality in the second group is lower than in the first, the
causal implication is obvious.
Prophylactic Trials
1. Individual allocation: Individuals are randomly allocated to the vaccination and control groups.
2. Group allocation (Community trials): communities such as villages, towns, schools or
factories are allocated wholly either to the vaccination or to the control group. Group
allocation is the method of choice when testing some live vaccines, in which persons in the
control group might inadvertently become vaccinated by contact with members of the
vaccinated group, if individual allocation was used.
3. No allocation: This method is used when there are ethical objections to leaving certain
individuals or groups of individuals unvaccinated. In such a situation, before - and-after
vaccination comparisons are made.
4. Matched allocation: Careful matching of the comparison individuals is done for all possible
factors except for the vaccination status. This method is commonly used in laboratory animal
experiments.
Disease incidence: by comparing the incidence of disease among the vaccinated with that among the
unvaccinated:
(C - V)
% Protection = ------- * 100
C
In immunological tests cut-off points are established and individuals are categorized as immune or
non-immune to the disease. It should be understood that these cut off points are often arbitrary and
may not be synonymous to protection against the disease. Protection is therefore determined by
comparing the proportion non-immune to the disease among the vaccinated with that among the
unvaccinated.
C. Sample size
• Before calculation of sample size one should initially identify the primary response variable
which will be used to judge effectiveness. Primary response variable are of two types
a)qualitative (dichotomous) response variables and b) quantitative (continuous) response
variable
The formula is; n = [P1 x (100 - P1) + P2 x (100 - P2)] x f(aB)/(P2 - P2) 2
N= 2σ2 x ƒ(&, B)
(M2-M1)2
a) Sample size
b) Blinding procedure
c) Randomization procedure
d) Types, dose, and duration of treatment.
Ethical considerations.
Every proposed trial must be exhaustively weighed in the ethical balance - each according to its
own circumstances and its own problems. On every occasion certain general questions must be
answered within the specific and particular circumstances of one trial he wishes to undertake.
89
3
RESEARCH METHODOLOGY
91
The magnitude of a particular disease present in a specific population may remain stable for long
periods of time or it may alternatively rise and fall due to fluctuations in the number of susceptible
individuals and the nature and extent of their exposure to disease agents. Diseases
continually/habitually present in populations are considered to be endemic to that population.
Endemicity denotes the habitual presence of a disease in a community. An epidemic is defined as
the occurrence of a disease in a community clearly in excess of the expected incidence
characteristically revealing itself in a relatively short period of time. In order to say that there is an
epidemic, it is necessary to know the level of endemicity of the disease. Does a single case
constitute an epidemic ? The number of cases that constitute an epidemic will vary with the type of
disease, the population, and its previous experience with that disease. An epidemic cannot be
determined strictly in absolute terms. Rather its a relative concept. In the USA a disease such as
cholera is not normally present in the population and many persons would be susceptible if exposed
to the cholera organism. If the disease reappeared, the source of the outbreak would be investigated
in order to prevent rapid and serious spread of the disease. Therefore, even one case of cholera
would constitute an epidemic in the USA. But in a country like India where cholera is always
present in some population subgroups, a few hundred cases a year may be the usual or expected
incidence. For cholera to be considered an epidemic in India several hundred or thousands of cases
(i.e. cases above the endemic level or frequency) would have to occur. Diseases like Ebola (Ref:
Zaire May 1995) do not occur habitually in human populations: a single case will also constitute an
epidemic in any part of the world.
Modern epidemiology arose out of the study of the so-called classical epidemics, such as plague,
smallpox, cholera, typhus, typhoid fever and dietary deficiencies. Some of these epidemics remain
an important threat to many tropical countries. Other frequently encountered epidemics are:
Although infectious diseases are the most important cause of epidemics in Tanzania and other
tropical countries, road accidents, drug addiction, poisoning, suicide, coronary heart disease and
lung cancer are important examples of epidemics elsewhere.
Disease outbreaks such as those of organic mercury poisoning with resultant deaths and
neurological disability have been reported from Iraq, Pakistan and Guatemala as a result of
ingestion of wheat products treated with methyl- and ethyl-mercuric compounds. The wheat was
intended only for use as seed and was so treated to prevent fungus growth.
Other disease outbreaks involving the nervous system (Konzo) have been reported from
Mozambique and Tanzania and were later found to be associated with the consumption of certain
types of cassava with a high content of cyanide.
92
(New) diseases such as Lassa fever, Legionnaire's or Veteran's disease continue to pose problems
from time to time in certain parts of the world, and now AIDS is a recognized world-wide problem.
Lassa fever: a viral disease transmitted from rodents and was first recognized in 1969. Three nurses
contracted it in Nigeria and two of them died.
Ebola, which is a viral disease was first recognized in Southern Sudan and Zaire in 1976, and
subsequently in Southern Sudan again in 1979. An Ebola virus Haemorrhagic Fever outbreak has
also been reported in Zaire in May 1995
AIDS: An immune deficiency disorder (Acquired Immuno deficiency Syndrome) brought about by
infection with the Human Immunodeficiency Virus (HIV). It was described for the first time in
1981 among homosexual men and intravenous drug abusers in USA. Later the epidemic was found
to spread among heterosexual populations in Africa and elsewhere. The disease has a long
incubation period (5-10 years) and is transmitted through sexual contact, blood transfusion and
unsterile skin piercing instruments including injections. Vertical transmission is also an important
route.
Principles of investigation:
In a clinical case, investigation and treatment must go side by side for the successful management of
the patient. Likewise an epidemic is always an emergency where action to counteract it must begin
even before complete investigation. For example, in 1854 Dr. John Snow showed that the outbreak
of cholera around the Broad Street in London resulted from contamination of drinking water with
excrement from cholera sufferers. The epidemic was well managed and put under control 30 years
before the identification and description of the cholera vibrio, the causative organism.
Types of epidemics.
1. Common Source. Occurs when a group of people are exposed to the same causative agent. If the
period of exposure to the agent is brief and essentially simultaneous for all persons contracting the
disease, the epidemic is called a point source epidemic. All persons are affected by the same source
and person -to-person transmission does not occur. Common source epidemics are not necessarily
caused by infectious agents, they may also result from common exposure to noxious agents in the
environment. The Bopal disaster involving gaseous explosions, the Chenobyl outbreak involving
radiation leakage, children swimming in a chemically polluted river or factory workers exposed to
extreme heat or volatile chemicals are some of the examples which do not involve infectious agents.
Generation time: Time interval between receipt of infection and maximal infectivity for both
clinical and subclinical infection.
presence of a high proportion of immunes although there may be a certain number of persons
who are individually susceptible to the agent.
Secondary attack rate: Proportion of contacts who get a communicable disease as a consequence
of contact with the index case within the accepted incubation period.
In general the two types of epidemics (common source vs propagated) can be differentiated by
plotting the distribution of cases by time of onset of symptoms. This is the epidemic curve - the rate
at which new infections occur.
The curve of onsets for a common source epidemic shows a rapid rise and fall within one incubation
period, whereas new cases in a propagated epidemic continue to develop beyond one incubation
period. If you look at the epidemic curve you can see that those affected have different times of
onset of symptoms. This is because of individual differences in the level of immunity and the taking
of different doses of the infective agent.
In the curve of a propagated epidemic, usually a gradual rise to a peak may be observed, followed
by a gradual fall in the number of new cases. This is because, as the number of cases increases, the
number of susceptibles falls below a critical level so that the number of new cases begins to fall.
The shape of the epidemic curve in this type of epidemic reflects several factors including the
population size and composition, the proportion of susceptibles in the population, the number of
cases at the start of the epidemic, the contact rate between the infected persons and the susceptible
individuals, the infectivity or pathogenicity of the disease agent and the incubation period of the
disease.
Sometimes it may be difficult to identify the nature of an epidemic from the shape of the epidemic
curve alone. The typical common source epidemic curves may be affected by the continued
development of cases through persistent contamination of the source, or exposure occurring
repeatedly or by a long and variable incubation period. The shape of the curve may also vary
depending on the size of the population exposed, the type of source distribution and extent of use or
the extent of contact with the susceptible population. The typical shape of a point source epidemic
may be modified by presence of more than one disease agent, each with a different incubation
period, or if secondary cases (person to person transmission) follows exposure to the original point
source.
Conversely, a propagated epidemic can create a rapidly rising and rapidly falling epidemic curve
similar to that of a common source epidemic. This is especially so when the disease has a short
incubation period and is highly infectious. (e.g. cholera).
94
Investigation of an epidemic starts with a study of the affected as well as the unaffected persons. For
this reason it is necessary to establish criteria for labelling persons as “cases” or “non-cases”.
Next clinical and laboratory studies should be done to confirm the diagnosis. Always consider
whether the initial reports are correct, otherwise you may start a large scale investigation of a "false
epidemic". You should always consider new, enthusiastic or inexperienced staff and the enthusiastic
use of new equipment. Also consider the possibility of using faulty reagents in the laboratory which
can produce false positive results.
Existence of an epidemic is established by comparing the current disease incidence with past levels
of the disease. If this is clearly in excess of the expected incidence, then there is an epidemic.
Sometimes it may be difficult to determine whether an epidemic exists due to the following reasons:
(a) Recent and marked fluctuations in the number of cases and populations although the
incidence rate may remain constant
(b) Gross exaggeration in the number of cases due to misdiagnosis, duplicate reporting by
hospitals, health centres, etc.
(c) Normal seasonal variation in the occurrence of a disease may give an impression of an
epidemic when few or no cases occurred until recently. Comparison of incidence rates
during the same season in previous years will reveal the expected frequency of disease and
clarify whether the observed frequency is unusual.
(d) Completeness of recording
(e) Manufactured epidemic - special surveys and efforts to find cases and innaparent infections
The cases are then plotted by time of onset of symptoms to determine the epidemic curve; and by
location to determine their geographical distribution (spot map). One may also draw/plot occurrence
of cases in relation to presumed antecedent exposure, e.g. place where lunch was taken, or where
they went for recreation etc.
95
(i) From the description of the epidemic; i.e. the epidemic curves, spot maps and the personal
characteristics, identify the type of epidemic - common source against propagated
epidemics.
(ii) Consider possible source or sources from which disease may have been contracted, e.g.
contaminated food or water supplies, breeding sites of insect vectors, etc.
(iii) Design an epidemiological study to test the hypothesis that the source is/or is not a
particular factor. Usually a case control study or a retrospective cohort study design is used
for this type of investigation depending on the nature of the epidemic.
(a) Use a case-control study design when only some of the population members of the
affected community can be identified and interviewed. e.g. large communities such as
villages, districts etc.
(b) Use a retrospective cohort study design when all the population members of the
affected community can be identified and interviewed, e.g. small and closed
communities such as schools, hostels, army barracks etc.
(iv) Compare cases with controls with regard to exposure to the postulated source by using a
case-control study design, or compare exposed populations with non-exposed populations
with regard to attack rates by using a retrospective cohort study design.
(v) Carry out statistical tests to determine the most probable source (Odds ratio or relative risk
and their 95% confidence intervals)
(vi) When appropriate confirm the epidemiologic findings by laboratory tests: i.e. on samples of
blood, faeces etc. or samples of suspect food, water etc.
Up until this point in time the epidemic has been studied passively by the use of reports and
incoming information. For proper control of the epidemic the next step should include active
investigation of the cases in more detail and of the remainder of the population.
These three groups can be important sources of continued spread of the disease agent to new
susceptible hosts. Further investigation may utilize the following methods, note that the methods are
not exhaustive:
This is done right from the time the existence of the epidemic is established.
- The principles of control are:
- controlling the source of the agent
- interrupting the methods of transmission
- enhancing the defence mechanisms of the host
There are as many research proposal formats as there are donors who support research activities,
and the format and information required usually varies considerably. Many of the larger donors
insist on strict adherence to given formats, which must be filled in. Some donors distribute
guidelines which the applicant follows in preparing the proposal. Yet some donors, especially the
smaller ones offer little or no guidelines.
Despite the considerable variation in formats, there are however, key sections which are
encountered in most research proposal formats. Even where there are no strict formats or
guidelines, it is advantageous to be familiar with the key elements of research proposal
development, so that essential information is not excluded and, the proposal is presented in a logical
sequence and is comprehensible to the reviewers who may often represent the views of the donor.
The Table below (from WHO/EDUC/87.187) presents modules leading to a comprehensive project
proposal. Where a donor agency requires less detail it usually suffices to only include the problem
statement, goals and objectives, methods or strategy and the resources required.
THE FORMAT
1. Summary
The summary of a research proposal, which in most formats appears at the very beginning, is
perhaps the most critical section of any proposal. It must be short (often a page or less), precise, yet
interesting and must state why the research project should be carried out and not what will be done.
The summary aims at convincing the donor agency that the proposal is relevant to his concerns and
yet meets the concerns, needs, and interest of the investigator, the institution and even the country.
Although short, the summary must also clearly state the problem, the objectives, strategy or
methods, and the total budget.
The summary among other things, guides executives receiving the proposals whether it is relevant to their
funding priorities, where to refer it to for detailed evaluation, etc. A busy executive may reject a proposal
98
merely on the basis of the summary. Later at committee level, some members may only read the summary.
Considerable effort must therefore be invested in preparing the summary, which although it usually comes
first, it must be written last.
One author says, "Goals are broad statements of what is ultimately to be accomplished. Objectives
are more specific aims which the project is to achieve" another author says, "Research objectives
are the goals to be achieved by a research project. The general objective or purpose of the research
identifies in general terms what is to be accomplished by the research project and why". "The
specific research objectives identify, in greater detail, the specific aims of the research project, often
breaking down what is to be accomplished into smaller clear, logical and measurable components."
Therefore, research objectives are stated as questions, or positive statements, or hypotheses
In clinical studies attention is paid to detailed descriptions of study protocols, criteria for selection
of study subjects, description of investigations to be made, patient care, patient follow up and a
description of the clinical facilities available. Clinical trials are even more demanding in that the
protocols are much more detailed - they include for example a description of the type of protocol
(e.g. controlled, single, double or even triple - blind, etc.), the number of treatment groups, the
proposed trial agent or drug, the criteria for selection, admission procedures, criteria for
discontinuation, follow up, etc.
Field Studies naturally must describe the study area (including sites, geography, climate, seasons,
etc.) and the human population, its activities and socio-cultural background. In such studies, one
must also describe the epidemiological method to be used (descriptive, analytical or experimental),
relevant epidemiological information available, and whether control measures (of the diseases to be
studied) have been undertaken and their impact if any.
99
Whatever the type of study, the following areas must be well documented in the methodology
section of the research proposal..
• Study design
• Study population
• Sample size
• Research instruments
• Definition of terms (i.e. what is a case and what is a control?)
• Data collection and management
• Ethical issues
• Perceived constraints, biases and limitations in achieving the set objectives
Then a workplan which outlines the activities, targets and schedules is presented. Workplans,
especially in large projects, may be subdivided into their component activities, and a timetable or
schedule is drawn showing when each activity will be carried out. Gantt charts are often used to
show when particular activities are to be done. You may for example show the periods for: pre-
testing of questionnaires, carrying out pilot studies, data collection, data analysis, etc. Remember to
consider allocating time for staff recruitment, staff training, acquisition of supplies and equipment.
Make sure you also include a description of the project organization (e.g. project organization
chart), explaining lines of linkage to other units or external institutions. Bio-data or CV of principal
investigators and their co-investigators is essential to increase the donors’ confidence in the strength
of the team.
Monitoring is undertaken by all concerned, especially the principal investigator, the members of the
research team, representatives of collaborating units or institutions, and a project officer from the
donor agency. Monitoring is often based on reports, field visits and regular meetings of the project
team. Workplans are invaluable in project monitoring and should always be prepared and utilized.
Evaluation on the other hand involves a systematic collection and analysis of data to assess whether
the objectives (i.e. goals) of the research project have been met. Data gathered in monitoring may
be used in final evaluation. Evaluation often focuses on the methodology used (was it appropriate,
valid, reliable), the design, data analysis, implementation, dissemination of research results, use of
findings (by government, communities, health workers, research team itself, etc.) financing (was the
investment worth it?), and follow up (has it led to further research activities, implementation etc.).
Evaluations are often scheduled in the middle and towards the end of the project period, and may
involve donor representatives.
100
Some donors stipulate listing of other contributions, especially those by your government or
institute to ensure commitment. Examples here will include personnel already employed in
government service who may not need salaries to do the study. Other recipient government
contributions may include office space and utility bills such as electricity, water and other well
established services e.g. accounts and general administration.
(ii). Consultants
-fees and honoraria may be required for consultants.
(iii). Allowances
-Subsistence allowances, hardships, housing, mileage, etc. Will be required by project staff.
(vi). Transportation
• for consultants
• for researchers and their assistants, etc.
• for project managers and administrators
• for participants to seminars on the research project
• fuel, maintenance and repair of project vehicles.
Depending on the type of project and donor, 5-10% contingency budget item is reasonable. Given
the ever escalating costs of living, funds for any recurring budget item should be increased every
year to cater for inflation.
Most donor agencies insist on dollar budgets, yet others want local currency budgets. Some require
two columns one for local and the other for foreign currency. It is therefore essential to consult the
particular guidelines of your target donor agency.
Where more than one donor is requested to fund a proposal, it is advisable to present a
comprehensive budget showing the amount you are requesting from each donor as well as that to be
contributed by your institution (and government), even if the latter are in kind. A budget Check List
from WHO/EDUC/87.187 is attached for further guidance.
102
9. References
A reference list must be prepared and included in the proposal. All references cited in the text must
be listed here in one of the acceptable formats. Alternatively and in the early stages of a proposal a
reference list must be kept for future use during report writing or manuscript preparation.
Capital Expenditure
1. Construction Office buildings; staff housing; training schools;
health units; other construction. (Few donors support
this budget item)
2. Commodities Equipment; furniture; vehicles etc.
Recurrent Expenditure
3. Personnel Technical staff; support staff; consultants; casual
labourers
4. Training Fellowships; study travel; workshops; refresher
courses; correspondence courses; extension courses;
other courses.
5. Travel costs Train/bus/air fares; per diem, etc. for personnel.
6. Supplies Medical supplies; training supplies; vehicles; office
supplies; printing and photocopying
7. Maintenance Equipment; furniture; buildings
8. Vehicles running /maintenance Operation and maintenance of vehicles
9. Other costs/ institutional overhead Utilities (electricity, fuel, water, rent);
communications (telephone, telex, fax, postage);
licences and permits
10. Contingency Usually 5% of the budget
11. Inflation
12. Budget justification
103
Some "scientists" prepare articles for publication just because they feel it is time their name
appeared in print, or because there is a forthcoming staff review, or they think they have hit a new
finding and can therefore, submit a preliminary report. Whatever the circumstances, scientific
papers should describe significant findings, that extend scientific knowledge or advance application
of existing tools or known principles. Remember "you are what you write" - many who will read
your paper will judge you so, and you will usually never have the chance to correct your paper.
It is advisable to first write down your conclusions clearly, precisely and economically, and relate
them to the statement of the problem or the hypothesis that was being advanced at the time you
were developing the research proposal. Measure these conclusions against existing reports, and
discuss them with experienced scientists for advise before deciding whether your findings are worth
publishing. Remember papers likely to be read are those that are useful to readers because they
answer questions for them. An effective paper deals with an important question clearly answered
with an adequate evidence for the answer. Sometimes it may be important to apply the ‘So what
test’ How important will the message be ? What if the paper’s message is correct, ‘So what’. The
‘so what’ look at the message of the paper, is a powerful, but crude sounding measure of its
importance. What effect will the message have ? Will it change concepts or practices ? If so, How ?
If you are satisfied with the answers to these questions then proceed.
Only submit for publication previously unpublished work (except in a review article) nor work
being considered for publication elsewhere. But originality is not invalidated by previous
publication of a preliminary communication, although early release of results (e.g. to the press, in
conference proceedings etc.) may lead to their rejection in scientific journals by some editors.
At this early stage, consider who will read the paper, and this will determine the "target journal" to
which you submit the manuscript. The author should draw up a list of journals that seem to be
suitable choices, the list should be short enough to allow for considering each journal carefully.
Apply these questions to each journal.
• Is the topic of my proposed paper within its scope ?
• Is the topic represented in the journal frequently or only rarely ?
• Would it offer the best match of audience with that topic ?
• What formats does it accept ?
• Does it publish an information for authors page or issue a similar sheet or booklet that may give
better answers to these questions ?
On narrowing down the possible journal after applying the above questions, read their purpose and
scope and/or their instructions to authors. Your final choice may also be influenced by such
considerations as: the scientific quality of their previous publications, the quality of their print, the
time taken for articles to appear in print, their reach (e.g. number published per issue), appearance in
major abstracting journals, etc. On choosing the target journal, carefully study its most recent
"Instructions to Authors" so that you can start preparing the article according to their style.
104
You should decide on authorship (unless you worked alone!) at a very early stage. Include as
authors all those who contributed substantially to the theoretical or practical aspects, the following
four criteria may be considered when justifying authorship:
1. An author should have generated at least a part of the intellectual content of the paper: initially
conceived of the study it reports, if it is a research paper or case report; or developed the plan for
the paper, if it is a review or an editorial.
2. An author should have collected reported data (including clinical observations) and interpreted
them for the papers message.
3. An author should have taken part in writing the paper or revising its intellectual content.
4. An author should be able to defend publicly in the scientific community that intellectual content
of the paper for which he or she can take responsibility.
Despite meeting any of the above criteria, never include anybody before obtaining his/her approval.
Arrangement of authors' names usually reflects the amount of contribution to the work reported,
starting with the researcher who contributed most. If several researchers made equal contributions,
arrange their names alphabetically. If you work in a team, and publish several papers together, take
turns at being first author.
Besides style, many journals have other requirements, especially relating to ethics where humans
and other animals are investigated. You should be familiar with the policy of your "target journal",
or write and ask the editor. Now compose a working title and draft abstract (i.e. a short version of
the article). The working title must cover only one topic. In the working abstract express concisely
the research hypothesis, the approach adopted, the findings and the conclusions reached.
Next determine the organization of the paper - again refer to the "Instructions to Authors". Most
biomedical articles run through four sub-headings viz.: Introduction, Materials and Methods,
Results and Discussions. Other journals, especially in a descriptive field may run; "Introduction,
Population, Materials and Methods, Data Analysis, Results, Discussion, and Conclusion. A paper
describing a new method may run: Introduction, Description of Procedures, Test of New Method,
and Discussion. These will form the first-order headings of your article.
Write each first-order heading at the top of a sheet of paper, and on each write all the relevant
findings or thoughts as they occur to you, referring to your laboratory, field, or other notes and
records. Then add the tables, illustrations, and all the reprints (or relevant bibliographical cards)
necessary for the article. Then take the tables and illustrations, and improve their design so that each
item conveys a clear message. Pay close attention to the title, legend, and design; make sure they
provide maximum information in minimum space. The title of the table should reflect the contents
of the table and should guide the reader on how to interpret them. Model the tables in accordance
with the style of its "target journal".
Next number the tables in the order they are expected to appear. Then on a separate sheet list in
order, the tentative titles of the tables. Within each table, arrange the columns so as to allow readers
to easily understand the contents of the table.
Remember, in tables; do not give numbers implying greater accuracy than the methods used; give
numbers to the nearest significant figure; do not give percentages to more than one decimal place;
eliminate surplus zeroes by choosing appropriate factors in the column heading; and avoid dashes in
columns (make a notation, e.g. *, then explain in footnote). Make sure that tables are not crowded,
and can preferably fit across a column or at most across one journal page only. Try to omit certain
information from an over-large table, or even split such a table; many journals reject large tables.
105
Do not include columns of data that can be calculated easily from data given in other columns. For
example, never include percent positive and percent negative in the same table.
For illustrations most journals prefer line drawings (diagrams, histograms, graphs) to photographs.
Illustrations must be functional, they should not repeat materials presented in tables or text. Since
illustrations are expensive, only submit them if essential. Many journals will accept colour
illustrations only at the author's expense. In making reference to tables and figures always use upper
case as in, “see Table 3 or see Figure 1; not see table 3 or see figure 1”.
In case material is borrowed from other people’s work, the author must request and get written
permission for its reproduction, from the copyright holder. Even when material is quoted from
unpublished work, permission to reproduce or quote it must be obtained.
At this moment you already have drafted a working title and abstract, tables and illustrations and the
outlines. The notes collected earlier are then arranged to form topic and sentence outlines. Quickly
write the first draft following the already prepared outlines. At this stage use the name-and-date
system for citing references, and do not worry much about the grammar, but try to use the
appropriate person, tense and choice of verbs.
Introduction
The introduction must be brief - two or three paragraphs - indicating the aim and scope of the paper.
The introduction must, therefore, open with a succinct statement of the subject, orient the report in
its field, and should explain the purpose. The introduction thus serves as a link between the past
and the present. Generally all aspects of the literature cannot be covered for a relatively brief
research paper. Therefore the review of past research is done with a bias towards only those aspects
of the problem which are of direct relevance to your report. A good introduction starts out by
making a few general statements about the field of research, leading down logically to a narrow and
specific statement of the aim or hypothesis. The last paragraph of the introduction should state the
precise aims or the problem being investigated or the hypothesis being tested.
Methods
Details of how research was carried out include description of the study population, its size and
selection. Description of research instruments such as questionnaires, surgical and laboratory
equipment is essential in this section. Adequate description of research procedures must be given
here to provide the reader with enough information to allow replication (of course, replications may
not be feasible for a unique event such as a case study of a specific individual). Do not describe
familiar procedures, use references instead. Follow a logical order in describing the methods -
many authors follow a chronological order . For chemicals and drugs use systematic chemical
names, or international non-proprietary names. Where laboratory animals are used, give the genus,
species, race, strain, breed or line name. Ethical considerations must be discussed in this section.
Unusual equipment must be described.
Results
The results section must be comprehensive on its own, and results must be presented in a logical
order. This section presents the findings of the investigation and draws attention to points of
interest. Raw data and statistical calculations are not presented in this section. Rather, we use the
principles of descriptive and inferential statistics to present the summerized and analysed data: i.e.
graphs, tables and outcomes of statistical tests and also provide their interpretations.
Discussion
In the discussion section the validity of the present work is assessed, these are comments on the
significance of the results, which are also compared to previous work. In writing a good discussion
section consider the following questions,
106
"Did your research find what you expected?" "How do the present results relate to those of
previous research?". If unexpected results were obtained, possible reasons for the outcome should
be discussed such as faulty designs and inappropriate selection of controls in previous studies.
In the conclusion, usually the last paragraph of the Discussion section, you summarize your main
findings and make recommendations on action to be taken as a result of your findings. Here you
also point out new lines of study and suggestions concerning further research.
A summary is included where the "target journal" wants it. The summary then states an outline of
the methods, the main findings and conclusions, referring briefly to new hypotheses and future
work - it may (unlike the abstract) include references to figures and tables. Some journals request
for a non-English summary.
Remember to acknowledge any substantial help received e.g. from a donor agency, from individuals
who gave money, materials, specimens, referred patients, assayed materials, advice on the execution
of the study and for typing the manuscript.
First examine the draft to make sure that paragraphs and sentences are in the right order, that all
essential points are included, superfluous ones removed, and that arguments run logically. Check
the paragraphs - they should not be too long (usually not more than 125 words) nor too short to be
adequately descriptive. In the results, make sure that no sentences begin with a number; e.g. 25% of
the population....! And that all numbers below 10 are written in words; e.g. four, six, nine etc.
Instead of 4, 6, 9 etc. Check the tables and illustrations. Some may need modifications, some may
be superfluous, some may be combined etc. Now have the tables and illustrations made in their
final form. The paper size used for illustrations must not exceed A4 paper (210 x 297 mm). Follow
the instructions given by the "target journal" and the "Uniform Requirements for Manuscripts
submitted to Biomedical Journals".
Check the legends against the illustrations and make sure they agree. Now re-read (completely) all
the papers cited to make sure you do not misquote or misrepresent the work cited. Next check for
consistency. Make sure you adhere to the style of the "target journal". Try to reduce the number of
abbreviations and footnotes.
Now examine the first draft and from it check the style. Make sure you use concise language; be
precise, clear and avoid very lengthy construction. Also avoid verbosity and pomposity; you should
write (and speak) to express, not to impress. To avoid misuse of English words, use a thesaurus at
this stage. After reviewing and editing the first draft (for style and structure) prepare the second
draft by adding more materials to it and editing it. Have the paper reviewed by critical but
sympathetic colleagues and co-authors.
Now go over carefully the comments from the co-authors and colleagues, and make changes where
you feel they are needed. Then polish the article, paying very particular attention to the title and
abstract, and prepare the final reference list. The title must be concise, accurate and informative.
107
Some journals specify the permitted or preferred title length. Aim at keeping the title below 15-20
words or 120 characters (including spaces). It is advisable to omit such phrases as "Observations
on.....", "A study of ....", "An approach to ...", and the like.
After the title, supply a short title called the "running head" which usually consists of 45 to 60
characters. The final abstract must summarize the contents and conclusions of the paper, pointing
out new information and indicating the relevance of the work. The abstract should be completely
self-explanatory and intelligible in itself. Do not write footnotes nor quote references in the abstract
itself. This abstract, which is on a separate page, will be submitted by the editor to an abstracting
journal. Next select "index terms", i.e. key works or phrases suitable for use in the journal's subject
index and for information retrieval systems.
The next stage is to assemble the completed manuscript. The title page bears the new title and the
by-line (i.e. names of the authors and the addresses where the work was done). The abstract follows
on a separate page; depending on the target journal's instructions, index terms may be on the title
page or on the page with the abstract.
Now assemble the text, adhering to the target journal's instructions. In the text use symbols for
units of measure whenever a number precedes them, but spell out the name of a unit if the number is
spelt out, for example at the beginning of a sentence. Alternatively you may recast the sentence so
that the number falls in the middle of the sentence. Never begin a new sentence with a number.
In case there is an appendix, it should go either after the text and before the acknowledgements and
references, or after the list of references. Place the acknowledgements on a separate page at the end
of the paper, before the references. References should conform to the guidelines of the target
journal. Assemble the tables and illustrations (and legends) making sure each is lightly marked in
pencil with the name of the first author. This will prevent mixing of illustrations of one paper with
those of another during publication. Make sure that references in the text and the list of references
follow the instructions in the target journal, and that they are correct.
Then have the whole paper typed or edited by a competent typist. In the advent of computers or
word processing equipment this is only a simple editing task. Aim at creating a good first
impression (although ultimately the article will be judged by its contents). The article must be neat,
clean and accurately typed, on good quality paper. Where journals are photographically reproduced
directly from the author's typescript, perfect typing is absolutely essential. Only use white A4 size
(210x297mm) paper. The original copy must be on strong good quality paper to withstand frequent
handling. Do not get disheartened when you find that you are typing and re-typing your article
many times. This is the only way to obtain a final and refined manuscript. When the typescript is
ready, read it word for word, and make necessary corrections on all the copies.
Regulations regarding permission to publish differ widely. Some academic institutions, lack such
regulations. But other organisations, require their staff to obtain clearance before the article is
submitted to the editor. It is probably more convenient to submit the manuscript for clearance
before the final typescript is made. Under all circumstances it is wise to obtain approval from a
responsible official in order to safeguard the interest of the staff member, and his/her institution;
this is more so with technical data, and where sensitive issues are the subject of the report.
Now the final typescript is ready for submission for publication in the target journal. Write a short
covering letter to the editor, following the instructions given in the "Uniform Requirements for
Manuscripts Submitted to Biomedical Journals" and the instructions to authors from a recent issue
of the target journal.
On receiving your manuscript, the editor will submit it to referees, who will review it and return it
to the editor with their comments. The editor will then compile the comments, and mail them to the
author. Your article may on very rare circumstances be accepted without revision, it may be
rejected outright ! In case the editor rejects the article, study his comments carefully, and consider
submitting it to another journal; contesting the editor's decision is often fruitless.
Where the editor requests for revision, consider the suggestions carefully, and modify the article if
you agree with him/her. If you do not agree, write a polite letter explaining your position.
Proofs
After you have corrected your manuscript the editor will then send it to the printer, through a
publisher. The printer will usually prepare galley proofs, send them to the editor who will in turn
send them to you (the author) with appropriate printer's and editor's correction marks. Read the
proof carefully, word for word against the typescript, to detect errors, omissions or other mistakes.
Mark all corrections and instructions in the margins of the proof, with corresponding marks in the
text. You must, therefore, be familiar with the basic symbols used in proof corrections. Make sure
the proofs are returned to the editor immediately.
References
1. Council of Biological Editors (1972) CBE Style Manual 3rd Edition, American Institute of
Biological Sciences, Washington, D.C.
2. International Committee of Medical Journal Editors (1986) Uniform Requirements for
Manuscripts submitted to Biomedical Journals. Annals Internal Medicine 108 (2): 258-65.)
or BMJ 296: 401-405
3. O'Connor, M. and Woodford, F. P. (1975) Writing Scientific papers in English: An HLSE-
Ciba Foundation Guide for Authors Associate Scientific Publishers, P. O. Box 211,
Amsterdam.
4. Introduction to research in the Health Sciences, Stephen Polgar and Shane A. Thomas
Churchill Livingstone, Melbourne 1992.
5. Huth E,J., How to write and publish papers in the Medical Sciences, Second edition,
Williams and Wilkins, London 1990.
109
It is imperative that in order to evaluate and critique a scientific article (or a research proposal) you
must know how a scientific article is written. We have already had a session on how to write a
paper for publication. Knowledge of how to write a scientific paper for publication is therefore a
pre-requisite for evaluating critically such communication. This section of the lecturer notes
therefore assumes that you have this knowledge. Before proceeding with this set of notes you are
therefore advised to recapitulate on the steps in writing a paper for publication.
This section deals mainly with scientific articles which are a result of original research work.
General knowledge in this area will also enable you to evaluate research reports, case reports,
scientific reviews and editorials including letters to editors of scientific journals.
We may scan journals as we do newspapers to keep in touch with what is going on in our fields.
But in reading some papers closely we are looking for more. i.e. answers to questions; solutions to
problems. For example, a clinician may wish to know the best antibiotic for treatment of a
pneumonia. A nursing instructor may wish to know the best way to teach the surveillance of
obstetrical labour. The microbiologist may wish to know what characteristics of Legionella
pneumophila justify creating a new genus and species in bacterial taxonomy. The physiologist may
wish to know how substance X mediates renal vasoconstriction. The clinical pharmacist wants to
know the pharmacokinetic variables by which the dose of drug Y can be adjusted when its clearance
by the kidney is reduced by half.
Scientific readers are looking for more than just answers to questions and solutions to problems. If
they only want to get the answers and the solutions, they need not read entire papers because the
answers and solutions are usually stated clearly in a summary at the head of the paper, or at the end
of the paper’s Introduction or Conclusion section. What we need as readers is to be convinced that
the message of the paper - its answer to our question, its solution for our problem- is correct, is
valid, is reliable. We are not going to adopt the message into our way of thinking and working
unless we are persuaded that we should. Scientific papers are not just baskets carrying unconnected
fats, like the telephone directory; they are instruments of persuasion. Scientific papers argue you
into believing what they conclude; they are built on the principles of critical arguments. Rather,
“argument” is defined as “a coherent series of reasons, statements, or facts intended to support or
establish a pint of view.” And “critical” means, of course, assessing evidence carefully for its
validity, what evidence to accept and what evidence to reject.
Consider a typical paper reporting clinical investigation of antibiotic treatment for the infectious
disease Q. The investigators know that antibiotic A is often effective but that it has a high rate of
adverse effects. A new antibiotic, antibiotic B, comes along and gives in vitro evidence of
bactericidal effect against the causative organism. What is the question to be answered? “Is
antibiotic B more effective that antibiotic A in the treatment of disease Q?”
The investigators draw up a protocol for a study properly designed to answer the question. The
study gets under way. Antibiotics A and B appear for a while to have the same efficacy, but then
antibiotic B begins to look better. The investigators carefully monitor results of treatment with an
appropriate statistical procedure so that the comparative trial can be stopped when a statistically
significant answer is reached. Finally, antibiotic B is found to be better, a conclusion reached with
statistically supported evidence; the trial is brought to an end. Then the investigators write and get
published a paper reporting the trial. The paper makes clear what was studied (what question was to
be answered), the findings (evidence), their validity (statistical statements, assessment of possible
alternative conclusions), and the answer, “Antibiotic B is better.” The reader who wants of know
110
the answer and to know whether the answer is reliable gets in the paper all that he or she needs to
know. The reader is persuaded; the reader has been argued into accepting the answer. This is what
we call critical reading.
In its simplest form the basic structure of a scientific paper is summarised by the acronym
IMRAD, which stands for
However, when evaluating a scientific article the following aspects should be critically examined.
1. Title
2. Abstract
3. Introduction
(a) Review of literature
(b) Objectives of the study
4. Methodology
5. Results and analysis
6. Discussion
7. Conclusion(s)
8. References
The process of evaluating a scientific article must naturally follow the order of the sections above.
Below is a set of questions that keep reminding you what you should do to evaluate properly the
article.
1. Title
The title should have the following characteristics. If it lacks any of them you should be in
position to suggest a good title for the article after reading through the entire paper when you
know what it is all about.
3. Introduction
(a) Review of literature
Has the author outlined the background of the problem?
Has he reviewed prior knowledge on the problem?
Has he identified properly the relevant variables selected and not selected in the
preceding studies?
Has he outlined the significance or public health importance of the problem and
therefore the purpose of this particular study?
111
(b) Objectives
Broad objectives
Specific objectives
-free of value statements not amenable to testing
-meet the purpose of the study
-clear and unambiguous
-stated in measurable terms
4. Methodology
Study design
Type of study design; prevalence survey, case-control study controlled trial case reports etc.
Is study design appropriate for the study objectives?
Study population
is the study group defined?
-what criteria were used for selection?
-was any kind of control or comparison group necessary
-Does this control group adequately control for variables which are not being studied?
-was any form of sampling used?
-type and sample size:- how representative is it of the target population?
-any factors which might have biased the selection of study and/or control group?
Methods
Type of measurements: what measurements were used? were they appropriate?
-notified cases
-attitudes
-laboratory tests
-examination
instruments used to collected the data are a variety of specific equipment, registers,
observation records and questionnaires
Type of errors to be expected in the measurements and how they were dealt with:
-observer variation
-subject variation
-instrument variation
6. Discussion
Has the author assessed properly how the results have demonstrated the evidence (or lack of
it) for a new scientific contribution. Does author show value of this contribution in disease
112
control programmes? What are the recommendations if any? Do they really come from this
piece of research work? And if so what kind of studies and what possible methods?
Conclusions
Are the conclusions justified from the data and evidence given in view of any problem you
have encountered with methods, results or analysis? Can you draw any alternative
conclusions from the same data? To what extent can these conclusions be generalized?
Have we really gained any important new knowledge or insights from this study, and are they
of immediate practical value, or are more studies required?
References
Are references properly cited? Are they relevant? Are they up to date? Are they retrievable?
i.e. are they complete and properly written according to requirements for manuscripts
submitted to biomedical journals?
Further reading
1. Huth E. J. How to write and publish papers in the Medical Sciences. ISI Press
Philadelphia 1982.
2. Hall G.M ed. How to write a paper BMJ Publishing group London 1994
113
There are three alternative explanations which we need to consider before we conclude that there is a
real association between exposure and a certain disease. These includes:-
a.) Chance - observed association may be due to chance, or the "luck of the draw".
b.) Study results may be explained by some form of bias or systematic errors due to
problems related to the way subjects were selected into the study or the way information was
obtained.
c.) Findings of the study may be due to other factors associated with both exposure and
outcome of interest or confounding.
It is therefore important to consider all these alternative explanations before we can conclude that there
is a real association between exposure and outcome of interest. Most errors in epidemiological studies
results from the failure to consider these alternative explanations during the period of study design and
data analyses.
Before focusing in detail on various sources of error, we must broadly distinguish between the terms
"validity" and "precision", especially since different approaches to the assessment of each are
advocated. Validity is the extent to which the study measures what it is intended to measure. Precision is
the reproducibility of a study result, that is, the degree of resemblance among study results, were the
study to be repeated under similar circumstances. In general, these terms are concerned with two
different sources of inaccuracy that can occur when estimating an effect: systematic error (a validity
problem) and random error (a precision problem).
TYPES OF ERRORS:
I: Random Errors:
Refers to part of our experience that we cannot predict (chance variations). In epidemiologic studies,
random errors has many components, but a major contributor is the process of selecting the specific
study subjects. This process is usually referred to as sampling; the attendant random error is known as
sampling error. Case-control studies literally involve a sampling process, but all epidemiologic
studies, including follow-up studies, which do not involve actual sampling of subjects, are said to have
sampling errors.
Precision in epidemiologic measurements corresponds to the reduction of random errors. Precision can
be improved in two ways:-
114
a) The primary means is to increase the size of the study population (increase sample size).
b) Precision can also be improved by modifying the study design to increase the efficiency with which
information is obtained from a given number of study subjects. Study efficiency can be assessed as:-
Bias may be defined as a systematic error in an epidemiologic study that results in an incorrect estimate
of the association between exposure and risk of disease.
Various types of biases affect validity of the study findings. There are a lot of possible biases that can
distort the estimation of an epidemiologic measure. The distinction among these biases is occasionally
difficult to make, but three general types can be identified:-
a) Selection bias
b) Information (observation) bias
c) Confounding
These categories are not always clearly demarcated. A useful practical distinction between confounding
and other biases is to consider a bias as confounding if it can be controlled during data analysis stage.
SELECTION BIAS:
Can occur wherever the procedures used to select individual subjects for inclusion into the study on the
basis of either exposure (cohort) or disease (case-control) status leads to an effect estimate among
subjects included in the study being different from the estimate obtainable from the entire population
theoretically targeted for the study.
Selection bias can result from a number of circumstances related to the way in which individuals are
ascertained and selected for the study. Some of the major forms of selection bias include:-
a) Self-selection bias:
- Self-referral of subjects is ordinarily considered a threat to validity, since the reasons for
self-referral may be associated with the outcome under study.
- e.g. Centers for Disease Control (CDC) investigated subsequent leukemia incidence among
troops working in the atomic plants. 76% of the troops identified as members of that cohort
were traced; out of which 18% of the subjects contacted the investigators on their own initiative
in response to publicity about the investigation. In this study, there were 4 leukaemia cases
among the 18% of subjects who referred themselves and 4 among the 82% of subjects traced by
the investigators, indicating that self- referral was indeed biased towards leukaemia cases.
Selection bias can occur when there is differential surveillance, diagnosis or referral of
individuals into the study. e.g. When the first hospital-based case-control studies of the
relationship between oral contraceptives (OC) use and thromboembolism were reported, there
was some concern that since physicians were already aware of a possible relationship of these
agents with thromboembolism;
i) OC users were more likely to be hospitalized than non-users for evaluation of this condition.
ii) Among cases, those with history of OC use were more likely to be subjected for intensive
investigation to rule out this condition than those without such a history (i.e. detection bias).
115
Hence increased frequency of current OC use among women hospitalized for thromboembolism might
actually be due, at least in part, to the fact that hospitalization and the determination of the diagnosis
were both influenced by a history of OC use.
Similarly, criticism concerning the increased risk of uterine cancer associated with use of exogenous
oestrogens observed in early case-control studies of this hypothesis was based on the argument that
women who used oestrogens experienced uterine bleeding and were, therefore, more likely to seek
medical attention and have a diagnostic evaluation than women not on oestrogens.
d) Selective survival:
a) In follow-up study, the comparison group serves to provide an estimate of what the disease
incidence would have been in the exposed group had the exposure been absent or without effect.
- e.g. assessment of cardiovascular mortality among factory workers.
? Compare mortality of workers with that in the general population?
? Compare mortality of sub-groups of workers based on level of activity?
b) In case-control studies, control group provides an estimate of the exposure distribution in the
source population from which the cases originate.
• Since completely identical groups will never be found, the process of determining which
factors are relevant in choosing a comparison group is a critical feature of epidemiologic
study design.
• Use of two or more comparison groups drawn in different ways. If the effects do not differ,
it means there is no bias in selection. Consistent results from several comparison groups
reinforce belief about the results.
3. Diagnostic surveillance:
- Uniform procedures in ascertainment of exposures and diseases.
Refers to a distortion in the estimate of effect due to measurement error or misclassification of subjects
on one or more variables. Major sources of information bias include:-
116
- Invalid measurements
- Incorrect diagnostic criteria
- Omissions, imprecisions, or other inadequacies in previously recorded data.
- Unequal diagnostic surveillance among exposure groups (cohort studies). Note that in case-control
studies, selective surveillance can lead to selection bias rather than information bias.
NOTE:
RECALL BIAS is the commonest type of information bias in case-control studies. It occurs if there is
differential recall of exposure in the cases and controls. e.g. in a study to assess exposure to certain
drugs during antenatal period, mothers of malformed infants recall exposures more thoroughly than
mothers of healthy infants.
Recall bias is a possibility in any case-control study that uses an anamnestic response since the cases and
controls by definition are people who differ with respect to their disease experience, and this difference
may affect recall.
1. Precise measurements
- well defined methods for taking measurements.
- Intensive training of data collection personnel
2. Diagnostic tests
- Uniformity in criteria used in diagnosis
- Methods with high sensitivity and/or specificity required.
4. Use of appropriate tools for data collection (research instruments which provides information
required).
Confounding
Confounding refers to mixing of the effect of the exposure under study (first factor) on the disease
(second factor) with that of a third factor. The third factor must be:-
b) independent determinant of the outcome (or a risk factor for the disease under investigation
independent of exposure of interest).
Example: In a study it was found that subjects with a high consumption of alcohol (first factor) had a
higher risk of developing lung cancer (second factor) than those who did not drink alcohol. The reason
for this association was that subjects with a high consumption of alcohol were often heavy smokers.
Confounding from smoking explained the observed association between alcohol consumption and lung
cancer.
117
References:
1. Hennekens CH and Buring JE. Epidemiology in Medicine. Little, Brown and Company, 1987.
3. Abramson JH. Survey Methods in Community Medicine (4th Edition). Churchill Livingstone, 1990.
118
Introduction
1. You have now completed the part of the curriculum on principles and methods of
Epidemiology. You are now ready to learn how to apply this knowledge and skill to
describe and investigate major public health problems in Tanzania. In order to do this the
class will be divided into the usual number of discussion groups so that you can gain some
practical skills on the subject. This exercise will begin during the third term of the
academic year and will stop a few weeks before the final university examination.
2. Each discussion group will be assigned or will select a specific disease for class
presentation. Members of each group must consultant the relevant instructor for references
and relevant literature and seek advice on how to prepare and give the presentation on the
selected disease. The value of this exercise is to expose you to the kind of experience
which is hard to lean from a library. It is expected to provide you with the skills of
preparing scientific materials for public presentation. Talking with confidence in a
scientific forum is an experience you can only get through this kind of exercise.
3. Students will be required to use all available knowledge on the disease and discuss its
epidemiology using the format below. Students in a group are advised to assign each other
particular sections of the presentation and consult relevant specialists and institutions on the
subject. The library is expected to be particularly useful for up todate knowledge on the
epidemiology of diseases. Copying text books alone without relating the materials to your
presentation will not be accepted. The Epidemiology unit and the planning section of the
Ministry of Health headquarters as well as disease specific national control programmes eg.
TB & Leprosy, AIDS etc. are also good sources of information on health problems in
Tanzania. You should also consult these to improve your presentations. In this case, letters
of introduction from the department may be useful. It must be stressed that getting
information from busy government officials is often difficult and so you have to be patient
and show that you are really interested in the information you are looking for. Remember
that it is easier to obtain information when you know exactly what you are looking for than
when you only have a vague idea of your problem.
4. The presentations must be brief and to the point. Students will be encouraged to use
overhead projections during presentations. Each group will be allocated a maximum of 10
overhead sheets and water soluble overhead felt pens from the department all of which must
be returned at the end so that they can be re-used by other groups. The group leader should
organize the presentations so that they are interesting and not boring. In order that each
group gets a sufficient audience, attendance for these sessions will be mandatory and those
who will be absent shall be penalized. The current penalty is 5 marks each time you don’t
appear except when it is your turn to make a presentation.
Presentation format
B. Levels of Prevention
Classify the preventive measures into primary, secondary and tertiary levels of
prevention, showing the effect each measure will have on the disease(s) natural
distribution (natural history). This, like in “A” above, must be supported by the
most current research findings properly quoted and where possible by
demonstrations with tables and graphs.
Indicate the country’s policy towards this disease and discuss it relative to the
previous discussion on the disease’s epidemiology: What is the current stage (s) of
control in Tanzania? What control options are in operation in Tanzania?
And what is the basis for the options taken? If control is indicated or exists, what
would be the ideal control strategy in the preparatory, attack, consolidation and
surveillance phases for this particular disease considering the natural history of the
disease and available control measures. In this section participant’s comments,
speculations and suggestions are appropriate in addition to basic factual
information. Participants may differ in many of these aspects, though differences
ought to be governed also by scientific evidence.
Together with the presentation on the epidemiology and control of major diseases in
Tanzania, students will be required to critique a published paper reporting on research
results relating to epidemiology and control of diseases. To achieve this, one of the
participants in a group should look for a relevant published paper on the subject and
critique it. The participant should present a summary of the paper and then critique it as
follows:
The following is the preliminary list of major diseases assigned to groups. More
diseases and other relevant problems may be added whenever it becomes necessary.
At the end of the sessions all presented materials should be handed in for assessment and grading.
Groups must ensure that all members participate and any materials handed in must contain a list of
all participating members. A hard working group will be awarded high marks and a lazy group will
get low marks. These marks will contribute to the continuous assessment grade. Remember that
the work must be neat and tidy. Typing of materials will be encouraged but not essential.