Artificial Intelligence in Biomedical Engineering
Artificial Intelligence in Biomedical Engineering
Engineering
DRAGOS AROTARITEI
[email protected]
May 2016
Introduction
Artificial intelligence (AI) is an area of computer science, which has been
developed since the 1950s, specialized in dealing with problems considered
difficult by traditional computer scientists through the use of knowledge and
of probabilities and other kinds of uncertainties.
AI techniques can be applied to solve complex problems in biomedical
engineering.
AI subfields
Neural Networks
Evolutionary Computation,
Vision
Robotics
Expert Systems
Speech Processing
Planning,
Machine Learning Algorithms can create new medical knowledge
(supervised and unsupervised)
Natural Language
Fuzzy systems and approximate reasoning
Hybrid systems
Machine Learning algorithms
UC Irvine Machine Learning Repository
http://archive.ics.uci.edu/ml/
Pattern recognition - a branch of machine learning that focuses on the
recognition of patterns
Feature extraction
Feature extraction is an intermediary step between preprocessing data and the
process (classification, clusterization, rule generation, etc.)
Two problems arise after data is collected: dimensionality of each item from
data and the number of items. Sometimes both problems are possible to be
solved in a single step.
The digital information is usually structured in vectors and matrix having
sometimes very large dimension. As consequence, manipulation of these types
of data requires a dimensionality reduction: principal component analysis
(PCA), linear discriminant analysis (LDA), canonical correlation analysis
(CCA), singular value decomposition (SVD), dimensionality reduction via
kernels.
The second problems refers to selection of features from all the features that
are relevant for our goal.
independent component analysis (ICA), ISOMAP with extensions, kernel
principal component analysis (kernel PCA), Latent semantic analysis (LSA),
Partial least squares regression (PLS regression), ortogonal transformations
orthogonal transformation is a linear transformation T : V → V on a real inner
product space V, that preserves the inner product), nonlinear dimensionality
reduction (NLDR) - manifold learning algorithms
The are many overlapping algorithms that map the both objectives.
The feature extraction is can be completely automated very rarely, the ability
of author plays an important role in identification and selection of possible
relevant feature and the method (new or classic one) that leads to results.
Evolutionary Algorithms
Subset of evolutionary computation
The most frequently usage: optimization (single objective and multiobjective)
Genetic algorithms (based on Darwinian evolution) – the most used in
numerical optimizations
Differential evolution – Based on vector differences
Ant colony optimization – suitable for path optimization problems
Particle swarm optimization
Genetic programming – suitable for finding a mathematical formula for
relationship among variables based on experimental data
Memetic algorithms - most MAs can be interpreted as a search strategy in
which a population of optimizing agents cooperate and compete using local
strategies
Other non-Darwinian algorithms: Baldwinian Evolutionary Algorithms,
Lamarckian Evolutionary Algorithms, Genetic Local Search
GA (Genetic Algorithms)
Used most frequently for numerical optimization problems
GA belong to heuristic search that mimics the natural evolution (Darwinian
evolution)
A large area of applications including Biomedical Engineering, Clustering,
Computational Biology, Image processing (Dense pixel matching).
Explanations of the power of genetic algorithms is given by Holland's schema
theorem (fundamental theorem of genetic algorithms). Low-order schemata
with above-average fitness increase exponentially in successive generations.
A schema is a template that identifies a subset of strings with similarities at
certain string positions1.
The chromosome is used to code de information. The usually conversion is
value to binary string of „0‟s and „1‟s. Each value „0‟/‟1‟ is named allele.
A general schema for GA
f
i 1
i
2D. Arotaritei,
C. Rotariu, Parameter Optimizations in Detection of Atrial Fibrillation Based on Heart Rate Interval,
BIOMEP 2015 - Conference on Bio-Medical Instrumentation and related Engineering and Physical Sciences, 2015,
Athenes. Greece
The set of parameters are focused on length of window L because an optimal
window can produce a relevant value of randomness of time series that can be
interpreted as AF
A set of parameters can be defined as P= [L, p1, p2,..., ph]. Optimization of
some parameters can be conflictual that is if optimization of objective using
one parameter can produce a decrease of optimized value for other
parameters. A tradeoff using optimized curve of values can be used to choose
a correct tradeoff (e.g. Pareto front).
The selected parameters are3: L – length of window, Pth1 – threshold from
where the segment is considered to be AF (number of AF segments/number of
total segments), RMSt – threshold of Root Mean Square of Successive
Differences, TPRt threshold of Turning Point Ratio, SEt - threshold of
Shannon Entropy and TKt - threshold of Teager-Kaiser energy (six
parameters).
An vector of HRV descriptors are good for detection of AF if sensitivity Se
and specificity Sp are both enough close to 1.0 for a correct classification.
Genetic Algorithms proved to be a good choice for optimization problem,
single objective or multi-objective.
The parameters are set to be limited by lower values [32, 0.1%, 0.01, 0.01%,
0.01, 0.01] and upper values [480, 99.99%, 0.999, 99.99%, 0.999, 0.999].
The chromosome coded for first value is mapping linearly into integer values
between 32 and 480 (logarithmic mapping is a solution for some types of
problems).
We start with a population of 50 individuals and after 40 iterations the
solution is given in next figure. The multi-objective function is minimization
of bot functions 1- Se and 1-Sp. The results are Se=90.17 and Sp = 90.12.
Single objective GA, ffitness= 1-min(Se, Sp)
fusion of two objectives (if we have more than two
objectives, the graphic become more difficult).
Artificial Neural Networks (ANN or shortly NN)
Neural networks are models inspired by biological neural networks. Main
features: neuron transfer function, architecture, neuron connections.
Main usage: function approximation, classifier, pattern recognition,
prediction.
ANN proven to be universal approximators (Haykin S.).
In 1989, Cybenko G. demonstrated, using a Kolmogorov‟s older result that
multilayer feed-forward network with a single hidden layer, which contains
finite number of hidden neurons, is a universal approximator. The
demonstration has been made for sigmoid activation function.
In the most common sense, the universal approximator can approximate any
non-linear function with a desired precision if its architecture is large enough.
Other NNs architectures have been proved to be also universal approximators:
radial basis function (RBF) (Park J., and Sandberg I.W.), recurrent neural
networks (RNN) (Schäfer A.M., and Zimmermann H.G.) and Kohonen maps.
Recurrent Neural Networks (RNN) in various architectures and connection
among neurons are the subject of dynamic nonlinear modelling and prediction
with very good results.
The most common architecture is feedforward Neural network (multilayer
perceptron) a (axial depth of cut [mm]), f (feed per tooth
p z
[mm/tooth]), and Vc (cutting speed [m/min]).
The outputs layer has six neurons
corresponding to average cutting forces and
maximum measured forces
S. Dash, K.H. Chon, S. Lu, E.A. Raeder, “Automatic Real Time Detection of Atrial Fibrillation”, Annals of
Biomedical Engineering, vol. 37, issue 9, pp. 1701-1709, 2009.
Conclusions
A gene is a segment of DNA containing the code used to synthesize a protein.
A chromosome contains hundreds to thousands of genes.
The number of individuals in population at each generation is fixed GA. Some
researchers proposed GA schemes using variable size of population during
evolution
The length of chromosome are the same for all individuals. There are some
proposals that use a variable length genotype (the organism itself), SAGA.
Algorithms from Artificial Intelligence are not a universal solution for
optimized problems - sometime other solution based on non-heuristic models
can be more efficient.
Σας εσταριστώ για την προσοτή σας