A Candidate Lattice Refinement Method For Online Handwritten Japanese Text Recognition
A Candidate Lattice Refinement Method For Online Handwritten Japanese Text Recognition
Abstract— This paper presents a candidate lattice refinement method has widely been used to overcome the ambiguity of
method for online handwritten Japanese text recognition. In character segmentation [1], which is classified into the
the integrated segmentation-recognition framework, we first segmentation-free method and the over-segmentation-based
over-segment a character string pattern into primitive method [2]. The over-segmentation-based method can utilize
segments at least at their true boundaries so that each the character shapes into recognition, therefore, it can be
primitive segment may compose a single character or a part of successfully applied into handwritten Chinese/Japanese text
a character. Then a candidate lattice is constructed based on recognition [3] [4].
the primitive segments. We search within the candidate lattice In the over-segmentation-based recognition framework, a
to obtain the optimal path as recognition result. In striving for
handwritten text (character string) is first over-segmented
high recognition accuracy, however, the approach must
generate many candidate lattice nodes, which ultimately
into primitive segments, consecutive segments are
increase the recognition time. To solve this problem, we refine concatenated into candidate character patterns. Each
the candidate lattice to eliminate unnecessary nodes before candidate pattern is associated with a number of candidate
path search and text recognition. For the refinement, we classes and confidence scores by a character classifier. Then,
evaluate all segmentation hypotheses by combining the a segmentation-recognition candidate lattice is constructed
probability of a character verifier using noncharacter samples, by combining all candidate patterns and character classes.
the class-independent unary and binary geometric context, as Finally, by integrating the scores of character recognition,
well as character segmentation. We retain N-best paths by geometric context and linguistic context, an optimal
beam search to reduce the complexity of the candidate lattice. segmentation-recognition path is searched from the
Experiments on horizontal text lines extracted from the constructed lattice by Viterbi search.
Kondate database show that the proposed method keeps In over-segmentation, we usually keep all possible
recognition accuracy while reducing recognition time to half. segmentation points to guarantee a high recognition rate,
though it complicates the lattice and consequently burdens
Keywords-online handwritten text recognition; integrated the computation of string recognition, especially for the case
segmentation and recognition; lattice refinement; noncharacter of partial overlapped characters [5]. As a result, there are
patterns. many candidate character patterns. Due to thousands of
Japanese characters, Zhu et al. [6] proposed a coarse
I. INTRODUCTION classifier for candidate patterns before the fine classification
Online handwritten text recognition has been receiving to speed up the online handwritten Japanese characters
large attention due to the development and growing recognition.
popularity of pen-based input devices with large writing Since candidate patterns include true character and
areas, such as tablet PCs, electronic whiteboards and digital noncharacter patterns in the lattice, the character classifier or
pens (e.g., Anoto pen). Using these devices, people tend to path evaluation should be resistant to noncharacter patterns.
write text continuously with little constraints. To meet the Liu et al. [1] evaluated several classifiers for candidate
needs of real applications, however, there is still further work patterns with noncharacter training in the context of
needed to improve the performance of handwritten text handwritten numeral strings, and showed that training with
recognition. noncharacter samples improves neural classifiers and support
Due to the variable space between characters and the fact vector classifiers. However, noncharacter training shows
that many characters are composed of multiple radicals with limited influence on the performance of discriminative
internal gaps in the languages of Chinese origin, online density models: the learning vector quantization classifier
handwritten Japanese text recognition faces the difficulty of and the discriminative learning quadratic discriminant
character segmentation compared to isolated character function classifier.
recognition. The integrated segmentation and recognition Moreover, Li et al. [7] proposed a probabilistic model to
evaluate segmentation hypotheses of a text line by
Candidatelatticeconstruction
Characterverifier
Candidatelatticerefinement Classindependent
geometriccontext (a)
Characterrecognition
Pathsearchandrecognition Linguisticcontext
Classdependent
geometriccontext (b)
Outputtheoptimalresultstring
Figure 2. Over-segmentation of the same text line: (a) strict case, (b)
Figure 1. Flow of handwritten Japanse text recognition system. loose case.
217
TABLE I. FEATURES EXTRACTED FROM AN OFF-STROKE. s1 s2 s3s4 s5 s6 s7 s8 s9 s10 s11 s12
No. Definition
f1 DBx / acs s1 s2 s3s4s5 s6 s7 s8 s9 s10 s11 s12
Start
f2 , f3 Dbx / acs, Dby / acs
f4 , f5 Dbsx / acs, Dbsy / acs
f6 Dbs / acs
f7 Loff / acs
f8 , f9 sine(Loff), cosine(Loff)
f10 DBx / the maximu DBx in text line
f11 D / acs
End
TABLE II. TERMS TO DERIVE FEATURES.
Term. Description Figure 4. Refined lattice for the loose case of the candidate lattice as
shown in Fig. 3(b).
acs Average character size of text line
Bbp Bounding box of immediately preceding stroke
Bbs Bounding box of immediately succeeding stroke C. Candidate lattice refinement
BBp_all Bounding box of all preceding strokes
Due to many noncharacter patterns in the constructed
BBs_all Bounding box of all succeeding strokes
candidate lattice, especially for the loose over-segmentation,
Bounding box of primitive segment which includes
BSp
immediately preceding stroke
noncharacters complicate the computation of text recognition.
Bounding box of primitive segment which includes To speed up the text recognition, we keep only N-best
BSs segmentation paths by beam search with combining the
immediately succeeding stroke
DBx Distance between BBp_all and BBs_all in x-axis probability of a character verifier using noncharacter samples,
Dbx Distance between Bbp and Bbs in x-axis the class-independent unary and binary geometric context, as
Dby Distance between Bbp and Bbs in y-axis well as character segmentation. We use the maintained N-
Dbsx Distance between centers of Bbp and Bbs in x-axis best segmentation paths to reconstruct a downsized candidate
Dbsy Distance between centers of Bbp and Bbs in y-axis lattice with few candidate character patterns, as shown in Fig.
Dbs Distance between centers of Bbp and Bbs 4, that is refined from the lattice as shown in Fig. 3 (b) where
D Distance between in BSp and BSs in x-axis the thickly marked path is the correct segmentation path. Sec.
Loff Length of off-stroke Ϫdescribes the processing of lattice refinement in detail.
For SVM classification, we set the target value of D. Path search and recognition
segmentation point as 1 and that of non-segmentation point Each candidate character pattern in the refined candidate
as -1, and train it using training patterns of off-strokes. lattice, is associated with a number of candidate classes with
confidence scores by character classification. Then, all
B. Candidate lattice construction retained segmentation paths and recognition candidate
The combination of all possible candidate character classes are represented by a segmentation-recognition
patterns is represented by a candidate lattice, as shown in Fig. candidate lattice.
3, where each node (rectangle) denotes a candidate character We utilize the path evaluation criterion proposed by Zhu
pattern. Each path from start segmentation point to end one et al. [3] to re-evaluate paths in the refined candidate lattice
in the lattice denotes a segmentation hypothesis for a and search for the optimal string result by the Viterbi
handwritten text line. algorithm. This criterion combines the scores of character
s1 s2 s3 s4 s5 s6 s7 s8 s9 s1 s2 s3s4 s5 s6 s7 s8 s9 s10 s11 s12
recognition, character size, inner gap, single-character
position, pair-character position, and linguistic context, as
s1 s2 s3s4 s5 s6 s7 s8 s9 s1 s2 s3s4s5 s6 s7 s8 s9 s10 s11 s12
Start Start well as character segmentation, with weighting parameters
estimated by the genetic algorithm.
III. CANDIDATE LATTICE REFINEMENT
This section describes the process of candidate lattice
refinement.
End Given a handwritten text pattern, which is over-
segmented into a sequence of candidate character
patterns X x1x2 ,, xn . Using the following evaluation
End
(a) (b) criterion, we can get N-best segmentation paths in the
candidate lattice.
Figure 3. Candidate lattice based on the over-segmentation as shown in
Fig. 2: (a) strict case, (b) loose case.
218
§ ·
¨ O1ki log P(true | xi ) O2 log P ( pi | xi ) ¸
u
n
¨ ¸
f ( X) ¦ ¨ O3 log P ( pi | xi 1 , xi ) O4 log P ( g ji | Sb) ¸ On
b
i 1
¨ ¸
¨ O5 ¦ ji j i1 log P ( g j | Sw)
j k 1
¸
© i ¹
219
TABLE III. STATISTICS OF TEXT LINES. 2
F
1/R 1/P
Text Character Character Characters number of correctly detected segmentation points (2)
Dataset R
lines patterns classes per line number of true segmentation points
Training set 10,174 104,093 1,106 10.23 number of correctly detected segmentation points
P
number of detected segmentation points (including false)
Testing set 2,058 26,863 686 13.05
220
Figure 6 shows some misrecognition examples of [3] B. Zhu, X.-D. Zhou, C.-L. Liu and M. Nakagawa, “A Robust
handwritten Japanese text. The recognition errors are mainly Model for On-line Handwritten Japanese Text Recognition,”
due to the following reasons: First, the character classifier International Journal on Document Analysis and Recognition
(IJDAR), vol.13, no. 2, pp.121-131, 2010.
cannot give the correct class in the top 10 candidates, Second,
[4] D.-H. Wang, C.-L. Liu, and X.-D. Zhou, “An approach for
the correct segmentation-recognition path cannot be found in real-time recognition of online Chinese handwritten
the path search and text recognition, although the refined sentences,” Pattern Recognition, vol. 45, no. 10, pp.3661–
lattice includes the correct segmentation path. Third, the 3675, 2012.
correct segmentation path cannot be found in N-best paths of [5] J. Liang, B. Zhu, T. Kumagai and M. Nakagawa, “Character-
the lattice refinement. Position-Free On-line Handwritten Japanese Text Recognition
by Two Segmentation Methods,” IEICE Trans. on Information
V. CONCLUSION and Systems, vol. E99-D, no. 4, pp.1172-1181, 2016.
[6] B. Zhu and M. Nakagawa, “A Robust Method for Coarse
In this paper, we proposed a method to refine the lattice Classifier Construction from a Large Number of Basic
for online handwritten Japanese text recognition. In the Recognizers for On-line Handwritten Chinese/Japanese
lattice refinement, we evaluate segmentation hypotheses by Character Recognition,” Pattern Recognition, 47, pp. 685-693,
combining the scores of the character verifier using 2014.
noncharacter samples, the class-independent unary and [7] N.X. Li and L.W. Jin, “A Bayesian-based method of
unconstrained handwritten offline Chinese text Line
binary geometric context, as well as character segmentation. recognition,” International Journal on Document Analysis and
The experiments on horizontal Japanese text lines showed Recognition (IJDAR), vol. 16, no. 1, pp.17-31, 2013.
the effect of the candidate lattice refinement. [8] X.-D. Zhou, C.-L. Liu, and M. Nakagawa, “Online
In the future work, we will do the following works to handwritten Japanese character string recognition using
make our proposed text recognition method robust. First, we conditional random fields,” in Proc. 10th ICDAR, pp. 521-525,
will optimize the parameters in the evaluation criterion of 2009.
lattice refinement by the genetic algorithm. Second, to better [9] F. Yin, Q.-F. Wang, and C.-L. Liu, “Transcript mapping for
verify candidate character patterns, we cluster true characters handwritten Chinese documents by integrating character
recognition model and geometric context,” Pattern
into several super-classes according to some geometric Recognition, vol. 46, no. 10, pp. 2807-2818, 2013.
features such as size and position, then classify them with [10] Y.-C. Wu, F. Yin and C.-L. Liu, “Evaluation of geometric
noncharacters using another classifier such as MQDF. context models for handwritten numeral string recognition,” in
Proc. 14th ICFHR, pp.193-198, 2014.
ACKNOWLEDGMENT [11] T. Matsushita and M. Nakagawa, “A database of on-line
This research is being partially supported by Grant-in-aid for handwritten mixed objects named “Kondate”,” in Proc. 14th
ICFHR, pp.369-374, 2014.
Scientific Research C-15K00225 and NEDO under the
[12] C.-L. Liu and X.-D. Zhou, “Online Japanese character
number 27J1103. recognition using trajectory-based normalization and direction
feature extraction,” in Proc. 10th ICFHR, pp. 217-222, 2006.
[13] B. Zhu, J. Gao and M. Nakagawa, “Objective function design
REFERENCES for MCE-based combination of on-line and off-line character
recognizers for on-line handwritten Japanese text recognition,”
[1] C.-L. Liu, H. Sako and H. Fujisawa, “Effects of classifier in Proc. 11th ICDAR, pp.594-599, 2011.
structures and training regimes on integrated segmentation and [14] M. Nakagawa and K. Matsumoto, “Collection of on-line
recognition of handwritten numeral strings,” IEEE Trans. handwritten Japanese character pattern databases and their
Pattern Analysis and Machine Intelligence, vol. 26, no. 11, analysis,” International Journal on Document Analysis and
pp.1395-1407, 2004. Recognition (IJDAR), vol.7, no.1, pp.69-81, 2004.
[2] B. Zhu and M. Nakagawa, “Online handwritten
Chinese/Japanese character recognition,” Advance in
Character Recognition, InTech, Chapter 3, pp.51-68, 2012.
221