L. Schomaker c.s., Abstracts
(These are handwriting recognition-related papers, there are
others)
Schomaker, L.R.B., & Thomassen, A.J.W.M. (1986).
On the use and limitations of averaging handwriting signals.
In H.S.R. Kao, G.P. van Galen, & R. Hoosain (Eds.),
Graphonomics: Contemporary research in handwriting
(pp. 225-238). Amsterdam: North-Holland.
The averaging of handwriting signals is subject to a
number of special restrictions that in general do not apply to
other signals such as electrophysiological recordings.
Specific problems are presented by the choice of the entities
to be averaged, by the choice of time-reference points and by
duration variability. Knowledge of the signal production
mechanism and of temporal and spatial characteristics of the
handwriting signal is needed to solve these questions.
It is noted that the signal is deterministic at the local
level, which justifies the use of averaging techniques.
The problem of stroke duration variability is dealt with
by applying time-axis normalization prior to averaging.
Examples of averaging at the stroke, letter and word
level are presented. Results indicate that at up to four
letters can be averaged without noticeable distortion in the
spatial domain. Eventually, however, variability in stroke
duration will dictate the choice of time-reference points
for the time-axis normalization in multi-letter handwriting
segments. This will 'warp' the original time path. If care is
taken in the selection of time references, time-axis
normalization and averaging can be useful in movement analysis,
pattern matching and simulation of handwriting.
Thomassen, A.J.W.M., & Schomaker, L.R.B. (1986).
Between-letter context effects in handwriting trajectories.
In H.S.R. Kao, G.P. van Galen, & R. Hoosain (Eds.),
Graphonomics: Contemporary research in handwriting
(pp. 253-272). Amsterdam: North-Holland.
In fluent, cursive handwriting, context effects may appear at
various levels. First, at the highest, central level, specific
allographs (defined as alternative shapes for the same character, or
grapheme) may be selected from motor memory when they are to be
produced in the neighbourhood of specific other allographs,
or following a space. Second, at an intermediate level, there
may be mutual influence amongst neighbouring character representations
during their temporary storage immediately preceding their execution.
Third, the dynamics prevailing during the actual performance of
a writing task may further influence the execution of the selected
allographs. The present study investigates context effects of the
two latter kinds. Specific hypotheses regarding the modification of
spatial features of strokes as a function of adjacent strokes are
tested both within letters and between letters in cursively written
words. Testing is done by using a newly developed averaging technique.
The results are interpreted in terms of performance effects
similar to those coarticulation in speech.
Teulings, H.-L., Schomaker, L.R.B., Morasso, P., & Thomassen, A.J.W.M.
(1987). Handwriting-analysis system. In R. Plamondon, C.Y. Suen, J.-G.
Desche^nes, & G. Poulin (Eds.), Proceedings of the Third International
Symposium on Handwriting and Computer Applications (pp. 181-183). Montreal:
Ecole Polytechnique. ISBN 2-553-00197-5.
Teulings, H.-L., Thomassen, A.J.W.M., Schomaker, L.R.B.,
& Morasso, P. (1987).
Experimental protocol for cursive script acquisition: The use of motor
information for the automatic recognition of cursive script.
Report 3.1.2., ESPRIT Project 419.
Maarse, F.J., Schomaker, L.R.B., & Teulings, H.-L. (1988).
Automatic identification of writers.
In G.C. van der Veer & G. Mulder (Eds.),
Human-Computer Interaction: Psychonomic Aspects
(pp. 353-360). New York: Springer.
A major problem in the automatic recognition of handwriting, as in
speech recognition, appear to be the inter-individual differences.
Although the use of feature extraction and abstract coding can
alleviate the problem to some extent, the range of idiosyncratic
handwriting properties is so wide that an early-stage narrowing of the
domain of possible recognition solutions is necessary. This can be done
by extracting global features from the handwriting and automatic
identification of the person or class of hand writers before the actual
recognition stage starts. In this paper, a list of useful handwriting
parameters for person identification is presented. It is shown in what
way person identification can be added to a handwriting recognition
system. As an illustration, a method is described in which persons where
identified on the basis of global characteristics of a single line of
handwriting.
Teulings, H.L., Schomaker, L.R.B., & Maarse, F.J. (1988).
Automatic handwriting
recognition and the keyboard-less personal computer. In F.J. Maarse, L.J.M.
Mulder, W.P.B. Sjouw, & A.E. Akkerman (Eds.),
Computers in psychology: Methods, instrumentation,
and psychodiagnostics
(pp. 62-66). Amsterdam: Swets & Zeitlinger.
Thomassen, A.J.W.M, Teulings, H.-L., Schomaker, L.B.R., Morasso, P.,
& Kennedy, J. (1988)
Towards the implementation of cursive-script understanding in an on-line
handwriting-recognition system. In D.G. XIII (Ed.), ESPRIT '88:
Putting the technology to use (pp. 628-639). Amsterdam: North-Holland.
As a preliminary to the prototype of an on-line handwriting-recognition
system, a flexible series of low-level analysis procedures has been
developed for the processing of totally unconstrained cursive script.
The present paper describes these procedures in a non-technical fashion.
A distinction is made between (1) continuous procedures yielding estimates
of the global properties of the handwriting signal and (2) discrete procedures
which deal with appropriately parse segments of handwriting. The latter
procedures are based in part on knowledge of the human motor system
which is responsible for the generation of natural cursive script.
Low-level analysis is achieved by a set of independent modules in a
parallel control structure and acting in an iterative mode. Both bottom-up
and top-down information streams are incorporated in this approach.
Thomassen, A.J.W.M., Teulings, H.-L., & Schomaker, L.R.B. (1988).
Real-time processing of cursive writing and sketched graphics.
In G.C. van der Veer & G. Mulder (Eds.),
Human-Computer Interaction: Psychonomic Aspects
(pp. 334-352). New York: Springer.
Thomassen, A.J.W.M., Teulings, H.L., Schomaker, L.R.B., Morasso, P., &
Kennedy, J. (1988). Towards the implementation of cursive-script understanding
in an on-line handwriting-recognition system. In Commission of the European
Communities: D.G. XIII (Ed.),
ESPRIT '88: Putting the technology to use. Part 1
(pp. 628-639). Amsterdam: North-Holland.
Schomaker, L.R.B., Thomassen, A.J.W.M., & Teulings, H.-L. (1989).
A computational model of cursive handwriting. In R. Plamondon, C.Y. Suen,
& M.L. Simner (Eds.), Computer Recognition and Human
Production of Handwriting (pp. 153-177). Singapore: World Scientific.
This paper presents a computational model for the production
of handwriting, starting with allograph codes as input and
ending with a target pen-tip trajectory as output. In the model,
a distinction is made between a symbolic level of processing and
a quantitative level of processing. At the symbolic level, a
grammar for the connection of cursive allographs determines
abstract codes for connecting strokes. At the quantitative
level, a translation of symbols into a sequence of parametrized
strokes takes place. A parsimonious stroke parametrization in
the velocity domain is used, that is based on planning in Cartesian
space and allocation of time to movement components along the
spatial axes. With the basic model settings used, simulation
results already show a satisfactory correspondence with original
handwriting samples.
Schomaker, L.R.B. & Plamondon, R. (1990).
The Relation between Pen Force and Pen-Point Kinematics in Handwriting.
Biological Cybernetics, 63,
277-289.
This study investigates the spectral coherence and time-domain correlation
between pen pressure (axial pen force, APF) and several kinematic variables in
drawing simple patterns and in writing cursive script. Two types of theories are
prevalent: "biomechanical" and "central" explanations for the force variations
during writing. Findings show that overall coherence is low $(<0.5)$ and
decreases with pattern complexity, attaining its lowest value in cursive
script. Looking at subjects separately, it is found that only in a small
minority of writers "biomechanical coupling" between force and displacement
takes place in cursive handwriting, as indicated by moderate to high negative
overall correlations. The majority of subjects displays low coherence and
correlation between kinematics and APF. However, APF patterns in cursive script
reveal a moderate to high replicatability, giving support to the notion of a
"centrally" controlled pen pressure. The sign of the weak residual average
correlation between APF and finger displacement, and between APF and wrist
displacement is negative. This indicates that small biomechanical effects may
be present, a relatively higher APF corresponding to finger flexion and wrist
radial abduction. On the whole, however, variance in APF cannot be explained by
kinematic variables. A motor task demanding mechanical impedance control, such
as handwriting, apparently introduces a complexity that is not easily explained
in terms of a passive mass-spring model of skeleto-muscular movement.
Schomaker, L.R.B., & Teulings, H.-L. (1990).
A Handwriting Recognition System based on the Properties and Architectures
of the Human Motor System.
Proceedings of the International Workshop on Frontiers in Handwriting
Recognition (IWFHR) . (pp. 195-211). Montreal: CENPARMI Concordia.
ISBN: 1-895193-00-1.
The human reader of handwriting is unaware of
the amount of back-ground knowledge that is constantly being used
by a massive parallel computer, his brain, to decipher cursive
script. Artificial cursive script recognizers do not have access
to a comparable source of knowledge or of comparable
computational power to perform top-down processing. Therefore, in
an artificial script recognizer, there is a strong demand for
reliable bottom-up processing. For the recognition of
unrestricted script consisting of arbitrary character sequences,
on-line recorded handwriting signals offer a more solid basis
than the optically obtained grey-scale image of a written pen
trace, because of the temporal information and the inherent
vectorial description of shape. The enhanced bottom-up processing
is based on implementing knowledge of the motor system in the
handwriting recognition system. The bottom-up information will
already be sufficient to recognize clearly written and
unambiguous input. However, ambiguous shape sequences, such as
$m$ vs $n..$ or $d$ vs $cl$, and sloppy
stroke patterns still
require top-down processing. The present paper discusses the
handwriting recognition system as being developed at the NICI.
The system contains six major modules: (1) On-line digitizing,
pre-processing of the movements and segmentation into strokes. (2)
Normalization of global handwriting parameters. (3) Extraction of
motorically invariant, real-valued, feature values per stroke to
form a multi-dimensional feature vector and subsequent feature vector
quantization by a self-organizing two-dimensional Kohonen network.
(4) Allograph construction, using a second network of transition probabilities
between cell activation patterns of the Kohonen network. (5) Optional word
hypothesization. (6) The system has to be trained by supervised
learning, the user indicating prototypical stroke sequences and their
symbolic interpretation (letter or N-gram naming).
Teulings, H.L., Schomaker, L.R.B., Gerritsen, J., Drexler, H., &
Albers, M. (1990).
An on-line handwriting-recognition system based on unreliable modules.
In R. Plamondon, & G. Leedham (Eds.),
Computer Processing of Handwriting (pp. 167-185).
Singapore: World Scientific.
In the automatic recognition of
unrestricted handwriting the ambiguities can be solved by
top-down processing. However, automatic systems never have
access to the extended background knowledge available to human
readers. In order to replace this higher-level information we
need to improve the reliability of the bottom-up processing. A
handwriting-recognition system can be split up into six discrete
blocks: (1) digitizing, word segmentation, pre-processing, and
segmentation into strokes, (2) normalization of global
handwriting parameters, (3) extraction of features per stroke,
(4) allograph recognition, (5) optional word hypothesization,
and, in order to allow recognition (6) a learning phase. The
present paper discusses the design of three of these processing
blocks: normalization, allograph recognition, and learning and
briefly specifies feature extraction. Normalization concerns
orientation, size, and slant. However, various alternative
algorithms can be chosen and some algorithms yield more reliable
results than others. A mechanism is proposed that will, sooner or
later, find the most appropriate normalization algorithms.
Consequently, the features extracted from each stroke in the
handwriting pattern will be more uniform within a writer and even
between writers. In the recognition phase, handwriting patterns
are segmented into allographs using an algorithm that can handle
allographs with various numbers of strokes and with optional
connection strokes between them. In order to teach the recognizer
the allographs a method has been designed that builds
non-interactively a lexicon of allographs by automatically
discovering the allographs in a large corpus of cursive script.
Schomaker, L.R.B. (1991).
Simulation and Recognition of Handwriting Movements,
Doctoral Dissertation (NICI TR-91-03).
Nijmegen University, The Netherlands.
This study concerns the processes that take place from the moment that a
writer wants to write down a given word, until one can inspect the
finished result. The approach followed is based on
the assumption that new insights can be gained by trying to build a
working generative computer model of handwriting. Chapter 1 deals with
the theoretical aspects of modeling processes of motor control.
Chapter 2 discusses pen-tip kinematics during
cursive writing: How reproducible are replications of writing movements
recorded on different occasions? Only if movements are actually
reproducible, it makes sense to develop a handwriting production model.
Chapter 3 presents a symbolic generative model of handwriting.
The problem that has to be solved concerns the transformation of
discrete entities, i.e., the symbolic representation of a planned letter
shape (allograph), into a continuous multi-dimensional time function:
Pen-tip movement. This problem is tackled with the
assumption that velocity-based strokes are the basic segments in handwriting.
In the model, a parsimonious parametrization of the strokes is
used, which is based on transforming a shape factor into differential
timing. Consequently, a grammar
providing rules for generating connecting strokes between two planned
letters is proposed. In Chapter 4, a kinetic aspect of writing is studied:
What happens to axial pen force during the production of several types
of movement patterns. It appears that pen-force
fluctuations are not a passive biomechanical phenomenon.
Pen-force control and compliance appear to be embedded in the
"motor programs" for letter production, in an idiosyncratic,
writer-dependent fashion. In Chapter 5, a change of perspective takes
place. It is noted that there are some limitations inherent to a
symbolical modeling approach, especially with respect to low-level
processes in handwriting control. A review of basic artificial
neural-network models is presented and their potential use both in
modeling handwriting movement control and in handwriting recognition is
assessed. In the chapters 6-8, three basic issues are raised with
respect to motor modeling: The coding of quantity, the representation of
time, and the representation of the effector system by neurally inspired
models. A final interesting and relevant problem is computer recognition
of handwriting movements which is the focus of Chapter 9.
An algorithm is proposed which performs recognition by actively
constructing letter (allograph) hypotheses on the basis of chains of
individual strokes, instead of storing prototypical allographs and
performing template matching.
Schomaker, L.R.B., (1991).
Stroke- versus Character-based Recognition of On-line, Connected
Cursive Script. In G.E. Stelmach (Ed.), Proceedings of the Fifth
Handwriting Conference of the IGS: Motor Control of Handwriting (pp. 213-217).
Tempe AZ: Arizona State University.
Schomaker, L.R.B., & Teulings, H.-L. (1992).
Stroke- versus Character-based Recognition of On-line, Connected
Cursive Script. In J.-C. Simon & S. Impedovo (Ed.),
From Pixels to Features III (pp. 313-325),
Amsterdam: North-Holland.
Teulings, H.-L., & Schomaker, L.R.B. (1992).
Un-supervised learning of prototype allographs in cursive script recognition
using invariant handwriting features.
In J.-C. Simon & S. Impedovo (Ed.),
From Pixels to Features III (pp. 61-73), Amsterdam: North-Holland.
An on-line cursive-script recognizer must first obtain knowledge about
the letters to be distinguished. This knowledge may be based upon a common
cursive-script style or it may be extracted from a large corpus of real
handwriting. The latter approach seems more flexible when different styles
of handwriting and large vocabularies are concerned. In order to train
a recognizer, the (velocity-based) strokes belonging to separate letters
in their context have to be identified. For that purpose, a corpus of
someones normal handwriting is available together with the letters of
the words. In this paper, various methods of segmenting on-line recorded
cursive script are briefly reviewed. A promising method of unsupervised
learning, inspired by simulated annealing or relaxation has been implemented
and tested. Only those variations that tend to yield consistent patterns
of stroke features for each letter, are accepted. The simulated-annealing
procedure is valuable but not perfect. It is concluded that a combination
of knowledge sources may be combined to obtain acceptable reliability.
Schomaker, L.R.B. (1992).
A neural-oscillator model of temporal pattern generation.
Human Movement Science, 11, 181-192.
Most contemporary neural network models deal with essentially static,
perceptual problems of classification and transformation. Models such
as multi-layer feed-forward perceptrons generally do not incorporate
time as an essential dimension, whereas biological neural networks are
inherently temporal systems. In modelling motor behaviour, however, it
is essential to have models that are able to produce temporal patterns
of varying duration and complexity. A model is proposed, based on a
network of pulse oscillators consisting of neuron/interneuron (NiN)
pairs. Due to the inherent temporal properties, a simple NiN net,
taught by a pseudo-Hebbian learning scheme, could be used in simulating
handwriting pen-tip displacement of individual letters.
Keywords: spike oscillator, spiking oscillator, neural pulse oscillator network
Abbink, G.H., H.-L. Teulings, & L.R.B. Schomaker (1993).
Description of on-line script using Hollerbach's generation model.
Proceedings of the Third International Workshop on Frontiers in
Handwriting Recognition (IWFHR-3), Buffalo, USA: CEDAR,
May 25-27, (pp. 217-224).
In most on-line cursive script recognizers, the pen trajectory is
chopped into smaller basic elements. These basic elements are identified
and combined in order to recognize the pen-tip trajectory as a whole.
Much research seems to be devoted to the question of how these basic
elements should be represented. In the present research, the basic elements
are so-called 'dual strokes', two successive velocity-based strokes
which form shapes that can be described well with Hollerbach's model
for cursive-script generation. Once such a dual-stroke description is
retrieved, various features can be estimated by analytical expressions.
As an example, a derivation of an expression of the curvature at the middle
segmentation point of a 'dual stroke' is given. A preliminary version of
a dual-stroke cursive script recognizer showed good results for
oscillating types of connected-cursive handwriting.
Helsper E & Schomaker LR (1993). Off-line and on-line handwriting
recognition: the role of pen movement in machine reading.
Optical Character
Recognition in the Historical Discipline: Proceedings of an International
Work group organized by NHDA and NICI. Goettingen: Max-Planck Institut fuer
Geschichte, pp. 39-51.
In computer recognition of text, there are a number of levels of
increasing difficulty: (1) printed text, (2) handwritten and fully
separated letters and digits, and (3) connected-cursive handwriting.
Software for machine reading of printed text is commercially available.
Techniques for reading isolated handwritten characters and digits are
already in use for the processing of bank checks and postal codes on
envelopes. Most writings, however, including
all historical material are produced with the purpose of being read
by humans, not machines. The most challenging area in this respect
is the automatic processing of scanned cursive script. This paper
shows illustrations of stroke-order reconstruction from off-line
script samples.
Helsper, E., Schomaker, L., & Teulings, H.L. (1993).
Tools for the recognition of handwritten historical documents.
History and Computing, 5(2), 88-93.
An optical cursive script recognition system for historical
documents faces the problem of recognizing handwritten
words on paper, including interfering lines and blots.
Recognition would be greatly simplified if the original
pen movements were available. Therefore, a technique is
proposed to recover pen movements from a scanned image.
The image of a writing pattern is thus translated into
an ordered sequence of strokes, where each stroke can be
classified by its shape. We use a simulated neural network
to learn the associations between pen position and current
direction on the one hand, and its desired direction and
speed on the other hand. The network uses no knowledge about
what letters are present. For the training phase, we use words
of which both bit images and recorded pen trajectories are
available. After training, the network can retrace the
strokes, given some starting points. Having retraced all
strokes,, one or more chains of strokes can be reconstructed.
Within these chains, the order of the strokes is known, and
chains are assumed to be ordered in time from left to right.
Improving robustness to cope with different kinds of scripts
will be the next step.
Schomaker, L.R.B. (1993).
Using Stroke- or Character-based Self-organizing Maps
in the Recognition of On-line, Connected Cursive Script.
Pattern Recognition , 26(3), 443-450.
In this study, comparisons are made between a number of
stroke-based and character-based recognizers of connected cursive script. In
both approaches a Kohonen self-organizing neural network is used as a
feature-vector quantizer. It is found that a "best match only" character-based
recognizer performs better than a "best match only" stroke-based recognizer at
the cost of a substantial increase in computation. However, allowing up to
three multiple stroke interpretations yielded a much larger improvement on the
performance of the stroke-based recognizer. Within the character-based
architecture, a comparison is made between temporal and spatial resampling of
characters. No significant differences between these resampling methods were
found. Geometrical normalization (orientation, slant) did not significantly
improve the recognition. Training sets of more than 500 cursive words appeared
to be necessary to yield acceptable performance.
Schomaker, L.R.B., Teulings, H.-L., Helsper, E.H., & Abbink, G.H. (1993).
Adaptive recognition of on-line, cursive handwriting.
Proceedings of the Sixth International Conference on Handwriting and
Drawing. Paris, July, 4-7: Telecom, (pp. 19-21).
Teulings, H.L., & Schomaker, L.R.B. (1993).
Invariant properties between stroke features in handwriting.
Acta Psychologica, 82, 69-88.
A handwriting pattern is considered as a sequence of ballistic strokes.
Replications of a pattern may be generated from a single, higher-level
memory representation, acting as a motor program. Therefore, those
stroke features which show the mot invariant pattern are probably
related to the parameters of the higher-level representation, whereas
the more noisy features are probably related to the parameters derived
at the lower levels (top-down hierarchy). This hierarchy of invariances
can be revealed by the signal-to-noise (SNR) ratio, the between-parameter
correlations and the between-condition correlations. Similarly, at the
higher level, a sequence of strokes may act as a unit from which
individual strokes are derived (sequence hierarchy). This second type of
hierarchy of invariances can be revealed by the between-stroke correlation
patterns. The present research largely confirmed the top-down
hierarchy, even for upward and downward strokes, separately. Downward
strokes were more invariant than upward strokes in terms of vertical
stroke size. However, contrary to vertical stroke size, the horizontal
stroke size cannot be considered to be an invariant. Furthermore,
both vertical and horizontal sizes shows substantial between-stroke
correlation. In contrast, the stroke durations did not show any between-stroke
correlation. This suggests that velocity-based stroke segmentation
is reliable, in spite of its discrete partitioning of the handwriting movements.
Guyon, I., Schomaker, L., Plamondon, R., Liberman, M. and Janet, S. (1994).
UNIPEN project of on-line data exchange and recognizer benchmarks,
Proceedings of the 12th International Conference on Pattern Recognition,
ICPR'94, pp. 29-33, Jerusalem, Israel, October 1994. IAPR-IEEE.
In this paper, we report on the status of the UNIPEN project of data exchange
and recognizer benchmarks for on-line handwriting. This project started two
years ago at the initiative of the International Association of Pattern
Recognition (Technical Committee 11). The purpose of the project is to propose
and implement solutions to the growing need of handwriting samples for on-line
handwriting recognizers used by pen-based computers. Researchers from several
companies and universities have agreed on a data format, a platform of data
exchange and a protocol for recognizer benchmarks. The on-line handwriting
data of concern may include hand print and cursive from various alphabets
(including Latin and Chinese), signatures and pen gestures. These data will be
compiled and distributed by the Linguistic Data Consortium. The benchmarks will
be arbitrated by the US National Institute of Standards and Technologies.
A brief introduction to the UNIPEN format is given. Furthermore, we explain the
protocol of data exchange and benchmarks.
Schomaker, L.R.B. (1994).
User-interface aspects in recognizing connected-cursive handwriting.
Proceedings of the IEE Colloquium on Handwriting and Pen-based input,
11 March, London: The Institution of Electrical Engineers,
Digest Number 1994/065, (ISSN 0963-3308).
There are at least two major stumbling blocks for user acceptance of pen-based
computers: (1) the recognition performance is not good enough, especially on
cursive handwriting, and (2) the user interface technology has not reached a
mature stage. The initial reaction of product reviewers and potential user
groups to pen-based computers varies. The realistic assessment is that this
is a technology with a large potential, but still too brittle for serious
real-world use. The application area of form filling, for instance, is
characterized by acceptable digit recognition rates, but the essential
punctuations such as decimal points and commas are still handled poorly.
Similarly, there are problems in handling text input. Novice users seem
to have a very high expectation of the pen technology. They start
quickly writing a paragraph of text in their style (mostly a mix of hand print
and cursive) and are amazed at the resulting strings of question marks and
characters which they do not remember having entered. Also, whereas users
seldomly blame the keyboard for their typing errors, the pen computer is blamed
for not recognizing script which shows evident shape errors after close
inspection. It seems to be clear that there are problems both at the system and
at the user side.
Schomaker, L, Abbink, G. & Selen, S. (1994). Writer and Writing-Style
Classification in the Recognition of On-line Handwriting.
Proceedings of the
European Workshop on Handwriting Analysis and Recognition: A European
Perspective, 12-13 July, 1994, London: The Institution of Electrical Engineers,
Digest Number 1994/123, (ISSN 0963-3308).
The Kohonen self-organized map of velocity-based strokes was found
to be a useful processing stage in earlier work. However, a problem is
posed by the huge variation in writing styles. In order to develop
style-specific recognition of cursive handwriting, it is useful to
apply a form of writer or writing-style classification in order to
restrict the set of possible and relevant character shapes. In this
paper, the histogram of stroke usage in the Kohonen self-organized
map of velocity-based strokes was used to classify writers. Using
average-linkage hierarchical clustering, the smallest clusters(n > 1)
consisted of individual writers.
Schomaker, L.R.B, Muench, S. & Hartung, K. (Eds). (1995).
A Taxonomy of Multimodal Interaction in the Human Information
Processing System. Report of the Esprit
Project 8579 MIAMI (187 p.), Nijmegen: NICI.
This document has been prepared in the ESPRIT/BRA project 8579,
Multimodal Integration for Advanced Multimedia Interfaces (MIAMI), in
order to serve as a basis for future work. The basic terms which will
be used in MIAMI are defined and an overview on man-machine-interfaces
is given. The document consists of six chapter and extended
appendices. In the Introduction, a general model for human-computer
interaction is given. Four components are identified: Human Output to
Computer (HOC), Computer Input Modalities (CIM), Computer Output Media
(COM), and Human Input from Computer (HIC). Furthermore, we identify
the necessary cognitive modules of abstract knowledge representation in
both man and machine. Chapter 2 deals with human Perception
characteristics (HIC). Chapter 3 describes Control and Manipulation by
the user (HOC). Chapter 4 introduces aspects of Interaction design and
the effects of 'modality blending'. Chapter 5 addresses the central or
or Cognitive component, which is needs to be described in
human-computer interaction, and Chapter 6, finally describes some
possible future scenarios in multimodal interaction. The Appendices
give an introduction into the uni- and bimodal interaction expertise
present in the MIAMI project: Binaural Technology, Audio-Visual Speech
Synthesis, Audio-Visual Speech Recognition, Gesture Taxonomies, and
Two-dimensional Movement in Time. The report is completed with an
extensive bibliography in the relevant scientific and technological
fields.
Schomaker , L.R.B, Nijtmans, J., Camurri, A., Lavagetto, F., Morasso, P., Benoît,
Guiard-Marigny, T., Le Goff,
B., Robert-Ribes, J., Adjoudani, A., Defée, I., Muench, S., Hartung, K., Blauert, J. (1995).
DI2 - Progress Report of the Esprit Project 8579 MIAMI
(122 p.), Nijmegen: NICI.
A number of experiments in multimodal control are described.
While in the fields of perception research and experimental
psychology, many findings on unimodel interaction are studied, our approach is
oriented towards practical multimodal solutions. Three types of
experiments were performed.
- Spatiotemporal integration of simple visuo-acoustical patterns: The
aim is to derive a numerical description about to what extent the
spatial and temporal attributes of visual and acoustic representations
may deviate in order to form an integrated single-object perception.
- Visual and acoustical integration in speech perception and automatic
recognition: A comparison is made between unimodal (acoustical) and
bimodal (sound + lip movement) recognition of speech by humans and by
existing recognition algorithms.
- Visual and haptic integration: In this set of experiments, haptic
and visual integration is tested. A manipulator with tactile feedback
is used to perform simple tele-operating manipulations with controlled
visual participation via camera and monitor. The images were varied in
detail, quality and amount of noise. Results are compared for a number
of control-interface devices.
Schomaker, L.R.B., & Van Galen, G.P. (1996).
Computer models of handwriting. In: Dijkstra & De Smedt (Eds.),
Computational Psycholinguistics: AI and connectionist models of
human language processing
(pp. 386-420). London: Taylor & Francis.
In this chapter, a general introduction is given in the area
of modeling handwriting. It is noted that most existing
models of handwriting focus on the peripheral aspects of the
handwriting process. Also, the majority of models is directed
at a superficial regeneration of experimental handwriting
data. In such an approach, the essence of handwriting as a
process that evolves in real time is lost, and there is the
risk that the model is more an exercise in curve fitting
rather than being a tool for gaining insight in the
psychomotor aspects of the handwriting process. An essential
problem in handwriting production is the fact that there are
symbolical levels of processing, in which discrete
representations or system states are involved, as well as
analog or continuous levels of processing, in which force,
amplitude and time play a central role. A conceptual model of
the handwriting process is presented, addressing aspects of
seriality and parallelism, and taking into account the
existing empirical psychomotor knowledge. Finally, two
simulation models are presented, which address two different
levels of chaining and shaping processes in handwriting
production.
Vuurpijl, L. & Schomaker, L. (1996). Coarse writing-style clustering
based on simple stroke-related features.
Proceedings of the 5th International
Workshop on Frontiers in Handwriting Recognition (pp. 29-34),
Sept. 2-5, 1996, Colchester, England.
Two methods are presented for the automatic detection of generic writing
styles like e.g. {\em mixed}, {\em cursive} and {\em hand print}.
Such techniques can be used for e.g. a dispatch process which assigns specialized$
recognition systems to a writer with unknown writing style.
Based on a set of handwritten words, three features are determined:
a {\em cursivity index} $c$, which indicates the tendency of a writer to write
cursive, and two distance measures $d_c$ and $d_h$. The distance measures
represent the distance between the stroke feature vectors in the input handwriting$
and the strokes stored in two style-specific Kohonen Self-Organizing Maps (SOM)$
SOM is tuned for the writing style {\em hand print}, and the other for {\em curs$
The first method uses a linear decision criterion to classify
feature vector $\{c,d_c,d_h\}$ into one of the three writing
styles. The second method uses non-linear decision boundaries found via
agglomerative hierarchical clustering of the three-dimensional feature vectors.
This method is able to make a more fine-grained distinction between writing sty$
classifications, especially within the lumped category 'mixed'.
Hartung, K., Muench, S., Schomaker L., Guiard-Marigny, T., Le Goff, B. MacLaverty, R.,
Nijtmans, J., Camurri,
A., Defee, I., and Benoit, C. (1996).
DI3 - Development of a System Architecture for the
Acquisition,
Integration, and Representation of Multimodal Information.
Report of Esprit Project
8579/MIAMI, March 25,
1996. Nijmegen: NICI. (121 p.).
Vuurpijl, L. & Schomaker, L. (1997). Coarse writing-style clustering based on simple
stroke-related features. In:
A.C. Downton & S. Impedovo, Progress in Handwriting Recognition (pp. 37-34). London: World
Scientific.
ISBN 981-02-3084-2
Mackowiak, J., Schomaker, L. & Vuurpijl, L. (1997). Semi-automatic determination of
allograph duration and position in on-line handwriting words based on the expected number of
strokes. In: A.C. Downton & S. Impedovo, Progress in Handwriting Recognition (pp.
69-74). London: World Scientific. ISBN 981-02-3084-2
In this paper, a semi-automatic character labeling ('truthing') procedure
is presented which uses an initial unsupervised algorithm to estimate the
starting stroke and number of strokes per allograph, followed by minimal
user interaction to improve the estimates (in this study, a stroke is
defined as the trajectory of the pen tip between two consecutive minima in
the absolute pen-tip velocity). Such a procedure is very useful since the
complete manual labeling of handwriting takes in the order of one hour per
one-hundred written words. Even without making use of
detailed shape information the proposed algorithm already yields a usable
segmentation which facilitates the manual labeling process.
Vuurpijl, L. & Schomaker, L. (1997). Finding structure in diversity: A hierarchical
clustering method for the categorization of allographs in handwriting, Proceedings of the
Fourth International Conference on Document Analysis and Recognition, Piscataway (NJ): IEEE
Computer Society, p. 387-393. ISBN 981-02-3084-2
This paper introduces a variant of agglomerative hierarchical clustering
techniques. The new technique is used for categorizing
character shapes (allographs) in large data sets of handwriting
into a hierarchical structure. Such a
technique may be used as the basis for a systematic naming scheme of
character shapes. Problems with existing methods are described and the proposed
method is explained. After application of the method to a very large set
of characters, separately for all the letters of the alphabet, relevant
clusters are identified and given a unique name. Each cluster represents
an allograph prototype.
Schomaker, L.R.B. (1998).
Entre écrire des formes sur une ardoise
et le traitement d'information utilisable.
Actes du 1er Colloque
International Francophone sur l'Écrit
et le document, p. 9-13. Montreal: Acfas. ISBN 2-89245-148-5.
L'introduction commercial du concept de 'pen computing' ou les petits
ordinateurs qui sont equippé d'un stylo pour l'entrée des données
etait un processus assez pénible pour l'industrie impliqué. Dans ce
tableau, les origines de cet échec sont analysés, partant de trois
points de vue de recherche. Aprés un introduction concernant
l'histoire recemment du ordinateur á stylo, le premier perspectif
donné concerne le processus psychomotorique gouvernant l'ecrire avec
le stylo. Le deuxième aspect concerne l'interface utilisateur á
stylo (IUS). La dernière perspective est celle-ci du reconaissance
des formes dans l'écriture registré en-ligne. Á la base de cet
tableau sont deux observations: (1) notre connaissance des formes qui
se trouvent dans l'écriture Occidentale et multi-culturelle est assez
superficiel et doit etre élargi et approfondi; et (2) la
développement des algorithmes pour la reconnaissance automatique de
l'écriture dans le contexte des agendas électroniques ne peut pas
réussir si l'interface utilisateur et l'application ne sont pas
connus précisement dès le début de cette entreprise. Les
solutions sont évident: (1) le construction d'une taxonomie explicite
des formes (allographes) dans l'écriture Occidentale, et (2)
l'integration des recherches sur le terrain de la reconnaissance
automatique avec ceux dans le terrain de la ergonomie cognitive des
interface utilisateurs sur les ordinateurs miniaturisés.
Schomaker, L.R.B. (1998).
From handwriting analysis to pen-computer applications.
IEE Electronics Communication Engineering Journal, 10(3), pp. 93-102.
In this paper, pen computing, i.e., the use of computers
and applications in which the pen is the main input device,
will be described from four different angles. In the first
section, a brief overview will be given on the hardware
developments in pen systems. After concluding that the
technological developments in this area did not lead to the
expected user acceptance of pen computing, the reasons
underlying this market failure are explored: The second part
deals with Pen-User Interfacing (PUI) aspects. Problems of
pen-user interface design are described. Existing and new
applications are summarized. The third part is concerned with
the handwriting process and product. The last part deals with
automatic recognition methodologies. Four basic factors
determining handwriting variation and variability are
identified. A handwriting recognition approach using
segmentation into velocity-based strokes is handled in somewhat
more detail. A large-scale project (UNIPEN) concerns the
benchmarking of the performance of on-line handwriting
recognition algorithms which is crucial for the advancement of
the state of the art in this area.
Schomaker, L. & Segers, E. (1998).
A method for the determination of features used in
human reading of cursive handwriting.
Proceedings of IWFHR'98, 12-14 August, Taejon, Korea, pp. 157-168.
Schomaker, L., Hoenkamp, E. & Mayberry, M. (1998).
Towards collaborative agents for automatic on-line handwriting recognition.
Proceedings of the Third European Workshop on Handwriting Analysis and
Recognition, 14-15 July, 1998, London: The Institution of Electrical Engineers,
Digest Number 1998/440, (ISSN 0963-3308), pp. 13/1-13/6.
A important problem in many areas of complex information processing is to
integrate heterogeneous information from different sources. For handwriting
recognition we accomplished such an integration through an agent architecture
of three co-operating agents. One agent represents the bottom-up knowledge as
generated by a shape-based classifier of unistrokes. The second agent
contains top-down (syntactic) knowledge about the language to be
interpreted. The third agent is the user who produces written symbol shapes
and who can be queried by the other agents. The chosen application paradigm
is the real-time input and execution of Scheme code. Although this is not a
practical domain, it contains all the essential elements of a full text-entry
application. Notable improvements can be gained over a pure bottom-up
approach to pen-based computing.
Vuurpijl, L. & Schomaker, L. (1998).
A framework for using multiple classifiers in a multiple-agent
architecture. Proceedings of the Third European Workshop on Handwriting
Analysis and Recognition, 14-15 July, 1998, London: The Institution of
Electrical Engineers, Digest Number 1998/440, (ISSN 0963-3308), pp. 8/1-8/6
Novel pattern recognition techniques using multiple agents for the recognition
of handwritten text are proposed in this paper. The architecture of a
distributed system dispatching recognition tasks to a set of recognizers and
combining their results is discussed. The concept of intelligent agents and
innovative multi-agent architectures for pattern recognition tasks is
introduced for combining and elaborating the classification hypotheses of
several classifiers. This concept is being developed in the IART project,
where intelligent agent architectures are built for pattern recognition tasks.
Vuurpijl, L. & Schomaker, L. (1998).
Multiple-agent architectures for the classification of handwritten text.
Proceedings of IWFHR'98, 12-14 August, Taejon, Korea, pp. 335-346.
(Loe?)
Hoenkamp, E.C.M., Stegeman, O., and Schomaker, L.R.B. (1999).
Supporting content retrieval from WWW via 'basic level categories'.
In Proceedings of the 22rd International Conference on Research and
Development in Information Retrieval (pp. 311-312).
New York: ACM Press (SIGIR forum, 33 (3)).
Plamondon, R., Lopresti, D.P., Schomaker, L.R.B. and
Srihari, R. (1999). On-line handwriting recognition.
In: J.G. Webster (Ed.). Wiley Encyclopedia of Electrical & Electronics
Engineering, 123-146, New York: Wiley, ISBN 0-471-13946-7.
This chapter gives an overview of the field of on-line handwriting
recognition. The introduction explains some backgrounds of the field:
on-line vs off-line recognition; recognition vs verification; occidental vs
oriental languages; and pen-pad applications. A section on
handwriting, i.e., process modeling and understanding deals with
psychophysical aspects. A section on preprocessing elaborates on
the following techniques: filtering, stroke detection, character and
word segmentation, restoration and beautification. Furthermore
data compression and ink preprocessing are elucidated. The next section
of the chapter focuses on the handwriting recognition itself:
Gesture, character and word classification; the differences between
recognizing cursive vs hand-printed styles, and processing of sentences.
Structural and statistical methods are explained, where a distinction is
made between explicit statistical methods and implicitly statistical
methods (such as neural-network classifiers). Finally, hidden-Markov
models as applied to on-line handwriting recognition are explained.
The final section is concerned with post processing of classification
results: combination of methods, lexical, syntactical and
semantic approaches.
Schomaker, L., de Leau, E. & Vuurpijl, L. (1999).
Using pen-based outlines for object-based
annotation and image-based queries.
In: D.P. Huijsmans and A.W.M. Smeulders
(Eds.). Visual Information and Information Systems,
New York: Springer, pp. 585-592.
A method for image-based queries and search is proposed which is based on the
generation of object outlines in images by using the pen, e.g., on color pen
computers. The rationale of the approach is based on a survey on user needs,
as well as on considerations from the point of view of pattern recognition and
machine learning. By exploiting the actual presence of the human users with
their perceptual-motor abilities and by storing textually annotated
queries, an incrementally learning image retrieval system can be developed. As
an initial test domain, sets of photographs of motor bicycles were used.
Classification performances are given for outline and bitmap-derived feature
sets, based on nearest-neighbour matching, with promising results. The benefit
of the approach will be a user-based multimodal annotation of an image database$
yielding a gradual improvement in precision and recall over time.
Schomaker, L., Vuurpijl, L. & de Leau, E. (1999).
New use for the pen: outline-based image queries.
Proceedings of the 5th International Conference on Document
Analysis and Recognition (ICDAR '99). Piscataway (NJ): IEEE.
pp. 293-296.
Schomaker, L. (1999).
New pen-based applications and the multimodal annotation of raw
multimedia .
Tutorial, presented at the ICDAR'99, Bangalore, Sept. 19.
Schomaker, L. & Segers, E. (1999).
Finding features used in the human reading of cursive handwriting
International Journal on Document Analysis and Recognition, 2, 13-18.
This paper first summarizes a number of findings in human
reading of handwriting. A method is proposed to uncover more
detailed information about geometrical features which human
readers use in the reading of Western script. The results of an
earlier experiment on the use of ascender/descender features were
used for a second experiment aimed at more detailed features
within words. A convenient experimental setup was developed, based
on image enhancement by local mouse clicks under time pressure.
The readers had to develop a cost-effective strategy to identify
the letters in the word. Results revealed a left-to-right
strategy in time, however, with extra attention to the initial,
leftmost parts and the final, rightmost parts of words in a range
of word lengths. The results confirm high hit rates on ascenders,
descenders, crossings and points of high curvature in the
handwriting pattern.
Christian Benoit, Catherine Pelachaud, Lambert Schomaker & Bernhard Suhm (1999).
Audio-visual and Multimodal Speech Systems.
Esprit project EAGLES.
(under revision: will appear as a chapter in a book).
This chapter on multimodal interfaces and multimodal speech systems (as defined
above) is structured as follows. After this introductory section, Section 1
presents results from a literature review and survey of multimodal systems that
the authors of this chapter performed. Section 2 discusses evaluation of
multimodal systems. Challenges in evaluating multimodal systems are identified,
known evaluation methodologies are reviewed, and issues in evaluating certain
kinds of multimodal systems are discussed, including talking heads and
synthetic conversational agents. The next four sections discuss aspects of the
various types of multimodal systems, according to what speech- and non-speech
modalities speech input and output is associated with: Section 3 describes
systems that combine speech input with information from the visual channel
(face detection, face recognition, tracking of facial features, and
lip-reading), Section 4 describes systems that combine speech with visual
output (e.g., talking heads), Section 5 describes systems that combine speech
input with other input modalities (defined as multimodal interface above), and
Section 6 describes systems that combine speech output with other modalities
(defined as multimedia systems above). These sections focus on concepts and
issues; the details of the technology necessary to implement such systems is
reviewed in Section 7. Finally, Section 8 presents established standards and
common resources for multimodal systems.
M. van Erp and L. Schomaker (2000).
Variants of the Borda count method for combining ranked classifier hypotheses.
Proceedings of the Seventh International Workshop on Frontiers in Handwriting
Recognition (7th IWFHR), Nijmegen: iUF, p. 443-452.
ISBN 90-76942-01-3
The Borda count is a simple yet effective method of combining rankings.
In pattern recognition, classifiers are often able to return a ranked
set of results. Several experiments have been conducted to test the
ability of the Borda count and two variant methods to combine these
ranked classifier results. By using artificial data, domain-specific
results were avoided. The results show the strength of the Borda count
when many errors occur in the results, but also show its weakness in
case of a limited number of large ranking errors.
L. Schomaker, D. Mangalagiu, L. Vuurpijl and M. Weinfeld (2000).
Two tree-formation methods for fast pattern search
using nearest-neighbour and nearest-centroid matching.
Proceedings of the Seventh International Workshop on Frontiers in Handwriting
Recognition (7th IWFHR), Nijmegen: iUF, p. 261-270.
ISBN 90-76942-01-3
This paper describes tree-based classification of character images,
comparing two methods of tree formation and two methods of matching:
nearest neighbor and nearest centroid. The first method, Preprocess
Using Relative Distances (PURD) is a tree-based reorganization of a
flat list of patterns, designed to speed up nearest-neighbor matching.
The second method is a variant of agglomerative hierarchical clustering
(HCLUS) which aims at finding a hierarchical structure of centroids in
the pattern space. Results indicate that the PURD method is a very
fast, effective and convenient method for the speedup of 1NN search,
from which it is, however, difficult to derive usable character
prototypes. HCLUS can be used to obtain very fast search with
acceptable classification rate while providing character prototypes,
however, at the cost of significant training efforts.
Vuurpijl and L. Schomaker (2000).
Two-stage character classification: A combined approach of
clustering and Support Vector Classifiers
Proceedings of the Seventh International Workshop on Frontiers in Handwriting
Recognition (7th IWFHR), Nijmegen: iUF, p. 423-432.
ISBN 90-76942-01-3
This paper describes a two-stage classification method for (1)
classification of isolated characters and (2) verification of the
classification result. Character prototypes are generated using
hierarchical clustering. For those prototypes known to sometimes
produce wrong classification results, a "support vector classifier"
(svc) is trained. The svc can be used to increase the confidence that a
classification is correct and furthermore decide on a classification if
the confidence using the standard method is too low. Experiments with
the iUF UNIPEN database yield 94% recognition rate. In cases where both
classifiers agree, the error rate is zero.
F. Wang, L. Vuurpijl and L. Schomaker (2000).
Support Vector Machines for the classification of Western
Handwritten Capitals
Proceedings of the Seventh International Workshop on Frontiers in Handwriting
Recognition (7th IWFHR), Nijmegen: iUF, p. 167-176.
ISBN 90-76942-01-3
In this paper, new techniques are presented using Support Vector
Machines (SVMs) for multi-class classification problems. The issue of
decomposing a N-class classification problem into a set of 2-class
classification questions is discussed. In particular, the technique for
normalizing the outputs of several SVMs is presented. Based on these
techniques, support vector classifiers for the recognition of Western
handwritten capitals are realized. Comparisons to several other
classification methods are also presented.
Schomaker, L.R.B. (2001) Image Search and Annotation: From Lab to
Web.
Document Electronique: Methodes, demarches et techniques cognitives,
pp.373-375, ISBN 2-909285-17-0.
In earlier work, a method for image-based
queries and search was proposed which is based on drawing of object outlines
on photos, by the users of an image-search system. By exploiting the actual
presence of the
human users with their perceptual-motor abilities and by storing textually
annotated queries, an incrementally learning image retrieval system can be
constructed. As a proof of concept, a large collection of paintings
of the Dutch Rijksmuseum was put on an Internet website, embedded in Java
software which allowed for the drawing of object outlines and the typing of
textual annotation [4]. The proposed outline-matching scheme can be used to search
and organize the annotated objects in a growing and enriched database of
objects which are present within the painting collection. Figures 1
display a query for 'dogs', based on the outline drawn in a painting by
the Dutch painter Jan Steen. The object can be extracted from the
image context. Different annotators produce multiple selections of
objects on different paintings in the database. Both at the level of perception
(geometry) and cognition (semantics) the human input is invaluable.
It is believed that such an approach will ultimately lead to better 'machine
intelligence': It has become clear in several domains (cf., the openMind
initiative) that only large
amounts of labeled data will provide the performance levels which are
required if these techniques (speech recognition, handwriting recognition,
object recognition) need to evolve from their current rudimentary stage.
Schomaker, L.R.B. (2002).
[Book Review on] "Plausible Neural Networks for Biological Modelling (2001).
H.A.K. Mastebroek & J.E. Vos (Eds.)",
BCN Nieuwsbrief, 46. Groningen: BCN - School of
Behavioral and Cognitive Neuroscience [in Dutch].
van Erp, M., Vuurpijl, L. & Schomaker, L. (2002).
An overview and comparison of voting methods for
pattern recognition, Proc. of the 8th IWFHR, Piscataway: IEEE,
ISBN 0-7695-1692-0, pp. 195-200.
In pattern recognition, there is a growing use of multiple
classifier combinations with the goal to increase recognition
performance. In many cases, plurality voting is a part of the
combination process. In this article, we discuss and test several
well known voting methods from politics and economics on classifier
combination in order to see if an alternative to the simple plurality
vote exists. We found that, assuming a number of prerequisites, better methods
are available, that are comparatively simple and fast.
Schomaker, L. (2003). Patronen en symbolen: een wereld door
het oog van de machine
[inaugural speech, 10/12/2002, English translation].
MUON, 87, pp. 8-13.
In this inaugural address, the incommensurability of methods from
logic and geometry are illustrated, using the example of automatic
recognition of cursive script. The schism started with the separation
of the AAAI and the IAPR research fields, currently leading to inelegant
hybrid systems and models. It is noted that the current state
of the art is far from being able to produce reliable and autonomous
systems. Furthermore, it is noted that systems have to be completely and
manually re-engineered for each application variant and context of usage.
A second theme of this inaugural address concerns the distinction between
builders (engineers), 'knowers' (scientists) and thinkers (philosophers).
It is noted that there exists a pecking order in which the builders are
in a most unfortunate position.
L. Schomaker, M. Bulacu & M. van Erp (2003).
Sparse-parametric writer identification
using heterogeneous feature groups.
ICIP'2003: IEEE International Conference on Image Processing (Vol. I),
pp. (I) 545-548.
This paper evaluates the performance of edge-based directional probability
distributions as features in writer identification in comparison to a number of
non-angular features. It is noted that angular features
outperform all other features. However, the non-angular features provide
additional valuable information. Rank-combination was used to realize a
sparse-parametric combination scheme based on nearest-neighbor search.
Limitations of the proposed methods pertain to the
amount of handwritten material needed in order to obtain reliable
distribution estimates. The global features treated in this study
are sensitive to major style variation (upper- vs lower case), slant,
and forged styles, which necessitates the use of other features in
realistic forensic writer identification procedures.
M. Bulacu, L. Schomaker & Vuurpijl, L. (2003).
Writer identification using edge-based directional features.
ICDAR'2003: International Conference on Document Analysis and Recognition,
pp. 937-941.
This paper evaluates the performance of edge-based directional
probability distributions as features in writer identification in
comparison to a number of non-angular features. It is noted that the
joint probability distribution of the angle combination of two "hinged"
edge fragments outperforms all other individual features. Combining
features may improve the performance. Limitations of the method pertain
to the amount of handwritten material needed in order to obtain reliable
distribution estimates. The global features treated in this study are
sensitive to major style variation (upper- vs lower case), slant, and
forged styles, which necessitates the use of other features in realistic
forensic writer identification procedures.
M. Bulacu & L. Schomaker (2003).
Writer Style from Oriented Edge Fragments.
Proc. of the 10th Int. Conference on Computer Analysis
of Images and Patterns (CAIP'03),
pp. 460-469.
In this paper we evaluate the performance of edge-based directional probability
distributions extracted from handwriting images as features in forensic writer
identification in comparison to a number of non-angular features. We compare
the performances of the features on lowercase and uppercase handwriting. In an
effort to gain location-specific information, new versions of the features are
computed separately on the top and bottom halves of text lines and then fused.
The new features deliver significant improvements in performance. We report
also on the results obtained by combining features using a voting scheme.
L. Schomaker & M. Bulacu (2004).
Automatic writer identification using connected-component
contours and edge-based features of upper-case Western script.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
pp.xx-xxx.
In this paper, a new technique for off-line writer identification is
presented, using connected-component contours (COCOCOs or CO3s) in
upper-case handwritten samples. In our model, the writer is considered
to be characterized by a stochastic pattern generator, producing a
family of connected components for the upper-case character set. Using
a codebook of CO3s from an independent training set of 100 writers,
the probability-density function (PDF) of CO3s was computed for an
independent test set containing 150 unseen writers. Results revealed a
high-sensitivity of the CO3 PDF for identifying individual writers
on the basis of a single sentence of upper-case characters. The
proposed automatic approach bridges the gap between image-statistics
approaches on one end and manually measured allograph features of
individual characters on the other end. Combining the CO3 PDF with
an independent edge-based orientation and curvature PDF yielded very
high correct identification rates.
R. Ekker, E.C.D. van der Werf & L.R.B. Schomaker (2004).
Dedicated TD-learning for stronger gameplay: applications to Go
In: A. Nowe, T. Lennaerts, K. Steenhaut (Eds.),
Proceedings of Benelearn 2004 Annual Machine Learning Conference of
Belgium and The Netherlands (Brussels, BE, January 8-9), pp. 46-52.
This paper presents a study of several dedicated Temporal-Difference (TD)
reinforcement learning algorithms for deterministic zero-sum games of perfect
information such as the game of Go. The algorithms include TD(mu) by
Beal (2002), which separates good play from bad play, TD-leaf(lambda) and
TD-directed(lambda) by Baxter et al. (1998), which exploit game tree
searching, as well as Baird's residual algorithm (1995) for preventing
instability during training. We show that dedicated TD learning
algorithms provide faster training and the acquisition of more 'genuine'
knowledge of the game resulting in significantly higher playing strength
than players trained by standard TD.
L. Schomaker (2004). Anticipation in cybernetic systems: A case against mindless anti-representationalism. Proc. of IEEE Systems,
Man & Cybernetics (SMC'04), pp. xx-xxx.
The developments in behavior-based robotics and
in ecological psychology have had a strong effect on theoretical
development in some research communities. A new belief has emerged
under the name of anti-representationalism, which is strongly opposed to
the notion of representations in cognition. This notion is spurred by
the inarguably fruitful insight that behavioral complexity can be
brought about by simple mechanisms at a low systemic level. Although
there are many problems with constructed formal representations in
the toy models of traditional artificial intelligence, there is a
fundamental problem with extreme anti-representationalism, as well.
Representations actually do exist in the biological neural-information
processing system. In
this paper, a review of neural representation mechanisms will be given,
looking at perception and motor control in biological systems.
Subsequently, it will be illustrated that already in simple animal behaviors,
simple 'representation-less' reactivity does not suffice. Anticipation
exists even in jumping spiders, requiring the existence of a representation
as the computational basis for the prediction of future system states.
Niels, R., Vuurpijl, L. & Schomaker, L.R.B. (2006, in press).
Automatic allograph matching in forensic writer identification.
International Journal of Pattern Recognition and Artificial Intelligence,
x(x), p. xxx-xxx.
A well-established task in forensic writer identification focuses on the comparison of
prototypical character shapes (allographs) present in the handwriting. In order for a
computer to perform this task convincingly, it should yield results that are plausible and
understandable to the human expert. Trajectory matching is a well-known method to
compare two allographs. This paper assesses a promising technique for so-called human-
congruous trajectory matching, called dynamic time warping. In the first part of this
paper, experiments are shown that indicate that DTW yields results that correspond to
the expectations of human users. Since DTW requires the dynamics of the handwritten
trace, the on-line dynamic allograph trajectories need to be extracted from the \off-line"
scanned documents. For the second part of this paper, handwritten images are employed
generated from relatively large on-line data sets, which provide the true trajectories. This
allows for a quantitative assessment of the trajectory extraction techniques rather than
a qualitative discussion of a small number of examples. Our results show that DTW can
significantly improve the results from trajectory extraction when compared to traditional
techniques.
Schomaker, L.R.B., Franke, K. & Bulacu, M.
(2006, in press).
Using codebooks of fragmented connected-component contours
in forensic and historic writer identification,
Pattern Recognition Letters, x(x), p. xxx-xxx.
Recent advances in 'off-line' writer identification allow for new
applications in handwritten text retrieval from archives of scanned
historical documents. This paper describes new algorithms for forensic
or historical writer identification, using the contours of fragmented
connected-components in free-style handwriting. The writer is
considered to be characterized by a stochastic pattern generator,
producing a family of character fragments (fraglets). Using a codebook
of such fraglets from an independent training set, the probability
distribution of fraglet contours was computed for an independent test
set. Results revealed a high sensitivity of the fraglet histogram in
identifying individual writers on the basis of a paragraph of text.
Large-scale experiments on the optimal size of Kohonen maps of fraglet
contours were performed, showing usable classification rates within a
non-critical range of Kohonen map dimensions. The proposed automatic
approach bridges the gap between image-statistics approaches and purely
knowledge-based manual character-based methods.
Bulacu, M. & Schomaker, L.R.B. (2007, in press).
Text-independent Writer Identification and Verification Using
Textural and Allographic Features, IEEE PAMI, July 2007, x(x), p. xxx-xxx.
The identification of a person on the basis of scanned images of handwriting is a useful biometric
modality with application in forensic and historic document analysis and constitutes an exemplary study
area within the research field of behavioral biometrics. We developed new and very effective techniques
for automatic writer identification and verification that use probability distribution functions (PDFs)
extracted from the handwriting images to characterize writer individuality. A defining property of our
methods is that they are designed to be independent of the textual content of the handwritten samples.
Our methods operate at two levels of analysis: the texture level and the character-shape (allograph)
level. At the texture level, we use contour-based joint directional PDFs that encode orientation and
curvature information to give an intimate characterization of individual handwriting style. In our analysis
at the allograph level, the writer is considered to be characterized by a stochastic pattern generator of
ink-trace fragments, or graphemes. The PDF of these simple shapes in a given handwriting sample is
characteristic for the writer and is computed using a common shape codebook obtained by grapheme
clustering. Combining multiple features (directional, grapheme and run-length PDFs) yields increased
writer identification and verification performance. The proposed methods are applicable to free-style
handwriting (both cursive and isolated) and have practical feasibility, under the assumption that a few
text lines of handwritten material are available in order to obtain reliable probability estimates.
PostScript Papers, directory
Up