Publications by Lawrence B. Holder

<TITLE> Publications by Lawrence B. Holder </TITLE>

<H1> Publications by Lawrence B. Holder </H1>

<HR>

<A NAME="mr96"></A>

<H2> An Empirical Approach to Solving the General Utility Problem in
Speedup Learning </H2>

A. Chaudhry and L. B. Holder.  An Empirical Approach to Solving the
General Utility Problem in Speedup Learning.  To appear in F. D. Anger
and M. Ali (eds.) <I>Machine Reasoning</I>, Gordon and Breach Science
Publishers, 1996. <P>

<B>Abstract:</B> The utility problem in speedup learning describes a
common behavior of machine learning methods: the eventual degradation
of performance due to increasing amounts of learned knowledge. The
shape of the learning curve (cost of using a learning method
vs. number of training examples) over several domains suggests a
parameterized model relating performance to the amount of learned
knowledge and a mechanism to limit the amount of learned knowledge for
optimal performance.  Many recent approaches to avoiding the utility
problem in speedup learning rely on sophisticated utility measures and
significant numbers of training data to accurately estimate the
utility of control knowledge.  Empirical results presented here and
elsewhere indicate that a simple selection strategy of retaining all
control rules derived from a training problem explanation quickly
defines an efficient set of control knowledge from few training
problems.  This simple selection strategy provides a low-cost
alternative to example-intensive approaches for improving the speed of
a problem solver.  Experimentation illustrates the existence of a
minimum (representing least cost) in the learning curve which is
reached after a few training examples.  Stress is placed on
controlling the amount of learned knowledge as opposed to which
knowledge. An attempt is also made to relate domain characteristics to
the shape of the learning curve. <P>

<B>Available as:</B> <A HREF = "pubs/mr96.ps">mr96.ps</A> <P>

A shorter version of this article appears in our <A HREF =
"#ieaaie94">IEA/AIE-94 paper</A>. <P>

<HR>

<A NAME = "jiis95"></A>

<H2> Knowledge Discovery from Structural Data </H2>

D. J. Cook, L. B. Holder, and S. Djoko.  Knowledge Discovery from
Structural Data.  <I>Journal of Intelligence and Information
Sciences</I>, Volume 5, Number 3, pages 229-245, 1995.  <P>

<B>Abstract:</B> Discovering repetitive substructure in a structural
database improves the ability to interpret and compress the data.
This paper describes the SUBDUE system that uses domain-independent
and domain-dependent heuristics to find interesting and repetitive
structures in structural data.  This substructure discovery technique
can be used to discover fuzzy concepts, compress the data description,
and formulate hierarchical substructure definitions.  Examples from
the domains of scene analysis, chemical compound analysis,
computer-aided design, and program analysis demonstrate the benefits
of the discovery technique.  <P>

<B>Available as:</B> <A HREF = "pubs/jiis95.ps">jiis95.ps</A> <P>

<HR>

<A NAME = "ijcai95"></A>

<H2> Intermediate Decision Trees </H2>

L. B. Holder.  Intermediate Decision Trees.  In the <I>Proceedings of
the 14th International Joint Conference on Artificial
Intelligence</I>, pages 1056-1062, 1995. <P>

<B>Abstract:</B> Intermediate decision trees are the subtrees of the
full (unpruned) decision tree generated in a breadth-first order.  An
extensive empirical investigation evaluates the classification error
of intermediate decision trees and compares their performance to full
and pruned trees.  Empirical results were generated using C4.5 with 66
databases from the UCI machine learning database repository.  Results
show that when attempting to minimize the error of the pruned tree
produced by C4.5, the best intermediate tree performs significantly
better in 46 of the 66 databases.  These and other results question
the effectiveness of decision tree pruning strategies and suggest
further consideration of the full tree and its intermediates.  Also,
the results reveal specific properties satisfied by databases in which
the intermediate full tree performs best.  Such relationships improve
guidelines for selecting appropriate inductive strategies based on
domain properties. <P>

<B>Available as:</B> <A HREF = "pubs/ijcai95.ps">ijcai95.ps</A> <P>

<HR>

<A NAME = "kdd95"></A>

<H2> Analyzing the Benefits of Domain Knowledge in Substructure
Discovery </H2>

S. Djoko, D. J. Cook and L. B. Holder.  Analyzing the Benefits of
Domain Knowledge in Substructure Discovery.  In the <I>Proceedings of
the First International Conference on Knowledge Discovery and Data
Mining</I>, pages 75-80, 1995. <P>

<B>Abstract:</B> Discovering repetitive, interesting, and functional
substructures in a structural database improves the ability to
interpret and compress the data.  However, scientists working with a
database in their area of expertise often search for predetermined
types of structures, or for structures exhibiting characteristics
specific to the domain.  This paper presents a method for guiding the
discovery process with domain-specific knowledge.  In this paper, the
SUBDUE discovery system is used to evaluate the benefits of using
domain knowledge to guide the discovery process.  The domain knowledge
is incorporated into SUBDUE following a single general methodology to
guide the discovery process.  Results show that domain-specific
knowledge improves the search for substructures which are useful to
the domain, and leads to greater compression of the data. To
illustrate these benefits, examples and experiments from the computer
programming, computer aided design circuit, and artificially-generated
domains are presented. <P>

<B>Available as:</B> <A HREF = "pubs/kdd95.ps">kdd95.ps</A> <P>

<HR>

<A NAME = "jair94"></A>

<H2> Substructure Discovery Using Minimum Description Length and
Background Knowledge </H2>

D. J. Cook and L. B. Holder.  Substructure Discovery Using Minimum
Description Length and Background Knowledge.  In <I>Journal of
Artificial Intelligence Research</I>, Volume 1, pages 231-255, 1994.
<P>

<B>Abstract:</B> The ability to identify interesting and repetitive
substructures is an essential component to discovering knowledge in
structural data.  We describe a new version of our SUBDUE substructure
discovery system based on the minimum description length principle.
The SUBDUE system discovers substructures that compress the original
data and represent structural concepts in the data.  By replacing
previously-discovered substructures in the data, multiple passes of
SUBDUE produce a hierarchical description of the structural
regularities in the data.  SUBDUE uses a computationally-bounded
inexact graph match that identifies similar, but not identical,
instances of a substructure and finds an approximate measure of
closeness of two substructures when under computational constraints.
In addition to the minimum description length principle, other
background knowledge can be used by SUBDUE to guide the search towards
more appropriate substructures.  Experiments in a variety of domains
demonstrate SUBDUE's ability to find substructures capable of
compressing the original data and to discover structural concepts
important to the domain.  <P>

<B>Available as:</B> <A HREF = "pubs/jair94.ps">jair94.ps</A> <P>

<HR>

<A NAME = "kdd94"></A>

<H2> Substructure Discovery in the SUBDUE System </H2>

L. B. Holder, D. J. Cook and S. Djoko.  Substructure Discovery in the
SUBDUE System.  In <I>Proceedings of the AAAI Workshop on Knowledge
Discovery in Databases</I>, pages 169-180, 1994.  <P>

<B>Abstract:</B> Because many databases contain or can be embellished
with structural information, a method for identifying interesting and
repetitive substructures is an essential component to discovering
knowledge in such databases.  This paper describes the SUBDUE system,
which uses the minimum description length (MDL) principle to discover
substructures that compress the database and represent structural
concepts in the data.  By replacing previously-discovered
substructures in the data, multiple passes of SUBDUE produce a
hierarchical description of the structural regularities in the data.
Inclusion of background knowledge guides SUBDUE toward appropriate
substructures for a particular domain or discovery goal, and the use
of an inexact graph match allows a controlled amount of deviations in
the instance of a substructure concept.  We describe the application
of SUBDUE to a variety of domains.  We also discuss approaches to
combining SUBDUE with non-structural discovery systems.  <P>

<B>Available as:</B> <A HREF = "pubs/kdd94.ps">kdd94.ps</A> <P>

<HR>

<A NAME="ieaaie94"></A>

<H2> An Empirical Approach to Solving the General Utility Problem in
Speedup Learning </H2>

A. Chaudhry and L. B. Holder.  An Empirical Approach to Solving the
General Utility Problem in Speedup Learning.  In the <I>Seventh
International Conference on Industrial and Engineering Applications of
Artificial Intelligence and Expert Systems</I>, pages 149-158,
1994. <P>

<B>Abstract:</B> Many recent approaches to avoiding the utility
problem in speedup learning (the eventual degradation of performance
due to increasing amounts of learned problem-solver control knowledge)
rely on sophisticated utility measures and significant numbers of
training problems to accurately estimate the utility of control
knowledge.  Empirical results presented here and elsewhere indicate
that a simple selection strategy of retaining all control rules
derived from a training problem solution quickly defines an efficient
set of control knowledge from few training problems.  This simple
selection strategy provides a low-cost alternative to
example-intensive approaches for improving the speed of a problem
solver.  Experimentation illustrates the existence of a minimum
(representing least cost) in the learning curve which is reached after
a few training examples.  Stress is placed on controlling the amount
of learned knowledge as opposed to which knowledge.  An attempt is
also made to relate domain characteristics to the shape of the
learning curve. <P>

<B>Available as:</B> <A HREF = "pubs/ieaaie94.ps">ieaaie94.ps</A> <P>

An extended version of this paper appears in our <A HREF =
"#mr96"><I>Machine Reasoning</I> book chapter</A>. <P>

<HR>

<A NAME = "iuw94"></A>

<H2> Sensor Planning and Coordination in Multi-Agent Systems </H2>

D. J. Cook and L. B. Holder.  Sensor Planning and Coordination in
Multi-Agent Systems.  In the <I>Proceedings of the Image Understanding
Workshop</I>, 1994.  <P>

<B>Abstract:</B> This paper describes the application of Artificial
Intelligence planning techniques to sensor pointing and control.  In
particular, the SAUSAGES plan execution system has been adapted to
execute and control plans generated for the guidance of autonomous
vehicles and control of camera movement associated with the vehicle.
Additional capabilities have been added to allow for multiple vehicle
and sensor coordination.  <P>

<B>Available as:</B> <A HREF = "pubs/iuw94.ps">iuw94.ps</A> <P>

<HR>

<A NAME = "tkde93"></A>

<H2> Discovery of Inexact Concepts from Structural Data </H2>

L. B. Holder and D. J. Cook.  Discovery of Inexact Concepts from
Structural Data.  In <I>IEEE Transactions on Knowledge and Data
Engineering</I>, Volume 5, Number 6, pages 992-994, 1993.  <P>

<B>Abstract:</B> Concept discovery in structural data requires the
identification of repetitive substructures in the data.  We describe a
method for discovering substructures in data using an inexact graph
match.  A previous implementation of our SUBDUE system discovers
substructures based on the psychologically-motivated criteria of
cognitive savings, compactness, connectivity and coverage.  However,
the instances in the data must exactly match the discovered
substructures.  We describe a new implementation of SUBDUE that
employs an inexact graph match to discover substructures which occur
often in the data, but not always in the same form.  This inexact
substructure discovery can be used to formulate fuzzy concepts,
compress the data description, and discover interesting structures in
data that are found either in an identical or in a slightly convoluted
form.  Examples from the domains of scene analysis and chemical
compound analysis demonstrate the benefits of the inexact discovery
technique.  <P>

<B>Available as:</B> <A HREF = "pubs/tkde93.ps">tkde93.ps</A> <P>

<HR>

<A NAME="kcsl93"></A>

<H2> Simple Selection of Utile Control Rules in Speedup Learning </H2>

L. B. Holder and A. Chaudhry. Simple Selection of Utile Control Rules
in Speedup Learning.  In <I>Proceedings of the Third International
Workshop on Knowledge Compilation and Speedup Learning</I>, pages
77-82, 1993. <P>

<B>Abstract:</B> Many recent approaches to avoiding the utility
problem in speedup learning rely on sophisticated utility measures and
significant numbers of training data to accurately estimate the
utility of control knowledge.  Empirical results presented here and
elsewhere indicate that a simple selection strategy of retaining all
control rules derived from a training problem explanation quickly
defines an efficient set of control knowledge from few training
problems.  This simple selection strategy provides a low-cost
alternative to example-intensive approaches for improving the speed of
a problem solver. <P>

<B>Available as:</B> <A HREF = "pubs/kcsl93.ps">kcsl.ps</A> <P>

<HR>

<A NAME = "aaai92"></A>

<H2> Empirical Analysis of the General Utility Problem in Machine
Learning </H2>

L. B. Holder. Empirical Analysis of the General Utility Problem in
Machine Learning.  In <I>Proceedings of the Tenth National Conference
on Artificial Intelligence</I>, pages 249-254, 1992. <P>

<B>Abstract:</B> The overfit problem in inductive learning and the
utility problem in speedup learning both describe a common behavior of
machine learning methods: the eventual degradation of performance due
to increasing amounts of learned knowledge.  Plotting the performance
of the changing knowledge during execution of a learning method (the
performance response) reveals similar curves for several methods.  The
performance response generally indicates an increase to a single peak
followed by a more gradual decrease in performance. The similarity in
performance responses suggests a model relating performance to the
amount of learned knowledge.  This paper provides empirical evidence
for the existence of a general model by plotting the performance
responses of several learning programs.  Formal models of the
performance response are also discussed.  These models can be used to
control the amount of learning and avoid degradation of performance.
<P>

<B>Available as:</B> <A HREF = "pubs/aaai92.ps">aaai92.ps</A> <P>

<HR>

<A NAME = "ml92"></A>

<H2> Fuzzy Substructure Discovery </H2>

L. B. Holder, D. J. Cook and H. Bunke. Fuzzy Substructure Discovery.
In <I>Proceedings of the Ninth International Conference on Machine
Learning</I>, pages 218-223, 1992.  <P>

<B>Abstract:</B> This paper describes a method for discovering
substructures in data using a fuzzy graph match.  A previous
implementation of the {\sc Subdue} system discovers substructures
based on the psychologically-motivated criteria of cognitive savings,
compactness, connectivity and coverage.  However, the instances in the
data must exactly match the discovered substructures.  We describe a
new implementation of {\sc Subdue} that employs a fuzzy graph match to
discover substructures which occur often in the data, but not always
in the same form.  This fuzzy substructure discovery can be used to
formulate fuzzy concepts, compress the data description, and discover
interesting structures in data that are found either in their pure
form or in a slightly convoluted form.  Examples from the domains of
scene analysis and chemical compound analysis demonstrate the fuzzy
discovery technique.
<P>

<B>Available as:</B> <A HREF = "pubs/ml92.ps">ml92.ps</A> <P>

<HR>

<A NAME = "kcsl92"></A>

<H2> Unifying Empirical and Explanation-Based Learning by Modeling the
Utility of Learned Knowledge </H2>

L. B. Holder. Unifying Empirical and Explanation-Based Learning by
Modeling the Utility of Learned Knowledge.  In <I>Proceedings of the
ML92 Workshop on Knowledge Compilation and Speedup Learning</I>, 1992.
<P>

<B>Abstract:</B> The overfit problem in empirical learning and the
utility problem in explanation-based learning describe a similar
phenomenon: the degradation of performance due to an increase in the
amount of learned knowledge.  Plotting the performance of learned
knowledge during the course of learning (the performance response)
reveals a common trend for several learning methods.  Modeling this
trend allows a control system to constrain the amount of learned
knowledge to achieve peak performance and avoid the general utility
problem.  Experiments evaluate a particular empirical model of the
trend, and analysis of the learners derive several formal models.  If,
as evidence suggests, the general utility problem can be modeled using
the same mechanisms for different learning paradigms, then the model
serves to unify the paradigms into one framework capable of comparing
and selecting different learning methods based on predicted achievable
performance.  <P>

<B>Available as:</B> <A HREF = "pubs/kcsl92.ps">kcsl92.ps</A> <P>

<HR>

<A NAME = "msl91"></A>

<H2> Selection of Learning Methods Using an Adaptive Model of
Knowledge Utility </H2>

L. B. Holder.  Selection of Learning Methods Using an Adaptive Model
of Knowledge Utility.  In <I>Proceedings of the International Workshop
on Multistrategy Learning</I>, pages 247-254, 1991.  <P>

<B>Abstract:</B> Experiments measuring knowledge utility during the
course of learning reveal a common behavior among several learning
methods.  As the amount of learned knowledge increases, the utility
(performance) of the knowledge initially increases, but eventually
degrades.  A multistrategy learning system based on a model of this
common behavior selects a learning method according to the model's
prediction of achievable performance.  The model-based adaptive
control (MBAC) approach provides this capability by maintaining models
for each learning method.  MBAC also predicts the amount of knowledge
to be acquired by the method in order to avoid the degradation of
performance.  MBAC unifies multiple learning strategies based on the
common behavior of the utility of their knowledge during the course of
learning. <P>

<B>Available as:</B> <A HREF = "pubs/msl91.ps">msl91.ps</A> <P>

<HR>

<A NAME = "phdthesis"></A>

<H2> Maintaining the Utility of Learned Knowledge Using Model-Based
Adaptive Control </H2>

L. B. Holder.  Maintaining the Utility of Learned Knowledge Using
Model-Based Adaptive Control.  Ph.D. Thesis, Department of Computer
Science, University of Illinois, Urbana, IL, October 1991. <P>

<B>Abstract:</B>
The overfit problem in empirical learning and the utility problem in
analytical learning both describe a common behavior of machine
learning methods: the eventual degradation of performance due to
increasing amounts of learned knowledge.  Plotting the performance of
the changing knowledge during execution of a machine learning method
(the performance response) reveals similar curves for several methods.
The performance response generally indicates a single peak performance
greater than that attained by popular pruning techniques.  The
similarity in performance responses suggests a parameterized model
relating performance to the amount of learned knowledge.  Given this
model, a model-based adaptive control (MBAC) approach can be used to
update the model based on feedback from the performance element and
make control decisions regarding the amount of knowledge to be learned
or unlearned. <P>

In view of the large number of alternative learning methods, a more
general utility problem exists in determining not only the correct
amount of learned knowledge, but also the correct method for learning
this knowledge.  Relying too heavily on one particular learning method
may result in less than optimal performance achievement.  Overcoming
this <I>general utility problem</I> requires a new control mechanism
for determining the correct learning method and amount of learned
knowledge in order to achieve the performance objectives of the task.
Maintaining models for several learning methods allows the MBAC
approach to decide the appropriate type of learning, in addition to
the amount. <P>

Experimentation analyzes the ability of the MBAC approach to converge
upon the peak of the performance response and avoid generation of low
utility knowledge.  Results indicate that a quadratic model is
sufficient to fit the peak of the performance response and that MBAC
using the quadratic model performs well at selecting the best learning
method for a given learning task.  More formal analysis of the
performance response supports the quadratic model for controlling
how much knowledge to learn as opposed to which knowledge.
<P>

<B>Available as:</B> <A HREF = "pubs/phdthesis.ps">phdthesis.ps</A> <P>

<HR>

<A NAME = "ieaaie90"></A>

<H2> Application of Machine Learning to the Maintenance of Knowledge
Base Performance </H2>

L. B. Holder.  Application of Machine Learning to the Maintenance of
Knowledge Base Performance.  In <I>Proceedings of the Third
International Conference on Industrial and Engineering Applications of
Artificial Intelligence and Expert Systems</I>, pages 1005-1012, 1990.
<P>

<B>Abstract:</B> Integration of machine learning methods into
knowledge-based systems requires greater control over the application
of the learning methods.  Recent research in machine learning has
shown that isolated and unconstrained application of learning methods
can eventually degrade performance.  This paper presents an approach
called performance-driven knowledge transformation for controlling the
application of learning methods.  The primary guidance for the control
is performance of the knowledge base.  The approach is implemented in
the PEAK system.  Two experiments with PEAK illustrate how the
knowledge base is transformed using different learning methods to
maintain performance goals.  Results demonstrate the ability of
performance-driven knowledge transformation to control the application
of learning methods and maintain knowledge base performance.  <P>

<B>Available as:</B> <A HREF = "pubs/ieaaie90.ps">ieaaie90.ps</A> <P>

<HR>

<A NAME = "ml90"></A>

<H2> The General Utility Problem in Machine Learning </H2>

L. B. Holder.  The General Utility Problem in Machine Learning.  In
<I>Proceedings of the Seventh International Conference on Machine
Learning</I>, pages 402-410, 1990.  <P>

<B>Abstract:</B> Experiments have revealed that uncontrolled
application of the analytical learning paradigm results in knowledge
having low utility.  Because the performance element must consider low
utility knowledge along with high utility knowledge, the proliferation
of low utility knowledge eventually defeats the goal of improved
performance.  Experiments in empirical learning have demonstrated a
similar phenomenon.  Uncontrolled application of an empirical learning
paradigm may result in inaccurate knowledge, and a post-processing
stage is typically needed to repair the degradation in performance.
The results from experimentation in both analytical and empirical
learning imply a general utility problem in machine learning.  This
paper presents evidence for such a perspective and recommends a closer
dependence between the learning paradigm and the performance goals for
which it is designed.  A new approach is presented along with
experimentation that illustrates the applicability of the approach to
the general utility problem. <P>

<HR>

<A NAME = "pdp90"></A>

<H2> Accelerated Learning on the Connection Machine </H2>

D. J. Cook and L. B. Holder.  Accelerated Learning on the Connection
Machine.  In <I>Proceedings of the Second IEEE Symposium on Parallel
and Distributed Processing</I>, pages 448-454, 1990.  <P>

<B>Abstract:</B> The complexity of most machine learning techniques
can be improved by transforming iterative components into their
parallel equivalent.  Although this parallelization has been
considered in theory, few implementations have been performed on
existing parallel machines.  The parallel architecture of the
Connection Machine provides a platform for the implementation and
evaluation of parallel learning techniques.  The architecture of the
Connection Machine is described along with limitations of the language
interface that constrain the implementation of learning programs.
Connection Machine implementations of two learning programs,
Perceptron and AQ, are described, and their computational complexity
is compared to that of the corresponding sequential versions using
actual runs on the Connection Machine. Techniques for parallelizing
ID3 are also analyzed, and the advantages and disadvantages of
parallel implementation on the Connection Machine are discussed in the
context of machine learning.  <P>

<B>Available as:</B> <A HREF = "pubs/pdp90.ps">pdp90.ps</A> <P>

<HR>

<A NAME = "flairs90"></A>

<H2> Performance-Driven Knowledge Transformation </H2>

L. B. Holder.  Performance-Driven Knowledge Transformation.  In
<I>Proceedings of the Third Florida Artificial Intelligence Research
Symposium</I>, pages 149-153, 1990.  <P>

<B>Abstract:</B> Integration of machine learning methods with
knowledge based systems requires sophisticated control mechanisms for
applying methods appropriate to the performance task.
Performance-driven knowledge transformation controls the application
of learning methods based on their ability to achieve desired
performance goals while preserving the performance on other tasks.  A
means-ends approach to performance-driven knowledge transformation is
presented along with experimental results from an implementation.  The
results indicate that performance-driven knowledge transformation is
able to maintain multiple performance goals by applying appropriate
machine learning methods to transform a knowledge base.  <P>

<HR>

<A NAME = "ml89"></A>

<H2> Empirical Substructure Discovery </H2>

L. B. Holder.  Empirical Substructure Discovery.  In <I>Proceedings of
the Sixth International Workshop on Machine Learning</I>, pages
133-136, 1989.  <P>

<B>Abstract:</B> This paper describes the substructure discovery
method used in the SUBDUE system.  The method involves a
computationally constrained best-first search guided by four
heuristics: cognitive savings, compactness, connectivity and coverage.
Each of the four heuristics are described along with their role in the
evaluation of a substructure.  An example demonstrates SUBDUE's
ability to discover substructure and the advantages to be gained by
other learning systems from the discovery of substructure concepts.
<P>

<HR>