Download as .pdf or .zip

 

All Books are in Adobe Acrobat PDF Format.
You may download a free plug-in here.

Download Winzip for
[Windows] or [Mac]
 
BOOK VIII - Page 6
 
  HERBERT SIMON, PAUL THAGARD AND OTHERS ON
DISCOVERY SYSTEMS
 
 

 

BACON and Other Discovery Systems

          In his Novum Organon (Book I, Ch. LXI) Francis Bacon had expressed the view that with a few easily learned rules or a method it may be possible for anyone undertaking scientific research to be successful.  And he proposed a method of discovery in the sciences which will leave little to the sharpness and strength of men’s wits, but will bring all wits and intellects nearly to a level.  For as in drawing a straight line or in inscribing an accurate circle by the unassisted hand, much depends on its steadiness and practice, but if a rule or pair of compasses be applied, little or nothing depends upon them, so exactly is it with his method.  Today Bacon’s agenda is called proceduralization for mechanization, and it is appropriate therefore that a discovery system should be named BACON.
          The BACON discovery system is actually a set of successive and increasingly sophisticated discovery systems that make quantitative empirical laws and theories.  Given sets of observation measurements for two or more variables, BACON searches for functional relations among the variables.  The search heuristics in earlier versions of each BACON computer program are carried forward into all later ones, and later versions contain heuristics that are more sophisticated than those in earlier versions.  In the literature describing the BACON systems each successive version is identified by a numerical suffix, such as BACON.1.  The original version, BACON.1, was designed and implemented by Pat Langley in 1979 as the thesis for his Ph.D. dissertation written in the Carnegie-Mellon department of psychology under the direction of Simon, and titled Descriptive Discovery Processes: Experiments in Baconian Science.  He published descriptions of the system in "BACON.1: A General Discovery System" in The Proceedings of the Second National Conference of the Canadian Society for Computational Studies in Intelligence (1978) and as a co-author with Simon and others in Scientific Discovery (1987).  BACON programs are implemented in a list processing computer language called LISP, and its discovery heuristics are implemented in a production-system language called PRISM.  The system lists the observable measurement data monotonically according to the values of one of the variables, and then determines whether the values of some other variables follow the same (or the inverse) ordering.  Picking one of these other variables, it searches for an invariant by considering the ratio (or the product) of these variables with the original one.  If the ratio or product is not constant, it is introduced as a new variable, and the process continues the search for invariants.  Examples of some of the simpler search heuristics expressed in the conditional form of a production are as follows: (1) If the values of a variable are constant, then infer that the variable always has that value.  (2) If the values of two numerical variables increase together, then examine their ratio.  (3) If the values of one variable increase as those of another decrease, then examine their product.  The general strategy used with these heuristics is to create variables that are ratios or products, and then to treat them as data from which still other terms are created, until a constant is identified by the first heuristic.
          BACON.1 has rediscovered several historically significant empirical laws including Boyle's law of gases, Kepler's third planetary law, Galileo's law of motion of objects on inclined planes, and Ohm's law of electrical current.  A similar system, BACON.3 has rediscovered the ideal gas law and Coulomb's law of electrical current.  For making these rediscoveries, Simon and his associates used measurement data actually used by the original discoverers, and published by W.F. Magie in A Source Book in Physics (1935).  BACON.4 is a significant improvement over earlier versions.  It was developed and firstly described by Gary Bradshaw, Pat Langley, and Herbert Simon in "The Discovery of Intrinsic Properties" in The Proceedings of the Third National Conference of the Canadian Society for Computational Studies in Intelligence (1980), and it is also described in their 1987 book.  The improvement is the ability to use nominal or symbolic variables that take only names or labels as values.  For example the nominal variable "material" may take on values such as "lead", "silver", or "water.”  Values for numerical properties may be associated with the values of the nominal variables, such as the den­sity of lead, which is 13.34 grams per cubic centimeter.  BACON.4 has heuristics for discovering laws involving nomi­nal variables by postulating associated values called "intrinsic properties", by inferring a set of numerical values for the intrinsic properties for each of the postulated nominal values, and then by retrieving the numerical values when applying its numerical heuristics to discover new laws involving these nominal variables.  The laws rediscovered by BACON.4 include: (1) Ohm's law of electrical circuits, where the intrinsic properties asso­ciated with the nominal variables are voltage and resistance, (2) Archimedes law of displacement, where the intrinsic properties are density and the volume of an irregular object, (3) Black's law of specific heat, where specific heat is the intrinsic property, (4) Newton's law of gravitation, where gravitational mass is the intrinsic property, and (5) the law of conservation of momentum, where the inertial mass of objects is the intrinsic property.  BACON.4 was further enhanced so that it could rediscover the laws describing chemical reactions formulated by Dalton, Gay-Lussac, and Comizzaro.  For example it rediscovered Gay-Lussac's principle that the relative densities of elements in their gaseous form are proportionate to their corresponding molecular weights.  Rediscovering these laws in quantitative chemistry involved more than postulating intrinsic properties and noting recurring values.  These chemists found that a set of values could be expressed as small integer multiples of one another.  This procedure required a new heuristic that finds common divisors.  A common divisor is a number which, when divided into a set of values, generates a set of integers.  BACON.4 uses this method of finding common divisors, whenever a new set of dependent values is about to be assigned to an intrinsic property.
          BACON.5 is the next noteworthy improvement.  It uses analogical reasoning for scientific discovery.  BACON.1 through BACON.4 are driven by data in search for regularities in the data.  Furthermore the heuristics in these previous BACON systems are almost entirely free from theoretical presuppositions about domains from which the data are drawn.  BACON.5 incorporates a heuristic for reducing the amount of search for laws in certain special cases, in which the system is given very general theoretical postulates, and then it reasons by analogy by postulating symmetries between the unknown law and a theoretical postulate given to the system as an input.  The general theoretical postulate that Simon gave to BACON.5 is the law of conservation.  The laws rediscovered by BACON.5 using analogy with the conservation law include the law of conservation of momentum, Black's law of specific heat, and Joule's law of energy conservation.
          The BACON discovery system was not the first system developed around Simon's principles of human problem solving with heuristics.  In 1976 Douglas B. Lenat published his Ph.D. dissertation written at Stanford University and titled AM: An Artificial Intelligence Approach to Discovery Mathe­matics as Heuristic Search.  Allen Newell was one of his dissertation advisors, and Lenat acknowledges that he got his ideas from Herbert Simon.  Lenat has since accepted a faculty position in the computer science department of Carnegie-Mellon University.  In 1977 he published "The Ubiquity of Discovery" in The Proceedings of the Fifth International Joint Conference on Artificial Intelligence, (IJCAI) in which he relates Simon's theory of heuristic problem solving in science and describes the specific heuristics in his AM discovery system.  While Lenat's article includes discussion of artificial intelligence in empirical science, his AM system is not for empirical science, but is a computer system which develops new mathematical concepts and conjectures with these concepts.  Also in the 1977 IJCAI Proceedings he published "Automated Theory Formation in Mathematics", which offers a more detailed description of the system’s two-hundred fifty heuristics, and which also discusses his application of the AM system in elementary mathematics.  He reports that in one hour of processing time AM rediscovered hundreds of common mathematical concepts including singleton sets, natural numbers, arithmetic, and also theorems such as unique factorization.  In 1979 Simon published "Artificial Intelligence Research Strategies in the Light of AI Models of Scientific Discovery" in The Proceedings of the Sixth International Joint Conference on Artificial Intelligence in which he considers Lenat's AM system and Langley's BACON systems as useful for illuminating the history of the discovery process in the domain of artificial intelligence itself, and for providing some insight into the ways to proceed in future research and development aimed at new discoveries in that field.  He says that AI will proceed as an empirical inquiry rather than as a theoretically deductive one, and that principles for the discipline will be inferred from the computer programs constituting the discovery systems, although he also notes that in the scientific profession the community members' work in parallel, while in the machines the work proceeds serially.
          BACON created quantitative empirical laws by examina­tion of measurement data.  Simon and his associates also designed and implemented discovery systems, that are capable of creating qualitative laws from empirical data, and three such systems are described in Scientific Discovery.  They are named GLAUBER, STAHL and DALTON.  The GLAUBER discovery system is named after the eighteenth century chemist, Johann Rudolph Glauber, who contributed to the development of the acid-base theory.  Langley developed the discovery system in 1983.  For its historical reconstruction of the acid-base theory GLAUBER was given facts very similar to those known to eighteenth century chemists, before they formulated the theory of acids and bases.  These facts consist of informa­tion about the tastes of various substances and the reac­tions in which they take part.  The tastes are "sour", "bitter", and "salty.”  The substances are acids, alkalis and salts labeled with common names, which for purposes of convenience are the contemporary chemical names of these substances, even though GLAUBER makes no use of the analytical information in the modern chemical symbols.  Associated with these common names for chemical substances are argument names, such as "input" and "output” that describe the roles of the chemical substances in the chemical reactions in which the substances partake.  Finally the system is given names for the three abstract classes: "acid", "alkali", and "salt.”  When the system is executed with these inputs, it examines the chemical substances and their reactions, and then correlates the tastes to the abstract classes, and also expresses the reactions in a general law that states that acids and alkalis react to produce salts.
          The second discovery system is STAHL, which creates a type of qualitative law that Simon calls "componential", because it describes the hidden structural components of substances.  System STAHL is named after the German chemist, Georg Ernst Stahl, who developed the phlogiston theory of combustion.  STAHL recreates the development of both the phlogiston and the oxygen theories of combustion.  Simon states that discovery systems should be able to arrive at laws that have been rejected later in favor of others in the history of science.  And he says that since a discovery system's historical reconstruction aims at grasping the main currents of reasoning in a given epoch, then reproducing the errors that were typical of that epoch is diagnostic.  Like GLAUBER, STAHL accepts qualitative facts as inputs, and generates qualitative statements as outputs.  The input is a list of chemical reactions, and its initial state consists of a set of chemical substances and their reactions represented by common names and argument names, as they are in GLAUBER.  When executed, the system generates a list of chemical elements and of the compounds in which the elements are components.  The intermediate states of STAHL's computation consist of transformed versions of initial reactions and inferences about the components of some of the substances.  When the system begins running, it is driven by data, but after it has made conjectures about the hidden structures, it is also driven by these conjectures, which is to say, by theory.  Simon concludes from the rediscovery of the phlogiston and oxygen theories by STAHL, that the proponents of the two theories reasoned in essentially the same ways, and that they differed mainly in their assumptions.  He also applied STAHL to the rediscovery of Black's analysis of magnesia alba, and he maintains that the same principles of inference were used by chemists quite widely in their search for componential explanations of chemical substances and their reactions.  The principal significance of this diversity to Simon is the demonstration that the reasoning procedures in STAHL are not ad hoc, and that STAHL is a general system.
          The third discovery system that creates qualitative laws is DALTON, which is named after John Dalton.  Like Dalton the chemist, the DALTON system does not invent the atomic theory of matter; it employs a representation that embodies the hypothesis, and that incorporates the distinction between atoms and molecules invented by Avogado.   DALTON is a theory-driven system for reaching the conclusions about atomic weights that BACON.4 derived in a data-driven manner.  And DALTON creates structural laws in contrast to STAHL, which creates componential laws.  DALTON is given information that is similar to what was available to chemists in 1800.  The input includes a set of reactions and knowledge of the components of the chemical substances involved in each reaction.  This is the type of information outputted by STAHL, and DALTON uses the same common-name/argument-name scheme of representation used by STAHL.  DALTON is also told which of the substances are elements having no components other than themselves.  And it knows that the number of molecules in each chemical substance is important in the simplest form of a reaction, and that the number of atoms of each element in a given molecule is also important.  DALTON's goal is to use this input to develop a structural model for each reaction and for each of the substances involved in each reaction, subject to two constraints.  The first constraint is that the model of a molecule of a substance must be the same for all reactions in which it is present.  The second constraint is that the models of the reactions display the conservation of particles.  Simon applied DALTON to the reaction involving the combination of hydrogen and oxygen to form water, and the system outputted a model giving a modern account of the water reaction.  He also considers applying DALTON to elementary particle physics and to classical genetics, but he states that the current version is not adequate to this task.
          Since the publication of Scientific Discovery Simon and his associates have continued their work on discovery systems and have pursued their work into new directions.  While BACON and the other systems described in the 1987 book are concerned mainly with the ways in which theories can be generated from empirical data, the question of where the data come from has largely been left unanswered.  In "The Process of Scientific Discovery: The Strategy of Experimentation" (1988) in Models of Thought Simon and Deepak Kulkarni describe their new KEKADA discovery system, which examines not only the process of hypothesis formation, but also the process of designing experiments and programs of observation. The KEKADA discovery system is constructed to simulate the sequence of experiments carried out by Hans Krebs and his colleague, Kurt Henseleit, between July 1931 and April 1932, which produced the elucidation of the chemical pathways for synthesis of urea in the liver.  This discovery of the ornithine cycle was the first demonstration of the existence of a cycle in the metabolic biochemistry.  Simon and Kulkarni's source for this episode is "Hans Krebs and the Discovery of the Ornithine Cycle" in Federation Proceedings (1980) by Frederic L. Holmes of Yale University.  Holmes also made himself available to Simon and Kulkarni for consultation in 1986 when their study was in progress.  The organization of KEKADA is based on a two-space model of learning proposed earlier by Simon and Lea in "Problem Solving and Rule Induction: A Unified View" in Knowledge and Cognition (1974).  The system searches in an "instance space" and a "rule space", each having its own set of heuristics.  The instance space is defined by the possible experiments and experimental outcomes, and it is searched by performing experiments.  The rule space is defined by the hypotheses and other higher level descrip­tions coupled with associated measures of confidence.  The system proceeds through cycles in which it chooses an experiment from the instance space to carry out on the basis of the current state of the rule space, and the outcome of the experiment modifies the hypotheses and confidences in the rule space.
          One of the distinctive characteristics of KEKADA is its ability to react to surprising experimental outcomes, and to attempt in response to explain the puzzling phenomenon.  Prior to carrying out any experiment, expectations are formed by expectations setters, which are a type of heuristic for searching the rule space, and the expectations are associated with the experiment.  The expectations consist of expected output substances of a reaction, and expected upper and lower bounds on the quantities or the rates of the outputs.  If the result of the experiment violates these bounds, it is noted as a surprise.  Comparison of the course of work of Krebs as described by Holmes and of the work of KEKADA in its simulation of the discovery reveals only minor differences, which Simon and Kulkarni say can be explained by focus of attention shifts and small differences in the initial knowledge with which Krebs and KEKADA started.  The authors also say that a man­ual simulation of the path that Krebs followed in a second discovery, that of the glutamine synthesis, is wholly consistent with the theory set forth by KEKADA.  They therefore conclude that the structure and heuristics in KEKADA constitute a model of discovery that is of wider applicability than the episode used to develop the system, and that the system is not ad hoc.

Simon's Philosophy of Science

          Simon's literary corpus is rich enough to contain a philosophy of science that addresses all four of the basic questions addressed by academic professional philosophers. 

Aim of Science

          What philosophers of science call the aim of science may be taken as a rationality postulate for basic scien­tific research.  Simon explicitly applies his thesis of bounded rationality developed for economics to scientific research in his autobiography in an "Afterword" titled "The Scientist as Problem Solver", although this explicit state­ment would not have been necessary for the attentive reader of his literary corpus.  Simon describes his theory of discovery as a special case of his theory of human problem solving, because both theories are based on his theory of heuristic search.  And he views his theory of heuristic search in turn as a special case of his postulate of bounded rationality.  To this metatheory one need only add that Simon's application of his postulate of bounded rationality to scientific discovery amounts to his thesis of the aim of science.  The function of heuristics is to search efficiently a problem space of possible solutions, which is too large to be searched exhaus­tively.  The limited computational ability of the scientist relative to the size of the problem space is the "computational constraint", that is the factor that bounds the scientist's rationality, constraining the scientist from aiming for anything like global rationality.  The research scientist is therefore a satisficer, and the aim of the scientist is satisficing within both empirical and computational constraints.

Explanation

          Simon's views on the remaining philosophical topics, explanation and criticism, may also be considered in rela­tion to the discovery systems.  Consider firstly his state­ments on scientific explanation including the topic of theoretical terms.  The developers of the BACON systems make a pragmatic distinction between observation variables and theoretical variables in their systems.  Simon notes that contemporary philosophers of science maintain that observa­tion is theory-laden, and his distinction between observational and theoretical terms does not deny this semantical thesis.  He calls his distinction "pragmatic", because he makes it entirely relative to the discovery system, and it is also pragmatic in the sense understood in the contemporary Pragmatist philosophy of science. Those variables that have their associated numeric values before input to the system are considered to be observational variables, while those that receive their values by the operation of the discovery system are considered to be theoretical ones.  He states that in any given inquiry we can treat as observable any term whose values are obtained from an instrument that is not itself problematic in the context of that inquiry.  Thus Langley considers all the values created by the BACON programs by multiplication or division for finding products or ratios to be theoretical terms.  And Simon accordingly calls the values for nominal variables that are postulated intrin­sic properties theoretical terms.
          Unfortunately Simon does not follow through with this Pragmatist relativizing to problem-solving discovery sys­tems, but reverts to the Positivist concept of explanation.  In his exposition of DALTON, which create structural theories, Simon comments that as an area in science matures its researchers progress from "descriptions" to "explanations", and he cites Hempel's Aspects of Scientific Explanation and Other Essays (1965).  Examples of explanations cited by Simon are the kinetic theory of heat, which provides an explanation of both Black's law and the ideal gas law, and Dalton's atomic theory, which provides explanations for the law of multiple proportions and Gay-Lussac's law of combin­ing volumes.  He notes that each of these examples involves a structural model in which macroscopic phenomena are described in terms of their inferred components.  Simon contrasts explanation to the purely phenomenological and des­criptive analyses carried out by BACON.4, when it redis­covered the concepts of molecular and atomic weight, and assigned correct weights to many substances in its wholly data-driven manner.  He affirms that BACON.4's analyses involved no appeal to a particulate model of chemical ele­ments and compounds, and that what took the place of the atomic model were the heuristics that searched for small integer ratios among corresponding properties of substances. This concept of explanation is a reversion to the hypothetical-deductive concept of explanation in which theories are said to “explain” empirical laws by deductive connection, and in which theory and empirical or descriptive generalizations are distinguished by their semantics.              This is what Hanson referred to as the almanac view of science.  On the Pragmatist view theory and empirical description are not distinguished semantically, but are distinguished pragmatically by their use in the problem-solving activity that is scientific research.  Theory is what is proposed for empirical testing, and description is what is presumed for testing.  Explanation is language that is theory after it has been empirically tested and not falsified; or one who speaks of "theoretical explanation" is merely speaking of a proposed explanation.  This is the functional view of the language of science instead of the Positivist almanac view.  Thus given that the discovery systems are problem-solving systems, defining "theory" and "explanation" relative to the discovery system is to define them in a manner consistent with the contemporary Pragmatist philosophy.  And on this Pragmatic view the outputted laws generated by BACON.4 are no less theoretical or explanatory than the outputs of DALTON.

 

Pages [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [Next 10 >>]
NOTE: Pages do not corresponds with the actual pages from the book

 

Web Design by Global Nexchange Solutions