subscribe to our mailing list:
|
SECTIONS
|
 |
 |
 |
Science in the Eyes of a Scientist
By Mark Perakh
Posted on November 3, 2001
Updated August 11, 2003
Contents
Introduction
Features of good, bad and pseudo-science
The language of science
Definitions
The building blocks of science
Methods of experimentation
Data
General discussion of data
Reproducibility and causality
Ceteris Paribus
Errors
Complex systems
Bridging hypotheses
Laws
Models
Cognitive hypotheses
Theories
Science and technology
Science and the supernatural
Conclusion
1. Introduction
In many articles on this site, especially those discussing books and
papers which are devoted to the most recent incarnation of the creationism,
disguised as the so-called "intelligent design theory," or which purportedly
assert the harmony between science and the Bible, the question will repeatedly
arise of what constitutes legitimate science and how to distinguish it from
either bad science or pseudo-science. In this paper I intend to discuss the
above question in general, thus laying foundation for the subsequent discussion
of that question in particular books and articles.
Obviously, this distinction is
possible only if a certain definition of science has first been agreed upon. Unfortunately, it seems that a rigorous definition of science is hard to
offer, and perhaps even impossible, for reasons which may become clear from the
following discourse. In a
non-rigorous and most general way, science can perhaps be defined as a
human endeavor consciously aimed at acquiring knowledge about the world in a systematic and
logically consistent manner, based on factual evidence obtained by observation
and experimentation.
As it should become clear from
the following discourse, it is hard to consistently adhere to the above
definition of science because the latter seems to inevitably encompass elements
beyond the limitations of that definition.
The subject of this article is traditionally discussed within
the framework of the philosophy of science. My intention is to look at the
problem from a different standpoint – that of a scientist rather than of a
philosopher. I envision this as a
glance from within science rather than from a philosophical outside. Such an approach entails considerable difficulty. Whereas philosophers may legitimately avoid discussing the specific nuts
and bolts of scientific work, a glance from within requires analyzing specific
details of the process of scientific inquiry and this may pose a serious
obstacle to non-scientists trying to understand my thesis. I will have, on the
one hand, to avoid the Scylla of engaging in arcane specifics of the modern highly
sophisticated scientific methodology and, on the other hand, the Charybdis of
oversimplification by limiting myself only to the most general notions and thus
actually becoming a dilettante philosopher.
I don't know if I can succeed in such an act of
balance on a thin wire between the two approaches and probably will not be able
to eschew the trap of becoming an
amateur philosopher anyway.
I also realize that it may be
simply impossible to analyze the essence of science as though all of its
branches were characterized by the same set of features. Science is very complex and multifaceted so an analysis which is quite
adequate for some fields of science may happen to be irrelevant in other fields. My approach will be relevant first and most to such fields as physics,
chemistry, engineering disciplines, and biology, while many aspects of my
discourse may be less applicable to medical science, and even less to history,
esthetics, literary criticism and the like.
One of the first questions in the discussion of science is what is the
driving force of science. While
different scientists may be driven by different motivations, I submit that the
main reason for the existence of scientific research is the curiosity which
seems to be inherent in human
nature. This statement does not
imply that every man and woman necessarily possesses such curiosity. On the
contrary, the vast majority of people lack it. As the great Russian poet Aleksandr Pushkin said (he meant only the
Russian people, but his words can probably be applied to humanity as a whole)
"We are lazy and not curious." However,
there has always existed a fraction of people differing from others in that they
are driven by insatiable curiosity. Apparently,
some ethnic groups, at different historical periods and for unknown reasons, had
a larger fraction of such people than others.
It seems that historically attempts to satisfy the curiosity in
question were made in three distinctive ways. These are religion, philosophy and
science.
(As any simple scheme, the above classification does not fully
represent all aspects of the human drive to understand the world. For example,
the above three-part scheme leaves out such important forms of human endeavor as
art. However, this scheme is
convenient for the purpose of the main subject of this article).
At the dawn of civilization, when humankind's knowledge of the
world was extremely limited by the narrow experience within a small locality,
the curiosity about the surrounding world could be satisfied only by means of
religion. For
example, until recently, the Indian tribes in America, as well as native tribes
in Africa and Polynesia, all had certain religious systems, but none of them had
developed either philosophy or science.
Religion is a way of
explaining the world which is not based on evidence but largely on imagination
and fantasy. The religious explanation of the world does not require either
factual evidence or logic. The holy
books of the world's largest religions not only are incompatible with each
other but are also each full of internal inconsistencies and descriptions of
improbable events, but this has in no way impaired their impact on large masses
of people. Although religions often borrowed concepts from each other,
most of them claim the monopoly on truth. They
have an enormous power of survival despite any evidence against their tenets.
As the boundaries of the explored world expanded, the next steps in the
attempts at explaining the world were philosophy and science. It is hard to
establish a definite chronological sequence of their emergence. In some cases
science preceded philosophy. Indeed, well developed science is known to have
existed in the ancient Babylon and Egypt while philosophy does not seem to have
left signs of existence in that remote past. In other cases philosophy and
science seemed to emerge simultaneously, often being for a while
indistinguishable from each other. Whatever
simple scheme can be suggested, it necessarily will be incomplete and contradict
at least some features of the complex relationship between the three ways of
comprehending the world I listed above.
Like religion, philosophy does not necessarily require a factual evidence
(although it may) but, unlike religion, philosophy requires logic. Unlike religion, philosophy rarely claims the monopoly on truth (although
some philosophical theories did just that – an example is the Marxist
philosophy).
Finally, as more factual evidence was accumulated about the world,
science took over as a way to gain knowledge in a systematic matter, trying to
base its conclusions both on factual evidence and logic.
In the course of millennia science underwent multiple
modifications, for many centuries remaining indistinguishable from philosophy.
The modern science which is quite distinctive from philosophy developed
largely during the few most recent centuries.
In a certain sense, the relation between science, philosophy and religion can be
represented, necessarily simplifying the actual situation, by three concentric circles. The inner circle envelopes scientific
knowledge. Its radius is constantly
increasing, as more and more knowledge is accumulated. Beyond that inner circle of scientific knowledge is the concentric circle
of a larger radius, the area between the two circles representing the domain of
philosophy. Philosophy reigns where
science has yet not acquired sufficient knowledge, thus leaving room for
speculations limited only by logic. Finally,
an even larger concentric circle surrounds the two mentioned circles – that of
science and that of philosophy. This is the domain of religion where neither science nor
philosophy has so far managed to offer good explanations of the world. The inner
circle – that of science – is continually expanding at an ever increasing
rate, as science accumulates more and more knowledge. This expansion of science pushes philosophy into a gradually
narrowing domain where it still has the freedom of logical explanations not
based on rigorously collected factual data. This, in turn, pushes religion even
further to the margins of the world's comprehension. The enormous power of survival obviously inherent in
religions is due not to their gradually shrinking explanatory abilities but
rather to their ability to satisfy the emotional needs of man.
2. Features of good, bad, and pseudo-science
We have to distinguish between
"good" and "bad" science as well as between genuine and pseudo-science. While these two dichotomies are of different nature, sometimes it is
difficult to distinguish between bad science and pseudo-science.
Bad science differs from good
science in that it employs the same legitimate approach to inquiry as good
science but at some stage of that inquiry fails, often inadvertently, to follow
the path of an objective and/or logical process and goes astray in its
conclusions. Generally speaking,
good science serves the progress of science itself and of its derivatives in
engineering, medicine, and technologies of various kinds whereas bad science is
rather promptly discarded. Since bad science is usually recognized rather
swiftly, it is often just a nuisance and is rarely so harmful as to seriously
impede the progress of good science.
A case of bad science can be
exemplified by the story of the so-called cold fusion. The discovery of cold fusion was announced by two groups of
scientists, of which one consisted of professors of electrochemistry Martin
Fleischman and B. Stanley Pons and
their associate M. Hawkings, and the other was a group headed by Professor
Steven Jones.
Whereas I never heard of Pons,
Hawkins, and Jones before they announced their discovery, I was
acquainted with Professor Fleischman. In the seventies, we both served as
members of the Council of the International Society of Electrochemistry (of
which Fleischman later served as president). I also twice visited the University of Southampton where Fleischman was a
professor of electrochemistry, and gave a talk there. Since Fleischman's research was in a field rather remote from my own, I
was not really familiar with his scientific production, but I knew of his
reputation as a serious electrochemist. Therefore, when I first heard about cold
fusion I was both greatly impressed and puzzled. I was impressed by the mere claim that two electrochemists, using an
experimental set-up which only could be characterized as primitive, reportedly
achieved a revolutionary result of immense importance. I was puzzled because
that result seemed rather unlikely (in particular because these researchers did
not observe emission of neutrons from an alleged nuclear fusion process, and,
moreover, their description of the experimental procedure seemed to be too
vague.) On the one hand, it seemed hardly probable that
experienced scientists would have made such an extravagant claim without having
meticulously studied the claimed phenomenon, and on the other hand, it seemed
equally hardly probable (although not impossible) that their claim was true. Soon, several
groups of researchers were busy with verification of the sensational cold fusion
reports. Most of them rather rapidly came
to the conclusion that the cold fusion effect could not be reproduced. The sensation died and most of the researchers discarded the cold fusion
report as bad science. (Despite the disappointment by the evident
irreproducibility of the initial results by Fleischman and Pons, some
research in that direction is still going on, and regular conferences
devoted to that subject are being held; although no actual cold fusion has been
reported so far, these efforts may be useful if we remember a saying attributed
to Rutherford. This famous physicist was once asked why he allowed one of the
researchers in his lab to conduct an obviously hopelessly futile study.
Reportedly, Rutherford answered: "Let him do it. He hardly will find what
he is looking for but maybe he'll find something else which happens to be
useful.")
Pseudo-science,
on the other hand, is an endeavor which essentially is not real science but
something else disguised in quasi-scientific clothes for the sake of a certain
agenda, usually having nothing to do with science and often hostile to science. It may be quite detrimental to genuine science because of its ability to
successfully disguise itself as genuine science and thereby forcing on science
an unnecessary defensive effort requiring the spending of valuable human
resources needed for the progress of genuine science. A little later in this article I will suggest some criteria for
discriminating between genuine and pseudo-science and suggest appropriate
examples.
It seems useful to distinguish
between the language of science (and definitions
as a part of it) and its building blocks.
3. The language of science
The language of science is
overwhelmingly mathematical. First,
this statement can justifiably be viewed as a simplification. The
interaction between science and mathematics is neither straightforward nor
one-dimensional. Second, an adequate analysis of mathematics as a
predominant language of science would require a preliminary analysis of what is
a language. This would, though, lead well beyond the scope of my topic. Third,
this
statement may be disparaged as allegedly dismissing non-mathematically expressed
knowledge as non-science. Of
course, it is just a question of semantics. Is psychology a science? Is history a science? Surely, these and many
other fields of inquiry are legitimately listed among sciences even though
mathematics has so far found only a limited room for itself within those
sciences. (A different story is that mathematics finds its way into
those sciences via the ever increasing penetration of computers and artificial
intelligence.) What is more
important, the path these sciences follow seems to repeat that traveled by
physics, chemistry, and biology. Centuries
ago, actually before Johannes Kepler and Galileo
Galilei, physics used little mathematics and was largely a descriptive science,
much like psychology or history are today. It took time for the marriage of physics and mathematics to occur, but
when it happened, the impetus for the development of powerful theories of
physics was enormous. "Mathematics
is a language," said the great American physicist Josiah Willard Gibbs. Modern mathematics is so multifaceted that it is probably more proper to
speak about mathematical languages in plural.
Mathematics is, though, more than a mere language (or languages). It is a compressed logic immensely facilitating a theoretical
comprehension of experimental facts. Without
mathematics, contemporary physics would not exist. Hence, while saying that the language of science is mathematics, this is
fully applicable to the physics of today but also, probably, to the psychology
or history of tomorrow.
The mathematical language is overwhelmingly used in science not
because it is more powerful than the polymorphous natural vernacular. The opposite is true – the polymorphous natural language
is, generally speaking, more
powerful. The use of mathematical
language in science is dictated not by its power but by necessity, as will be
discussed in more detail when we turn to the concept of definition. Actually, whereas mathematical language immensely facilitated
and accelerated the development of science, toward the middle of the 20th
century it has become evident that the scientific study of the so-called large
or poorly-organized systems (which will be discussed in subsequent sections) is
sometimes impeded by a strict adherence to the rigorously enforced use of a
uniform mathematical language. The
sharp demarcation between natural language with its inherent ambiguity of terms
and the rigorous uniform mathematical language started gradually diffusing.
Hence, although the mathematical language is still the prevalent,
largely preferred and often simply necessary way of scientific discourse,
polymorphous language gradually finds its way into that discourse, a phenomenon
which is often viewed simply as a modification of the mathematical language
itself. Indeed, mathematics
constantly develops new tools (such a chaos theory, non-standard logic, and the
like) which help it catch up with the needs of physics and other disciplines in
refining their mathematical representation.
The question of why mathematics
is applicable in physics puzzled many scientists and philosophers alike.
Physics is a discipline in
which mathematics is so intertwined with the body of that science that often it
seems hard to see where physics ends and mathematics starts. Physics and mathematics have historically developed, constantly
exchanging ideas and methods. The greatest mathematicians often happened to be also
outstanding physicists and vice versa. Isaac Newton developed some of the most
fundamental theories in physics and also invented a revolutionary part of
mathematics – differential calculus. Leonard
Euler made enormous contributions both to mathematics and to physics. New mathematical methods which seemed to be of a highly abstract nature
(like functional analysis) swiftly found their way into specific fields of
physics while methods developed to tackle specific problems of physics (like
vector algebra and then vector calculus originally invented in hydrodynamics)
very soon were generalized to become chapters of abstract mathematics.
There is something fascinating
in that mysterious usefulness of mathematics in physics despite the obvious
difference in their treatment of objects. Here
is a simple example. We all know that matter has a discrete structure. It
consists of tiny particles. It is impossible to visualize these particles since
their behavior is enormously different from the behavior of the macroscopic
bodies we see, hear, taste, smell and touch. Whatever these particles really are like, we know that matter is not
continuous. However, we routinely
apply in physics an abstract mathematical concept of infinitesimally small
quantities, like infinitesimally small electric charge and the like. This mathematical trick works very effectively despite its
obvious contradiction of our knowledge of the discrete structure of matter. The concept of an infinitesimally small charge is contrary to the
reliably established facts of the discrete nature of electric charge.
I believe the described
discrepancy between mathematics and physics does not undermine the extreme
usefulness of mathematics in physics for the same reason that the use of
physical models does not undermine the study of real objects. One of the following sections will discuss the use of models in science
in detail. Preempting that
discussion, we may note that models differ from the real objects in that the
models share with the real objects they represent only a small fraction of
properties while ignoring the overwhelming majority of the latter. A good model shares with the real object only those few properties which
are crucial for the problem at hand. For example, in Newton's celestial
mechanics the model of a planet is a point mass. It has only one property common
with a real planet – its mass. Replacing
a real object with its model, while introducing a certain, usually insignificant
imprecision, enables scientists to tackle otherwise enormously complex problems. Similarly, applying an abstract mathematical concept such as
infinitesimally small charge, is similar to using a model in which the discrete
structure of the real electric charge is ignored as inconsequential for the
problem at hand. Since those properties of an electric charge which are
crucial for the problem at hand are accounted for in the theory assuming a
continual structure of the electric charge, the powerful mathematical machine
works very well, immensely facilitating the analysis of the behavior of real
charges.
Therefore
the contradiction between abstract mathematical concepts and the real properties
of the objects of study in physics is more of philosophical than of practical
significance.
Still, the amazing usefulness
of mathematics in physics seems to be a vexing problem for curious minds. It has
so far not received a commonly accepted interpretation, although more than once
various explanations have been suggested.
I believe that the fact of
mathematics being the language of science can be understood through a rather
simple and general idea. One of the
requirements for an approach to be genuinely scientific, which is not limited to
science alone, but is equally valid for any discourse which may be viewed as
reasonable, is to be logical. Logic
itself cannot serve to distinguish between correct and incorrect statements,
because logic is simply the proper path from a premise (or premises) to a
conclusion. The logical conclusion
is only as good as the premise. However,
if the premise is correct the conclusion reached via a logical procedure is
reliable. Therefore it is
imperative to use logic at every stage of a scientific inquiry. Mathematics is the most powerful and the most concise form of logic. Naturally, science had to resort to mathematics as an extremely powerful
tool of logical discourse.
Leaving this philosophical
problem aside, we may assert, that, generally speaking, mathematics is the
proper language for science.
4. Definitions
In
many of the following articles, the question will arise of what constitutes a
proper definition of the concepts under discussion (see, for example, A Consistent Inconsistency,
sections on probability and complexity). Therefore it seems appropriate to
provide first a general discussion of what the features of proper definitions
are and which types of definitions can be acceptable in a scientific discourse.
Obviously,
a fruitful scientific discourse is possible only if certain definitions are
agreed upon. Proper definitions are therefore of the utmost importance and
introducing them requires the maximum possible care.
It
seems a platitude to say that a definition is essentially an
agreement regarding the meaning of a term.
In the everyday discourse, we are rarely concerned
with the precise definition of words
used in the vernacular. The vagueness of the vernacular is actually its
advantage, supplying flexibility to the discourse and enlivening the latter. The vagueness of the vernacular is also connected to its redundancy,
which facilitates communication. Redundancy,
providing a certain amount of excessive information (in the layman's sense of
that term), helps to compensate for the possible distortion of the message which
is due to the unavoidable noise accompanying the transmission of a message.
Scientific terms, on the other
hand, usually require precise definitions and this is one of the features which
distinguish scientific discourse from an everyday chat.
A term may have quite different
definitions depending on the branch of science in which it is being utilized.
For example the term vector in
mathematics and its applications in physics means a quantity fully determined by
its magnitude and direction (as opposite to a scalar which is determined by its magnitude alone, without any
reference to direction). On the other hand, in pathology the same term, vector,
means an organism, such as a mosquito or tick, that carries disease-causing
microorganisms from one host to another. This example illustrates the utmost
importance of rigorous definitions of concepts used in each field of science.
Science is a collective
enterprise. It is impossible without communication within the community of
participants in that enterprise. Communication
is predicated on the acceptance of certain common definitions of concepts of
which the edifice of a particular science is composed.
Formulating a definition of a
concept is a logical procedure which, although it may vary in particular
details, is subordinated to certain general rules common for all the variations
of definitions. One of the most
salient features of a definition is that it presupposes a certain preliminary
knowledge relevant to the subject of definition. No concept can be defined
unless the meaning of certain underlying concepts has already been agreed upon. If the definition of a concept is attempted in the absence of some
underlying relevant knowledge, such an attempt will not provide a legitimate
definition, but at best a quasi-definition which will necessarily be ambiguous
and logically deficient.
A corollary to the last
statement is that every field of science starts by introducing a seminal concept
for which no real definition can be given. The word "starts" in that sentence implies a logical rather than
chronological order of steps in building up a field of science.
There are several possible
classifications of definitions.
One such classification is
based on the logical structure of a definition. According to that
classification, three types of definition can be distinguished. They are a) Deductive definition; b) Inductive definition, and c)
Descriptive definition.
A deductive definition has the
form of a triad. It necessarily
comprises three elements which are: a) a general
concept which encompasses a certain set A of elements and which is assumed to be
commonly understood within the appropriate circle of the participants of a
discourse; b) a particular concept
which is the target of the definition and which encompasses a certain subset B
of elements of the set A, and c) qualifiers
which specify those features of the elements of subset B distinguishing the
latter from all other elements of set A not included in subset B. Obviously, for a deductive definition to be meaningful, the participants
of the discourse must have a preliminary understanding, assumed to be identical
for all participants of the discourse, first of the general concept which
determines set A and second of the qualifiers characterizing subset B.
A deductive definition also
usually contains sentential elements, either explicitly or implicitly, in the form of "such.... that," "that (those)...which (who)" and the
like. These sentential elements can be omitted but they are nevertheless
implied.
For example here is a deductive
definition of the term "colt."
A
colt is such a
horse that is a)
young, and b) male.
In this definition the general concept is that of
"horse." It is assumed to be
known to all participants of the discourse. If it were not known, the definition
would make no sense as it would define the target - "colt" – through an
unknown concept requiring its own preceding definition. By the same token, the qualifiers, of which there are two in
this case (young and male) are assumed to be commonly understood by all participants of
the discourse, otherwise the definition of a colt would be useless. All horses,
regardless of their being colts or not colts (i.e. fillies, geldings, stallions
and mares) are elements of set A, while only those horses which meet the
qualifiers (i.e. are male and young) are
elements of subset B.
Here is another example.
Daughters
of Ann and Andrew are those of their
kids who are female.
I have intentionally chosen an
example which sounds primitive in the extreme because, despite its simplicity,
it is representative of a deductive definition.
According to the goal
of a definition, deductive definitions can be of three basic types, a)
phenomenological, b) taxonomic, and c) explanatory definitions. The distinction between these three types of definitions may sometimes be
quite clear, while in some other cases this distinction may be blurred. For example, a taxonomic definition may be at the same time also either
phenomenological or explanatory. Also, a definition may be viewed as either
phenomenological or explanatory depending on the requirements to the degree of
its explanatory power.
The distinction in question is
determined not as much by the contents of a definition as by its goal.
A taxonomic definition is
always a member of a set of "parallel" definitions all of which contain the
same general concept but differ in qualifiers. For example, the definition of a colt
discussed above is taxonomic, as it is a member of a set, including also
definitions of filly, mare, stallion
and gelding.
From another standpoint,
deductive definitions can be either qualitative
or quantitative.
Let us review an example of a
definition taken from the elementary course of physics: Semiconductors are those
materials whose electric conductivity
increases with temperature.
In this case set A comprises all materials whereas subset B comprises only semiconductors, while the qualifier is a
certain feature of the behavior of the elements of subset B which distinguishes
them from all other materials – namely the temperature dependence of their
electric conductivity. This definition belongs to the phenomenological type. It determines the target of definition by specifying a certain behavior
of elements of subset B without providing an explanation of why they behave as
they do. Assume now that we
need a definition of "semiconductor" which is of explanatory type, i.e.
containing a reference to the mechanism underlying the behavior of conductivity
described in the phenomenological definition. We discover that the mechanism in question is not the same for all
semiconductors, so that there are three distinctive classes of semiconductors,
in all of which electric conductivity increases with temperature, but the
underlying mechanisms of that phenomenon are different. Therefore, in order to
provide an explanatory rather than a
phenomenological definition of a semiconductor, we have to first introduce
taxonomic definitions for each of the three classes of semiconductors. One such class is the so-called intrinsic semiconductors (the two other
classes are the donor-type or n-type semiconductors, and the acceptor-type or
p-type semiconductors). A taxonomic
definition of the intrinsic semiconductors can be given as:
An intrinsic semiconductor is such a semiconductor that
is not doped by either donor or acceptor elements. (The term doping
means deliberately inserting into a material A small amounts of another
material, B, which alters the properties of A in a desired manner). Now,
as the concept of an intrinsic
semiconductor has been taxonomically defined, we can give an explanatory definition
of that concept: Intrinsic semiconductors are such
materials in which the energy gap between the conduction and the valence energy bands is narrow.
This
definition is better in that it specifies the intrinsic property of elements of
subset B which predetermines their behavior. However this definition, while
explanatory, is only qualitative. A
still better scientific definition would be quantitative. It would numerically specify
the meaning of the term "narrow."
If either a general concept
encompassing set A or the qualifiers distinguishing subset B from the rest of
set A are unknown, the triad-like deductive definition is impossible.
The first type of situation is
common if a seminal concept of a science is to be chosen. Since at the starting point of a new field of science there are yet no
defined concepts in the arsenal of that science, its starting concept cannot be
deductively defined. In this case the first element of the triad is absent, hence
the triad cannot be logically constructed. A good example is the concept of energy in physics.
Energy is the most fundamental concept in physics. It is also its
most seminal concept (not in a chronological but in a logical sense). In a
certain sense, physics is a science about energy transformations. The law of
energy conservation is the most fundamental basis of the science of physics.
However, since physics has no concept which is more general than energy, the
latter cannot be deductively defined.
Of course, one can find in textbooks on physics, especially those
published many years ago, any number of alleged definitions of energy, but most
of them are logically deficient and are not real rigorous definitions but rather
attempts at explaining the meaning of that concept without really defining it in
a logically consistent way. Some of
those quasi-definitions are plainly absurd. One can see in some old textbooks such quasi-definition of energy as one
asserting that energy is "the ability of a body to perform work," which is a
meaningless statement, appealing to the laymen's understanding of the term
"work."
The famous physicist Richard
Feynman, in his excellent course "The Feynman Lectures on Physics,"
prudently avoided attempts to define energy in a rigorous way, providing instead
an explanation of the term energy as
"a certain quantity, which we call
energy, that does not change in the manifold changes which nature undergoes."
(R. P. Feynman, R.B. Leyton and M. Sands, "The Feynman Lectures on Physics,
Addison-Wesley Publishing Co, 1963, v.1, p. 4-1). This is a descriptive
definition which cannot be replaced by a rigorous triad because we lack any
concept which is more general than energy, so the vague term a
certain quantity is utilized to avoid the unsolvable problem of defining the
seminal concept.
Another case when the rigorous triad-like deductive
definition is impossible is when no qualifiers are known.
In such cases we know the general concept and know that, in principle,
the target of the definition is a sub-category within that general concept, but
we don't know what distinguishes that sub-object from other sub-objects within
the same general concept. For
example, let us say John and Mike
are identical twins. Obviously they are two different persons, both being,
though, sons of Andrew and Mary. However, we don't know how to distinguish
between John and Mike so we can't provide a triad forming the proper
definition of either who Mike is or who John is. Any attempt to provide a
definition, say, of Mike, will result in a tautology which, although necessarily
true, is of no substance (for example, an assertion that Mike is he of the twins
whose name is Mike). At best, such
a definition can be of what can be called "accidental" type, which is not
really useful because it refers to some irreproducible (accidental) qualifiers
and therefore cannot be applied under conditions which are in any way different
from the specific situation used for generating such qualifiers. For example,
such a definition can sound like: Mike is that
son of Andrew and Mary who on
Tuesday, January 3, 1978 opened the door of their house when we rang the bell. Although
that definition formally is in the form of a deductive triad, its qualifier is
of an accidental type and cannot be used to distinguish between the twins at any
time other than Tuesday, January 3. 1978.
While a deductive quantitative
explanatory definition is the most desirable scientific definition, quite often
we have to satisfy ourselves with different types of definitions. The reason for that is not necessarily the principal impossibility of
formulating a deductive triad, but often simply the history of the development
of a certain range of concepts. If
an experimental work generates a set of data, this often provides the basis for
formulating an inductive definition of a new concept which becomes
indistinguishable from a specific type of law
of science. (We will discuss
the concept of laws of science in detail a little later, but at this point we
can note that a law of science which results from a process of induction is not
equivalent to those laws which constitute parts of a scientific theory. Inductive laws, which are often no more than inductive definitions, belong to the phenomenological type. They state facts without explaining them.)
Inductive definitions which
spell out a law can often be characterized also as relational definitions – they state a certain relation between two
or more concepts. For example,
assume a scientist studied the behavior of the metal Gallium at various
temperatures. Repeating the tests
many times, she found that at a pressure of 105 Pascal,
pure Gallium melts at 302.5 Kelvin. She postulates that her data, consistently showing that result, are not
accidental but reflect a law. She
formulates a law: "Pure Gallium melts at 302.5 K
if the pressure is 105 Pascal."
Her postulate resulted from the procedure of induction wherein she proceeded
from a set of individual events to generalizing it as a law. Her formulation defines a law
and therefore is also a definition. It
is a relational definition as it establishes a relation between a certain event
(melting of Gallium) and two factors – temperature and pressure. It is also a phenomenological definition since its states a fact without
explaining its nature.
Even if an inductive definition
is not stating a law, the path to such a definition is similar to the procedure
of scientific induction wherein the results of observations or experiments
repeated many times are generalized, postulating that the data in question are
the manifestation of a law of nature. We will discuss the transition from a set
of observational or experimental data to a postulate of a law in detail in a
subsequent section. At this point,
let us simply illustrate the idea of inductive definition by an example.
Imagine that a zoologist whose name is Johns went to explore
jungles in, say, some part of Africa, and discovered there a number of animals
hitherto unknown. Let's say all these newly discovered animals happened to be
various kinds of cats – cat M, cat N, cat L, etc. Each of these cats differs
in certain characteristics from other cats, but all of these cats have certain
features in common distinguishing them from cats found in other parts of the
world. The zoologist suggests a name for this newly discovered subfamily of
cats, say Johns's cats. Johns
provides a detailed description of each of the kinds of John's cats – M, N,
L etc. He has then to provide a
definition of the new subfamily of cats as a whole. He defines them as follows:
"Johns's cats is a subfamily of cats comprising cats M, N and L." He may also provide a description of the features which are common for
all Johns's cats, but are absent in all other cats (for example, all Johns's
cats may have a peculiar shape of their ears, which is not observed in any other
cats). This is an inductive definition, generalizing subcategories
into a broader super-category.
Inductive
definitions, by their nature, are open
definitions, in that the list of their elements can be expanded if new elements
are discovered which can be included into the definition. On the other hand, deductive definitions are closed
in that they encompass all possible elements which can be included in the
definition.
The
lack of a rigorous definition, agreed upon by the participants of a discourse,
may render an otherwise sophisticated discourse too vague to be genuinely
scientific and may deprive even a very ingenious argument of its evidentiary value.
5. The building blocks of science
The principal building blocks of science are
1) Methods of
experimentation,
2) Data,
3) Bridging hypotheses,
4) Laws,
5) Models,
6) Cognitive hypotheses,
7) Theories.
A warning seems to be in order.
The above list may create the impression that science can be presented by a neat
straightforward scheme, as a combination of clearly distinguishable building
blocks. No such neat scheme exists. The above listed building blocks of science can overlap, intersect and
emerge in an order different from the above list. However, the division of the body of science into supposedly separate
building blocks, besides providing convenience in analyzing the extremely
complex and multifaceted form of human endeavor we call science, also reflects
the real composition of science, even though the boundaries between its
constituents may sometimes be diffuse.
We will discuss the above
concepts in detail. Before doing that, note that for each of the above elements
of good science there is a corresponding element of bad science, i.e. bad
methods, bad data, bad hypotheses, bad laws, bad models, and bad theories. On
the other hand, pseudo-science differs from genuine science not just because one
or several of the above building blocks can be characterized by adding the term pseudo
(which, of course, is possible in principle) but because pseudo-science
typically lacks legitimate data. Since data are that part of science which provides evidence, the absence
of real data means the absence of evidence which would support the hypotheses,
laws, models and theories. A
hypothesis, law, model, or theory suggested without the real supporting data is
typical of pseudo-science.
The statements in the preceding
paragraph require clarification of what distinguishes good data from bad data
and from wrong data. This question
will be discussed in the section on data.
Whereas bad science is usually
swiftly recognized as such, pseudo-science sometimes features a strong power of
mimicry, disguising it as genuine science. The litmus test enabling one to distinguish genuine science from
pseudo-science is in looking for the data underlying the hypotheses, laws,
models, and theories of the supposed science. Quite often, discovering no data, i.e. no evidence supporting those
hypotheses, laws, models and theories, serves as an indication that we are
dealing with pseudo-science, regardless of how sophisticated the hypotheses,
laws, models and theories in question may seem to be and regardless of how
eloquently they are presented.
Review a few examples of
pseudo-science. Probably the most
egregious example of a pseudo-science which had tragic consequences on an
enormous scale is that of Marxism. Created in the 19th century Europe
mainly by Karl Marx and Friedrich Engels, it so successfully disguised as
science that it won a large number of adherents despite its obvious
contradiction with facts. Marxism
comprised several parts which can be roughly defined first as a philosophy of
the so-called dialectical materialism, second as the alleged historical analysis
of society's development (the so-called historical materialism) and third, as
the Marxist political economy. The dialectical materialism was an eclectic
philosophical theory combining elements of Feuerbachs's materialism and
Hegel's dialectics. Neither Marx
nor Engels contributed much to the development of their predecessors'
philosophical ideas but they reformulated those ideas in the simplified form of
a neat set of simple statements which could be much easier comprehended by
non-philosophers (such as a vulgarized version of Hegel's principles of the
transition of quantity into quality or negation of negation, etc). (It is
interesting to note that a frequent feature of pseudo-scientific theories is
that they often suggest a neat and simple scheme allegedly representing complex
reality). The core of Marxism was though their theory of the class struggle
whose essence they themselves succinctly expressed by the maxim that the history
of mankind is the history of the struggle of classes. That theory reduced all the complexity of human history to one factor –
the struggle of economic classes. Marx
and Engels did not seem to be worried by the fact that their representation of
history was completely contrary to the histories of such countries as, for
example, China or India, and
reflected even the history of Europe only in a rather strained way. Such absurdities as interpreting religious wars as the results of the
struggle of economic classes did not seem to make the creators of "scientific
Marxism" pause and allow for the role of any factors besides the class
struggle in the history of the mankind. Their extremely narrow concept ignored a
multitude of various factors that affected the history of the human society. In other words, they arbitrarily chose a very narrow subset of data from
the much wider set of available data in order to fit the data to their
preconceived theory. This is a
classic example of pseudo-science, whose main fault is the absence of a
sufficient scope of data so that the theory is built accounting for only a
deliberately selected tiny fraction of factual evidence. Since, however, Marxism
had all the appearance of a scientific theory, its predictions won wide
popularity, with one of the consequences being the bloody revolution in Russia. Although the actual revolution and its consequences quite obviously did
not fit the blueprint predicted by Marxism, the latter acquired the status of a
godless religion, believed in with a fanatical stubbornness despite its obvious
futility and the blood bath it caused. Indeed, like in religions, one of the
persistent claims by Marxism was that it was "omnipotent because it was
true."
Another example of a
pseudo-science is Lysenko's pseudo-biology which was imposed by decree of the
Soviet communist rulers as the only true biology compatible with
Marxism-Leninism. Lysenko's
theory was a complete hogwash with no basis in factual data whatsoever. Its
imposition also had tragic consequences, as many genuine scientists were
arrested and perished in the Gulag because they tried to refute Lysenko's
pseudo-science by pointing to the numerous data which it plainly contradicted.
One more, this time rather
comic, example was the allegedly revolutionary theory of viruses suggested in
the late forties in the USSR by a semi-literate veterinary physician Boshian. According to that theory, which was officially approved by the communist
party, viruses constantly convert into bacteria and vice versa. Of course,
Boshian's theory was based on fictional data. When,
after many unsuccessful attempts, a commission of scientists finally gained
access to Boshian's secret laboratory, all they found, instead of samples of
the alleged virus-bacteria colonies, was plain dirt.
Other examples of
pseudo-science include the so-called "creation science," with its arrogant
distortion and misuse of facts, as well as its more recent and more refined
reincarnation under the label of the "intelligent design theory."
This theory, promoted by a large group of writers,
including many with scientific degrees from prestigious universities and with
long lists of publications, and propagated on various levels of sophistication,
has all the appearance of scientific research, as it offers definitions,
hypotheses, laws, models, and theories like a genuine science. What is, though, absent in the "intelligent design theory," is
evidence. No relevant data which
support hypotheses, laws, models, and theories could be found in the papers and
books by proponents of intelligent
design, only unsubstantiated assumptions. Therefore
it can justifiably be viewed as pseudo-science. I will also touch on this
subject in the section on the admissibility of the supernatural into science, as
well as in other articles on this website.
6. Methods of experimentation
The statement that methods of experimentation are the engine of the
progress of science seems to be quite trivial Whereas this statement is especially transparent in physics, chemistry,
biology and in engineering sciences, it is valid as well for any other science
even if it may sometimes be not as obvious as it is for the listed fields of
inquiry.
When Galileo built his telescope, this provided an enormous
impetus to the development of astronomy. When
the Hubble telescope was put in orbit, astronomy underwent another powerful push
forward. When Antony van Leeuwenhoek built his microscope, it revolutionized
biological science. When the first
electron microscope was built, it again revolutionized several fields of
science, leading to the amazing modern achievements
like actually seeing and even manipulating individual atoms.
These well known examples are only the tip of the iceberg, because
the progress of science is pushed forward by the multiple everyday inventions of
various ingenious methods of experiment or observation.
In thousands of research laboratories scientists whose names are
usually unknown to anybody except for the narrow circle of their colleagues,
every day apply their ingenuity and inventiveness to creating new, ever more
subtle methods to question nature. Without
these often extremely ingenious experimental methods and highly sophisticated
experimental set-ups there would be no progress of scientific penetration into
objective reality, which often guards its secrets very jealously.
There are two paths experimental science takes to progress.
One path is the ever increasing capabilities of devices designed
as versatile tools of scientific inquiry and of engineering or medical
development, such as electron microscopes, telescopes, and electronic devices
(signal generators, potentiometers, recorders, oscilloscopes, etc).
The other path is the design of unique experimental contraptions
designed for specific experiments and measurements.
These two path intersect at many points, as designing a specific
new experimental contraptions is often mightily assisted by the availability of
the more advanced tools of a versatile type.
Whereas inventions such as a microscope or radio transmitters and
receivers justifiably gain wide publicity, many amazingly ingenious methods
invented by scores of not-very-famous scientists usually remain unknown except
for a narrow circle of researchers working in that particular field of science.
Here is a typical example. In the late seventies, at one of the
foremost research institutions in the USA, a group of highly qualified and
trained scientists was developing certain types of magnetic memory for
computers. In one part of that
study a magnetic wire was pulled at a certain speed through a tube filled with
an electrolyte, and certain electrochemical processes on the surface of the wire
were investigated. The researchers
encountered a problem: as the electrolyte was pumped through the tube, the
friction between the electrolyte and the walls of the tube caused the
pressure's drop along the wire and this distorted the data. Considerable
effort was invested in order to somehow eliminate or at least alleviate the
described effect, but the effect persisted. Then a guest scientist arrived who
has never before dealt with the experiment in question. At the very first meeting where the detrimental effect of the pressure
drop was discussed, the guest scientist looked at the diagram of the tube and
the wire moving along the latter, and said: "Why, instead of pumping the
electrolyte through the tube, just eliminate the pump, make a rifle-type spiral
groove on the inner surface of the tube and put the tube into rotation. The groove will push the liquid with a pressure which will be
automatically uniform all along the wire."
This is a very simple example of how a fresh look at a problem may
suddenly solve a seemingly difficult problem in a very simple way, but this
simple example is typical and exemplifies those thousands and thousands of small
and sometimes not very small inventions and innovations which occur every day in
numerous research laboratories.
The unstoppable development of experimental methods, from designing giant
accelerators of elementary particles to small improvements in measuring and
observational methods, is what underlies the progress of science.
Therefore, the statement that a revolution in science starts when the
experimental technique proceeds to the next digit after the decimal point
reflects an important aspect of the progress of science.
7. Data
a) General
discussion of data
In the words of the famous Russian physiologist Ivan Pavlov, facts are
the bread of science. The term data is
synonymous with facts. It is also
synonymous, in a certain sense, with evidence.
Without data there is no science, only pseudo-science. In order to function as evidence, data must be reliable. There are two main types of data, observational
and experimental. Both are
equally legitimate, the difference being in that observational data are usually
obtained in a passive manner, by merely recording whatever nature deigns to
display by itself, whereas experimental data are gained as the result of a
deliberate procedure of a planned research in which specific conditions are
artificially created to force nature to display answers to specific questions. The demarcation between observation and experiment are not quite sharply
defined since observation may be a natural part of a designed experiment.
Observation, and even some primitive form of experimentation, may
happen to occur in a completely non-scientific activities. In a popular comic Russian poem for children, a Russian orthodox priest
counts ravens sitting on trees, just for the heck of it. It is an observation and it is accompanied by a measurement, but this
does not make the priest's activity scientific. Scientific observation or
experimentation, while driven mainly by curiosity, always has an either
conscious or subconscious purpose – to establish facts which may shed light on
the intrinsic structure and functioning of the real world.
A crucial element of a scientific experiment is measurement. Measurement makes data quantitative, thus enormously
enhancing the data's cognitive value. There is a theory of measurement which
teaches us about the precautions necessary to ensure the reliability of the
measured quantities and about the proper estimate of unavoidable errors of
measurement. This theory is beyond the scope of this essay, but we have to
realize that, on the one hand, the data obtained via a properly conducted
experimental procedure are reasonably reliable, but on the other hand they are
always true only approximately and cannot be relied upon to an extent exceeding
that determined by the margin of a properly estimated error.
Consider some examples.
Astronomy is a science wherein observation is by far more
prevalent than experimentation. A
classic example of highly valuable observational data is the tables of the
planets' positions compiled by Tycho de Brahe over the course of many years of
painstaking observations accompanied by meticulous measurements. These tables, after they wound up in the possession of
Johannes Kepler, served as the data the latter used to derive his famous three
laws of planetary motion. We will
discuss later the transition from data via hypotheses to laws, and from laws to
theories. My point now is to
illustrate how reliable data,
gained via a proper observational procedure accompanied by measurement became a
legitimate part of a scientific arsenal.
A classic example of the experimental acquisition of data is the
discovery, in the early years of the 20th century, of
superconductivity by a group of researchers in Leyden, the Netherlands, headed
by Heike Kamerlingh Onnes. These
researchers systematically measured the electric resistivity of various
materials at ever lower temperatures. In
a specially designed experiment, samples of material were gradually inserted
deeper and deeper into a Dewar flask on whose bottom there was a puddle of
liquid helium, while their electric resistance was measured. Analysis of the obtained data led to the conclusion that at certain very
low temperatures the electric resistivity of certain materials dropped to zero. This unexpected result seemed contrary to the common understanding of
electric resistance prevalent at that time. However, data take precedence over theories. The scientists of Leyden, however puzzled and astonished by their data,
came to believe in those data's reliability and thus announced the discovery
of superconductivity. Their data
were then reproduced by other scientists and a new law asserting the fact of
superconductivity was formulated. The
formulation of a law did not mean a theory of superconductivity was forthcoming. It took half a century until a theory of superconductivity was developed. Neither the law in question nor the theory of that phenomenon would be
possible without first acquiring reliable data on the behavior of
superconducting materials, these data acquired in a deliberately designed
specific experiment.
As we will see, the path from data acquisition to a law can be
quite arduous and prolonged. It involves steps necessarily requiring imagination
and inventiveness, because no law automatically follows from data. This is even
more true for the path from data, via law, to theory. However, only those laws and theories which stem from
reliable data are constituents of genuine science. (The question of what constitutes reliable
data will be the subject of subsequent sections wherein the reproducibility of
data and the problem of errors will be discussed.)
I would like to illustrate the last statement by an example. In
1994, three Israelis, Witztum, Rips, and Rosenberg (WRR), published a paper in
the mathematical journal "Statistical Science." In that paper they claimed to have discovered in the Hebrew text of the
book of Genesis a statistically significant effect of the so-called "codes." According to WRR, the codes in question contained information about the
names and dates of birth and death of many famous rabbis who lived thousands of
years after the book of Genesis was written. If WRR's claim were true, its only explanation could be that the author
of the book of Genesis knew the future. This
would be a powerful argument for the divine origin of that book. The paper by WRR caused a prolonged and heated discussion. As a result of a thorough analysis of WRR's methodology, the
overwhelming majority of experts in mathematical statistics concluded that the
WRR's data were obtained in a procedure which in many respects was contrary to
the rules of a proper statistical analysis. In other words, the community of
experts concluded that WRR's claim stemmed from bad
data. Therefore, WRR's work was rejected as bad science. It was
not, though, originally rejected as pseudo-science,
because they based their law on certain data to which they applied a statistical
treatment (although the latter was partially flawed as well). However, despite the scientific community's almost unanimous rejection
of WRR's work as based on unreliable data, two of them (Witztum and Rips) as
well as a small circle of their adherents, stubbornly continued to insist that
they made a genuine discovery, thus converting their theory from bad science to
pseudo-science. Because the story
of the alleged Bible code is quite educational with regard to how pseudo-science
appears and persists, it is discussed at length in a separate article (see B-Codes Page).
On the other hand, claims such as those by Immanuel Velikovski
have been rejected from the very beginning not as bad science but rather as
pseudo-science because his claims did not stem from any data but only from
arbitrary assumptions. In
a book published in 1950 and titled "Worlds in Collision" Velikovski offered
a whole bunch of wild theories allegedly explaining many mysteries that
contemporary science could not explain. For example, one of his suggestions was
that when, according to the biblical story, Yehoshua (Joshua) stopped the sun in
the sky, the earth indeed stopped its rotation. Another hypothesis by Velikovski
postulated a near-collision of Venus and Mars with Earth, thus allegedly
explaining numerous biblical
miracles. Of course, there are no
data whatsoever which would serve as evidence for such theories, therefore they
were justifiably relegated to pseudo-science. While Velikovski acquired
substantial notoriety and was compared in non-scientific publications to Newton,
Einstein, and other great scientists, no scientific magazine accepted his papers
because they, while plainly contradicting Newtonian mechanics, did not offer a
shred of evidence which would support his claims.
b)
Reproducibility and causality
To be legitimately useful in science, data must meet several requirements, one of which is reproducibility. Neither the reputation of the scientists claiming certain experimental
results nor the impressive appearance of their data seemingly conforming to the
strict requirements of a properly conducted experiment are sufficient for the
data to be accepted as a contribution to science. Data become a part of science only after they have been reproduced by
other scientists. Indeed, the
demise of the cold fusion (at least as it stands for now) was due precisely to the fact that other groups of
researchers could not reproduce the data claimed by Pons-Fleischman and Jones. Likewise, the data claimed by Witztum and Rips could not be reproduced by
other scientists, which was the
main reason their theory was rejected by the scientific community.
The requirement of reproducibility is based on the assumption of
causality as a universal law of nature. This
assumption presupposes that reproducing certain experimental conditions must
necessarily lead to reproducing the outcome of the experiment. This supposition is of course a philosophical principle to be accepted a
priori. This principle has an ancient origin. It was already discussed in detail
by Aristotle who introduced the concept of a hierarchy of four causes, the
so-called material causes, the efficient causes, the formal causes, and the
final causes. In more recent times,
the principle of causality known as the principle of determinism was formulated
by Laplace, and has been universally accepted in science for at least two
centuries.
The advent of quantum mechanics seemed to have shattered that
principle. The very fact that many
outstanding scientists were prepared to abolish a principle that had been a
foundation of experimental science for so long testifies against the claims by adherents of the so-called intelligent design theory who assert that
scientists are dogmatically adhering to "icons" of metaphysical concepts
rather than keeping open minds.
However, the interpretation of quantum-mechanical effects as a
breach of causality is by no means unavoidable. Rather, it testifies to the insufficient understanding of submicroscopic
processes wherein causality is allegedly absent.
First of all, even in the case of quantum-mechanical events,
causality is obviously present as long as the macroscopic manifestations of
those events are observed. Recall
the example of the alleged absence
of causality, namely experiments with microscopic particles passing through
slits in a partition. If only one
slit is open, on the screen behind the partition a diffuse image of the slit is
observed. If, though, two slits are
open, the image on the screen is a set of fringes. On macroscopic level, the outcome of the experiment is reliably
predictable and fully consistent with the principle of causality. Indeed, the image on the screen is always the same if only
one slit is open, and can be easily reproduced at any time, any number of times,
anywhere in the world. If two slits
are open, the picture on the screen is different from the case of only one
opened slit, but it is reliably predictable and can be easily reproduced at any
time anywhere in the world. Hence,
causality is present as long as the macroscopic effects of the experiment are
the issue. A question about the validity of causality is raised when the details
of the event on a microscopic level are considered.
If an electron is a particle,
it cannot pass simultaneously through two slits. Therefore we could expect that in the case of two opened slits, the image
on the screen would be two separate diffuse images of slits rather than a whole
set of fringes. Indeed, how can an electron passing a particular slit "know"
whether the other slit is open or not? However,
the behavior of individual electrons is different, depending on whether the
other slit is open or not.
Thus, the results of the
described experiments are sometimes interpreted as indicating that the same microscopic conditions lead to different
outcomes, depending on the variations in macroscopic conditions. On the microscopic level, as this argument goes, the same conditions may
result in different outcomes thus causality is absent.
Quantum mechanics tells us that
the explanation of the described phenomena is in that electrons are not
particles like those we can see, touch, etc. They are very different entities, which under certain conditions behave
like waves rather than like particles. It
does not seem possible to visualize
an electron, because it is unlike anything we can interact with by means of our
senses in our macroscopic world. Of
course, it is well known that macroscopic waves (for example, sea waves) can
very well pass several openings simultaneously. Therefore, the two slit experiment is not really an
indication of the absence of causality on a microscopic level, but rather an
indication of our inability to visualize a subatomic particle.
Richard Feynman said that
nobody understands quantum mechanics. To interpret this statement, we have first
to agree what the term "understanding" means. We can't visualize an electron, and in this sense we may say that we
don't understand its behavior. However,
we can reasonably well describe the electron's behavior using the mathematical
apparatus of quantum mechanics, and can successfully predict many features of
that behavior. In that sense, we
can say that we indeed do understand quantum mechanics.
Since we can't visualize the
intrinsic details of the electron's behavior, we are actually uncertain of
whether, on the microscopic level, its behavior is indeed non-deterministic, or
we simply don't know what causes this or that seemingly random path of the
electron. And since we are
uncertain, there is no reason to
assert that the principle of causality breaks down at the microscopic level.
Review another situation
wherein causality is often claimed to be absent. If we have a lump of a
radioactive material, we can easily measure the rate of its atoms' decay. This rate is a constant for a specific material. For example, we can
experimentally find that, in the course of a prolonged experiment, the fraction
of the atoms which decayed every second was, on the average, 0.01. Thus, if initially the lump consisted of N0 atoms, one second
later the number of atoms of that material was N1< N0, one more second later, the number of atoms was N2
< N1, etc, whereas the ratios N1/N0, N2/N1,
etc, averaged over the duration of the experiment, were 0.99. This result is well reproducible, thus, again, on the macroscopic level
causality seems to be intact. However,
if we discuss the phenomenon on microscopic level, the question arises of why at
any particular moment of time a particular atom decays while other atoms do not. All atoms within the lump of material are, from a macroscopic
viewpoint, in the same situation. There
is a certain probability of an atom's decay, which is exactly the same for
every atom in that lump of material. However,
at every moment some atoms decay whereas others do not. Therefore, the argument goes, causality does not seem to be at work in
that process. All atoms are in the
same situation, with the same probability of decay, but only a certain fraction
of them decays without any evident reason which would distinguish the decaying
atoms from those still waiting their turn.
The
described problem of the possible "hidden parameters" affecting microscopic
processes is the subject of a rather heated discussion among scientists, wherein
no consensus has so far been reached. For example, Feynman maintained that an
explanation referring to "hidden parameters" is impossible, because nature
itself "does not know in advance" which atom will decay at which moment, or
which path a particular electron will choose at a given moment. With all due
respect to the views of the brilliant scientist Feynman, I find it hard to
accept his position. I express here my personal view of the problem.
In my view, when we assert that
all atoms are in the same situation, this is true on macroscopic level. We have
insufficient knowledge of the situation of each individual atom on the
microscopic level. The deficit of
knowledge forces us to discuss the situation in probabilistic terms. Probability
is a quantity whose cognitive value is determined by the amount of knowledge
about the situation at hand. If we knew exactly the intrinsic details of the
situation each atom is in, we could, instead of probabilities, discuss the
matter in terms of certainties. Rather
than abolish the principle of causality, the much more natural interpretation of
the described process of atoms' decay is that there are definite causes of
individual atoms decays at this or that moment of time, but we simply do not
possess sufficient knowledge of the details of the process and therefore cannot
predict which atom will decay at which moment. Our lack of knowledge does not necessarily mean the decay of this or that
particular atom occurs without a specific cause.
Actually, an analogous
situation can be observed on a macroscopic level as well. This happens, for example, in the so-called Keno game in Las
Vegas casinos. In that game, players choose a set of several numbers out of a
table containing, say, 49 numbers. The winning set is determined by a machine in
which a pile of small balls is constantly mechanically shuffled in a shaking
plastic vessel. Each ball bears a number. Balls move up and down the pile,
constantly exchanging their positions, some of them pushed up by the rest of the
balls, some others sinking deeper into the pile. Every couple of minutes, one of
the balls is pushed high enough up from the pile and rolls out of the vessel. The number on that ball becomes part of the winning set. This is a chance process in which each ball has the same probability of
rolling out at any given minute. This
situation, albeit on a macroscopic level, is rather similar to the atoms'
decay. At any moment of time, a
particular ball rolls out while the rest of the balls remain within the vessel,
waiting for their turn to get out. Why a particular ball happens to get out, but
not any other ball, despite all of them having the same probability of being
pushed out? We don't have a
sufficient knowledge of the intricate web of interactions between the balls in
the constantly shuffled pile, therefore we resort to probabilistic approach. However, if we knew precisely all the intricate details of the multiple
encounters of those fifty or so balls in the shuffling machine, we would be
able, instead of a probabilistic estimate, to predict precisely which ball will
roll out and when. The lack of detailed knowledge, while forcing a probabilistic
approach, does not mean that the observed occurrence – the choice of
particular ball through a chance procedure - did not have a definite cause
making the fallout of that particular ball at that particular moment inevitable
while for no other ball there existed a similar cause to get pushed out.
The atoms' decay occurs on a
microscopic level, and we have much less knowledge about the intrinsic
conditions within and between the decaying atoms that we have in the case of the
balls in a Keno game. This by no
means must be interpreted as an indication that the acts of decay have no
definite causes. The principle of
causality can be preserved, even accounting for the quantum-mechanical effects.
Whatever interpretation of
causality is preferred, if there is no causality, there is no science.
c) Ceteris paribus
The next question to be
discussed in relation to data as a foundation of science is the so-called
principle of ceteris paribus. This
Latin expression means "everything else equal." The idea of that principle is as follows: Imagine that in a certain
experiment the behavior of a quantity X is studied. This quantity is affected by a number of factors, referred to as factors
A, B, C, D, etc. Some of those factors may be known while some others may be
not. From a philosophical standpoint, everything in nature is
interconnected and therefore the number of factors affecting X is immensely
large, so we never can account for all of them. However, an overwhelming majority of that multitude of
factors have only a very minor effect on X and therefore, in practical terms,
most of them are of no consequence for the study at hand. Still, there is a set of factors whose effect on X cannot be
ignored if we want to gain meaningful data from our study.
The principle of ceteris
paribus is a prescription regarding how to conduct the study. Namely,
according to that principle, the process of research must be divided into a
number of independent series of experiments. In each series only one of the
factors - either A, or B, or C, etc., has to be controllably changed while the
rest of the factors must be kept unchanged. For example, if A is being changed,
B, C, and D have to be kept constant, and the behavior of X has to be recorded
for each value of A. Then another series of experiments has to be conducted
wherein only factor B is being changed and X recorded while A, C, D, etc., are
kept constant.
Of course, the application of
the described procedure is based on the assumption that factors A, B, C, etc.,
are independent of each other, so that it is possible to change at will any one
of those factors without causing all the rest of those factors to change
simultaneously.
There are two problems
inseparable from the above approach. One
is related to the errors of measurement and the other to the very core of the ceteris
paribus principle, i.e. to the above mentioned assumption of the factors'
independence.
The problem of errors stems
from the fact that no measurement can be absolutely precise and accurate ( the
difference between precision and accuracy is defined in the theory of
measurement and is beyond the scope of our discourse). In particular, even if factors A, B, C, etc,. are independent of each
other, it is impossible to guarantee that while one of them is being changed,
the rest of them remain indeed absolutely constant, because the values of B, C,
D, etc., while in principle they are supposed to remain constant, cannot be
measured with absolute precision and accuracy. Moreover, the values of factor A itself, which are supposed to be changed
in a controlled way, are never guaranteed to have precisely the desired values,
and, finally, the target of the study, X
itself cannot be measured with absolute precision and accuracy.
The problem with the principal
foundation of the ceteris paribus
methods stems from the fact that in many systems, factors A, B, C, etc., are in
principle not independent of each other. Hence, as factor A is being
changed, it may be impossible in
principle to suppress the simultaneous changes of factors B, C, D etc, which
are supposed to remain constant. Such
systems are sometimes referred to as "large" or "complex," or
"diffuse," or "poorly organized" systems and their study necessarily can
be conducted only beyond the confines of ceteris
paribus.
Philosophically speaking, all
systems are complex, so that factors A, B, C, etc., are always to a certain
extent interdependent. However, the extent of their interdependence may be
insignificant to a degree which makes the assumption of ceteris paribus reasonably substantiated.
For example, assume we study
the behavior of ice/water in the vicinity of 273 K, which is, of course, 00 C or
320 F. The factor we change is
temperature. Besides temperature,
the behavior of ice/water is affected by a multitude of other factors. Most of them are insignificant and can be ignored because they have only
a very minor effect. However, at least two factors have a pronounced effect –
pressure and the purity of the water. In this case it is relatively easy to
ensure the condition of ceteris paribus. We can keep a sample of ice in a vessel where pressure is automatically
controlled and kept at a predetermined level, regardless of changes in
temperature. Likewise, the purity
of ice can be maintained at a constant level at all values of temperature
investigated. In such a study we
find that ice containing, say, not more than 0.01% impurities, at a pressure of
about 105 Pascal, melts at
273 K.
d) Errors
We have to distinguish between
various types of errors. Sometimes
errors stem from a basically wrong approach to scientific inquiry. It may be due
to an insufficient understanding of the subject by a researcher, which results
in an improperly designed experiment. It may be due to a preconceived view or
belief, or to a strong desire on the part of a researcher to confirm his
preferred theory. The researcher
may subconsciously ignore results which are contrary to his expectations and
inadvertently choose from the set of measured data only those which jibe with
his/her expectations. Errors of these type, while by no means uncommon, should
be viewed as pathological; they produce wrong
data, which are a version of bad
data, and are one of the constituents of bad science. An example of such bad
data is the already discussed case of the alleged Bible codes.
However, even in the most
thoroughly designed experiments, errors inevitably occur despite the most
strenuous effort on the part of a researcher to avoid or at least to minimize
them. These honest errors are due to
the inevitable imperfection of experimental setups and to the equally inevitable
imperfection of the researcher's performance. Let us discuss these honest errors
in experimental studies.
Until recently, a common notion
in the theory of experiment was the distinction between systematic and random
errors. While this distinction is certainly logically meaningful and often can
be made quite rigorously, lately it has become evident that on a deeper level
the demarcation between these two types of error is rather diffuse. Moreover, the error can often be viewed as either systematic
or random depending on the formulation of the problem. For example, an error can be viewed as systematic if it
occurs systematically provided we always conduct measurements in a specific
laboratory A. However, the same
error can be viewed as random if we have a choice among many laboratories A, B,
C, etc., and choose laboratory A, to conduct the measurement on a particular
day, by chance. The deeper analysis
of that question, as well as of the distinction between precision and accuracy
of measurements, which is a proper subject of the theory of experiment, is
beyond the scope of this paper, whose subject is the general discussion of the
nature of science.
The question of experimental
errors is, though, related to the problem of causality. If data are a legitimate component of science only if
they are reproducible, the immediate question is: What are the legitimate
boundaries of reproducibility? It
is an indisputable fact that the expression "precise data," if interpreted
literally, is an oxymoron. If an
experiment is repeated many times, in each run the measured data will be to a
certain extent different. There is
no way to ensure the absolute reproducibility of data. The conventional interpretation of that fact is not that it negates the
principle of causality but rather that in each experiment there are
uncontrollable factors which change despite the most strenuous effort on the
part of the experimenter to keep all conditions of the experiment under control.
Therefore the principle of
causality is not a direct product of the experimental evidence but a
metaphysical principle borne out by the total body of science. Science is based on the assumption of causality without which it simply
would make no sense.
Therefore the inevitable errors
of measurement are something science has to live with, making, though, a very
strong effort to distill objective truth from the chaos of measured numbers. Data which are a legitimate foundation of laws and theories are therefore
the result of a complex process in which wheat is to be separated from chaff and
regularities have to be extracted from the error-laden experimental numbers thus
building a bridge from data to laws. The bridge from data to laws will be
discussed in subsequent sections.
It seems appropriate to discuss
at this juncture of the discourse two points related to the manner in which an
experiment is conducted.
One of these points is
calibration. Calibration is a very
powerful tool for ensuring the meaningfulness of an experiment, wherein the
effects of many factors, which are extraneous to the subject of the study and
whose contribution can only mask the phenomenon under study, are summarily
neutralized in one step. For
example, when Cavendish performed his famous experiment in which he, in his
words, "weighed the earth," he needed to measure the attraction between lead
balls. The force of attraction to
be measured was determined by the angle of rotation of a rod to whose ends the
lead balls were attached. The rod
hung on a string, so that the elastic forces in the string created a torque
which counterbalanced the forces of gravitational attraction. That torque was
approximately proportional to the angle of the rod's rotation. Instead of
calculating the forces which would cause a certain angle of rotation, Cavendish
simply calibrated his contraption by applying known forces to the rod and
measuring the correspondent angle of rotation. He obtained the "calibration
curve," which incorporated all known and unknown factors that could affect the
angle of rotation thus eliminating in one step – calibration – most of the
sources of error. Calibration in
various forms is routinely used in scientific experiments whenever possible,
thus substantially eliminating many sources of error.
Another point to discuss is the
difference between the discrete and the continual methods of recording
experimental data. Each of these two methods has both advantages and
shortcomings.
If a discrete recording is
employed, the target of study - X - is measured for a set of discrete values of
the controlled parameter A, while all other parameters affecting X are kept
constant. Factor A is given values
A1, A 2 ,A 3 etc,
and at each of those values of A, X is measured, yielding a set X1,
X2 , X3 , etc.
As a shortcoming of such
method, it is often asserted that there is never certainty regarding the values
of X between the measured points. However
close the values of A at which X is measured are, there are always gaps between
those values of X, and to fill those gaps the researcher has an infinitely large
number of choices. It is often
expressed as an assertion that there are infinitely many curves which can be
drawn through the same set of experimental points, regardless of how close those points are to each other. (Of
course, a more accurate assertion should be that the curves may be drawn through
the margins of error surrounding each experimental point).
While the above assertion is
certainly correct in an abstract philosophical way, it rarely has serious
practical implications, as will be discussed in the section on the bridging
hypotheses.
The mentioned shortcoming of
the discrete method of experiment seems to be eliminated if a continual method
is employed instead. In this
method, the controlled factor A (or often several controlled factors A, B, C,
etc.) are being continually changed at a certain rate, while the values of the
target of the study, X, are simultaneously and continually recorded.
An obvious advantage of this
method is the substantial saving of time and effort necessary to accumulate
information about the phenomenon under study. The wide availability of sophisticated recording equipment in the 20th
century made continual recording the favorite method of experimental work in
physics, chemistry and other fields of study.
Since the method of continual
recording generates continuous curves rather than sets of experimental points,
it may seem to eliminate the arbitrariness involved in drawing curves through
discrete experimental points. Unfortunately,
this supposed advantage of the continual method is illusory. The illusion is related to the problem of a system's response
time to a perturbation, i.e. to an abrupt change of parameters. If any parameter affecting the system's behavior is
abruptly changed, it sets in motion a process of the system's transition to a
new state, this process proceeding with a certain finite speed. If a system was in a state wherein X had a certain value X1
while factor A had the value A1, and at some moment of time t1
parameter A is changed to A2, the
system's measured property X does not instantly jump to a new value X2,
but rather undergoes a transition to the new value which takes a certain time
("relaxation period"). Therefore
the experimental curves reflecting the functional dependence of X on A will have
different shapes depending on the rate at which A is being altered by the experimental
set-up. In a discrete method of
experiment normally the time interval between consecutive measurements of X is
sufficiently large for X to complete the transition to the new equilibrium
value. In continual recording, depending on the rate of change of A, the
transition of X to the new equilibrium value is often not yet completed as A
already shifts to new values. It is well known that, for example, the loops of
magnetic hysteresis are very different depending on the rate of the magnetizing
field's change.
In connection with the question
of experimental errors, the notion of noise seems to be relevant. The term noise
entered science from information/communication theory. That branch of science studies the transmission of information (of
"signals") which is inevitably accompanied by noise in the elements of the transmission channel. The noise is
superimposed on the useful signal and has to be filtered out if the signal is to
be reliably interpreted. The term noise gradually gained a wider use,
referring to any unintended extraneous input to the experimental setup. Normally, to extract useful data from the experimental output the desired
data have to be distilled from the overall output by excising noise. However, more than once noise happened to be the source of scientific
discoveries. For example,
when, at the end of the 19th century, physicists studied the
processes in a cathode ray tube, these processes were always accompanied by
emission of a certain radiation from the walls of the tube. Some researchers did
not notice these rays, others did but viewed them just as noise and
ignored them. Then Wilhelm Conrad Roentgen, who specialized in the study of dielectrics, and was generally highly
meticulous in his research, looked at the "noise" as a phenomenon
interesting in itself. He very thoroughly studied the phenomenon which his
predecessors dismissed as just a nuisance and thus became the discoverer of that
important type of radiation which he called X-rays.
Another well known example of how a phenomenon which was viewed as
annoying noise led to an important discovery is that of the background cosmic
radiation discovered in 1965 by Arno Penzias and Robert Wilson.
Generally speaking, if an experiment yields results which could be
predicted, this adds little to the progress of science. The matter becomes really interesting if the results of an experiment
look absurd. If a researcher
encounters an absurd result, which persists despite efforts to clean up the
experiment and to eliminate possible sources of error, there is a good chance
the researcher has come across something novel and therefore interesting. The absurd outcomes may often be attributed to noise, but not
too rarely that noise carries information about a hitherto unknown effect or can
be utilized in an unconventional way for a deeper study of the phenomenon.
Let me give an example from my personal experience.
Many years ago I studied the process of adsorption of various
molecules on the surface of metallic and semiconductor electrodes. One of the methods of that study was the measurement of the so-called
differential capacity of the electric double layer on the electrode surface. A couple of my students participating in that research built a setup
enabling us to measure the differential capacity at various values of the
electric potential imposed on the electrode. After having worked with that setup for several weeks, they became
grossly frustrated by their inability to eliminate the instability of the
measured capacity. Each time they
changed the value of the electrode potential, the differential capacity started
shifting and they had to wait for many hours and sometimes days until a new,
uncertain equilibrium seemed to set in. They
viewed this unpredicted "instability" of the measured differential capacity
as annoying noise preventing them from performing the desired measurement. They complained to me about their frustrating experience and asked for
advice - how to get rid of it. I
remember that moment when a kind of a sudden light exploded in my mind and I
shouted, "Lads, it is wonderful! We got an unexpected result.< The capacity creeps – and this means that if you accurately measure the
curve of capacity versus time, this
curve will contain plenty of information about the kinetics of the adsorption
process!" I set out to develop
equations reflecting the relaxation process of the capacity. A new method of scientific study we named potentiostatic
chronofaradometry was thus born. Instead
of trying to eliminate the supposed noise and routinely measure the supposed
equilibrium values of the differential capacity, now my students concentrated on
measuring those very relaxation curves which they angrily considered to be
annoying noise. Plenty of information about the kinetics of the adsorption/desorption
process was extracted from that "noise," assisted by the equations derived
for that process.
e) Complex
systems
For over 200 years, one of the
underlying hypotheses of science, often unspoken, was that every system which is
a subject of a scientific study is "well organized," in the sense that it
was always possible to separate a few phenomena or processes of similar nature,
dependent on a limited set of important factors, in such a way that the
system's behavior could be studied by isolating factors one by one, finding
the functional dependencies between pairs of factors, and attributing to those
functional dependencies the status of laws. In the 20th century the described approach started
encountering serious difficulties. In
many systems the separation of various factors turned out to be impossible. Science encountered the ever increasing number of what is referred to as
"large" systems, or "poorly organized systems," or "diffuse
systems," etc., reflecting various aspects of such systems' behavior. Such systems cannot be legitimately studied under the ceteris
paribus approach because it is impossible to separately and controllably
vary any one parameter affecting the system without causing a simultaneous
variation of other parameters.
It should be noted that the
concept of a "large" or "poorly organized" system is not equivalent to
the concept of stochastic systems. The
latter were well known and successfully dealt with in 19th century
physics. An example of a stochastic system is a gas occupying a certain volume. It consists of an enormous number of molecules (for example, at
atmospheric pressure and room temperature 1cubic centimeter of gas contains
about 1019 molecules). The
behavior of each molecule is determined by the laws of Newton's mechanics.
However, because of the immense number of molecules, it is impossible to analyze
the behavior of gas by solving the equations of mechanical motion for each
molecule. Therefore, 19th century science came up with very
powerful statistical methods which enabled it to analyze the behavior of a gas
without a detailed analysis of the motion of each individual molecule. (There are theories according to which a system already becomes
stochastic, i.e., not treatable by studying the behavior of its individual
elements, if the number of these elements exceeds about 30.)
However, stochastic systems
such as gases behave stochastically only on a microscopic level. On a macroscopic level they can be easily treated using the ceteris
paribus approach. Macroscopic
properties of a gas, such as pressure, temperature, volume, etc., can be very
well studied by isolating and controllably changing these properties one by one
while keeping the rest of the properties constant. This is possible because the immense number of elements of such systems
does not translate into a large number of macroscopic properties. A gas containing quadrillion quadrillions of microscopic molecules, all
of them identical, macroscopically is characterized only by a few parameters,
such as pressure, temperature and volume.
On the other hand, the
"large," or "poorly organized" systems are characterized by a very large
number of macroscopic parameters, many
of which cannot be individually varied without simultaneously changing some
other parameters affecting the system.
Historically, one of the
situations whose study mightily served the realization of the ineliminable
interdependence of various parameters in a "poorly-organized" system is emission spectral analysis. A good description of the problems
encountered when trying to interpret the data obtained via emission spectral
analysis was given in a book "Teoriya Eksperimenta" (The Theory of
Experiment) by Vasily V. Nalimov (Nauka Publishers, Moscow 1971; in
Russian).
In that process, a sample of
material to be studied is placed between two electrodes into which a high
voltage is fed. The breakdown of the gap between the electrodes causes a spark
discharge wherein the temperature reaches tens of thousands of Kelvin. The
density of energy at certain locations on the electrodes reaches a huge level.
An explosive evaporation occurs, followed by an equilibrium evaporation. A cloud
of evaporated matter is created between the electrodes in which simultaneous
processes of diffusion, excitation and radiation take place. Besides,
oxidation/reduction processes occur on the electrodes' surface, and diffusion
of elements occurs in their bulk, dependent on the temperature gradient, phase
composition of electrode material, defects etc. All these processes are of a
periodic nature, on which a one-directional drift is imposed by the
electrodes' erosion. Attempts to analyze the described extremely complex
combination of phenomena by means of separating various factors one by one
turned out to be in vain. The
system is too complex and therefore its analysis forced scientists to give up
attempts to proceed using the ceteris
paribus approach.
To study the "large" or
"complex" systems, the science of the 20th century developed two
main approaches. One is the multi-dimensional statistical approach and the other
is the cybernetic (or computer-modeling) approach.
Since this article is not about
the theory of experiment but about a general discussion of the nature of
science, I will discuss the above two approaches very briefly and only in rather
general terms.
The statistical approach was to
large extent initiated by the British mathematician R. A. Fisher. It is often referred to as multi-dimensional mathematical statistics (MDMS). Essentially, MDMS is a logically substantiated formalization of such an
approach to the study of "large" systems wherein the researcher deliberately
avoids a detailed penetration into intricate mechanisms of a complex phenomenon,
resorting instead to its statistical analysis using multiple variables.
The application of MDMS
requires solving a number of problems involving the appropriate strategy of an
experiment - the proper choice of the essential parameters - since accounting
for too many parameters may make the task beyond the available intellectual and
computational resources, while accounting for too few parameters may reduce the
solution to a triviality. The
effectiveness of MDMS has been drastically improving along with the availability
of ever more powerful computers and software. Statistical models of complex systems have been successfully employed for
a multitude of problems which could not be tackled by the traditional methods of
direct measurement.
The cybernetic approach, whose
origin can be attributed to Norbert Wiener, is in some respects very different
from the statistical one, although it also has become really useful only with
the advent of powerful computers. While the method of multi-dimensional
statistics and that of a computer modeling are principally different, both are
successfully applied, sometimes to the study of the same "complex" system.
The cybernetic approach makes
no sense as long as only simple "well organized" systems are studied. In
such systems one-dimensional functional connections fully determine the
system's behavior and the problem of control (management) is moot. For example, the motions of planets are fully determined by Kepler's
laws. On the other hand, control
of the behavior of a "poorly organized," or "large" system, which is
quite challenging, can be fruitfully approached via a cybernetic model. (Various
meanings of the term model will be
discussed in the section on models in science).
One of the frontiers of modern
science is the field of artificial intelligence which exemplifies the cybernetic
approach to a "complex" system (human intellect) by using a computer model
of the latter.
Biological systems are mostly
"large" or "poorly organized" in the above formulated sense.
Another example of a
"large" system is the economy of any country.
In every country there is a
multitude of factories, farms, companies and individuals each pursuing its/his
particular economic interests, All
these elements of the economic system interact among themselves in an immensely
complicated web of transactions, changing every moment so that this enormously
complex system is in a constant flux whose trends are usually not obvious. It is impossible to account for each and all features of that immensely
complex game, not only because of its sheer size and the huge number of its
constituent elements, but also because of its unstable character. Before the available data about the state of economy have been digested
and interpreted, they have already changed. No computer, however powerful, can follow all the nuances of the economic
game. Therefore the economy cannot be in principle "scientifically managed"
as it supposedly was in the allegedly "socialist planned economy" of the
former USSR. This was one of the
reasons for the abysmal failure of the "socialist economy," which was no
more scientific than it was really socialist. The actual economic system in the former USSR was "state capitalism,"
with all the drawbacks of an extreme form of a monopolistic capitalist system
but without the advantages of a free enterprise. The attempts to sustain a
planned economy allegedly based on a scientific analysis of the resources,
demands and supplies, resulted in a seemingly controlled but actually chaotic
economy wherein stealing and cheating became the only possible means of survival
for the powerless slaves of the state.
Pretending to maintain a
scientifically planned economy, as Marxist theory required, actually resulted in
a system that was contrary to elementary basics of science.
The economy of the modern world
functions in a cyclic manner, following its own poorly understood complex
statistical laws wherein its oscillatory motions have an immense inertia, so
that attempts to steer it in this or that direction (for example, by regulating
the interest rate or changing the tax laws) have only a marginal effect on the
waves of prosperity and recession replacing each other as if by their own will.
Large (or diffuse) systems
require for their analysis specific approaches, and 20th century
science has provided these approaches in the two above mentioned ways.
Whatever method is used to
acquire data, either the traditional multi-step measurement under the ceteris
paribus principle, or the application of multi-dimensional statistics, or a
computer model, or any other approach, the acquisition of reliable data is an
ineliminable step in the scientific method. It is possible to apply the
scientific approach without strict definitions or without good hypotheses or
without good models or without good theories (although omitting any of the
listed components would seriously impair the quality of the scientific study,
often making it bad science). It is
impossible to have any science, good or bad, without reliable data. In the absence of data, the endeavor, however cleverly disguised and
eloquently offered, can only be pseudo-science.
8. Bridging hypotheses
Perhaps, the title of this section should more appropriately be
"Bridging Hypothesis," in the singular, because this hypothesis is
essentially always the same. This hypothesis postulates the objective existence
of a law. It constitutes a
"bridge" from the raw data to the postulated law.
In the case of "simple" or "well organized" systems which
can be studied under the ceteris paribus
condition, the bridging hypothesis appears in its most explicit form, while in
the case of "large" or
"poorly organized" systems it can often be hidden within the complex web of
the data themselves.
Imagine an experiment under the ceteris
paribus assumption wherein a set of values of a target quantity X has been
measured for a set of values of a parameter A, so for each value of A1,
A2, A3, etc., corresponding values X1, X2,
X3,etc., have been measured and also the margin of error ±Δ has been estimated so that every value of Xi is believed to be within
the margin of Xi ±Δ.
For the sake of example, assume that the measured values of X are
found to increase along with the increase of the corresponding values of A. Every
set of X and A, however extensive, is always still only a selected subset of all
possible values of X and A.
Reviewing the sets of measured
numbers, we try to discern a
regularity connecting A and X. To
do so we have necessarily to assume
that the set of actually measured numbers indeed reflects an objective
regularity. There is never unequivocal proof that such a regularity
indeed objectively exists, we have necessarily to postulate it before formulating a law.
More often than not such a hypothesis is implicitly present
without being explicitly spelled out. In scientific papers we, quite usually, see statements introducing a law, without mentioning the
bridging hypothesis. The
researchers reporting their results routinely assert that, for example, "as
our data show, in the interval between A1 and Ak, X
increases proportionally to A." This
statement is that of a law. In
fact, though, such statement of a law might not be legitimately made without
first postulating the very objective existence of a law connecting X to A. The bridging hypothesis according to which the observed sets of values of
X correspond to the selected sets of values of A, not as an accidental result of
an experiment conducted under a limited set of conditions but because of an
objectively functioning law, must necessarily precede, if often subconsciously,
the statement of a law.
The hypothesis in question, which is necessarily present, most
often implicitly, in the procedure of claiming a law, is never more than a
hypothesis and cannot be proven. Fortunately
it happens to be true, at least as an approximation, in the vast number of
situations, thus constituting a reliable building block of science. In fact, if
sets of As and of Xs seem to match each other according to some regularity, more
often than not such a regularity does indeed exists, although sometimes it also
Of course, the bridging hypothesis is not applied if no law is
postulated. A researcher may obtain
sets of values of the measured quantity X corresponding to a set of values of a
parameter A which may look like a display of a regularity, but for various
reasons she may reject the bridging hypothesis and avoid an attempt to spell out
a pertinent law. The reasons for that may be, for example, some firmly
established theoretical concepts making the law in question very unlikely, or
serious doubts regarding the elimination of some extraneous factors which could
distort the data thus creating the false appearance of a regularity, or any
number of other reasons.
More often, though, a scientist may be just uncertain in regard to the
choice between accepting or rejecting the bridging hypothesis rather than flatly
rejecting it. Often such uncertainty is based simply on the insufficiently
complete data. The researcher is
uncertain whether or not the available sets of data do indeed properly represent
the entire multiplicity of possible data.
If that is the case, the proper remedy is to resort to a statistical
study of the phenomenon in question. The proper tool for such a study is the part of mathematical statistics
called "hypotheses testing."
The procedure involves introducing two competing hypotheses, one called
the "null hypothesis," and the other the "alternative hypothesis." The null hypothesis is the assumption that the available set of data does
not represent a law. The alternative hypothesis is in this case a synonym for what
I called the bridging hypothesis, i.e. the assumption that the available set of
data reflects a law. As a result of
a proper statistical test, the researcher compares the likelihood of the null
hypothesis vs. the likelihood of the alternative hypothesis. The hypothesis whose likelihood is larger, is accepted, while the other
hypothesis is rejected. This choice is never assured to be ultimately correct, since
discovery of additional data may change the likelihood of the competing
hypotheses. In this sense, laws of
science are usually referred to as tentative.
Fortunately, the rigorous self-verification inherent in a proper
scientific procedure makes a vast number of laws of science work very well
despite their "tentative" character. In
the next section we will discuss the laws more in detail.
9. Laws
Whereas the bridging hypothesis discussed in the previous section is
simple and uniform in that its gist is simply in assuming that the set of
experimental data reflects a law, the specific formulation of a law itself is
quite far from being simple and uniform. While
in every scientific procedure the same bridging hypothesis is present, this in
itself contains no indication of how the particular law has to be spelled out.
Formulating a law is not a mechanical process of stating the evident
behavior of the studied quantities. It
necessarily involves an interpretive effort on the part of the scientist, who
makes choices among various alternative interpretations of the assumed
regularity which have to be extracted from arrays of numbers defined within
certain margins of error.
In the simple case of a functional dependence between two
variables, the scientist is confronted with a set of experimental points which
seem to display a certain regularity, often rather nebulous because of
inevitable experimental errors. Hence,
besides the bridging hypothesis which simply postulates the very existence of a
law, the scientist must then postulate the specific law itself.
Consider an example. In Coulomb's well known law of interaction between
point electric charges, the force of interaction is presented as being inversely
proportional to the squared distance between the charges.
The inverse proportionality of the force of electric interaction
to the square distance between point charges is, however, not a direct
conclusion from the experimental data. One
of the reasons for that is the inevitability of errors occurring in every
procedure of measurement. If the
dependence of Coulomb's force on the distance between the point charges is
measured, it is assumed that a) while the distance is being changed, the
interacting charges are indeed being kept constant, b) that no other charges
which could distort the results are anywhere close to the measuring contraption,
c) that the distance itself is measured with sufficient accuracy, etc. All of
this is, of course, just an approximation. Let us imagine that in the course of an experiment wherein Coulomb's
force was measured, the distance between the supposedly point charges was given
the following values (in whatever units of length): 1, 2, 3, 4, 5, 6, the total
of six points at which the above values were determined with an error not
exceeding, say, ±5%. This means
that when the distance was assumed to be 2, it could actually be anything
between 1.9 and about 2.1, and similarly for every other value of the distance.
Imagine further that the measured Coulomb's force (in whatever units of force)
happened to have the following set of values, averaged over many repeated
measurements: 1097, 273.87, 121.64, 68.97, 43.12 and 31.01. Repeating the measurement many times, the experimentalist evaluated the
margin of error for the force to be ± 10%. Reviewing the set of numbers for the
values of the measured force, the researcher notices that these numbers are
rather close to a set of numbers obtained by dividing the maximum force of 1097
(measured for the distance of 1) by the squared distances. Indeed, dividing 1097
by the squared values of distances 2, 3, 4, 5, and 6, one obtains the following
set: 274.25, 121.89, 68.56, 43.88, 30.47. Every number in the experimentally measured set differs from
a corresponding number in the second, calculated set, not more than by 10%,
which is within the margin of error of that experiment. Although none of the
measured numbers exactly equals the numbers calculated upon the assumption that
the force drops inversely proportional to the distance, the scientists postulates
that the difference between the measured and calculated sets is due to
experimental error and that the real law, hidden behind the measured set of
numbers, is indeed the inverse proportionality between the force and the squared
distance.
What is the foundation of that
postulate? If instead of the power
of 2, the distance in the same formula appeared with the power of, say, 2.008 or
1.9986, such a formula would describe the experimental data as well as
Coulomb's law, where the power is exactly 2. Of course, the precision and
accuracy of the measurement can be improved so that the margin of error is
substantially reduced. However, it can never be reduced to zero. Therefore, the best experimental data cannot provide more than is
inherent in them. With a much
improved technique we may be able to assert that Coulomb's force changes
inversely proportional to the distance to the power being, say, between
1.9999999987 and 2.0000000035, but we can never assert, based just on the
experimental data, that the power is exactly 2. We have to postulate
that the power is exactly 2 rather than any other number between 1.9999999987
and 2.0000000035.
Such
a postulate is often made on non-scientific grounds. It is often based on some metaphysical consideration, some philosophical
principle, or such criteria as simplicity, elegance and the like. We actually have no scientific grounds to prefer the power of 2 to, say,
the power of 2.0000986345, since both numbers equally well fit the experimental
data. We choose 2 not because the
data directly point to that choice but because this choice seems to be either
simpler (i.e., using Occam's razor), or more elegant, or more convenient, or
maybe just favored by a particular scientist for purely personal reasons.
Of course, the difference between 2 and, say, 2.0000897 has no
practical consequences, so that we can safely use the formula wherein the exact
power of 2 has been postulated, even
though we may not be absolutely confident that it is indeed the exactly correct
value.
Furthermore, even the principal form of the law – the power
function - is not logically predetermined by the data. There are many formulas
differing from the simple power function which would yield numbers within the
margin of error of the experimental data. For
example, it is always possible to construct a polynomial expression whose
coefficients are chosen in such a way as to make the numbers calculated by that
expression match the measured data within the margin of the experimental error. Hence, not only the value of 2 in the formula of Coulomb's law, but
even the form of the law itself is the result of a scientist's assumption. From
the above discussion it follows that the laws of science are necessarily
postulates. We don't know
that the distance in Coulomb's law must be squared, we postulate
it.
The
choice of a postulate is limited by two considerations. One is the requirement
not to contradict experimental data, so we may not arbitrarily assume that the
distance in the formula in question must have the power of, say, 2.75, because
it is contrary to evidence. The other requirement is not to contradict the
overall body of scientific knowledge. Otherwise
any number which is within the margin of the experimental error has, in
principle, the same right to appear in the formula of law, and the choice of 2
instead of, say, 1.99999867 is a postulate based on metaphysical grounds.
The application of some mathematical apparatus such as, for
example, least square fit, does not principally change the situation because the
output of the mathematical machine is only as good as is the input.
The above discourse may create an impression of arbitrariness of
the laws of science. Some philosophers of science think so. Are the laws |