j.-a. goulet 18
insufficient to eliminate epistemic uncertainties. When large data
sets are available, probabilistic and deterministic met hods may lead
to indistinguishable results; the opposite occurs w he n li tt l e dat a
is available. Therefore, the less we know about it, the stronger the
argument for approaching a problem using probability theory.
In t h i s chapter, a revi ew of set theory lays the foundation for
probability theory, where the central part is the concept of random
variables. Machine learning methods are built from an ensemble
of f un ct i on s organize d in a clever way. Therefore, the last part of
this chapter looks at what happens when random variables are
introduced into deterministic functions.
For specific notions related to pr ob abi l i ty theory that are outside
the scope of this chapter, the reader should refer to dedicated
textbooks such as those by Box and Tiao;
3
Ang and Tang.
4
3
Box, G. E. P. and G. C. Tiao (1992).
Bayesian inference in statistical analysis.
Wiley
4
Ang, A. H.-S. and W. H. Tang (1975).
Probability concepts in engineering plan-
ning and decision,Volume1—Basic
Principles. John Wiley
3.1 Set Theory
Set: Ensemble of events or elements.
Universe/sampling space
(
S
)
:
Ensem-
ble of all possible events.
Elementary event
(
x
)
:
Asingleevent,
x 2S.
Event
(E)
:
Ensemble of elementary events.
E ⇢S :SubsetofS
E = S :Certainevent
E = ; :Impossibleevent
E :ComplementofE
A set descri bes an ensemble of elements, also referred to as events.
An el emen tar y event
x
refers to a single event among a sampling
space (or universe) denoted by the calligraphic lett er
S
.Bydefini-
tion, a sampling space contains all the possible events,
E ✓S
.The
special case where an event i s equ al to the sampl in g space ,
E
=
S
,
is called a certain event. Th e opposite,
E
=
;
, w he re an event is
an e mp ty set, is call ed a null event.
E
refers to the complement of
a set , t hat is, al l el eme nts belonging to
S
and not t o
E
. F i gu re 3.3
illustrates these concepts using a Venn diagram.
Figure 3.3: Venn diagram representing
the sampling space
S
,anevent
E
,its
complement E,andanelementaryeventx.
(a) Union operation
(b) Intersection operation
Figure 3.4: Venn diagrams representing the
two basic operations.
Let us consider the example,
5
of t h e stat e of a structu r e foll ow-
5
This example is adapted from Armen Der
Kiureghian’s course, CE229, at University
of California, Berkeley.
ing an earthquake, which is described by a sampling space,
S = {no damage, light d amage , important damage , collapse}
= {N, L, I, C}.
In t h at context, an event
E
1
=
{
N
,
L
}
could contain the no damage
and light damage events, and another event
E
2
=
{
C
}
could contain
only the collapsed state. The complements of these events are,
respectively, E
1
= {I, C} and E
2
= {N, L, I}.
The two main operations for events, union and intersection, are
illustrated in figure 3.4. A union is analogous to the “or” operator,
where
E
1
[ E
2
holds if the event belongs to e i the r
E
1
,
E
2
, or
both. The intersection is analogous to the “and” operator, where
E
1
\ E
2
⌘ E
1
E
2
holds if the event belongs to both
E
1
and
E
2
. As
a convention, intersection has priority over union. Moreover, both
operations are commutative, as s ociative, and distributive.
Given a set of
n
events
{E
1
,E
2
, ··· ,E
n
}2S
,
E
1
,E
2
, ··· ,E
n
,