Stochastic integration and differential equations - Philip E. Protter

Author: Philip E. Protter
Tags: mathematics mathematical analysis
ISBN: 0172-4568
Year: 2004
Similar
Numerical solution of stochastic differential equations (Stochastic Modelling and Applied Probability)
Stochastic Differential Equations and Applications. Volume 1
Forward-Backward Stochastic Differential Equations and Their Applications
Asymptotic Methods in the Theory of Stochastic Differential Equations
Text
                    Stochastic Mechanics
Random Media
Signal Processing
and Image Synthesis
Mathematical Economics and Finance
Stochastic Optimization
Stochastic Control
Stochastic Models in Life Sciences
Edited by
Advisory Board
Applications of
Mathematics
Stochastic Modelling
and Applied Probability
21
B. Rozovskii
M.Yor
D. Dawson
D. Geman
G. Grimmett
I. Karatzas
R Kelly
Y. Le Jan
B. 0ksendal
E. Pardoux
G. Papanicolaou
Springer
Berlin
Heidelberg
New York
Hong Kong
London
Milan
Paris
Tokyo

Applications of Mathematics
1 Fleming/Rishel, Deterministic and Stochastic Optimal Control A975)
2 Marchuk, Methods of Numerical Mathematics 1975,2nd. ed. 1982)
3 Balakrishnan, Applied Functional Analysis A976,2nd. ed. 1981)
4 Borovkov, Stochastic Processes in Queueing Theory A976)
5 Liptser/Shiryaev, Statistics of Random Processes I: General Theory A977,2nd. ed. 2001)
6 Liptser/Shiryaev, Statistics of Random Processes II: Applications A978,2nd. ed. 2001)
7 Vorob'ev, Game Theory: Lectures for Economists and Systems Scientists A977)
8 Shiryaev, Optimal Stopping Rules A978)
9 Ibragimov/Rozanov, Gaussian Random Processes A978)
10 Wonham, Linear Multivariable Control: A Geometric Approach A979, 2nd. ed. 1985)
11 Hida, Brownian Motion A980)
12 Hestenes, Conjugate Direction Methods in Optimization A980)
13 Kallianpur, Stochastic Filtering Theory A980)
14 Krylov, Controlled Diffusion Processes A980)
15 Prabhu, Stochastic Storage Processes: Queues, Insurance Risk, and Dams A980)
16 Ibragimov/Has'minskii, Statistical Estimation: Asymptotic Theory A981)
17 Cesari, Optimization: Theory and Applications A982)
18 Elliott, Stochastic Calculus and Applications A982)
19 Marchuk/Shaidourov, Difference Methods and Their Extrapolations A983)
20 Hijab, Stabilization of Control Systems A986)
21 Protter, Stochastic Integration and Differential Equations A990,2nd. ed. 2003)
22 Benveniste/Metivier/Priouret, Adaptive Algorithms and Stochastic Approximations A990)
23 Kloeden/Platen, Numerical Solution of Stochastic Differential Equations
A992, corr. 3rd printing 1999)
24 Kushner/Dupuis, Numerical Methods for Stochastic Control Problems in Continuous
Time A992)
25 Fleming/Soner, Controlled Markov Processes and Viscosity Solutions A993)
26 Baccelli/Bremaud, Elements of Queueing Theory A994,2nd ed. 2003)
27 Winkler, Image Analysis, Random Fields and Dynamic Monte Carlo Methods
A995,2nd. ed. 2003)
28 Kalpazidou, Cycle Representations of Markov Processes A995)
29 Elliott/Aggoun/Moore, Hidden Markov Models: Estimation and Control A995)
30 Hernandez-Lerma/Lasserre, Discrete-Time Markov Control Processes A995)
31 Devroye/Gyorfi/Lugosi, A Probabilistic Theory of Pattern Recognition A996)
32 Maitra/Sudderth, Discrete Gambling and Stochastic Games A996)
33 Embrechts/Kluppelberg/Mikosch, Modelling Extremal Events for Insurance and Finance
A997. corr. 4th printing 2003)
34 Duflo, Random Iterative Models A997)
35 Kushner/Yin, Stochastic Approximation Algorithms and Applications A997)
36 Musiela/Rutkowski, Martingale Methods in Financial Modelling A997)
37 Yin, Continuous-Time Markov Chains and Applications A998)
38 Oembo/Zeitouni, Large Deviations Techniques and Applications A998)
39 Karatzas, Methods of Mathematical Finance A998)
40 Fayolle/Iasnogorodski/Malyshev, Random Walks in the Quarter-Plane A999)
41 Aven/Jensen, Stochastic Models in Reliability A999)
42 Hernandez-Lerma/Lasserre, Further Topics on Discrete-Time Markov Control Processes
A999)
43 Yong/Zhou, Stochastic Controls. Hamiltonian Systems and HJB Equations A999)
44 Serfozo, Introduction to Stochastic Networks A999)
45 Steele, Stochastic Calculus and Financial Applications B001)
46 Chen/Yao, Fundamentals of Queuing Networks: Performance, Asymptotics,
and Optimization B001)
47 Kushner, Heavy Traffic Analysis of Controlled Queueing and Communications Networks
B001)
48 Fernholz, Stochastic Portfolio Theory B002)
49 Kabanov/Pergamenshchikov, Two-Scale Stochastic Systems B003)
50 Han, Information-Spectrum Methods in Information Theory B003)
(continued after index)

Philip E. Protter
Stochastic
Integration
and Differential
Equations
Second Edition
Springer

Author
Philip E. Protter
Cornell University
School of Operations Res. and
Industrial Engineering
Rhodes Hall
14853 Ithaca, NY
USA
e-mail: pep4@c0rnell.edu
Managing Editors
B. Rozovskii M. Yor
Center for Applied Mathematical Universite de Paris VI
Sciences Laboratoire de Probabilites
University of Southern California et Modeles Aleatoires
1042 West 36th Place, 175, rue du Chevaleret
Denney Research Building 308 75013 Paris, France
Los Angeles, CA 90089, USA
Mathematics Subject Classification B000): Primary: 60H05,60H10,60H20
Secondary: 60G07,60G17,60G44,60G51
Cover pattern by courtesy of Rick Durrett (Cornell University, Ithaca)
Cataloging-in-Publication Data applied for
A catalog record for this book is available from the Library of Congress.
Bibliographic information published by Die Deutsche Bibliothek
Die Deutsche Bibliothek lists this publication in the Deutsche Nationalbibliografie;
detailed bibliographic data is available in the Internet at http://dnb.ddb.de
ISSN 0172-4568
ISBN 3-540-00313-4 Springer-Verlag Berlin Heidelberg New York
This work is subject to copyright. All rights are reserved, whether the whole or part of the
material is concerned, specifically the rights of translation, reprinting, reuse of illustrations,
recitation, broadcasting, reproduction on microfilm or in any other way, and storage in data
banks. Duplication of this publication or parts thereof is permitted only under the provisions
of the German Copyright Law of September 9, 1965, in its current version, and permission
for use must always be obtained from Springer-Verlag. Violations are liable for prosecution
under the German Copyright Law.
Springer-Verlag Berlin Heidelberg New York
a member of BertelsmannSpringer Science + Business Media GmbH
http://www.springer.de
© Springer-Verlag Berlin Heidelberg 2004
Printed in Germany
The use of general descriptive names, registered names, trademarks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from
the relevant protective laws and regulations and therefore free for general use.
Cover design: Erich Kirchner, Heidelberg
Typesetting by the author using a Springer TEX macro package
Printed on acid-free paper 41/3142db-543210

To Diane and Rachel

Preface to the Second Edition
It has been thirteen years since the first edition was published, with its subtitle
"a new approach." While the book has had some success, there are still almost
no other books that use the same approach. (See however the recent book by
K. Bichteler [15].) There are nevertheless of course other extant books, many
of them quite good, although the majority still are devoted primarily to the
case of continuous sample paths, and others treat stochastic integration as
one of many topics. Examples of alternative texts which have appeared since
the first edition of this book are: [32], [44], [87], [110], [186], [180], [208], [216],
and [226]. While the subject has not changed much, there have been new
developments, and subjects we thought unimportant in 1990 and did not
include, we now think important enough either to include or to expand in this
book.
The most obvious changes in this edition are that we have added exercises
at the end of each chapter, and we have also added Chap. VI which intro-
introduces the expansion of nitrations. However we have also completely rewritten
Chap. III. In the first edition we followed an elementary approach which was
P. A. Meyer's original approach before the methods of Doleans-Dade. In or-
order to remain friends with Freddy Delbaen, and also because we now agree
with him, we have instead used the modern approach of predictability rather
than naturality. However we benefited from the new proof of the Doob-Meyer
Theorem due to R. Bass, which ultimately uses only Doob's quadratic martin-
martingale inequality, and in passing reveals the role played by totally inaccessible
stopping times. The treatment of Girsanov's theorem now includes the case
where the two probability measures are not necessarily equivalent, and we
include the Kazamaki-Novikov theorems. We have also added a section on
compensators, with examples. In Chap. IV we have expanded our treatment
of martingale representation to include the Jacod-Yor Theorem, and this has
allowed us to use the Emery-Azema martingales as a class of examples of mar-
martingales with the martingale representation property. Also, largely because of
the Delbaen-Schachermayer theory of the fundamental theorems of mathe-
mathematical finance, we have included the topic of sigma martingales. In Chap. V

VIII Preface to the Second Edition
we added a section which includes some useful results about the solutions of
stochastic differential equations, inspired by the review of the first edition by
E. Pardoux [191]. We have also made small changes throughout the book;
for instance we have included specific examples of Levy processes and their
corresponding Levy measures, in Sect. 4 of Chap. I.
The exercises are gathered at the end of the chapters, in no particular
order. Some of the (presumed) harder problems we have designated with a
star (*), and occasionally we have used two stars (**). While of course many
of the problems are of our own creation, a significant number are theorems
or lemmas taken from research papers, or taken from other books. We do not
attempt to ascribe credit, other than listing the sources in the bibliography,
primarily because they have been gathered over the past decade and often
we don't remember from where they came. We have tried systematically to
refrain from relegating a needed lemma as an exercise; thus in that sense the
exercises are independent from the text, and (we hope) serve primarily to
illustrate the concepts and possible applications of the theorems.
Last, we have the pleasant task of thanking the numerous people who
helped with this book, either by suggesting improvements, finding typos and
mistakes, alerting me to references, or by reading chapters and making com-
comments. We wish to thank patient students both at Purdue University and
Cornell University who have been subjected to preliminary versions over the
years, and the following individuals: C. Benes, R. Cont, F. Diener, M. Di-
ener, R. Durrett, T. Fujiwara, K. Giesecke, L. Goldberg, R. Haboush, J. Ja-
cod, H. Kraft, K. Lee, J. Ma, J. Mitro, J. Rodriguez, K. Schurger, D. Sezer,
J. A. Trujillo Ferreras, R. Williams, M. Yor, and Yong Zeng. Th. Jeulin,
K. Shimbo, and Yan Zeng gave extraordinary help, and my editor C. Byrne
gives advice and has patience that is impressive. Over the last decade I have
learned much from many discussions with Darrell Duffle, Jean Jacod, Tom
Kurtz, and Denis Talay, and this no doubt is reflected in this new edition.
Finally, I wish to give a special thanks to M. Kozdron who hastened the ap-
appearance of this book through his superb help with ВД^Х, as well as his own
advice on all aspects of the book.
Ithaca, NY Philip Protter
August 2003

Preface to the First Edition
The idea of this book began with an invitation to give a course at the Third
Chilean Winter School in Probability and Statistics, at Santiago de Chile, in
July, 1984. Faced with the problem of teaching stochastic integration in only
a few weeks, I realized that the work of C. Dellacherie [42] provided an outline
for just such a pedagogic approach. I developed this into a series of lectures
(Protter [201]), using the work of K. Bichteler [14], E. Lenglart [145] and
P. Protter [202], as well as that of Dellacherie. I then taught from these lecture
notes, expanding and improving them, in courses at Purdue University, the
University of Wisconsin at Madison, and the University of Rouen in France.
I take this opportunity to thank these institutions and Professor Rolando
Rebolledo for my initial invitation to Chile.
This book assumes the reader has some knowledge of the theory of stochas-
stochastic processes, including elementary martingale theory. While we have recalled
the few necessary martingale theorems in Chap. I, we have not provided
proofs, as there are already many excellent treatments of martingale the-
theory readily available (e.g., Breiman [23], Dellacherie-Meyer [45, 46], or Ethier-
Kurtz [71]). There are several other texts on stochastic integration, all of which
adopt to some extent the usual approach and thus require the general theory.
The books of Elliott [63], Kopp [130], Metivier [158], Rogers-Williams [210]
and to a much lesser extent Letta [148] are examples. The books of McK-
ean [153], Chung-Williams [32], and Karatzas-Shreve [121] avoid the general
theory by limiting their scope to Brownian motion (McKean) and to contin-
continuous semimartingales.
Our hope is that this book will allow a rapid introduction to some of the
deepest theorems of the subject, without first having to be burdened with the
beautiful but highly technical "general theory of processes."
Many people have aided in the writing of this book, either through dis-
discussions or by reading one of the versions of the manuscript. I would like to
thank J. Azema, M. Barlow, A. Bose, M. Brown, C. Constantini, C. Dellache-
Dellacherie, D. Duffie, M. Emery, N. Falkner, E. Goggin, D. Gottlieb, A. Gut, S. He,
J. Jacod, T. Kurtz, J. de Sam Lazaro, R. Leandre, E. Lenglart, G. Letta,

X Preface to the First Edition
S. Levantal, P. A. Meyer, E. Pardoux, H. Rubin, T. Sellke, R. Stockbridge,
C. Strieker, P. Sundar, and M. Yor. I would especially like to thank J. San Mar-
Martin for his careful reading of the manuscript in several of its versions.
Svante Janson read the entire manuscript in several versions, giving me
support, encouragement, and wonderful suggestions, all of which improved
the book. He also found, and helped to correct, several errors. I am extremely
grateful to him, especially for his enthusiasm and generosity.
The National Science Foundation provided partial support throughout the
writing of this book.
I wish to thank Judy Snider for her cheerful and excellent typing of several
versions of this book.
Philip Protter

Contents
Introduction 1
I Preliminaries 3
1 Basic Definitions and Notation 3
2 Martingales 7
3 The Poisson Process and Brownian Motion 12
4 Levy Processes 19
5 Why the Usual Hypotheses? 34
6 Local Martingales 37
7 Stieltjes Integration and Change of Variables 39
8 Naive Stochastic Integration Is Impossible 43
Bibliographic Notes 44
Exercises for Chapter I 45
II Semimartingales and Stochastic Integrals 51
1 Introduction to Semimartingales 51
2 Stability Properties of Semimartingales 52
3 Elementary Examples of Semimartingales 54
4 Stochastic Integrals 56
5 Properties of Stochastic Integrals 60
6 The Quadratic Variation of a Semimartingale 66
7 Ito's Formula (Change of Variables) 78
8 Applications of Ito's Formula 84
Bibliographic Notes 92
Exercises for Chapter II 94
III Semimartingales and Decomposable Processes 101
1 Introduction 101
2 The Classification of Stopping Times 103
3 The Doob-Meyer Decompositions 105
4 Quasimartingales 116

XII Contents
5 Compensators 118
6 The Fundamental Theorem of Local Martingales 124
7 Classical Semimartingales 127
8 Girsanov's Theorem 131
9 The Bichteler-Dellacherie Theorem 143
Bibliographic Notes 147
Exercises for Chapter III 147
IV General Stochastic Integration and Local Times 153
1 Introduction 153
2 Stochastic Integration for Predictable Integrands 153
3 Martingale Representation 178
4 Martingale Duality and the Jacod-Yor Theorem on
Martingale Representation 193
5 Examples of Martingale Representation 200
6 Stochastic Integration Depending on a Parameter 205
7 Local Times 210
8 Azema's Martingale 227
9 Sigma Martingales 233
Bibliographic Notes 235
Exercises for Chapter IV 236
V Stochastic Differential Equations 243
1 Introduction 243
2 The И? Norms for Semimartingales 244
3 Existence and Uniqueness of Solutions 249
4 Stability of Stochastic Differential Equations 257
5 Fisk-Stratonovich Integrals and Differential Equations 270
6 The Markov Nature of Solutions 291
7 Flows of Stochastic Differential Equations: Continuity and
Differentiability 301
8 Flows as Diffeomorphisms: The Continuous Case 310
9 General Stochastic Exponentials and Linear Equations 321
10 Flows as Diffeomorphisms: The General Case 328
11 Eclectic Useful Results on Stochastic Differential Equations . .. 338
Bibliographic Notes 347
Exercises for Chapter V 349
VI Expansion of Filtrations 355
1 Introduction 355
2 Initial Expansions 356
3 Progressive Expansions 369
4 Time Reversal 377
Bibliographic Notes 383
Exercises for Chapter VI 384

Contents XIII
References 389
Symbol Index 403
Subject Index 407

Introduction
In this book we present a new approach to the theory of modern stochastic
integration. The novelty is that we define a semimartingale as a stochastic pro-
process which is a "good integrator" on an elementary class of processes, rather
than as a process that can be written as the sum of a local martingale and an
adapted process with paths of finite variation on compacts: This approach has
the advantage over the customary approach of not requiring a close analysis of
the structure of martingales as a prerequisite. This is a significant advantage
because such an analysis of martingales itself requires a highly technical body
of knowledge known as "the general theory of processes." Our approach has a
further advantage of giving traditionally difficult and non-intuitive theorems
(such as Strieker's Theorem) transparently simple proofs. We have tried to
capitalize on the natural advantage of our approach by systematically choos-
choosing the simplest, least technical proofs and presentations. As an example we
have used К. М. Rao's proofs of the Doob-Meyer decomposition theorems
in Chap. Ill, rather than the more abstract but less intuitive Doleans-Dade
measure approach.
In Chap. I we present preliminaries, including the Poisson process, Brown-
ian motion, and Levy processes. Naturally our treatment presents those prop-
properties of these processes that are germane to stochastic integration.
In Chap. II we define a semimartingale as a good integrator and establish
many of its properties and give examples. By restricting the class of integrands
to adapted processes having left continuous paths with right limits, we are
able to give an intuitive Riemann-type definition of the stochastic integral as
the limit of sums. This is sufficient to prove many theorems (and treat many
applications) including a change of variables formula ("Ito's formula").
Chapter III is devoted to developing a minimal amount of "general the-
theory" in order to prove the Bichteler-Dellacherie Theorem, which shows that
our "good integrator" definition of a semimartingale is equivalent to the usual
one as a process X having a decomposition X — M + A, into the sum of a
local martingale M and an adapted process A having paths of finite variation
on compacts. Nevertheless most of the theorems covered en route (Doob-

2 Introduction
Meyer, Meyer-Girsanov) are themselves key results in the theory. The core
of the whole treatment is the Doob-Meyer decomposition theorem. We have
followed the relatively recent proof due to R. Bass, which is especially simple
for the case where the martingale jumps only at totally inaccessible stopping
times, and in all cases uses no mathematical tool deeper than Doob's quadratic
martingale inequality. This allows us to avoid the detailed treatment of nat-
natural processes which was ubiquitous in the first edition, although we still use
natural processes from time to time, as they do simplify some proofs.
Using the results of Chap. Ill we extend the stochastic integral by continu-
continuity to predictable integrands in Chap. IV, thus making the stochastic integral
a Lebesgue-type integral. We use predictable integrands to develop a theory of
martingale representation. The theory we develop is an I? theory, but we also
prove that the dual of the martingale space Ц} is В MO and then prove the
Jacod-Yor Theorem on martingale representation, which in turn allows us to
present a class of examples having both jumps and martingale representation.
We also use predictable integrands to give a presentation of semimartingale
local times.
Chapter V serves as an introduction to the enormous subject of stochastic
differential equations. We present theorems on the existence and uniqueness
of solutions as well as stability results. Fisk-Stratonovich equations are pre-
presented, as well as the Markov nature of the solutions when the differentials
have Markov-type properties. The last part of the chapter is an introduction
to the theory of flows, followed by moment estimates on the solutions, and
other minor but useful results. Throughout Chap. V we have tried to achieve
a balance between maximum generality and the simplicity of the proofs.
Chapter VI provides an introduction to the theory of the expansion of ni-
nitrations (known as "grossissements de nitrations" in the French literature). We
present first a theory of initial expansions, which includes Jacod's Theorem.
Jacod's Theorem gives a sufficient condition for semimartingales to remain
semimartingales in the expanded filtration. We next present the more diffi-
difficult theory of progressive expansion, which involves expanding nitrations to
turn a random time into a stopping time, and then analyzing what happens
to the semimartingales of the first filtration when considered in the expanded
filtration. Last, we give an application of these ideas to time reversal.

Preliminaries
1 Basic Definitions and Notation
We assume as given a complete probability space (Q, T', P). In addition we are
given a filtration (Tt)o<t<oo- By a filtration we mean a family of cr-algebras
(•^t)o<t<oo that is increasing, i.e., Ts С Tt if s < t. For convenience, we will
usually write F for the filtration (Tt)o<t<oo-
Definition. A filtered complete probability space (fi,T,F, P) is said to sat-
satisfy the usual hypotheses if
(i) To contains all the P-null sets of T;
(ii) Tt = f]u>t Tu, all i, 0 < t < oo; that is, the filtration F is right continuous.
We always assume that the usual hypotheses hold.
Definition. A random variable T : п —> [0, oo] is a stopping time if the
event {T < t} € Tt, every t,0 <t <oo.
One important consequence of the right continuity of the filtration is the
following theorem.
Theorem 1. The event {T < t} € Tt, 0 < t < oo, if and only if T is a
stopping time.
Proof. Since {T < t} = f)t+e>u>t{T < UK апУ ? > 0, we have {T <
t} € f)u>tTu = Tt, so T is a stopping time. For the converse, {T < t} =
{Jt>e>olT < ^ - ?}. and {T<t-e}€ Tts, hence also in Tt. D
A stochastic process X on (fl, T, P) is a collection of R-valued or Re-
Revalued random variables (^Q)o<t<oo- The process X is said to be adapted if
Xt ? Tt (that is, is Tt measurable) for each t. We must take care to be precise
about the concept of equality of two stochastic processes.
Definition. Two stochastic processes X and Y are modifications if Xt =Yt
a.s., each t. Two processes X and Y are indistinguishable if a.s., for all t,
Xt = Yt.

4 I Preliminaries
If X and Y are modifications there exists a null set, Nt, such that if ш $. Nt,
then Xt(u) = Yt{u). The null set Nt depends on t. Since the interval [0, oo)
is uncountable the set TV = Uo<t<oo Nt could have any probability between 0
and 1, and it could even be non-measurable. If X and Y are indistinguishable,
however, then there exists one null set TV such that if w $. N, then Xt{u)) =
Yt(uj), for all t. In other words, the functions t н-> Xt(uj) and t н-» Yt{u>) are
the same for all w $ N, where P(N) = 0. The set TV is in Tt, all t, since J-q
contains all the P-null sets of T. The functions t i-> Xt(w) mapping [0, oo)
into R are called the sample paths of the stochastic process X.
Definition. A stochastic process X is said to be cadlag if it a.s. has sam-
sample paths which are right continuous, with left limits. Similarly, a stochastic
process X is said to be caglad if it a.s. has sample paths which are left
continuous, with right limits. (The nonsensical words cadlag and caglad are
acronyms from the French for continu a droite, limites a gauche and continu
a gauche, limites a droite, respectively.)
Theorem 2. Let X and Y be two stochastic processes, with X a modifica-
modification of Y. If X and Y have right continuous paths a.s., then X and Y are
indistinguishable.
Proof. Let A be the null set where the paths of X are not right continuous,
and let В be the analogous set for Y. Let Nt = {w : Xt(w) ф Yt(w)}, and
let TV = \JteqNt, where Q denotes the rationale in [0, oo). Then P{N) = 0.
Let M = A U В U N, and P(M) = 0. We have Xt{u) = Yt{u) for all t e Q,
ш $ M. If t is not rational, let tn decrease to t through Q. For w $ M,
Xtn(cu) = Ytn(u), each n, and Xt(w) = lim^ooXtn(ш) — limn-.oo^^w) =
Yt(w). Since P{M) = 0, X and Y are indistinguishable. ?
Corollary. Let X and Y be two stochastic processes which are cadlag. If X
is a modification of Y, then X and Y are indistinguishable.
Cadlag processes provide natural examples of stopping times.
Definition. Let X be a stochastic process and let Л be a Borel set in R.
Define
T(w) = M{t > 0 : Xt € A}.
Then T is called a hitting time of Л for X.
Theorem 3. Let X be an adapted cadlag stochastic process, and let Л be an
open set. Then the hitting time of Л is a stopping time.
Proof. By Theorem 1 it suffices to show that {Г < t} € Tt, 0 < t < oo. But
{T<t}= U {XseA},
s6Qn[0,t)
since Л is open and X has right continuous paths. Since {Xs € Л} = Х8(Л) е
J-s, the result follows. ?

1 Basic Definitions and Notation 5
Theorem 4. Let X be an adapted cadlag stochastic process, and let Л be a
closed set. Then the random variable
T(w) = ini{t >0:I((wN/l or Xt-(w) € A}
is a stopping time.
Proof. By Xt_(w) we mean lims_tiS<t Xs(w). Let An = {x : d(x, A) < 1/n},
where d(x, A) denotes the distance from a point x to A. Then An is an open
set and
{T<t} = {xt e Л or Xt_ e Л} и {f| (J {xs e AJ}. a
» s€Qn[0,t)
It is a very deep result that the hitting time of a Borel set is a stopping
time. We do not have need of this result.
The next theorem collects elementary facts about stopping times; we leave
the proof to the reader.
Theorem 5. Let S, T be stopping times. Then the following are stopping
times:
(i) S AT = min(S,T);
(ii) SvT = ma,x(S,T);
(in) S + T;
(iv) aS, where a > 1.
The (T-algebra Tt can be thought of as representing all (theoretically) ob-
observable events up to and including time t. We would like to have an analogous
notion of events that are observable before a random time.
Definition. Let Г be a stopping time. The stopping time <r-algebra Тт
is defined to be
{Ле7:Лп{Т<«}€7(, alH > 0}.
The previous definition is not especially intuitive. However it does well
represent "knowledge" up to time T, as the next theorem illustrates.
Theorem 6. Let T be a finite stopping time. Then Тт is the smallest a-
algebra containing all cadlag processes sampled at T. That is,
J~t = <t{Xt;X all adapted cadlag processes}.
Proof. Let G — a{XT;X all adapted cadlag processes}. Let A € Тт- Then
Xt = lyil{t>T}1 is a cadlag process, and Хт = 1л- Hence A € Q, and Тт С Q.
1 1л is the indicator function of A : 1л(ш) = I ' '
10, w f A.

6 I Preliminaries
Next let X be an adapted cadlag process. We need to show XT is Тт
measurable. Consider X(s,u) as a function from [0, oo) x fi into R. Construct
4> ¦ {T < t} ->• [0, oo) x П by ip(u) = (Т(ш),ш). Then since X is adapted and
cadlag, we have Хт = Xoip is a measurable mapping from ({T < t}, TtD{T <
t}) into (R, B), where В are the Borel sets of R. Therefore
{co:X(T(Lj),u;)eB}n{T<t}
is in Tt, and this implies Хт € Тт- Therefore Q С Тт- ?
We leave it to the reader to check that if S < T a.s., then Ts с Тт, and
the less obvious (and less important) fact that Ts ОТт = Tsat-
If X and Y are cadlag, then Xt = Yt a.s. each t implies that X and Y are
indistinguishable, as we have already noted. Since fixed times are stopping
times, obviously if Хт = Yr a.s. for each finite stopping time T, then X and
Y are indistinguishable. If X is cadlag, let AX denote the process AXt =
Xt — Xt-. Then AX is not cadlag, though it is adapted and for a.a. w,
t i—> AXt = 0 except for at most countably many t. We record here a useful
result.
Theorem 7. Let X be adapted and cadlag. If АХт1{т<оо} = 0 a-s- for each
stopping time T, then AX is indistinguishable from the zero process.
Proof. It suffices to prove the result on [0, to] for 0 < to < oo. The set {t :
|AXt| > 0} is countable a.s. since X is cadlag. Moreover
oo 1
= \J{f.\AXt\>-}
n=l
and the set {t : |AXj| > 1/n} must be finite for each n, since ?o < oo. Using
Theorem 4 we define stopping times for each n inductively as follows:
грП,1
rpn,k
Then Tn'k > Tn'k~l a.s.
{|AXt| ;
= inf{t
= mi{t
on {T1'
>0} = L
n
>0:
>Tn,
JilAJ
,fc
АХ4|
k-i.
oo}.
AXt\ > -}.
n
Moreover,
L{T".fe<oo}l >
where the right side of the equality is a countable union. The result follows. D
Corollary. Let X and Y be adapted and cadlag. If for each stopping time
T, АХт1{т<оо} = AYt1{t<oo} a-s-5 then AX and AY are indistinguishable.

2 Martingales 7
A much more general version of Theorem 7 is true, but it is a very deep
result which uses Meyer's "section theorems," and we will not have need of
it. See, for example, Dellacherie [41] or Dellacherie-Meyer [45].
A fundamental theorem of measure theory that we will need from time
to time is known as the Moiiotone Class Theorem. Actually there are several
such theorems, but the one given here is sufficient for our needs.
Definition. A monotone vector space Hona space Q is defined to be
the collection of bounded, real-valued functions /onfi satisfying the three
conditions:
(i) ti is a vector space over R;
(ii) 1q e H (i.e., constant functions are in H); and
(iii) if (/„)„>! С П, and 0 < ^ < /2 <•¦•</„<••-, and limn-.*, fn = f,
and / is bounded, then / G H.
Definition. A collection M. of real functions defined on a space fl is said to
be multiplicative if /, g e M implies that fg € M.
For a collection of real-valued functions ЛЛ defined on 0, we let cr{.M}
denote the space of functions defined on fl which are measurable with respect
to the ст-algebra on Q generated by {f~1(A}; Л ? В (Ж), f G M}.
Theorem 8 (Monotone Class Theorem). Let Ai be a multiplicative class
of bounded real-valued functions defined on a space fl, and let Л = а{ЛЛ}. If
ti is a monotone vector space containing ЛЛ, then ti contains all bounded, Л
measurable functions.
Theorem 8 is proved in Dellacherie-Meyer [45, page 14] with the additional
hypothesis that H is closed under uniform convergence. This extra hypothesis
is unnecessary, however, since every monotone vector space is closed under
uniform convergence. (See Sharpe [215, page 365].)
2 Martingales
In this section we give, mostly without proofs, only the essential results from
the theory of continuous time martingales. The reader can consult any of
a large number of texts to find excellent proofs; for example Dellacherie-
Meyer [46], or Ethier-Kurtz [71]. Also, recall that we will always assume as
given a filtered, complete probability space (fi,.F, F, P), where the filtration
F = (^t)o<t<oo is assumed to be right continuous.
Definition. A real-valued, adapted process X — (Xt)o<t<00 is called a mar-
tingale(resp. supermartingale, submartingale) with respect to the filtra-
filtration F if
(i) Xt e L^dP); that is, E{\Xt\} < oo;
(ii) if s < t, then E{Xt\Ts} = Xs, a.s. (resp. E{Xt\Ts} < Xs, resp. > Xs).

8 I Preliminaries
Note that martingales are only defined on [0, oo); that is, for finite t and not
t = oo. It is often possible to extend the definition to t = oo.
Definition. A martingale X is said to be closed by a random variable Y if
E{\Y\} < oo and Xt = E{Y\ft}, 0<t<oo.
A random variable Y closing a martingale is not necessarily unique. We
give a sufficient condition for a martingale to be closed (as well as a construc-
construction for closing it) in Theorem 12.
Theorem 9. Let X be a supermartingale. The function t i—> E{Xt) is right
continuous if and only if there exists a modification Y of X which is cadlag.
Such a modification is unique.
By uniqueness we mean up to indistinguishability. Our standing assump-
assumption that the "usual hypotheses" are satisfied is used implicitly in the state-
statement of Theorem 9. Also, note that the process Y is, of course, also a super-
supermartingale. Theorem 9 is proved using Doob's upcrossing inequalities. If X is
a martingale then 11—> E{Xt} is constant, and hence it has a right continuous
modification.
Corollary. If X = (Xt)o<t<<x> is a martingale then there exists a unique
modification Y of X which is cadlag.
Since all martingales have right continuous modifications, we will always
assume that we are taking the right continuous version, without any special
mention. Note that it follows from this corollary and Theorem 2 that a right
continuous martingale is cadlag.
Theorem 10 (Martingale Convergence Theorem). Let X be a right
continuous supermartingale, sup0<t<oo-E{|Xt|} < oo. Then the random vari-
variable Y = limt^oo^t a.s. exists, and E{\Y\} < oo. Moreover if X is
a martingale closed by a random variable Z, then Y also closes X and
A condition known as uniform integrability is sufficient for a martingale
to be closed.
Definition. A family of random variables {Ua)a^A is uniformly integrable
if r
lim sup / \Ua\dP = 0.
™^°° a J{\Ua\>n}
Theorem 11. Let {Ua)aeA be a subset of L1. The following are equivalent:
(i) (Ua)a?A is uniformly integrable.
(ii) supa€j4 .E{|[/a|} < oo, and for every e > 0 there exists 5 > 0 such that
ЛеТ, Р(Л) < 5, imply E{\UalA\} < e.
Vo<t<oo ^* denotes the smallest cr-algebra generated by (JFt), alii, 0 < t < oo.

2 Martingales 9
(in) There exists a positive, increasing, convex function G(x) defined on
[0, oo) such that lim^oo ^jp = +oo and supa E{G о \Ua\} < oo.
The assumption that G is convex is not needed for the implications (Hi) =Ф>
(ii) and (in) =Ф> (i).
Theorem 12. Let X be a right continuous martingale which is uniformly
integrable. Then Y = Нгщ-юо Xt a.s. exists, E{\Y\} < oo, and Y closes X as
a martingale.
Theorem 13. Let X be a (right continuous) martingale. Then (Xt)t>o is
uniformly integrable if and only ifY = limt^ooXt exists a.s., E{\Y\} < oo,
and (Xt)o<t<oo is a martingale, where Xoo = Y.
If X is a uniformly integrable martingale, then Xt converges to Xoo = Y in
L1 as well as almost surely. The next theorem we use only once (in the proof
of Theorem 28), but we give it here for completeness. The notation (Xn)n<0
refers to a process indexed by the non-positive integers: ... ,X-2, X~i, Xo-
Theorem 14 (Backwards Convergence Theorem). Let (Xn)n<o be a
martingale. Then linin^-oo Xn = E{X0\ Dn=-oo ^n} a-$- and in L1.
A less probabilistic interpretation of martingales uses Hilbert space theory.
Let Y e L2{U,T,P). Since Tt С Т, the spaces L2{U,Tt,P) form a family of
Hilbert subspaces of Ь2(П,Т, Р). Let WtY denote the Hilbert space projection
of rontoL2@,ft,P).
Theorem 15. Let Y € I2(Q,T, P). The process Xt - VtY is a uniformly
integrable martingale.
Proof. It suffices to show E{Y\Jrt) — VtY. The random variable E{Y\Tt}
is the unique Tt measurable r.v. such that JAYdP = fA E{Y\!Ft)dP, for
any event A e Tt. We have JA YdP = JA ^YdP + JA{Y -"•* Y)dP. But
JA(Y-^Y)dP = J lA{Y-^Y)dP. Since 1A e I2(Q,Ft,P), and (Y-**Y)
is in the orthocomplement of L2{U,TUP), we have / 1A(Y - VtY)dP = 0,
and thus by uniqueness E{Y\Tt) = VtY. Since |Г'У||ь2 < ЦУЦ^г, by part (iii)
of Theorem 11 we have that X is uniformly integrable (take G{x) = x2). ?
The next theorem is one of the most useful martingale theorems for our
purposes.
Theorem 16 (Doob's Optional Sampling Theorem). Let X be a right
continuous martingale, which is closed by a random variable X^. Let S and
T be two stopping times such that S <T a.s. Then Xs and Xt are integrable
and
Xs = E{XT\Ts] a.s.
Theorem 16 has a similar version for supermartingales.

10
I Preliminaries
Theorem 17. Let X be a right continuous supermartingale (resp. martin-
martingale), and let S and T be two bounded stopping times such that S < T a.s.
Then Xs and Хт are integrable and
Xs > E{XT\?s} a.s. {resp. =).
If Г is a stopping time, then so is t A T = mm(t, T), for each t > 0.
Definition. Let X be a stochastic process and let T be a random time. XT
is said to be the process stopped at T if Xj = XtAT-
Note that if X is adapted and cadlag and if T is a stopping time, then
is also adapted. A martingale stopped at a stopping time is still a martingale,
as the next theorem shows.
Theorem 18. Let X be a uniformly integrable right continuous martingale,
and let T be a stopping time. Then XT — (XtAT)o<t<oo is also a uniformly
integrable right continuous martingale.
Proof. XT is clearly right continuous. By Theorem 16
e Тт. Thus,
However for H e Tt we have
Therefore
= E{XT\Tt},
since Хт1{т<г} is Ft measurable. Thus XT is a uniformly integrable Tt mar-
martingale by Theorem 13. ?
Observe that the difficulty in Theorem 18 is to show that XT is a martin-
martingale for the filtration {Tt)o<t<oo- It is a trivial consequence of Theorem 16 that
XT = XtAT is a martingale for the filtration (Qt)o<t<oo given by Qt = FtAT-
Corollary. Let Y be an integrable random variable and let S, T be stopping
times. Then
= E{E{Y\TT}\Ts]
= E{Y\Fsat).

2 Martingales 11
Proof. Let Yt = E{Y\!Ft}. Then YT is a uniformly integrable martingale and
= Y$ = E{YT\FS}
= E{E{Y\TtWs}.
Interchanging the roles of T and 5 yields
Ysat = Yj? = E{YS\TT}
= E{E{Y\FsWt}-
Finally, E{Y\FSat} = YSat- ?
The next inequality is elementary, but indispensable.
Theorem 19 (Jensen's Inequality). Let <p : E —> R 6e convex, and let X
and <p(X) be integrable random variables. For any a-algebra Q,
<P°E{X\G}<E{<p(X)\G}.
Corollary 1. Let X be a martingale, and let <p be convex such that <p(Xt)
is integrable, 0 < t < oo. Then </?(X) is a submartingale. In particular, if M
is a martingale, then \M\ is a submartingale.
Corollary 2. Let X be a submartingale and let <p be convex, non-decreasing,
and such that <p(Xt)o<t<oo is integrable. Then <p(X) is also a submartingale.
We end our review of martingale theory with Doob's inequalities; the most
important is when p = 2.
Theorem 20. Let X be a positive submartingale. For all p > 1, with q con-
conjugate to p (i.e., ^ + ^ = 1), we have
| sup \Xt\\\LP<q sup \\Xt\\
t t
We let X* denote sups \XS\. Note that if M is a martingale with M^ e L2,
then \M\ is a positive submartingale, and taking p = 2 we have
This last inequality is called Doob's maximal quadratic inequality.
An elementary but useful result concerning martingales is the following.
Theorem 21. Let X = (Xt)o<t<oo be an adapted process with cadlag paths.
Suppose E{\Xt\} < oo and E{X?} = 0 for any stopping time T, finite or
not. Then X is a uniformly integrable martingale.
Proof. Let 0 < s < t < oo, and let Л € Ts. Let
\ u, if ш е A,
00, if LJ f Л.

12 I Preliminaries
Then ил are stopping times for all и > s. Moreover
f XudP = I XUAdP - f X^dP
Ja J Jq\a
= _/
Jn\
)п\л
since Е{Хил} = 0 by hypothesis, for и > s. Thus for Л е !FS and s < t,
E{XtlA} = E{XS1A} = -E{Xooln\A}, which implies E{Xt\Fs} = X., and
X is a martingale, 0 < t < oo. П
Definition. A martingale X with Xq = 0 and E{Xf} < oo for each t > 0 is
called a square integrable martingale. If E{X^a\ < oo as well, then X is
called an L2 martingale.
Clearly, any I? martingale is also a square integrable martingale. See also
Sect. 3 of Chap. IV.
3 The Poisson Process and Brownian Motion
The Poisson process and Brownian motion are the two fundamental examples
in the theory of continuous time stochastic processes. The Poisson process is
the simpler of the two, and we begin with it. We recall that we assume given
a filtered probability space (Q,^", F, P) satisfying the usual hypotheses.
Let (Tn)n>o be a strictly increasing sequence of positive random variables.
We always take To = 0 a.s. Recall that the indicator function l{t>Tn} is defined
as
_/l, if t>Tn(cj),
1{|
Definition. The process N = (NtH<t<oo denned by
with values in NU{oo} where N = {0,1,2,... } is called the counting process
associated to the sequence (Tn)n>\.
If we set T — supn Tn, then
[Г„,оо) = {N>n} = {(t,w) : Щи) > n}
as well as
[Tn,Tn+l) = {N = n}, and [r,oo) = {7V = oo}.

3 The Poisson Process and Brownian Motion 13
The random variable T is the explosion time of N. If T = oo a.s., then N is a
counting process without explosions. For T = oo, note that for 0 < s < t < oo
we have
The increment Nt — Ns counts the number of random times Tn that occur
between the fixed times s and t.
As we have defined a counting process it is not necessarily adapted to the
filtration F. Indeed, we have the following.
Theorem 22. A counting process N is adapted if and only if the associated
random variables (Tn)n>i are stopping times.
Proof. If the (Tn)n>o are stopping times (with To = 0 a.s.), then the event
{Nt = n} = {uj : Tn{uj) < t < Tn+iH} € Tu
for each n. Thus Nt e Tt and N is adapted. If N is adapted, then {Tn < t} =
{Nt > n\ € Tt, each t, and therefore Tn is a stopping time. D
Note that a counting process without explosions has right continuous paths
with left limits; hence a counting process without explosions is cadlag.
Definition. An adapted counting process TV is a Poisson process if
(i) for any s, t, 0 < s < t < oo, Nt — Ns is independent of Т3\
(ii) for any s, t, u, v, 0 < s <t < oo, 0 < и < v < oo, t — s — v — u, then the
distribution of Nt — Ns is the same as that of Nv — Nu.
Properties (i) and (ii) are known respectively as increments independent of
the past, and stationary increments.
Theorem 23. Let N be a Poisson process. Then
n\
n = 0,1,2,..., for some Л > 0. That is, Nt has the Poisson distribution with
parameter Xt. Moreover, N is continuous in probability3 and does not have
explosions.
Proof. The proof of Theorem 23 is standard and is often given in more ele-
elementary courses (cf., e.g., Qinlar [33, page 71]). We sketch it here.
Step 1. For all t > 0, P(Nt = 0) = e~Xt, for some constant Л > 0.
3 N is continuous in probability means that for t > 0, limu^t Nu = Nt where the
limit is taken in probability.

14 I Preliminaries
Since {Nt = 0} = {Ns = 0} П {Nt - Ns = 0} for 0 < s < t < oo by the
independence of the increments,
P(Nt = 0) = P(NS = 0)P(Nt - NS = 0)
= P(NS = 0)P(Nt_s = 0),
by the stationarity of the increments. Let a(t) = P(Nt = 0). We have a{t) =
a(s)a(t — s), for all 0 < s < t < oo. Since a{t) can be easily seen to be right
continuous in t, we deduce that either a{t) = 0 for all t > 0 or
a(t) = e~xt for some Л > 0.
If a(t) — 0 it would follow that Nt(w) = oo a.s. for all t which would con-
contradict that N is a counting process. Note that limu^t P(\NU — Nt\ > e) =
lim«^t P(\Nu-t\ > e) = lim^^o P(NV > e) = lim^^o 1 - e~Xv = 0; hence N
is continuous in probability.
Step 2. P(Nt > 2) is o(t). (That is, limt^0 \P(Nt > 2) = 0.)
Let C{t) = P(Nt > 2). Since the paths of N are non-decreasing, C is
also non-decreasing. One readily checks that showing lim^o jP(t) — 0 is
equivalent to showing that Ит^^оо и/3(^) = 0. Divide [0, l] into n subintervals
of equal length, and let Sn denote the number of subintervals containing at
least two arrivals. By the independence and stationarity of the increments Sn is
the sum of n i.i.d. zero-one valued random variables, and hence has a Binomial
distribution (n,p), where p = /3(^). Therefore E{Sn} = np = n/3(?).
Since N is a counting process, we know the arrival times are strictly
increasing; that is, Tn < Tn+i a.s. Since Sn < N\, if E{Ni} < oo we
can use the Dominated Convergence Theorem to conclude limn_>oon/3(^) =
limn^oo E{Sn} = 0. (That E{Ni} < oo is a consequence of Theorem 34,
established in Sect. 4).
Also note that ?{./Vi} < oo implies Ni < oo a.s. and hence there are no
explosions before time 1. This implies for fixed ui, for n sufficiently large no
subinterval has more than one arrival (otherwise there would be an explosion).
Hence, limn^oo Sn(uj) = 0 a.s.
Step 3. limt^o \P{Nt = 1} = A.
Since P{Nt = 1} = 1 - P{Nt = 0} - P{Nt > 2}, it follows that
hm lP{Nt 1} lim
t-.o t t—о
Step 4- Conclusion.
We write <p(t) = E{aNt}, for 0 < a < 1. Then for 0 < s < t < oo,
the independence and stationarity of the increments implies that <p(t + s) =
<p(t)<p(s) which in turn implies that <p(t) = e*^a). But

3 The Poisson Process and Brownian Motion 15
n=0
oo
= p(Nt = 0) + aP(Nt = 1) + ^ anP(Nt = n),
n=2
and ф(а) = '/''(О), the derivative of <p at 0. Therefore
«<» - sa ^ - a {m^1+^^^+>}
= -Л + Xa.
Therefore <p(t) = e-M+Xat, hence
00
V{t) = E a"PGVt = „) =
Equating coefficients of the two infinite series yields
7*.
n=0 n=0
for n =0,1,2,.... П
Definition. The parameter Л associated to a Poisson process by Theorem 23
is called the intensity, or arrival rate, of the process.
Corollary. A Poisson process N with intensity Л satisfies
E{Nt} = Xt,
Variance^) = Var(Art) = At.
The proof is trivial and we omit it.
There are other, equivalent definitions of the Poisson process. For example,
a counting process N without explosion can be seen to be a Poisson process
if for all s, t, 0 < s < t < oo, E{Nt} < oo and
E{Nt-Ns\Ts} = X(t-s).
Theorem 24. Let N be a Poisson process with intensity X. Then Nt — Xt and
(Nt - XtJ - Xt are martingales.
Proof. Since Xt is non-random, the process Nt — Xt has mean zero and inde-
independent increments. Therefore
E{Nt -Xt- (Ns - Xs)\Ts} = E{Nt -Xt- (Ns - As)} = 0,
for 0 < s < t < oo. The analogous statement holds for (Nt - XtJ - Xt. ?

16 I Preliminaries
Definition. Let if be a stochastic process. The natural filtration of H,
denoted F° = (^?)o<t<oo, is denned by T% = a{Hs\ s < t}. That is, F$ is the
smallest filtration that makes H adapted.
Note that natural nitrations are not assumed to contain all the P-null sets
Theorem 25. Let N be a counting process. The natural filtration of N is
right continuous.
Proof. Let E = [0, oo] and В be the Borel sets of E, and let Г be the path
space given by
Г = ( П Еа, ® Bt).
s€[0,oo) s€[0,oo)
Define the maps щ : О —> Г by
щ{ш) = s ^ NsM(lj).
Thus the range of Tit is contained in the set of functions constant after t. The
сг-algebra ff is also generated by the single function space-valued random
variable щ.
Let A be an event in Пп>1 ^t+i • Then there exists a set An € ® se[o oo) ^s
such that Л = {?rt+i. G An}. Next set Wn = {щ = 7rt+i}. For each w,
there exists an n such that s •—> Ns(lj) is constant on [t,t + ^]; therefore
Q = Un>i ^iii where TVn is an increasing sequence of events. Therefore
Л = lim(Wn П A)
П
= \im{Wn П {щ+г е Л„})
= lim(Wn П {7rt e An})
n
which implies Л G ^. We conclude Пп>1 J~t+± *- -^?) which implies they are
equal. D
We next turn our attention to the Brownian motion process. Recall that
we are assuming as given a filtered probability space (fi, T, F, P) that satisfies
the usual hypotheses.
Definition. An adapted process В = (Bt)o<t<oo taking values in Rn is called
an n-dimensional Brownian motion if
(i) for 0 < s < t < oo, Bt — Bs is independent of Ts {increments are indepen-
independent of the past);
(ii) for 0 < s < t, Bt — Bs is a Gaussian random variable with mean zero and
variance matrix (t — s)C, for a given, non-random matrix C.

3 The Poisson Process and Brownian Motion 17
The Brownian motion starts at x if P{Bq = x) = 1.
The existence of Brownian motion is proved using a path-space construc-
construction, together with Kolmogorov's Extension Theorem. It is simple to check
that a Brownian motion is a martingale as long as ?{|i?o|} < oo. Therefore
by Theorem 9 there exists a version which has right continuous paths, a.s.
Actually, more is true.
Theorem 26. Let В be a Brownian motion. Then there exists a modification
of В which has continuous paths a.s.
Theorem 26 is often proved in textbooks on probability theory (e.g.,
Breiman [23]). It can also be proved as an elementary consequence of Kol-
Kolmogorov's Lemma (Theorem 72 of Chap. IV). We will always assume that we
are using the version of Brownian motion with continuous paths. We will also
assume, unless stated otherwise, that С is the identity matrix. We then say
that a Brownian motion В with continuous paths, with С = I the identity
matrix, and with Bo = x for some x € R™, is a standard Brownian motion.
Note that for an Rn standard Brownian motion B, writing Bt = {B\, ...,B?),
0 < t < oo, then each B% is an R1 Brownian motion with continuous paths,
and the B*'s are independent.
We have already observed that a Brownian motion В with ?{|i?o|} < oo
is a martingale. Another important elementary observation is the following.
Theorem 27. Let В = (-Bt)o<t<oo be a one dimensional standard Brownian
motion with Bo = 0. Then Mt = B2 — t is a martingale.
Proof. E{Mt} = E{Bf -t}=0. Also
E{Mt - Ма\Та} = E{B2t - B2S ~{t-
and
a\F.} = BsE{Bt\fs} = B2S,
since В is a martingale with Bs, Bt € L2. Therefore
E{Mt - MS\FS} = E{B2 - 2BtBs + B2 -(t-
= E{(Bt-BsJ-(t-s)\fs}
= E{(Bt - BsJ} -(t-s)
= 0,
due to the independence of the increments from the past. D
Theorem 28. Letnn be a sequence of partitions of[a,a+t]. Suppose тгт С тгп
if m > n (that is, the sequence is a refining sequence). Suppose more-
moreover that limn^oo meshGrn) = 0. Let nnB = ^2ueVn(Bti+1 - BuJ. Then
limn_»oo "KnB = t a.s., for a standard Brownian motion B.

18 I Preliminaries
Proof. We first show convergence in mean square. We have
where Fj are independent random variables with zero means. Therefore
Е{(тгпВ - tJ} = E{(? YiJ} =
Next observe that (Bt.+1 — BtiJ/(ti+\ -1{) has the distribution of Z2, where
Z is Gaussian with mean 0 and variance 1. Therefore
E{(nnB - tJ} = E{{Z2 - IJ}
U 67Г
which tends to 0 as n tends to oo. This establishes L2 convergence (and hence
convergence in probability as well).
To obtain the a.s. convergence we use the Backwards Martingale Conver-
Convergence Theorem (Theorem 14). Define
for n — —1,-2,-3, Then it is straightforward (though notationally
messy) to show that
E{Nn\Nn.1,Nn-2,...}=Nn-1.
Therefore Nn is a martingale relative to Qn = a{Nk, к < n}, n = —1, —2,
By Theorem 14 we deduce limn^_oo Nn = linin-.oo nnB exists a.s., and since
ттпВ converges to t in L2, we must have limn^oo жпВ = t a.s. as well. D
Comments. As noted in the proofs, the proof is simple (and half as long) if we
conclude only L2 convergence (and hence convergence in probability), instead
of a.s. convergence. Also, we can avoid the use of the Backwards Martingale
Convergence Theorem (Theorem 14) in the second half of the proof if we
add the hypothesis that Yln meshGrn) < oo. The result then follows, after
having proved the L2 convergence, by using the Borel-Cantelli Lemma and
Chebsyshev's inequality. Furthermore to conclude only L2 convergence we do
not need the hypothesis that the sequence of partitions be refining.
Theorem 28 can be used to prove that the paths of Brownian motion
are of unbounded variation on compacts. It is this fact that is central to
the difficulties in defining an integral with respect to Brownian motion (and
martingales in general).

4 Levy Processes 19
Theorem 29. For almost all ш, the sample paths t >-* Bt(oj) of a standard
Brownian motion В are of unbounded variation on any interval.
Proof. Let A = [a,b] be an interval. The variation of paths of В is defined to
be
VA{w) = sup V \Bti+1 -Bti\
*€Vue*
where V are all finite partitions of [a,b]. Suppose P(VA < oo) > 0. Let nn
be a sequence of refining partitions of [a, b] with limn meshGrn) = 0. Then by
Theorem 28 on {VA < oo},
< lim sup \Bti+1-Bti\ V \Bti+1-Bt.\
< lim sup \Bt -Bti\VA
= o,
since suptie7rn \Bti+1 — Bti\ tends to 0 a.s. as meshGrn) tends to 0 by the a.s.
uniform continuity of the paths on A. Since b — a < 0 is absurd, by Theorem 27
we conclude VA = oo a.s. Since the null set can depend on the interval [a, b],
we only consider intervals with rational endpoints a, b with a < b. Such а
collection is countable, and since any interval (a,b) = U^Li[an>M with an,
bn rational, we can omit the dependence of the null set on the interval. D
We conclude this section by observing that not only are the increments of
standard Brownian motion independent, they are also stationary. Thus Brow-
Brownian motion is a Levy process (as is the Poisson process), and the theorems
of Sect. 4 apply tor it. In particular, by Theorem 31 of Sect. 4, we can con-
conclude that the completed natural filtration of standard Brownian motion is
right continuous.
4 Levy Processes
The Levy processes, which include the Poisson process and Brownian motion
as special cases, were the first class of stochastic processes to be studied in
the modern spirit (by the French mathematician Paul Levy). They still pro-
provide prototypic examples for Markov processes as well as for semimartingales.
Most of the results of this section hold for W1 -valued processes; for notational
simplicity, however, we will consider only R-valued processes.4 Once again
we recall that we are assuming given a filtered probability space (п,^,?, Р)
satisfying the usual hypotheses.
4 Rn denotes n-dimensional Euclidean space. R+ = [0, oo) denotes the non-negative
real numbers.

20 I Preliminaries
Definition. An adapted process X = (Xt)t>o with Xo = 0 a.s. is a Levy
process if
(i) X has increments independent of the past; that is, Xt — Xs is independent
of Ts, 0 < s < t < oo; and
(ii) X has stationary increments; that is, Xt — Xs has the same distribution
as Xt-S, 0 < s < t < oo; and
(iii) Xt is continuous in probability; that is, limt^s Xt = Xs , where the limit
is taken in probability.
Note that it is not necessary to involve the filtration F in the definition of
a Levy process. Here is a (less general) alternative definition; to distinguish
the two, we will call it an intrinsic Levy process.
Definition. An process X = (Xt)t>o with Xo = 0 a.s. is an intrinsic Levy-
process if
(i) X has independent increments; that is, Xt — Xs is independent of Xv — Xu
if (u,v) П (s,t) = 0; and
(ii) X has stationary increments; that is, Xt — Xs has the same distribution
as Xv — Xu if t - s = v - и > 0; and
(iii) Xt is continuous in probability.
Of course, an intrinsic Levy process is a Levy process for its minimal
(completed) filtration.
If we take the Fourier transform of each Xt we get a function f(t,u) = ft(u)
given by
ft(u) = E{eiuX<},
where /o(u) = 1, and ft+s(u) = /*(«)/„(«), and ft(u) ф 0 for every (t,u).
Using the (right) continuity in probability we conclude /t(w) = exp{-?^(u)},
for some continuous function ф(и) with ф@) = О. (Bochner's Theorem can
be used to show the converse. If ip is continuous, ф@) = О, and if for all
t > 0, ft(u) = e-4i^u) satisfies J2i,j ai&jМщ - щ) > 0, for all finite
(wi,... ,un;cti,... ,an), then there exists a Levy process corresponding to
/¦)
In particular it follows that if X is a Levy process then for each t > 0, Xt
has an infinitely divisible distribution. Inversely it can be shown that for each
infinitely divisible distribution fj, there exists a Levy process X such that fj, is
the distribution of X\.
Theorem 30. Let X be a Levy process. There exists a unique modification
Y of X which is cadlag and which is also a Levy process.
Proof. Let M" = e, /I. For each fixed и in Q, the rationale in K, the process
(M"H<t<oo is a complex-valued martingale (relative to F).
We first show that the paths of X cannot explode a.s. For any real u,
(Mt")t>o is a (complex-valued) martingale and thus for a.a. w the functions

4 Levy Processes 21
t i-> Mt"(w) and ? н-> e»«-X'«M) With ? € Q+, are the restrictions to Q+ of
cadlag functions. Let
Л = {(и, и) € U х R : eiuXt^\t € Q+,
is not the restriction of a cadlag function}.
One can check that Л is a measurable set. Furthermore, we have seen that
/ 1л(и, u)P(du)) = 0, each и e R. By Fubini's Theorem
// lA(u,u)duP(duj) = lA(u,u)P{du)du = 0,
J J— oo 7-oo >/
hence we conclude that for a.a. oj the function t н-> e%uXt^\ t G Q+ is the
restriction of a cadlag function for almost all и € R. We can now conclude
that the function t н-> Xt(w), ? € Q+, is the restriction of a cadlag function
for every such w, with the help of the lemma that follows the proof of this
theorem.
Next set Yt{w) = lims€Q+]S|t Xs(oj) for all u> in the projection onto п of
{ft x Щ \ Л and Yt — 0 on Л, all ?. Since Tt contains all the P-null sets
of J- and (J-t)o<t<oo is right continuous, Yt € Tt- Since X is continuous in
probability, P{Yt -? Xt} = 0, hence У is a modification of X. It is clear that
У is a Levy process as well. D
The next lemma was used in the proof of Theorem 30. Although it is a
pure analysis lemma, we give a proof using probability theory.
Lemma. Let xn be a sequence of real numbers such that e%UXn converges as
n tends to oo for almost all tisR. Then xn converges to a finite limit.
Proof. We will verify the following Cauchy criterion: xn converges if for any
increasing sequences nk and т^, then linifc^oo ?„,. — xmk = 0. Let U be a
random variable which has the uniform distribution on [0,1]. For any real t,
by hypothesis a.s. ettUXnk and eltUXmk converge to the same limit. Therefore,
\\m eitU(xnk-xmk) = 1 &s
fc-»oo
so that the characteristic functions converge,
lim Е{еа^Хп"-х^и} = 1,
к—>oo
for all t € R. Consequently (хПк — xmk)U converges to zero in probability,
whence lim^oo xnk - xmk = 0, as claimed. D
We will henceforth always assume that we are using the (unique) cadlag
version of any given Levy process. Levy processes provide us with examples
of filtrations that satisfy the "usual hypotheses," as the next theorem shows.

22 I Preliminaries
Theorem 31. Let X be a Levy process and let Qt = J^VAf, where
is the natural filtration of X, andAf are the P-null sets of T¦ Then (Qt)o<t<oo
is right continuous.
Proof. We must show Gt+ = Gt, where Gt+ = f)u>tGu- Note that since the
filtration G is increasing, it suffices to show that Gt — C\n>1Gt+±- Thus,
we can take countable limits and it follows that if Si,..., sn < t, "then for
For ы,..., vn > t and (ui,..., un), we give the proof for n = 1 for nota-
tional convenience. Therefore let z > v > t, and suppose given Ui and «2- We
have
0 \ = li
wit
using that M = e, ? 4. is a martingale. Combining terms the above becomes
wit
and the same martingale argument yields
= Шпе^+^^Л^И! + u2)fz-v(u2)
wit
It follows that E{eiSu>x'j\gt+} = E{eiEu>x*i\gt} for all (s1,...,sn) and
all (ui,...,un), whence E{Z\gt+} = E{Z\gt} for every bounded Z G
Vo<s<oo-^s)- Tms implies gt+ = gt except possibly for events of probabil-
probability zero. However since both cr-algebras contain Af, we conclude Gt+ = Gt for
each t > 0. D
The next theorem shows that a Levy process "renews itself" at stopping
times.

4 Levy Processes 23
Theorem 32. . Let X be a Levy process and let T be a stopping time. On
the set {T < oo} the process Y = (Yt)o<t<oo defined by Yt = Хт+t ~ Хт is a
Levy process adapted to %t = Тт+ti Y is independent of Тт and Y has the
same distribution as X.
Proof. First assume T is bounded. Let A e Тт and let (щ,..., un; to,--., *n)
be given with Uj in a countable dense set (for example the rationale Q) and
tj e R+, tj increasing with j.
Recall that M = ^^ is a martingale, where /t(wj) = E{eiu*Xt}. Then
Aexp{i 2]
3
{
by applying the Optional Sampling Theorem (Theorem 16) n times. Note that
this shows the independence of Yt = Хт+t — Хт from Тт as well as showing
that Y has independent and stationary increments and that the distribution
ofY is the same as that of X.
If T is not bounded, we let Tn = min(r, n)=TAn. The formula is valid
for Лп = A(~){T < n} when A G Тт, since then An € Ттлп- Taking limits and
using the Dominated Convergence Theorem we see that our formula holds for
unbounded T as well, for events Л = А П {T < oo}, A € Тт- This gives the
result. D
Since a standard Brownian motion is a Levy process, Theorem 32 gives
us a fortiori the strong Markov property for Brownian motion. This allows
us to establish a pretty result for Brownian motion, known as the reflection
principle. Let В = (Bt)t>o denote a standard Brownian motion, Bo = 0 a.s.,
and let St = supo<s<t5s, the maximum process of Brownian motion. Since
В is continuous, St = sup0<M<t m€q.Bm, where Q denotes the rationals; hence
St is an adapted process with non-decreasing paths.
Theorem 33 (Reflection Principle for Brownian Motion). Let В =
(Bt)t>o be standard Brownian motion (Bo = 0 a.s.) and St = supo<s<t-Bs,
the Brownian maximum process. For у > 0, z > 0,
P(Bt <z-y;St>z)= P{Bt >y + z).
Proof. Let T — inf{t > 0 : Bt — z}. Then Г is a stopping time by Theorem 4,
and P(T < oo) = 1. We next define a new process X by

24 I Preliminaries
Xt = Btl{t<T} + BBT - Bt)l{t>T}-
The process X is the Brownian motion В up to time T, and after time T it is
the Brownian motion В "reflected" about the constant level z. Since В and
—В have the same distribution, it follows from Theorem 32 that X is also a
standard Brownian motion.
Next we define
R = inf{t > 0 : Xt = z}.
Then clearly
P(R <t;Xt<z-y) = P(T<t;Bt<z- y),
since (R,X) and (T,B) have the same distribution. However we also have
that R = T identically, whence
{R < t; Xt < z - y} = {T < t; Bt > z + y}
by the construction of X. Therefore
P(T<t;Bt>z + y) = P(T<t;Bt<z-y). (*)
The left side of (*) equals
P(St >z;Bt>z + y)= P(Bt >z + y),
where the last equality is a consequence of the containment {St > z) D {Bt >
z + y}. Also the right side of (*) equals P(St > z; Bt < z - y). Combining
these yields
P(St >z;Bt<z-y) = P(Bt >z + y),
which is what was to be proved. D
We also have a reflection principle for Levy processes. See Exercises 30
and 31.
Corollary. Let В = (Bt)t>o be standard Brownian motion (Bo = 0 a.s.) and
St = supo<s<t Bs. For z > 0,
P(St >z) = 2P(Bt > z).
Proof. Take у = 0 in Theorem 33. Then
P(Bt <z;St>z)=P(Bt>z).
Adding P(Bt > z) to both sides and noting that {Bt > z} = {Bt > z}(~){St >
z} yields the result since P(Bt — z) = 0. D

4 Levy Processes 25
A Levy process is cadlag, and hence the only type of discontinuities it can
have is jump discontinuities. Letting Xt- = Ит3цХ3, the left limit at t, we
define
the jump at t. If supt |AXt| < С < oo a.s., where С is a non-random constant,
then we say that X has bounded jumps.
Our next result states that a Levy process with bounded jumps has finite
moments of all orders. This fact was used in Sect. 3 (Step 2 of the proof of
Theorem 23) to show that E{N\} < oo for a Poisson process N.
Theorem 34. Let X be a Levy process with bounded jumps. Then E{\Xt\n} <
oo for all n — 1,2,3,....
Proof. Let С be a (non-random) bound for the jumps of X. Define the stop-
stopping times
T1=ini{t:\Xt\ >C}
Tn+1 = M{t > Tn : \Xt -XTn\> C}.
Since the paths are right continuous, the stopping times (Tn)n>i form
a strictly increasing sequence. Moreover |ДХт| < С by hypothesis for any
stopping time T. Therefore sups |Xjn| < 2nC by recursion. Theorem 32 im-
implies that Tn — Tn_i is independent of Ттп-г and also that the distribution
of Tn — Tn-i is the same as that of Ti.
The above implies that
for some a, 0 < a < 1. But also
P{\Xt\ > 2nC} < P{Tn
which implies that Xt has an exponential moment and hence moments of all
orders. D
We next turn our attention to an analysis of the jumps of a Levy process.
Let Л be a Borel set in R bounded away from 0 (that is, 0 ^ Л, where Л is
the closure of A). For a Levy process X we define the random variables
T\ = inf{i > 0 : AXt e A)
T2+1 = ini{t > T2 : AXt € A}.

26 I Preliminaxies
Since X has cadlag paths and 0 ф Л, the reader can readily check that {Гд >
t} G Tt+ = Ft and therefore each Tfc is a stopping time. Moreover 0 ^ Л and
cadlag paths further imply T\ > 0 a.s. and that lirrin^oo Тд = оо a.s. We
define
0<s<t n=l
and observe that NA is a counting process without an explosion. It is straight-
straightforward to check that for 0 < s < t < oo,
NA - NA € a{Xu -Xv;s<v<u<t},
and therefore NA - NA is independent of Ta; that is, NA has independent
increments. Note further that TV/1 - NA is the number of jumps that Zu =
Xs+U — Xs has in Л, 0 < и < t — s. By the stationarity of the distributions of
X, we conclude TV/1 — NA has the same distribution as iV/!s. Therefore 7УЛ is
a counting process with stationary and independent increments. We conclude
that NA is a Poisson process. Let v(A) = E{NA} be the parameter of the
Poisson process NA (г^(Л) < oo by the proof of Theorem 34).
Theorem 35. The set function Л н-> NA(u>) defines a a-finite measure on
R \ {0} for each fixed (t,oj). The set function v(A) = E{N^} also defines a
a-finite measure опЖ\ {О}.
Proof. The set function Л <-» Na(oj) is simply a counting measure: /х(Л) =
{number of s < t: AXs(u) & Л}. It is then clear that v is also a measure. D
Definition. The measure v defined by
u{A) = E{NA} = E{
0<s<l
is called the Levy measure of the Levy process X.
We wish to investigate further the role the Levy measure plays in governing
the jumps of X. To this end we establish a preliminary result. We let Nt (u>, dx)
denote the random measure of Theorem 35. Since Nt(<j,dx) is a counting
measure, the next result is obvious.
Theorem 36. Let Л be a Borel set of R, 0 ф A, f Borel and finite on A.
Then
L
f{x)Nt{u,dx)= J2
0<s<t
Just as we showed that NA has independent and stationary increments,
we have the following consequence.

4 Levy Processes 27
Corollary. Let Л be a Borel set of К with 0 <$. A, and let / be Borel and
finite on A. Then
f f(X)Nt(;dx)
JA
is a Levy process.
For a given set A (as always, 0 g A), we denned the associated jump
process to be
jtA = J2 ьх.Ыьх,).
0<s<t
By Theorem 36 and its corollary we conclude that
JA
= I xNt(;dx).
JA
Hence, J/1 is a Levy process itself, it is defined, and JA < oo a.s., each t > 0.
Theorem 37. Given A, 0 ^ A, the process Xt — J/1 is a Levy process.
Proof. It is clear that we need only check the independence and stationarity
of the increments. But
Xt- JtA-(Xs~ JA) = Xt-Xs- Y^ A^uU(AXu)
3<U<t
which is clearly cr{Xv — Xu;s < и < v < t} measurable, and due to the
stationarity of the increments of X it has the same law as Xt^a — JA_S. D
We are now in a position to consider
0<3<t
for some constant а > 0. The advantage of doing this is that Ya then has jumps
bounded by a, and hence has finite moments of all orders (Theorem 34). We
can choose any a > 0; we arbitrarily choose a = 1. Note that
¦rrl v t(-°°.-1]U[1,oo)
= Xt- I xNt(;dx).
J\x\>l
The next theorem gives an interpretation of the Levy measure as the expected
rate at which the jumps of the Levy process fall in a given set.
Theorem 38. Let A be Borel with 0 ^ A. Let v be the Levy measure of X,
and let f\A e L2(du). Then
E{ f f(x)Nt(-,dx)} = t f f{x)u{dx)
JA JA

28 I Preliminaries
and also
E{( f f(x)Nt(-,dx) - t I f{x)v{dx)J} = t I f(xJv(dx).
JA JA JA
Proof. First let / = ?) ¦ а^1ля a simple function. Then
since Nt 3 is a Poisson process with parameter v{Aj). The first equality follows
easily.
For the second equality, let Ml = N^ —tv(Ai). The Mt* are V martingales,
all p > 1, by the proof of Theorem 34. Moreover, Е{М}} ~ 0. Suppose Л*, Aj
are disjoint. We have
к е
for any partition 0 = to < t\ < ¦ ¦ ¦ < tn = t. Using the martingale property
we have
E{MiMi} = Е(?(м;к+1 - Mik)(Mik+i - Mi)}.
к
Using the inequality \ab\ < a2 + b2, we have
E - O2-
However T,k(ML+i ~ Mtj2 ^ (N^J + "(AiJt2; therefore the sums are
dominated by an integrable random variable. Since Ml and Ml have paths of
finite variation on [0, t] it is easy to deduce that if we take a sequence (?rn)n>i
of partitions where the mesh tends to 0 we have
„I™, ? (
ttG 0<s<t
Using Lebesgue's Dominated Convergence Theorem we conclude
E{MltM3t} = E{^ АМгаАМ>} = О;
0<s<t
the expectation above is 0 because Ai and Aj are disjoint, implying that Ml
and M-3 jump at different times. The second equality is now easy to verify for
simple functions. For general /, let fn be a sequence of simple functions such
that fn^A converges to / in L2{dv), and the result follows. Q

4 Levy Processes 29
Remark. The first statement in Theorem 38 remains true if /1л е Ll(dv).
See Exercise 28.
Corollary. Let / : К —> К be bounded and vanish in a neighborhood of 0.
Then
/-OO
f(AXs)} = t f{x)u{dx).
J— OO
0<s<t
Proof We need only combine Theorem 38 with Theorem 36. G
Theorem 39. Let A\, Л2 be two disjoint Borel sets with 0 ^ Л\, О ^ Л2.
Then the two processes
0<3<t
Jt2= 53
0<s<t
are independent Levy processes.
Proof. By Theorem 36 and its corollary we have that J1 and J2 are Levy
processes. To show they are independent, we begin by forming for u, v in M,
iuj\
* " Е{е™Л}
П" _ _T 1
* " Е{е^Ц
Then Cu and Dv are both martingales, with E{C?} = E{D"} = 0. As in the
proof of Theorem 38, let 7rn: 0 = to < t\ < ¦ ¦ ¦ < tn = t be a sequence of
partitions of [0, t] with limn meshGTn) = 0. Then
»+i - Dl)}
Since Cu and Dw have paths of finite variation on compacts, it follows by
letting meshGrn) tend to 0, that
0<s<t
The expectation above equals zero because Cu and Dv jump at different times,
due to the void intersection of A\ and Л2.
We conclude that Е{С^В\) = 0, and thus

30 I Preliminaries
Е{ем1емЦ = E{eiuJ!}E{eivJ?},
which in turn implies, because of the independence and stationarity of the
increments that
{i-Jl1)+---+un(j}n-jln_i))ei{vlJ2tl+---+vn(J2ln-jln_i)),
This is enough to give independence. ?
The preceding results combine to yield the following useful theorem, which
is one of the fundamental results about Levy processes.
Theorem 40. Let X be a Levy process. Then Xt = Yt + Zt, where Y, Z are
Levy processes, Y is a martingale with bounded jumps, Yt G Lp for all p > 1
and Z has paths of finite variation on compacts.
Proof. Let Jt = X^o<s<t АХа1{\лх,\>1}- Since X has cadlag paths, for each
fixed w the function s >-> Xa(w) has only finitely many jumps bigger than
one on [0, t]. Therefore J has paths of finite variation on compacts. J is also
a Levy process by Theorem 36 and its corollary. The process W = X - J
is also a Levy process (Theorem 37), and W has jumps bounded by one. We
know therefore that, n > 1, ?"{|И^|"} exists (Theorem 34), and the stationary
increments of W implies E{Wt} = at, for a = E{Wi}. (Recall E{W0} = 0.)
We set Yt — Wt — E{W{\. Then Y has independent increments and mean 0;
it is a martingale. Setting Zt — Jt + at completes the proof. ?
While Theorem 40 is the most important result about Levy processes
from the standpoint of stochastic integration, the next two theorems provide
a better understanding of Levy processes themselves.
Theorem 41. Let X be a Levy process with jumps bounded by a. That is,
sups |AXS| < a a.s. Let Zt = Xt — E{Xt}. Then Z is a martingale and Zt =
Z? + Zf where Zc is a martingale with continuous paths, Zd is a martingale,
Zf= f x(Nt(-,dx)-tv(dx)),
J{\x\<a)
and Zc and Zd are independent Levy processes.
Proof. Z has mean zero and independent increments so it is a martingale, as
well as a Levy process. For a given set Л we define
M/1 = / xNt(-,dx) - t I xu{dx)
Ja j л
= ^ ДХ81л(ДХ8) — t j xu(dx).
0<s<t ^A

4 Levy Processes 31
For this proof we take a = 1. Let Ak = {-^ < \x\ < ?}. Then МЛк
are pairwise independent Levy processes and martingales (Theorem 39). Set
M" = ]T)fc=i мЛк ¦ Then the martingales Z - Mn and M" are independent
by an argument similar to the one in the proof of Theorem 39. Moreover
Var(Zt) = Var(Zt - Mt") + Var(Mt") where Var(X) denotes the variance of a
random variable X. Therefore Var(M") < Var(Zt) < oo for all n. We deduce
that M™ is Cauchy in L2 and hence converges in L2 as n tends to oo to a
martingale Zf, and Z — M" also converges to a martingale Zc. Using Doob's
maximal quadratic inequality (Theorem 20), we can find a subsequence con-
converging a.s., uniformly in t on compacts, which permits the conclusion that
Zc has continuous paths. The independence of Zd and Zc follows from the
independence of M" and Z — Mn, for every n. ?
Note that a consequence of the convergence of Mt" to Zf in L2 in the proof
of Theorem 41 is that the integral J,x 0)и@ - x2v{dx) is finite. Note that this
improves a bit on the conclusion in Theorem 38.
We recall that for a set A, 0 ^ A, the process TV/1 = JANt(-,dx) is a
Poisson process with parameter v(A), and thus Nf - tv{A) is a martingale.
Definition. Let N be a Poisson process with parameter Л. Then Nt — \t is
called a compensated Poisson process.
Theorem 41 can be interpreted as saying that a Levy process with bounded
jumps decomposes into the sum of a continuous martingale Levy process and
a martingale which is a mixture of compensated Poisson processes. It is not
hard to show that E{eiuZ'} = e^2/2, which implies that Zc must be a
Brownian motion. The full decomposition theorem then follows easily. We
state it here without proof (consult Bertoin [12], Bretagnolle [25], Feller [72],
or Jacod-Shiryaev [110] for a proof).
Theorem 42 (Levy Decomposition Theorem). Let X be a Levy process.
Then X has a decomposition
Xt=Bt+ x(Nt(; dx) - tu(dx))
+ tE{Xi- xNx{-,dx)}+ I xNt(-,dx)
= Bt+ f x{Nt{-,dx)-tu(dx)) + at+ V Д^
l{\x\<\)
where В is a Brownian motion; for any set A, 0 ^ A, Nf = fA Nt(-, dx) is a
Poisson process independent of B; Nf is independent of Nf if A and Г are
disjoint; Nf has parameter v{A); and i/(dx) is a measure on Ш\{0} such that
oo.
Finally we observe that Theorem 42 gives us a formula (known as the
Levy-Khintchine formula) for the Fourier transform of a Levy process.

32 I Preliminaries
Theorem 43. Let X be a Levy process with Levy measure v. Then
EieiuXt\ — e~*^u)
where
ф(и) = ^-u2 -iau+ I A - eiux)v{dx) + [ A - eiux + iux)v{dx).
Moreover given v, a2, a, the corresponding Levy process is unique in distri-
distribution.
The Levy-Khintchine formula has another, more common expression. (For
a complete discussion of the various formulae of Levy and Khintchine for the
case of infinitely divisible distributions, see Feller [72, pages 558-565].) Indeed,
in the n-dimensional case we can express the Levy-Khintchine formulas as
follows (letting (•, •) denote the standard inner product for ]Rn).
Theorem 44. Let X be a Levy process in Шп, with Xo = 0. Then there exists
a convolution semigroup of probability measures on K" such that C{Xt) — /j,t.
The characteristic function of ^ is е~*^и\ where ф is given in terms of a
positive definite n x n matrix E and a Levy measure v on W1 such that
(i) v{A) < oo if Л is Borel on Шп and bounded away from 0;
(ii) fM<1 \x\2v(dx) < oo and v{Q} = 0.
Also there exists a bounded, continuous centering function rj : WLn —> Kn
such that limj_,o ^ufe* = 0» an& a constant а €Шп such that
¦ф{и) = -i(u,a) + )г{Еи,и) - f ei(u'x) - 1 - i(u,-q{x))v{dx).
A nice proof of the above theorem can be found in Bertoin [12, pages
13-15]. Several observations concerning this theorem are in order. First, one
still has (in the n-dimensional case) the key property that for / : Rn —+ Rn,
bounded, vanishing in a neighborhood of 0, that
E{ Y, /(A*-)} = * Г f(x)v(dx).
0<s<t
Second, in the case n = 1, one often takes r)(x) = j^t, possibly at the cost of
changing the constant a. In the next section (Sect. 6) we discuss semigroups
of Markov processes. The semigroup Pt for the Levy process X has the form
Ptf(x) — J f(x + y)/J,t(dy), and one can show in fact that the "infinitesimal
generator" of the semigroup Pt is, for / e bC2(JR"),

4 Levy Processes 33
/¦
f(x + y)- f(x) -
Note further that if / is constant in a neighborhood of x, then
(X) =Jf(x + y)- f{x)u{dy).
lim Pt/(X)
We next consider some examples of Levy processes. For the examples we take
n = 1.
Example. If a Levy process X has continuous paths, then v is identically
0, and X must be Brownian motion with drift. That is, Xt = crBt + at, for
constants a, a, are the only Levy processes with continuous paths.
Example. Assume that v is a finite measure. That is, и(Шп) < со, and
that a2 = 0. Then X is a compound Poisson process with jump arrival
intensity Л = v(M.n). That is, let Nt — Yll^i l{t>Ti} be a Poisson process with
arrival intensity Л. Let Zt = ]СП=1 ^jl{t>T;} where the sequence (Uj)j>i are
i.i.d., independent of the sequence (Tj)j>i, and C{U\) = ji/. Then one easily
verifies E{eiuZt} = e-*^"\ where tp(u) = /A - eiux)v(dx).
Example. Let v(dx) = ^2<^L1 OLkepk{dx), where epk(dx) denotes the positive
point mass measure at Ck G К of size one. Assume Yl'kLx Pkak < °° and &2 =
0. Let Nk be a sequence of independent Poisson processes with parameters
afc. Then Xt = YlT=i Pk(Nt — afci) has v as its Levy measure.
Example. Let X be a Levy process which is a Gamma process. That is,
C(Xt) = Г(а,р), with /3 > 0 and a = at. More precisely, the density for Xt
is ft(x) = I!?,\xta~1e~f3xl{x>0y In this case a2 = 0, and taking a = C = 1
gives <pt(u) = E{ezuZt} — A — ro)~*. (See, for example, page 43 of Jacod-
Protter [109] or p. 73 of Bertoin [12].) Then <pt(u) = e-tln(i-<«) = е*^(«)
But ^
\ = гфх{и) = iE{eiuZl } = i f
A — ги) Jo
Integrating yields
ip(u) = / e~xdx
Jo x
which implies, by the Levy-Khintchine formula, that v{dx) = ~;e~xl{x>oydx.
Example. Our last example is that of stable processes. We begin by recall-
recalling what a stable distribution is. Extending the notion of the Central Limit
Theorem, consider the normalized sums
= t
(

34 I Preliminaries
s ZtiXi _p 0
In
We say that a random variable X has a stable law if it is the limit in
distribution as n —> oo of sums of the above form. One can then prove (see,
for example, Breiman [23, page 200]) that either X has a normal distribution,
or there exists a number a, 0 < a < 2, called the index of the stable law, and
also constants mj > 0, тг > 0, and S such that E{eluX} = e^u^ and
°° 9VT 1
One says that a stable distribution is symmetric if m\ = тг. By analogy we
define a symmetric stable process (in the case n = 1) to be a Levy process
where the Levy measure has the form v(dx) = . u\+a) dx, with 0 < a < 2,
and the constant a2 in the Levy-Khintchine formula is 0, which, of course,
implies that no Brownian component is present in the process. Note that such
stable processes have nice scaling properties. If a is the index, then the law
of (Xt)t>0 is the same as the law of {fl~^Xpt)t>Q- Consult Bertoin [12] for
many other properties of stable processes.
5 Why the Usual Hypotheses?
We defined the usual hypotheses at the very beginning of this book, and we
saw with Levy processes that the natural filtration of a Levy process, once
completed, satisfies a fortiori right continuity, and thus the usual hypotheses
hold (see Theorem 31). This phenomenon is true much more generally. Indeed
it is true for all "reasonable" strong Markov processes, as well as for all count-
counting processes (cf., Theorem 25). In this section we will show that it is true for
a very large class of Markov processes. We first begin with some definitions.
Definition 1. Let X be an Rd-valued process, with X adapted to a filtration
F = (Ft)o<t<oo- X is Markov with respect to F it Ft and cr{X3; s > t} are
conditionally independent, given Xt.
Definition 2. The process X in Definition 1 is Markov with respect to
F if for any random variable Y e ha{Xs; s > t}, E{Y\Tt} = E{Y\Xt}, a.s.,
all t > 0.
The following theorem has a straightforward proof.
Theorem 45. Definitions 1 and 2 are equivalent. Moreover X is Markov with
respect toWifand only if E{f(Xu)\Ft} = E{f(Xu)\Xt} a.s. for allu>t>0.

5 Why the Usual Hypotheses? 35
Let X be a cadlag process and let X be Markov with respect to F. Since X
is Markov, we can associate with it a Markov transition function on (Rd,B),
where В denotes the Borel sets of Rd. This is a family (Ps,t)s,teR+ of Markov
kernels on (Rd, B) such that
for each triple (s, t, u) with s < t < u. We then have the fundamental rela-
relationship
E{f(X)\F} = Ps,t(XsJ) a.s.
Note that, by using indicator functions such as 1^ = /, we have
PSyt{Xs,A) a.s.
Also note that the above implies the equality
Ps,t(Xs,A) a.s.
which, of course, is weaker. That is, the converse implication is not true in
general. The most important special case is Markov processes which are time
homogeneous. In this case we have a transition semigroup Pt given by
Pt-s = Ps,t>
for s < t.
Definition. Let Co denote continuous functions vanishing at infinity. A
Markov process is a Feller process if its transition semigroup (Pt)t>o has
the following two properties.
(a) For each / e Co and each t > 0, Ptf e Co.
(b) For each / e Co and each x e Rd, Ptf(x) -> f(x) as t -> 0.
Examples of Feller processes include Brownian motion and the Poisson
process. More generally, one can show that diffusions5 and all Levy processes
are Feller processes.
Theorem 46. Let X be a cadlag Feller process for a filtration ?. Then it is
also Markov with respect to the filtration ?+ = (.Ft+)o<t<oo-
Proof. Let t > 0 be fixed, and assume / G Co. Let Yt = f(Xt). For 0 < s < t,
let
Ys = E{f(Xt)\Fs} = ??{/(*t)|*e} = Pt-.f(Xa).
Then У is a martingale for 0 < s < t, and it is cadlag, because Ptf is
continuous, and X is cadlag. Therefore, У is a martingale with respect to
F-|- too, which means that E{f(Xt)\Jrs+} — Ys, from which we deduce that
E{f{Xt)\Ts+} = E{f{Xt)\Xa}, and we are done. D
5 This depends, of course, on how one defines a diffusion; the definition is not yet
standardized in the literature.

36 I Preliminaries
We now define P^ as a probability measure such that X is Markov with
respect to P^ and also C(X0) = \i under P^, where C(X0) denotes the distri-
distribution, or "law," of Xo. Let F^ = (^f )o<t<oo be the nitration where T% de-
denotes the P» completion of F$, J^ = a{Xs : s <t}, and F° = a{Xs : s > 0}.
Also, let .Ft =
Theorem 47. If X is Markov with respect to PM, or alternatively Markov
with respect to P^ for all /j, with \\/j,\\ = 1, then F^ and F are, respectively,
right continuous.
Proof. We prove the result for FM, from which the result for F follows. Let H
denote all h e bjF^ such that
E{h\Jf} = E{h\Jf+}, P*-a.s.
Then H Э {H = f(Xs), f e ЪВ}, which is obvious for s <t, and for s > t it
is just Theorem 46. Then H contains all H of the form H = ПГ=1 /»(-^«<)> f°r
any / G ЪВ. Using the Monotone Class Theorem, H э bJ70. By completion,
П D b^. Therefore if h G Jf+, we have ?"{/i|^} = E{h\Jf+} = h, P»-
a.s. П
The property that the natural nitration of a Feller process is right con-
continuous when completed is not restricted to Feller processes. Indeed, it can
also be shown to be true for more general classes of processes, such as Hunt
processes^. However it is simply two elementary properties which yield the
right continuity of the filtration in a general setting.
Definition. Let X be a cadlag Markov process for a nitration F, which we
take to be completed and therefore can assume to be right continuous, in view
of Theorem 47. Let T be a stopping time. We say that X verifies the strong
Markov property at T if
E{f(XT+s)l{T<oo}\TT} = PS(XTJI{T<OO} a.s.
We say that X is strong Markov if the above holds for every stopping time
T.
Suppose now that the semigroup (Pj) is Borel. That is, if / e B, then
Ptf € В as well. Suppose that for every probability measure fi on (Rd,B),
there exists a probability space (fi, G, Q) and a cadlag Markov process (lt)t>o
denned on this space with C(X0) = fi. Suppose further that every such cadlag
Markov process Y is strong Markov. Then the filtrations F^ = (^r|')o<t<oo and
F = (.Ft)o<t<oo can be shown to be right continuous in a manner analogous
to the previous proofs.
See, for example, Sharpe [215, page 219] for a definition of a Hunt process.

6 Local Martingales 37
6 Local Martingales
Recall that we assume as given a filtered probability space (fi, T, F, P) satis-
satisfying the usual hypotheses. For a process X and a stopping time T we further
recall that XT denotes the stopped process
Xt = XtAr = Xtl{t<T) + ^т1{*>т>-
Definition. An adapted, cadlag process X is a local martingale if there
exists a sequence of increasing stopping times, Tn, with Нт^-юо Тп = oo a.s.
such that XtATnl{Tn>o} IS a uniformly integrable martingale for each n. Such
a sequence (Tn) of stopping times is called a fundamental sequence.
Example. Clearly any cadlag martingale is a local martingale (take Tn = n).
Example. We give an example of a local martingale which is not a martingale.
Let (.Bt)o<t<oo be a Brownian motion in M3 with B$ = x, where x / 0.
Let u(y) — \\y\\~1, a superharmonic function on M3. As a consequence of
Ito's formula (cf., Theorems 32 and 41 of Chap. II) one can show that Xt =
u(Bt) is a positive supermartingale (indeed, it is even a uniformly integrable
supermartingale). Next let Tn = inf{t > 0 : \\Bt\\ < l/nl- Outside of the
ball of radius \/n centered at the origin the function и is harmonic. Again
by Ito's formula one can show that this implies u(BtATn) is a martingale.
Since u{Bt/KTn) is bounded by n it is uniformly integrable. On the other hand,
if Bo = x ф 0 then lim^oo E{u(Bt)} = 0 as is easily seen by a calculation,
while E{u(Bo)} = \\x\\~1. Since the expectations of a martingale are constant,
u(Bt) is not a martingale. (For more on this example, see Sect. 6 of Chap. II,
following Corollary 4 of Theorem 27, as well as Exercise 20 in Chap. II.)
The reason we multiply Х4Лт„ by 1{т„>о} is to relax the integrability
condition on Xo- This is useful, for example, in the consideration of stochastic
integral equations with a non-integrable initial condition.
Definition. A stopping time T reduces a process M if MT is a uniformly
integrable martingale.
Theorem 48. Let M, N be local martingales and let S and T be stopping
times.
(a) IfT reduces M and S <T a.s., then S reduces M.
(b) The sum M + N is also a local martingale.
(c) If S, T both reduce M, then S V T also reduces M.
(d) The processes MT, Мт1{т>о} are local martingales.
(e) Let X be a cadlag process and let Tn be a sequence of stopping times
increasing to 00 a.s. such that XTjll^pn>o} is a local martingale for each
n. Then X is a local martingale.
Proof, (a) follows from the Optional Sampling Theorem (Theorem 16) and
Theorem 13. For (b), if En)n>i, (Tn)n>i are fundamental sequences for M

38 I Preliminaries
and N, respectively, then Sn /\Tn is a fundamental sequence for M + N. For
(c), let Xt = Mt - Mo. Then XSyT = Xs + XT - XSAT is a uniformly
integrable martingale. But
which is in L1, therefore XSvT+Mol{5VT>o} = -WSvTl{svT>o} is a uniformly
integrable martingale. The proof of (d) consists of the observation that if
(Tn)n>i is a fundamental sequence for M, then it is also one for MT and
Мт1{т>о} by the Optional Sampling Theorem (Theorem 16). For (e), we
have XTnl{xn>o} = Mn is a local martingale for each n. For fixed n we know
there exists a fundamental sequence Un'k increasing to сю a.s. as к tends
to oo. For each n, choose к = k(n) such that P(Un'h^ < Tn An) < 2~n.
Then limn Un'k^ = oo a.s., and [/"•*(") л Тп reduces X for each n. We
take Rm = max(!71'kA) ЛТЬ...,f/m-fe(m) Л Тт), and each Rm reduces X
by (c), the Rm are increasing, and lim Rm = oo a.s. Therefore X is a local
martingale. ?
Corollary. Local martingales form a vector space.
We will often need to know that a reduced local martingale, MT, is in Lp
and not simply uniformly integrable.
Definition. Let X be a stochastic process. A property тг is said to hold
locally if there exists a sequence of stopping times (Tn)n>i increasing to oo
a.s. such that ХТп1{т„>о} has property тг, each n > 1.
We see from Theorem 48(e) that a process which is locally a local mar-
martingale is also a local martingale. Other examples of local properties that
arise frequently are locally bounded and locally square integrable. Theorems 49
and 50 are consequences of Theorem 48(e).
Theorem 49. Let X be a process which is locally a square integrable martin-
martingale. Then X is a local martingale.
The next theorem shows that the traditional "uniform integrability" as-
assumption in the definition of local martingale is not really necessary.
Theorem 50. Let M be adapted, cadlag and let (Tn)n>i be a sequence of
stopping times increasing to oo a.s. If МТп1{тп>о} *s a martingale for each
n, then M is a local martingale.
It is often of interest to determine when a local martingale is actually a
martingale. A simple condition involves the maximal function. Recall that
XI = sups<t \XS\ and X* = sups \XS\.
Theorem 51. Let X be a local martingale such that E{X?} < oo for every
t > 0. Then X is a martingale. If E{X*} < oo, then X is a uniformly
integrable martingale.

7 Stieltjes Integration and Change of Variables 39
Proof. Let (Tn)n>i be a fundamental sequence of stopping times for X. If
s < t, then E{Xt^Tn\^s} = Xs^Tn- The Dominated Convergence Theorem
yields Е{Хг\Т8} = Xs. If E{X*} < oo, since each \Xt\ < X*, it follows that
(Xt)t>o is uniformly integrable. ?
Note that in particular a bounded local martingale is a uniformly inte-
integrable martingale. Other sufficient conditions for a local martingale to be a
martingale are given in Corollaries 3 and 4 to Theorem 27 in Chap. II, and
Kazamaki's and Novikov's criteria (Theorems 40 and 41 of Chap. Ill) establish
the result for the important special cases of continuous local martingales.
7 Stieltjes Integration and Change of Variables
Stochastic integration with respect to semimartingales can be thought of as
an extension of path-by-path Stieltjes integration. We present here the es-
essential elementary ideas of Stieltjes integration appropriate to our interests.
We assume the reader is familiar with the Lebesgue theory of measure and
integration on M_|_.
Definition. Let A = (At)t>0 be a cadlag process. A is an increasing pro-
process if the paths of A : t —> At(u) are non-decreasing for almost all u. A is
called a finite variation process (FV) if almost all of the paths of A are of
finite variation on each compact interval of K_|_.
Let A be an increasing process. Fix an и such that t i—> At(u) is right
continuous and non-decreasing. This function induces a measure ij,a{u, ds)
on K.+ . If / is a bounded, Borel function on R+, then Jo /(s)^^, ds) is
well-defined for each t > 0. We denote this integral by Jo f(s)dAs(u). If
Fs = F(s,u) is bounded and jointly measurable, we can define, и-Ъу-и, the
integral I{t,w) = J* F(s,u)dAs(u). I is right continuous in t and jointly
measurable.
Proceeding analogously for A an FV process (except that the induced
measure (j,a(<^>, ds) can have negative measure; that is, it is a signed measure),
we can define a jointly measurable integral
I(t,w)= f F(s,uj)dAs(w)
Jo
for F bounded and jointly measurable.
Let A be an FV process. We define
Then \A\t < oo a.s., and it is an increasing process.

40 I Preliminaries
Definition. For A an FV process, the total variation process, \A\ =
(|j4|t)t>o, is the increasing process defined in (*) above.
Notation. Let A be an FV process and let F be a jointly measurable process
such that Jo F(s,u>)dAs(u>) exists and is finite for all t > 0, a.s. We let
(F-A)t(u)= I F(s,u)dAs(u)
Jo
and we write F ¦ A to denote the process F ¦ A = (F ¦ At)t>0. We will also
write /0 F.s|cL4s| for (F ¦ \A\)t. The next result is an absolute continuity result
for Stieltjes integrals.
Theorem 52. Let А, С be adapted, strictly increasing processes such that
С — A is also an increasing process. Then there exists a jointly measurable,
adapted process H (defined on @, oo)J such that 0 < H < 1 and
A^H-C
or equivalently
At= I HsdCs.
Jo
Proof. If /i and v are two Borel measures on E+ with |i«i/, then we can set
otherwise.
Defining h and к by h(t) = lirninis^ta(s,t), k(t) = limsupt^sa(s,i), then h
and к are both Borel measurable, and moreover they are each versions of the
Radon-Nikodym derivative. That is,
djj, = h dv, djj, = к dv.
To complete the proof it suffices to show that we can follow the above proce-
procedure in a (t,tj) measurable fashion. With the convention jy = 0, it suffices to
define
H(t,u) = hminf —Г7 — ^-;
V ' ; rTl,r€Q+ (C(t,u)-C(rt,w))'
such an H is clearly adapted since both A and С are. ?
Corollary. Let A be an FV process. There exists a jointly measurable,
adapted process H, — 1 < H < 1, such that
\A\ = H-A and A = H-\A\
or equivalently
|Л|t = / HsdAs and At = / Hs\dAs\.
Jo Jo

7 Stieltjes Integration and Change of Variables 41
Proof. We define A+ = \{\A\t + At) and A^ = \{\A\t-At). Then A+ and A~
are both increasing processes, and \A\— A+ and \A\ — A~ are also increasing
processes. By Theorem 52 there exist processes H+ and H~ such that Л^1" =
J* Я+|йЛв|, Л" = /o H-\dA.\. It then follows that At = A+ -A; = J*(H+ -
H~)\dAs\. Let Ht = Hf — H^ and suppose H+ and Л~ are defined as in
the proof of Theorem 52. Except for a P-null set, for a given w it is clear that
\Hs(u)\ — 1 dAs(w) almost all s. Considering H ¦ A, we have
f HsdAs= f Hsd{f Hu\dAu\)
Jo Jo Jo
= I HsHs\dAs\ = I l\dAs\ = \A\t.
Jo Jo
This completes the proof. ?
When the integrand process H has continuous paths, the Stieltjes integral
Jo HsdAs is also known as the Riemann-Stieltjes integral (for fixed u>). In
this case we can define the integral as the limit of approximating sums. Such
a result is proved in elementary textbooks on real analysis (e.g., Protter-
Morrey [195, pages 316, 317]).
Theorem 53. Let A be an FV process and let H be a jointly measurable
process such that a.s. s <—> H(s,w) is continuous. Let nn be a sequence of
finite random partitions of [0,t] with limn_,oo mesh(?rn) = 0. Then for 2\ <
- ЛТк) = HsdAs a.s.
J
rt
= H
Jo
We next prove a change of variables formula when the FV process is
continuous. Ito's formula (Theorem 32 of Chap. II) is a generalization of this
result.
Theorem 54 (Change of Variables). Let A be an FV process with con-
continuous paths, and let f be such that its derivative f exists and is continuous.
Then (f(At))t>0 is an FV process and
f(At)-f(Ao)= f f'{As)dAs.
Jo
Proof. For fixed a», the function s \—> f'{As(ui)) is continuous on [0,t] and
hence bounded. Therefore the integral /0 f'(As)dAs exists. Fix t and let тг„
be a sequence of partitions of [0, t] with 1ш1п_юо meshGrra) = 0. Then
f(At)-f(Ao)= V U(Atk+1)-f(Atk)}

42 I Preliminaries
by the Mean Value Theorem, for some Sk, tk < Sk < *fc+i- The result now
follows by taking limits and Theorem 53. ?
Comment. We will see in Chap. II that the sums
f(Atk)(Atk+1 - Atk)
**€тг„[О,*]
converge in probability to
f
Jo
f f{As_)dAs
o
for a continuous function / and an FV process A. This leads to the more
general change of variables formula, valid for any FV process A, and f € C1,
namely
-/(A,)= f f\A..)dA.+ J2 {f(As)-f(As_)~f'(As_)AAs}.
0<s<t
The next corollary explains why Theorem 54 is known as a change of
variables formula. The proof is an immediate application of Theorem 54.
Corollary. Let g be continuous, and let h(t) = Jo g(u)du. Let A be an FV
process with continuous paths. Then
h(At) - h(Ao) = / g{u)du
J Aq
= I g{As)dAa.
Jo
We conclude this section with an example.
Example. Let TV be a Poisson process with parameter A. Then Mt = Nt —
Xt, the compensated Poisson process, is a martingale, as well as an FV
process. For a bounded (say), jointly measurable process H, we have
It = I HsdMs = I Hsd(Ns - Xs) = I HsdNa - X [ Hsds
Jo Jo Jo Jo
/
Hsds
0
where (Tn)n>i are the arrival times of the Poisson process N. Now suppose
the process H is bounded, adapted, and has continuous sample paths. For
0 < s < t < oo, we then have

8 Naive Stochastic Integration Is Impossible 43
Е{1г-18\Т8} = Е{ HudMu\Fs)
= 0.
The interchange of limits can be justified by the Dominated Convergence The-
Theorem. We conclude that the integral process / is a martingale. This fact, that
the stochastic Stieltjes integral of an adapted, bounded, continuous process
with respect to a martingale is again a martingale, is true in much greater
generality. We shall treat this systematically in Chap. II.
8 Nai've Stochastic Integration Is Impossible
In Sect. 7 we saw that for an FV process A, and a continuous integrand
process H, we could express the integral /0 HsdAs as the limit of sums (The-
(Theorem 53). The Brownian motion process, B, however, has paths of infinite
variation on compacts. In this section we demonstrate, with the aid of the
Banach-Steinhaus Theorem7, some of the difficulties that are inherent in try-
trying to extend the notion of Stieltjes integration to processes that have paths of
unbounded variation, such as Brownian motion. For the reader's convenience
we recall the Banach-Steinhaus Theorem.
Theorem 55. Let X be a Banach space and let Y be a normed linear space.
Let {Ta} be a family of bounded linear operators from X into Y. If for each
x e X the set {Tax} is bounded, then the set {||Ta||} is bounded.
Let us put aside stochastic processes for the moment. Let x(t) be a right
continuous function on [0,1], and let nn be a refining sequence of dyadic
rational partitions of [0,1] with lim^^oo meshGrra) = 0. We ask the question
"What conditions on x are needed so that the sums
converge to a finite limit as n —> oo for all continuous functions h?" From
Theorem 53 of Sect. 7 we know that x of finite variation is sufficient. However,
it is also necessary.
Theorem 56. If the sums Sn of (*) converge to a limit for every continuous
function h then x is of finite variation.
7 The Banach-Steinhaus Theorem is also known as the Principle of Uniform Bound-
edness.

44 I Preliminaries
Proof. Let X be the Banach space of continuous functions equipped with the
supremum norm. Let Y be M, equipped with absolute value as the norm. For
heX, let
For each fixed n it is simple to construct an h in X such that h(tk) =
sign{x(tfe+i) - x(tk)}, and ||/i|| = 1. For such an h we have
Therefore
iitji > V
each n, and supra ||ГП|| > total variation of x. On the other hand for each
h € X we have 1ш1п_юо ТП(Л) exists and therefore supra ||Tn(ft)|| < oo. The
Banach-Steinhaus Theorem then implies that supra ||Tn|| < oo, hence the total
variation of x is finite. D
Returning to stochastic processes, we might hope to circumvent the lim-
limitations imposed by Theorem 56 by appealing to convergence in probability.
That is, if X(s,u) is right continuous (or even continuous), can we have the
sums
J2 Hth(Xth+1 - Xth) (¦*)
converging to a limit in probability for every continuous process HI Unfortu-
Unfortunately the answer is that X must still have paths of finite variation, a.s. The
reason is that one can make the procedure used in the proof of Theorem 56
measurable in u, and hence a subsequence of the sums in (**) can be made
to converge a.s. to +oo on the set where X is not of finite variation. If this
set has positive probability, the sums cannot converge in probability.
The preceding discussion makes it appear impossible to develop a coherent
notion of a stochastic integral Jo HsdXs when X is a process with paths of
infinite variation on compacts; for example a Brownian motion. Nevertheless
this is precisely what we will do in Chap. II.
Bibliographic Notes
The basic definitions and notation presented here have become fundamental
to the modern study of stochastic processes, and they can be found many
places, such as Dellacherie-Meyer [45], Doob [55], and Jacod-Shiryaev [110].
Theorem 3 is true in much greater generality. For example the hitting time of
a Borel set is a stopping time. This result is very difficult, and proofs can be
found in Dellacherie [41] or [42].

Exercises for Chapter I 45
The resume of martingale theory consists of standard theorems. The reader
does not need results from martingale theory beyond what is presented here.
Those proofs not given can be found in many places, for example Breiman [23],
Dellacherie-Meyer [46], or Ethier-Kurtz [71].
The Poisson process and Brownian motion are the two most important
stochastic processes for the theory of stochastic integration. Our treatment
of the Poisson process follows Qinlar [33]. Theorem 25 is in Bremaud [24],
and is due to J. de Sam Lazaro. The facts about Brownian motion needed for
the theory of stochastic integration are the only ones presented here. A good
source for more detailed information on Brownian motion is Revuz-Yor [208],
or Hida [88].
Levy processes (processes with stationary and independent increments) are
a crucial source of examples for the theory of semimartingales and stochastic
integrals. Indeed in large part the theory is abstracted from the properties of
these processes. There do not seem to be many presentations of Levy processes
concerned with their properties which are relevant to stochastic integration
beyond that of Jacod-Shiryaev [110]. However, one can consult the books of
Bertoin [12], Rogers-Williams [210], and Revuz-Yor [208]. Our approach is
inspired by Bretagnolle [25]. Local martingales were first proposed by K. I to
and S. Watanabe [102] in order to generalize the Doob-Meyer decomposition.
Standard Stieltjes integration applied to finite variation stochastic processes
was not well known before the fundamental work of Meyer [171]. Finally, the
idea of using the Banach-Steinhaus Theorem to show that naive stochastic
integration is impossible is due to Meyer [178].
Exercises for Chapter I
For all of these exercises, we assume as given a filtered probability space
(fi, T, F, P) satisfying the usual hypotheses.
Exercise 1. Let S, T be stopping times, S <T a.s. Show Ts С Тт-
Exercise 2. Give an example where S, T are stopping times, S < T, but
T — S is not a stopping time.
Exercise 3. Let (Tn)n>i be a sequence of stopping times. Show that supra Tn,
infnTn, \imsupn_,ooTn, and liminf^-^Tn are all stopping times.
Exercise 4. Suppose (Tn)n>i is a sequence of stopping times decreasing
to a random time T. Show that T is a stopping time, and moreover that
Гт = Г\пГтп.
Exercise 5. Let p > 1 and let Mn be a sequence of continuous martingales
(that is, martingales whose paths are continuous, a.s.) with МД, e Lp, each
n. Suppose M? -» X in Lp and let Mt = E{X\Ft}-
(a) Show Mt G Lp, all t > 0.

46 Exercises for Chapter I
(b) Show M is a continuous martingale.
Exercise 6. Let TV = (Nt)t>0 be Poisson with intensity A. Suppose Xt is an
integer. Show that E{\Nt - Xt\} =
Exercise 7. Let TV = (TVt)t>o be Poisson with intensity A. Show that TV is
continuous in L2 (and hence in probability), but of course TV does not have
continuous sample paths.
Exercise 8. Let Bt = (??/,??42,.В43) be three dimensional Brownian motion
(that is, the Bl are i.i.d. one dimensional Brownian motions, 1 < г < 3). Let
та = supt{||Bt|| < a}. The time та is known as a last exit time. Show that
if a > 0, then та is not a stopping time for the natural filtration of B.
Exercise 9. Let GVtJ)i>o be a sequence of i.i.d. Poisson processes with pa-
parameter A = 1. Let Ml = \{Щ - t), and let M = (?~i M')t>0.
(a) Show that M is well-defined (i.e., that the series converges) in the I?
sense.
*(b) Show that for any t > 0, Yls<t &MS = oo a.s., where AMS — Ms - Ms_
is the jump of M at time s.
Exercise 10. Let (iVt*)t>o and (LJt)t>obe two independent sequences of i.i.d.
Poisson processes of parameters A = 1. Let Ml = \{Nl - L\).
(a) Show that M is a martingale and that M changes only by jumps.
*(b) Show that ]Ts<t |ДМа| = oo a.s., and t > 0.
Exercise 11. Show that a compound Poisson process is a Levy process.
Exercise 12. Let Z be a compound Poisson process, with i5{|[/i|} < сю.
Show that Zt — E{Ux}Xt is a martingale.
Exercise 13. Let Z = (Zt)t>o be a Levy process which is a martingale, but
Z has no Brownian component (that is, the constant a1 — 0 in the Levy-
Khintchine formula), and a finite Levy measure v. Let A = и(Ж). Show that
Z is a compensated compound Poisson process with arrival intensity A and
i.i.d. jumps with distribution /i = ji/.
*Exercise 14. Let 0 < T\ < T^ < ¦ ¦ ¦ < Tn < ¦ ¦ ¦ be any increasing sequence
of finite-valued stopping times, increasing to oo. Let Nt = SSi 1{*>T;}> and
let (Ui)i>i be independent, and also the (Ui)i>i are independent of the pro-
process TV. l3how that if supt E{\Ui\} < oo, all i, and E{Ui) = 0, all i, then
Zt = Y^i Ui\{t>Ti) ls a martingale. (Note: We are not assuming that TV
is a Poisson process here. Furthermore, the random variables (f*)i>i, while
independent, are not necessarily identically distributed.)
Exercise 15. Let Z be a Levy process which is a martingale, but with no
Brownian component and with a Levy measure of the form

Exercises for Chapter I 47
oo
v(dx) = y^akepk{dx),
fe=i
where e@k (dx) denotes point mass at /?? G R of size one, with Xjfcli /?fcafc < °°-
Show that Z is of the form
fe=i
where (Nf)t>0 is an independent sequence of Poisson processes with param-
parameters («fc)fc>i. Verify that Z is an I? martingale.
Exercise 16. Let В = (Bt)o<t<i be one dimensional Brownian motion on
[0,1]. Let Wt = Bi_t - B\, 0 < t < 1. Show that W has the same distribution
as does B, and that W is a Brownian motion for 0 < t < 1 for its natural
filtration.
Exercise 17. Let X be any process with cadlag paths. Set e > 0.
(a) Show that on [0, t] there are only a finite number of jumps larger than e.
That is, if Nt = ?s<t 1{\axb\>c}, where AXS =XS- Xs-, then P(Nt <
oo) = 1, each t > 0.
(b) Conclude that X can have only a countable number of jumps on [0,t].
Exercise 18. Let Z be a Levy process, and let e > 0. Let AZS = Zs — Zs~,
and set Jt? = X^<t ^^s^{\azs\>e}- Show that Z - J? and J? are independent
Levy processes.
Exercise 19. Let X be an adapted process with continuous paths. Show that
X is locally bounded.
*Exercise 20. Give an example of an adapted process X with Xo = 0 that
has cadlag paths which are not locally bounded.
Exercise 21. (This exercise shows that 2 is the best constant in Doob's
quadratic martingale inequality.) Let (Г2, J-, P) be Lebesgue measure on @,1)
with the Borel sets. Define the filtration
Tt = {A&T \ (t,l) cA, or Л с @,*]}, 0<*< 1,
and Tt—T for t > 1. Intuitively, with this filtration we know all about what
is going on during @, t], but nothing about what is happening during (t, 1).
(a) Show that for Y ? L1, the martingale Mt = Е{У\^} is given by
() 0<co<t,
Mt(u) = "
/ Y(s)ds, t < и < 1.
1 ~ * Jt

48 Exercises for Chapter I
(b) Let Y{uj) = A - u>)-a for some a, 0 < a < 1/2. Show that Y <E L2 and
also if Mt = E{Y\Ft), then
(c) Show supt Mt = jz^Y, и-Ъу-со, and deduce
Note that Iima_>1/2A - a) = 2.
Exercise 22. (This exercise gives an example of a martingale with each path
bounded but which nevertheless is not locally bounded.) Let (fi,.F,F, P) be
as in Exercise 21. Let Y(u>) = ^, and Mt = E{Y\Tt}.
(a) Show that
(b) Let Г be a stopping time, Г not identically 0. Show that T(w) > со on
some w-interval @, e) with e > 0.
(c) Use (b) to show E{M%.} > /Q? ^ds = oo, and deduce that M cannot be
locally an L2 martingale.
(d) Conclude that M has bounded paths but that Мт1{т>о) is unbounded
for every stopping time Г not identically 0.
*Exercise 23. Find an example of a local martingale M and finite stop-
stopping times S, T with S < T a.s. such that Е{\МТ\\Р3) < oo, but Ms ф
E{MT\Ts} a.s. {Hint: Begin by showing that the equality would imply that
every positive local martingale is a martingale.)
Exercise 24. (This exercise helps to clarify the difference between stopping
time (T-algebras of the two forms Gt- and Gt-) Let Z be a Levy process,
and let Gt — (?{Zs;s < i} V J\f, where N are all the null sets in the given
complete probability space [Q.,J-,P). For a stopping time Г define Gt- =
a{A П (t < Г); A ? Gt}; at time 0, set Go- = Go- For a process X, let
XT~ = Xtl{t<T] + XT-l{t>T}, and take X0- = 0.
(a) Show St- = cj{T, ZT~}vAf.
(b) Show Gt = v{T,Zt}vN.
(c) Show Gt = °{Gt-,Zt} if T is a.s. finite.
(d) Conclude that for a bounded time Г if Zy = Z?- a.s., then С/т = Gt--
(Note that if G = (?t)t>o is the completed natural filtration of a Brownian
motion, then Gt = Gt- for all bounded stopping times.)

Exercises for Chapter I 49
Exercise 25. Let Z be a Levy process (or any process which is continuous
in probability). Show that the probability Z jumps at a given fixed time t is
zero.
Exercise 26. Use Exercises 24 and 25 to show that with the assumptions of
Exercise 24, for any fixed time t, Qt- = Qt, giving a continuity property of
the filtration for fixed times (since by Theorem 31 we always have Qt+ — Qt),
but of course not for stopping times.
Exercise 27. Let S, T be two stopping times with S < T. Show Ts- С
Тт-- Moreover if (Tn)n>i is any increasing sequence of stopping times with
limn_,oo Tn = Г, show that
Ft- = \j Ртп--
n
Exercise 28. Prove the first equality of Theorem 38 when /1л е Ll{dv).
**Exercise 29. Show that if Z is a Levy process and a local martingale,
then Z is a martingale. That is, all Levy process local martingales are actually
martingales.
Exercise 30 (reflection principle for Levy processes). Let Z be a sym-
symmetric Levy process with Zo — 0. That is, Z and —Z have the same dis-
distribution. Let St = sups<t Zs. Show that
P(St >z-Zt<z-y)< P(Zt >z + y).
Exercise 31. Let Z be a symmetric Levy process with Zq — 0 as in Exer-
Exercise 30. Show that
P(St >z)< 2P{Zt > z).

II
Semimartingales and Stochastic Integrals
1 Introduction to Semimartingales
The purpose of the theory of stochastic integration is to give a reasonable
meaning to the idea of a differential to as wide a class of stochastic processes
as possible. We saw in Sect. 8 of Chap. I that using Stieltjes integration on a
path-by-path basis excludes such fundamental processes as Brownian motion,
and martingales in general. Markov processes also in general have paths of
unbounded variation and are similarly excluded. Therefore we must find an
approach more general than Stieltjes integration.
We will define stochastic integrals first as a limit of sums. A priori this
seems hopeless, since even by restricting our integrands to continuous pro-
processes we saw as a consequence of Theorem 56 of Chap. I that the differential
must be of finite variation on compacts. However an analysis of the proof of
Theorem 56 offers some hope. In order to construct a function h such that
h(tk) = sign(x(ifc+i) — x(tk)), we need to be able to "see" the trajectory of x
on (tk,tk+i]- The idea of K. Ito was to restrict the integrands to those that
could not see into the future increments, namely adapted processes.
The foregoing considerations lead us to define the stochastic processes that
will serve as differentials as those that are "good integrators" on an appropri-
appropriate class of adapted processes. We will, as discussed in Chap. I, assume that
we are given a filtered, complete probability space (fi,.F,F, P) satisfying the
usual hypotheses.
Definition. A process H is said to be simple predictable if H has a rep-
representation
Ht =
where 0 = T\ < ¦ ¦ ¦ < Tn+i < oo is a finite sequence of stopping times,
Hi G TTi with \Ht\ < oo a.s., 0 < i < n. The collection of simple predictable
processes is denoted S.

52 II Semimartingales and Stochastic Integrals
Note that we can take T\ = To = 0 in the above definition, so there is
no "gap" between To and T\. We can topologize S by uniform convergence in
(t,co), and we denote S endowed with this topology by Su. We also write L°
for the space of finite-valued random variables topologized by convergence in
probability.
Let X be a stochastic process. An operator, Ix, induced by X should
have two fundamental properties to earn the name "integral." The operator
Ix should be linear, and it should satisfy some version of the Bounded Con-
Convergence Theorem. A particularly weak form of the Bounded Convergence
Theorem is that the uniform convergence of processes Hn to H implies only
the convergence in probability of Ix{Hn) to Ix(H).
Inspired by the above considerations, for a given process X we define a
linear mapping Ix '¦ S —> L° by letting
n
Ix{H) = ЩХо + 2_^ Hi(Xri+1 — Xti),
2=1
where H e S has the representation
Ht = ffol{o} '
Since this definition is a path-by-path definition for the step functions
H(u>), it does not depend on the choice of the representation of H in S.
Definition. A process X is a total semimartingale if X is cadlag, adapted,
and Ix '¦ Su —> L° is continuous.
Recall that for a process X and a stopping time Г, the notation XT denotes
the process (XtAT)t>o-
Definition. A process X is called a semimartingale if, for each t G [0, oo),
X1 is a total semimartingale.
With our definition of a semimartingale, the second fundamental property
we want (bounded convergence) holds. We postpone consideration of examples
of semimartingales to Sect. 3.
2 Stability Properties of Semimartingales
We state a sequence of theorems giving some of the stability results which are
particularly simple.
Theorem 1. The set of (total) semimartingales is a vector space.
Proof. This is immediate from the definition. D

2 Stability Properties of Semimartingales 53
Theorem 2. If Q is a probability and absolutely continuous with respect to
P, then every (total) P semimartingale X is a (total) Q semimartingale.
Proof Convergence in P-probability implies convergence in Q-probability.
Thus the theorem follows from the definition of X. D
Theorem 3. Let (Pk)k>i be a sequence of probabilities such that X is a Pk
semimartingale for each к. Let R = ^ZfeLi ^кРк> where Xk > 0, each k, and
Sfet=i -^fc = 1- Then X is a semimartingale under R as well.
Proof. Suppose Hn e S converges uniformly to H G S. Since X is a Pk
semimartingale for all Pk, Ix{Hn) converges to Ix(H) in probability for every
Pk- This then implies Ix(Hn) converges to Ix(H) under R. О
Theorem 4 (Strieker's Theorem). Let X be a semimartingale for the
filtration F. Let G be a subfiltration of F, such that X is adapted to the G
filtration. Then X is a G semimartingale.
Proof. For a filtration И, let S(H) denote the simple predictable processes for
the filtration Ш = (Tit)t>o- In this case we have S(G) is contained in S(F).
The theorem is then an immediate consequence of the definition. G
Theorem 4 shows that we can always shrink a filtration and preserve the
property of being a semimartingale (as long as the process X is still adapted),
since we are shrinking as well the possible integrands; this, in effect, makes
it "easier" for the process X to be a semimartingale. Expanding the filtra-
filtration, therefore, should be—and is—a much more delicate issue. Expansion
of nitrations is considered in much greater detail in Chap. VI. We present
here an elementary but useful result. Recall that we are given a filtered space
{?l,J-, F, P) satisfying the usual hypotheses.
Theorem 5 (Jacod's Countable Expansion). Let A be a collection of
events in T such that if Aa, Ap e A then Aa П A$ = 0, (а ф /3). Let Tit be
the filtration generated by Ft and A. Then every (F, P) semimartingale is an
(H, Р) semimartingale also.
Proof. Let An G A. If P{An) = 0, then An and An are in J-q by hypothesis.
We assume, therefore, that P{An) > 0. Note that there can be at most a
countable number of An e A such that P(An) > 0. If Л = \Jn>1 An is the
union of all An € A with P(An) > 0, we can also add Ac to A without loss
of generality. Thus we can assume that A is a countable partition of Q with
P(An) > 0 for every An G A. Define a new probability Qn by Qn{-) = P(-\An),
for An fixed. Then Qn <C P, and X is a (F, Qn) semimartingale by Theorem 2.
If we enlarge the filtration F by all the T measurable events that have Qn-
probability 0 or 1, we get a larger filtration Jn = (J")t>o, and X is a (J™, Qn)
semimartingale. Since Qn(Am) = 0 or 1 for m ф n, we have Tt С Ht С J™,
for t > 0, and for all n. By Strieker's Theorem (Theorem 4) we conclude that
X is an (И, Qn) semimartingale. Finally, we have dP = J2n>i P(An)dQn,

54 II Semimartingales and Stochastic Integrals
where X is an (H, Qn) semimartingale for each n. Therefore by Theorem 3
we conclude X is an (H, P) semimartingale. D
Corollary. Let Л be a finite collection of events in F, and let H = (Ht)t>o
be the filtration generated by Tt and A. Then every (F, P) semimartingale is
an (H, P) semimartingale also.
Proof. Since A is finite, one can always find a (finite) partition П of п such
that Ht= f(V П. The corollary then follows by Theorem 5. ?
Note that if В = (Bt)t>o is a Brownian motion for a filtration {J~t)t>o,
by Theorem 5 we are able to add, in a certain manner, an infinite number of
"future" events to the filtration and В will no longer be a martingale, but it
will stay a semimartingale. This has interesting implications in finance theory
(the theory of continuous trading). See for example Duffie-Huang [60].
The corollary of the next theorem states that being a semimartingale is a
"local" property; that is, a local semimartingale is a semimartingale. We get
a stronger result by stopping at Tn— rather than at Tn in the next theorem.
A process X is stopped at Г— if Xj~ = Xfl{o<t<T} + ^T-l{t>T}> where
Xo- = 0.
Theorem 6. Let X be a cadlag, adapted process. Let (Tn) be a sequence of
positive r.v. increasing to oo a.s., and let (Xn) be a sequence of semimartin-
semimartingales such that for each n, XTn~ = (Xn)Tn~. Then X is a semimartingale.
Proof. We wish to show X* is a total semimartingale, each t > 0. Define
Rn = Tnl{Tn<t} + °°l{Tn>t}- Then
P{\Ix*{H)\ >c}< P{\I{xn)t(H)\ >c} + P(Rn < oo).
But P(Rn < oo) = P(Tn < t), and since Tn increases to oo a.s., P(Tn < t) —> 0
as n —> oo. Thus if Hk tends to 0 in Su, given e > 0, we choose n so that
P(Rn < oo) < e/2, and then choose к so large that P{\I(X"y(Hk)\ > с} <
e/2. Thus, for к large enough, P{\IXt(Hk)\ > c} < e. ?
Corollary. Let X be a process. If there exists a sequence (Tn) of stopping
times increasing to oo a.s., such that XTn (or XTnl{Tn>0}) is a semimartin-
semimartingale, each n, then X is also a semimartingale.
3 Elementary Examples of Semimartingales
The elementary properties of semimartingales established in Sect. 2 will allow
us to see that many common processes are semimartingales. For example, the
Poisson process, Brownian motion, and more generally all Levy processes are
semimartingales.

3 Elementary Examples of Semimartingales 55
Theorem 7. Each adapted process with cadlag paths of finite variation on
compacts (of finite total variation) is a semimartingale (a total semimartin-
semimartingale).
Proof. It suffices to observe that \IX{H)\ < \\H\\U /0°° \dXs\, where /0°° \dX3\
denotes the Lebesgue-Stieltjes total variation and ||#||u — sup(t№) \H(t,w)\.
П
Theorem 8. Each L2 martingale with cadlag paths is a semimartingale.
Proof Let X be an I? martingale with Xo = 0, and let H G S. Using Doob's
Optional Sampling Theorem and the L2 orthogonality of the increments of
L2 martingales, it suffices to observe that
E{{lX{n)) ) = Ь{{2^Нг(Лт1+1 -XTi)) f = b{2_^HiixTi+1 ~ XTi) }
г=0 г=0
n n
У (A^-.j —X r?i) / — ll^llu-^l/ ( T' — Ту/
i-0 i=0
" J<||ff||^{^}. D
Corollary 1. Each cadlag, locally square integrable local martingale is a
semimartingale.
Proof. Apply Theorem 8 together with the corollary to Theorem 6. ?
Corollary 2. A local martingale with continuous paths is a semimartingale.
Proof. Apply Corollary 1 together with Theorem 51 in Chap. I. G
Corollary 3. The Wiener process (that is, Brownian motion) is a semimartin-
semimartingale.
Proof. The Wiener process Bt is a martingale with continuous paths if So is
integrable. It is always a continuous local martingale. ?
Definition. We will say an adapted process X with cadlag paths is decom-
decomposable if it can be decomposed Xt = Xo + Mt + At, where Mo = Л о = О,
M is a locally square integrable martingale, and A is cadlag, adapted, with
paths of finite variation on compacts.
Theorem 9. A decomposable process is a semimartingale.
Proof. Let Xt — Xo + Mt + At be a decomposition of X. Then M is a semi-
semimartingale by Corollary 1 of Theorem 8, and Л is a semimartingale by The-
Theorem 7. Since semimartingales form a vector space (Theorem 1) we have the
result. D
Corollary. A Levy process is a semimartingale.

56 II Semimaxtingales and Stochastic Integrals
Proof. By Theorem 40 of Chap. I we know that a Levy process is decompos-
decomposable. Theorem 9 then gives the result. ?
Since Levy processes are prototypic strong Markov processes, one may well
wonder if all R"-valued strong Markov processes are semimartingales. Simple
1 /я
examples, such as Xt = Bt , where В is standard Brownian motion, show
this is not the case (while this example is simple, the proof that X is not a
semimartingale is not elementary1). However if one is willing to "regularize"
the Markov process by a transformation of the space (in the case of this
example using the "scale function" S{x) = x3), "most reasonable" strong
Markov processes are semimartingales. Indeed, Dynkin's formula, which
states that if / is in the domain of the infinitesimal generator G of the strong
Markov process Z, then the process
Mt = f(Zt) - f(Z0) -
is well-defined and is a local martingale, hints strongly that if the domain of
G is rich enough, the process Z is a semimartingale. In this regard see Sect. 7
of Cinlar, Jacod, Protter, and Sharpe [34].
4 Stochastic Integrals
In Sect. 1 we defined semimartingales as adapted, cadlag processes that acted
as "good integrators" on the simple predictable processes. We now wish to
enlarge the space of processes we can consider as integrands. In Chap. IV we
will consider a large class of processes, namely those that are "predictably
measurable" and have appropriate finiteness properties. Here, however, by
keeping our space of integrands small—yet large enough to be interesting—
we can keep the theory free of technical problems, as well as intuitive.
A particularly nice class of processes for our purposes is the class of
adapted processes with left continuous paths that have right limits (the French
acronym would be caglad).
Definition. We let D denote the space of adapted processes with cadlag
paths, L denote the space of adapted processes with caglad paths (left con-
continuous with right limits) and bL denote processes in L with bounded paths.
We have previously considered Su, the space of simple, predictable pro-
processes endowed with the topology of uniform convergence; and L°, the space
of finite-valued random variables topologized by convergence in probability.
We need to consider a third type of convergence.
1 See Theorem 71 of Chap. IV which proves a similar assertion for Xt = |St|°
0 < a < 1/2.

4 Stochastic Integrals 57
Definition. A sequence of processes {Hn)n>i converges to a process H uni-
uniformly on compacts in probability (abbreviated ucp) if, for each t > 0,
sup0<s<t |ff™ — Hs\ converges to 0 in probability.
We write H* = supo<s<t \HS\. Then if Yn G В we have Yn -> Г in ucp
if (Fra - У)* converges to 0 in probability for each t > 0. We write Ducp,
LUcp> Sucp to denote the respective spaces endowed with the ucp topology.
We observe that Ducp is a metrizable space; indeed, a compatible metric is
given, for X, Y e Ю>, by
n=l
The above metric space Bucp is complete. For a semimartingale X and a
process H 6 S, we have defined the appropriate notion Ix(H) of a stochastic
integral. The next result is key to extending this definition.
Theorem 10. The space S is dense in L under the ucp topology.
Proof. Let Y 6 L. Let Rn = inf{t: \Yt\ > n}. Then Rn is a stopping time and
yn _ YRnl{Rn>oy are in bL and converge to Y in ucp. Thus bL is dense in
L. Without loss we now assume Y 6 bL. Define Z by Zt = limu_»t Yu. Then
u>t
Z 6 D. For ? > 0, define
T?+1 = inf{t :t>T^ and \Zt - ZT*\ > e}.
Since Z is cadlag, the T^ are stopping times increasing to oo a.s. as n increases.
Let Z? = VV Zt'Iit' tc ь for each e > 0. Then Ze are bounded and con-
verge uniformly to Z as ? tends to 0. Let Ce = lol{o} + Sn^r«l(Te,Te J-
Then Us 6 bL and the preceding implies U? converges uniformly on compacts
Finally, define
*yn,e ~y -i _j_
i=l
and this can be made arbitrarily close to У 6 bL by taking e small enough
and n large enough. D
We defined a semimartingale X as a process that induced a continuous
operator Ix from Su into L°. This Ix maps processes into random variables
(note that X^ is a random variable). We next define an operator (the stochas-
stochastic integral operator), induced by X, that will map processes into processes
(note that XTi is a process).

58 II Semimaxtingales and Stochastic Integrals
Definition. For H 6 S and X a cadlag process, define the (linear) mapping
Jx : S -> В by
JX(H) = H0X0 +
г=1
for H in S with the representation
n
H = tfol{o} +^ЯД(ТгХ+1],
#» G -^ and 0 = To < Ti < ¦ ¦ ¦ < Tn+1 < oo stopping times.
Definition. For HeS and X an adapted cadlag process, we call Jx {H) the
stochastic integral of H with respect to X.
We use interchangeably three notations for the stochastic integral:
Jx{H)= fHsdXs=H-X.
Observe that Jx(H)t = Ix*(H). Indeed, Ix plays the role of a definite inte-
integral. For H e S, IX(H) = /0°° HsdXs.
Theorem 11. Let X be a semimartingale. Then the mapping Jx '¦ Sucp —>
Ducp is continuous.
Proof. Since we are only dealing with convergence on compact sets, without
loss of generality we take X to be a total semimartingale. First suppose Hk
in S tends to 0 uniformly and is uniformly bounded. We will show Jx(Hk)
tends to 0 ucp. Let 5 > 0 be given and define stopping times Tk by
Tk =ini{t: \(Hk-X)t\ >5}.
Then fffel[0>Tfc] 6 S and tends to 0 uniformly as к tends to oo. Thus for every
t,
P{(Hk ¦ X)* >S}< P{\Hk ¦ XTkAt\ > 5}
= P{\(Hkl[OtTkyX)t\>6}
= P{\Ix(H%,TkAt])\>6}
which tends to 0 by the definition of total semimartingale.
We have just shown that Jx '¦ Su —» Ducp is continuous. We now use this
to show Jx : Sucp —> Ducp is continuous. Suppose Hk goes to 0 ucp. Let
S > 0, e > 0, t > 0. We have seen that there exists rj such that \\H\\U < r)
implies P(Jx(H)*t > S) < §. Let Rk = inf{s : \Hk\ > 77}, and set Hk =
Hkl[0,Rk]l{Rk>0}. Then ff'eS and \\Hk\\u < rj by left continuity. Since
Rk > t implies (Hk ¦ X)*t = (Hk ¦ X)*, we have

4 Stochastic Integrals 59
P((Hk ¦ Xft >5)< P((Hk ¦ X)* >S) + P(Rk < t)
< e
if к is large enough, since lim/t^oo P((Hk)l > rj) = 0. ?
We have seen that when X is a semimartingale, the integration operator
Jx is continuous on SUcp, and also that Sucp is dense in Lucp. Hence we are
able to extend the linear integration operator Jx from S to L by continuity,
since Ducp is a complete metric space.
Definition. Let X be a semimartingale. The continuous linear mapping Jx '¦
LUCp —> ^>ucp obtained as the extension of Jx : S —> Ю> is called the stochastic
integral.
The preceding definition is rich enough for us to give an immediate example
of a surprising stochastic integral. First recall that if a process (At)t>o has
continuous paths of finite variation with Ao — 0, then the Riemann-Stieltjes
integral of /0 AsdAs yields the formula (see Theorem 54 of Chap. I)
/
Jo
¦t I
AsdAs = ~At.
о 2
Let us now consider a standard Brownian motion В = (Bt)t>o with Bq = 0.
The process В does not have paths of finite variation on compacts, but it is
a semimartingale. Let (тг„) be a refining sequence of partitions of [0, oo) with
limn^oo meshGrn) - 0. Let Btn = 52гк€ъп B*k 1(tk,tk+i]- Then Bn € L for each
n. Moreover, Bn converges to В in ucp. Fix t > 0 and assume that t is a
partition point of each тг„. Then
jB{Bn)t = J2
tk <i
and
JB(B)t= lim JB(Bn)t = lim
{\(Btk+l + Btk)(Btk+1 - Btk)
tk?irn,tk<t
-\(Btk+1-Btk)(Btk+1-Btk)}
where we have used telescoping sums. The last term on the right, however,
converges a.s. to t, by Theorem 28 of Chap. I. We therefore conclude that

60 II Semimartingales and Stochastic Integrals
i:
BsdBs = -B* - -t, (**)
a formula distinctly different from the Riemann-Stieltjes formula (*). The
change of variables formula (Theorem 32) presented in Sect. 7, will give a
deeper understanding of the difference in formulas (*) and (**).
5 Properties of Stochastic Integrals
Throughout this paragraph X will denote a semimartingale and H will denote
an element of L. Recall that the stochastic integral defined in Sect. 4 will be
denoted by the three notations Jx(H) = H ¦ X = J HsdXs. Evaluating these
processes at t, we have
= / HsdXs = f
Jo J[o,t
HXt= [ HsdXs = / HsdXs.
Ml
To exclude 0 in the integral we write
/ HsdXs = / HsdXs.
o+ J(o,t]
The integral /0°° HsdXs is defined to be limt-юо /0* HsdXs when the limit
exists. Note that f* HsdXs = H0X0 + f*+HsdXs. For a process Y e Ю>,
we recall that AYt = Yt - Y"t_, the jump at t. Also since Yo- = 0 we have
AY0 = Yo. Recall further that for a process Z and stopping time T, we let
ZT denote the stopped process {Zj) = ZthT- We will establish in this section
several elementary properties of the stochastic integral. These properties will
help us to understand the integral and to examine examples at the end of
the section. Recall that two processes Y and Z are indistinguishable if
P{oj : t >—> Xt{u>) and t >—> Yt(oj) are the same functions} = 1.
Theorem 12. Let T be a stopping time. Then (H ¦ X)T = i/l[Or] • X =
н-{хт).
Theorem 13. The jump process A(H-X)S is indistinguishable from Hs (AXS).
Proofs. Both properties are clear when ffeS, and they follow when H e L
by passing to the limit. Indeed, we take convergence in ucp for each t, then
a.s. convergence by taking a subsequence, and then we choose the rationale
(which of course is countable and dense, so the union of null sets is still a null
set), and then use the path regularity to get indistinguishability. ?
Let Q denote another probability law, and let Hq ¦ X denote the stochastic
integral of H with respect to X computed under the law Q.
Theorem 14. Let Q <?C P. Then Hq ¦ X is Q indistinguishable from HP ¦ X.

5 Properties of Stochastic Integrals 61
Proof. Note that by Theorem 2, X is a Q semimartingale. The theorem is clear
if H 6 S, and it follows for H G L by passage to the limit in the ucp topology,
since convergence in P-probability implies convergence in Q-probability. ?
Theorem 15. Let Pk be a sequence of probabilities such that X is a Pk
semimartingale for each к. Let R = YlT=i ^кРк where Xk > 0, each к, and
Efeli Afc = !• Then HR-X = HPk ¦ X, Pk-a.s., for all к such that Xk > 0.
Proof. If Afc > 0 then Pk <€. R, and the result follows by Theorem 14. Note
that by Theorem 3 we know that X is an R semimartingale. ?
Corollary. Let P and Q be any probabilities and suppose X is a semimartin-
semimartingale relative to both P and Q. Then there exists a process H ¦ X which is a
version of both Hp ¦ X and Hq ¦ X.
Proof. Let R = (P + Q)/2. Then Hr-X is such a process by Theorem 15. ?
Theorem 16. Let G = (Qt)t>o be another filtration such that H is in both
L(G) and L(F), and such that X is also a G semimartingale. Then Hq • X =
H?-X.
Proof. L(G) denotes left continuous processes adapted to the filtration G. As
in the proof of Theorem 10, we can construct a sequence of processes Hn
converging to H where the construction of the Hn depends only on H. Thus
Hn 6 S(G) П S(F) and converges to H in ucp. Since the result is clear for S,
the full result follows by passing to the limit. ?
Remark. While Theorem 16 is a simple result in this context, it is far from
simple if the integrands are predictably measurable processes, rather than
processes in L. See the comment following Theorem 33 of Chap. IV.
The next two theorems are especially interesting because they show—at
least for integrands in L—that the stochastic integral agrees with the path-
by-path Lebesgue-Stieltjes integral, whenever it is possible to do so.
Theorem 17. If the semimartingale X has paths of finite variation on com-
compacts, thenH-X is indistinguishable from the Lebesgue-Stieltjes integral, com-
computed path-by-path.
Proof. The result is evident for H 6 S. Let Hn e S converge to H in ucp.
Then there exists a subsequence n^ such that limnfc_)OO(i/™'c — HI = 0 a.s.,
and the result follows by interchanging limits, justified by the uniform a.s.
convergence. D
Theorem 18. Let X, X be two semimartingales, and let H, H G L. Let
A = {lj : H.(w) = Я.(ы) andX.(tj) = X.(w)}, andletB = {u : t н-> Xt(u)
is of finite variation on compacts}. Then H ¦ X = H ¦ X on A, and H ¦ X is
equal to a path-by-path Lebesgue-Stieltjes integral on B.

62 II Semimartingales and Stochastic Integrals
Proof. Without loss of generality we assume P(A) > 0. Define a new prob-
probability law Q by Q{A) = Р{Л\А). Then under Q we have that Я and Я as
well as X and X are indistinguishable. Thus Hq ¦ X = Hq • X, and hence
Я • X = Я • X P-a.s. on A by Theorem 14, since Q <C P.
As for the second assertion, if В = Q the result is merely Theorem 17.
Define R by Д(Л) = Р(Л\В), assuming without loss that P(B) > 0. Then
Д<Р and В = Q, i?-a.s. Hence Яд ¦ X equals the Lebesgue-Stieljes integral
-R-a.s. by Theorem 17, and the result follows by Theorem 14. D
The preceding theorem and following corollary are known as the local
behavior of the integral.
Corollary. With the notation of Theorem 18, let 5, T be two stopping times
with S <T. Define
С = {cj : Ht(w) = Ht(w); Xt(w) = Xt(w); S(w) < t < T(cj)}
D = {u> : t н-> Xt(oj) is of finite variation on S(oj) < t < T(oj)}.
Then H ¦ XT - H ¦ Xs =H -XT -H -Xs on С and H ¦ XT - H ¦ Xs equals
a path-by-path Lebesgue-Stieltjes integral on D.
Proof. Let Yt=Xt- XtAS. Then Я • Y = H ¦ X - H ¦ Xs, and Y does not
change the set [0, S], which is evident, or which—alternatively—can be viewed
as an easy consequence of Theorem 18. One now applies Theorem 18 to YT
to obtain the result. D
Theorem 19 (Associativity). The stochastic integral process Y = Я • X is
itself a semimartingale, and for G ? L we have
GY = G(H-X) = (GH) ¦ X.
Proof. Suppose we know Y = H-X is a semimartingale. Then G-Y = Jy(G).
If G, H are in S, then it is clear that Jy(G) = J\(GH). The associativity
then extends to L by continuity.
It remains to show that Y = H ¦ X is a semimartingale. Let (Я") be in S
converging in ucp to Я. Then Hn ¦ X converges to Я ¦ X in ucp. Thus there
exists a subsequence (nk) such that НПк ¦ X converges a.s. to Я • X.
Let G 6 S and let Ynk = НПк-Х, Y = HX. The Ynk are semimartingales
converging pointwise to the process Y. For G 6 S, Jy(G) is defined for any
process Y; so we have
JY(G)=G-Y= lim G-Ynk= lim G • (НПк ¦ X)
= lim (GHnk)-X
nk~*oo
which equals \imnk^O0Jx{GHnk) = Jx(GH), since X is a semimartingale.
Therefore JY(G) = JX(GH) for GeS.

5 Properties of Stochastic Integrals 63
Let Gn converge to G in Su. Then GnH converges to GH in Lucp, and since
X is a semimartingale, hin^^oo Jy(G") = Нт^^оо Jx(GnH) = J\{GH) =
Jy{G). This implies Yl is a total semimartingale, and so Y = H ¦ X is a
semimartingale. ?
Theorem 19 shows that the property of being a semimartingale is preserved
by stochastic integration . Also by Theorem 17 if the semimartingale X is an
FV process, then the stochastic integral agrees with the Lebesgue-Stieltjes
integral, and by the theory of Lebesgue-Stieltjes integration we are able to
conclude the stochastic integral is an FV process also. That is, the property
of being an FV process is preserved by stochastic integration for integrands in
L.2
One may well ask if other properties are preserved by stochastic integra-
integration; in particular, are the stochastic integrals of martingales and local mar-
martingales still martingales and local martingales? Local martingales are indeed
preserved by stochastic integration, but we are not yet able easily to prove it.
Instead we show that locally square integrable local martingales are preserved
by stochastic integration for integrands in L.
Theorem 20. Let X be a locally square integrable local martingale, and let
Яб1. Then the stochastic integral H ¦ X is also a locally square integrable
local martingale.
Proof. We have seen that a locally square integrable local martingale is a
semimartingale (Corollary 1 of Theorem 8), so we can formulate H ¦ X. With-
Without loss of generality, assume Xo = 0. Also, if Tk increases to oo a.s. and
(H -X)Tk is a locally square integrable local martingale for each k, it is simple
to check that H ¦ X itself is one. Thus without loss we assume X is a square
integrable martingale. By stopping H, we may further assume H is bounded,
by ?. Let Hn^G S be such that Hn converges to H \n ucp. We can then modify
Hn, call it Hn, such that Hn is bounded by ix Hn 6 S, and Hn converges
uniformly to H in probability on [0,4]. Since Hn 6 bS, one can check that
Hn • X is a martingale. Moreover
¦ XJt} - ^
and hence (Hn • X)t are uniformly bounded in L2 and thus uniformly inte-
integrable. Passing to the limit then shows both that H ¦ X is a martingale and
that it is square integrable. ?
See Exercise 43 in Chap. IV which shows this is not true in general.

64 II Semimartingales and Stochastic Integrals
In Theorem 29 of Chap. Ill we show the more general result that if M is
a local martingale and ff e L, then H ¦ M is again a local martingale.
A classical result from the theory of Lebesgue measure and integration
(on R) is that a bounded, measurable function / mapping an interval [0,6]
to R is Riemann integrable if and only if the set of discontinuities of / has
Lebesgue measure zero (e.g., Kingman and Taylor [127, page 129]). Therefore
we cannot hope to express the stochastic integral as a limit of sums unless
the integrands have reasonably smooth sample paths. The spaces Ю> and L
consist of processes which jump at most countably often. As we will see in
Theorem 21, this is smooth enough.
Definition. Let a denote a finite sequence of finite stopping times:
0 = To < Ti < ¦ ¦ ¦ < Tfe < 00.
The sequence a is called a random partition. A sequence of random parti-
partitions <7„,
on : To™ < T? < • ¦ • < T?n
is said to tend to the identity if
(i) HmrasupfeT^ = 00 a.s.; and
(ii) \\an\\ = supfe \Tg+1 - T?\ converges to 0 a.s.
Let У be a process and let a be a random partition. We define the process
Y sampled at a to be
Ya =
It is easy to check that
YfdX. = Y0X0 + Y^YTi{XT^ - XT'),
!•¦
for any semimartingale X, any process Y in S, Ю>, or L.
Theorem 21. Let X be a semimartingale, and let Y be a process in D or in L.
Let (an) be a sequence of random partitions tending to the identity. Then the
processes /0*+ Y°ndXs = J2i Yr?(Xt i+1 —Xt*) tend to the stochastic integral
(Y-) • X in ucp.
Proof (The notation У_ means the process whose value at s is given by
(Y-)s — limu_)SU<syu; also, (YL)o = 0, by convention.) We prove the the-
theorem for the case where Y is cadlag, the other case being analogous. Also
without loss of generality we can take Xo = 0. Y cadlag implies F_ e L. Let
Yk 6 S such that Yk converges to Y_ (ucp). We have

5 Properties of Stochastic Integrals 65
f(Y_ - Y"")adXa = f(Y_ - Yk)sdXs + f(Yk - {Yk)°")sdXs
+ J((?{)"»-Ya»)adX.
where Yk denotes the cadlag version of Yk. The first term on the right side
equals J\ (Y- -Yk), and since Jx is continuous in Lucp and since У_ — Yk —> 0,
we have f(Y- —Yk)sdXs —» 0 (ucp). The same reasoning applies to the third
term, for fixed n, as к —> oo. Indeed, the convergence to 0 of {(y?)"n - Y"n)
as к —> оо is uniform in n.
It remains to consider the middle term on the right side above. Since the Yk
are simple predictable we can write the stochastic integrals in closed form, and
since X is right continuous the integrals (for fixed (к, ш)) J(Yk - (Y+)"nKdX3
tend to 0 as n —> oo. Thus one merely chooses к so large that the first and
third terms are small, and then for fixed k, the middle term can be made
small for large enough n. ?
We consider here another example. We have already seen at the end of
Sect. 4 that if Б is a standard Wiener process, then
i:
showing that the semimartingale calculus does not formally generalize the
Riemann-Stieltjes calculus.
We have seen as well that the stochastic integral agrees with the Lebesgue-
Stieltjes integral when possible (Theorems 17 and 18) and that the stochastic
integral also preserves the martingale property (Theorem 20; at least for lo-
locally square integrable local martingales). The following example shows that
the restriction of integrands to L is not as innocent as it may seem if we want
to have both of these properties.
Example. Let Mt = Nt — Xt, a compensated Poisson process (and hence a
martingale with Mt 6 Lp for all t > 0 and all p > 1). Let (Ti)i>\ be the jump
times of M. Let Ht = l[0)Tl)(t). Then Я G D. The Lebesgue-Stieltjes integral
is
/ HsdMs = f HsdNs -X f Hsds
Jo Jo Jo
oo л
= ^tfTil{t>Ti}-A/
ds
Jo
= -A(iATi).

66 II Semimartingales and Stochastic Integrals
This process is not a martingale. We conclude that the space of integrands
cannot be expanded even to Ю>, in general, and preserve the structure of the
theory already established.3
6 The Quadratic Variation of a Semimartingale
The quadratic variation process of a semimartingale, also known as the bracket
process, is a simple object that nevertheless plays a fundamental role.
Definition. Let X, Y be semimartingales. The quadratic variation pro-
process of X, denoted [X, X] = ([X,X]t)t>o, is defined by
[X,X] =X2 -2 f X-dX
(recall that Xo_ = 0). The quadratic covariation of X, Y, also called the
bracket process of X, Y, is defined by
[X,Y] = XY- f X-dY - /Y_
dX.
It is clear that the operation (X, Y) —> [X, Y] is bilinear and symmetric.
We therefore have a polarization identity
[X,Y] = \({X + YX + Y] - [X,X] - [Y,Y]).
The next theorem gives some elementary properties of [X, X]. (X is assumed
to be a given semimartingale throughout this section).
Theorem 22. The quadratic variation process of X is a cadlag, increasing,
adapted process. Moreover it satisfies the following.
(i) [X,X}0 = X2 and A[X,X] = (AXJ.
(ii) If an is a sequence of random partitions tending to the identity, then
with convergence in ucp, where an is the sequence 0 = T$ < T" < • • • <
T" < • • • < T? and where T" are stopping times.
(Hi) If T is any stopping time, then [XT,X] = [X,XT] = [XT,XT] =
[X,Xf.
3 Ruth Williams has commented to us that this example would be more convincing
if M were itself a semimartingale, H bounded and in D, and J* HsdMs were not
a semimartingale. Such a construction is carried out in [1].

6 The Quadratic Variation of a Semimartingale 67
Proof. X is cadlag, adapted, and so also is / X-dX by its definition; thus
[X, X] is cadlag, adapted as well. Recall the property of the stochastic integral:
. Then
(AXJS = (Xs - Xs_J = X2S - 2XSXS_ + X2
= X2S - X2_ + 2XS_(XS_ - X.
from which part (i) follows.
For part (ii), by replacing X with X = X — Xo, we may assume Xq — 0.
Let Rn — supjT". Then Rn < oo a.s. and limni?n — oo a.s., and thus by
telescoping series
converges ucp to X2. Moreover, the series J^ Ij»(ITi+i —XT") converges in
ucp to f X^dX by Theorem 21, since X is cadlag. Since b2 - a2 — 2a(b - a) =
(b-aJ, and since XTr(XT^ -XT") = XT"(XT^ -XT"), we can combine
the two series convergences above to obtain the result. Finally, note that if
s < t, then the approximating sums in part (ii) include more terms (all non-
negative), so it is clear that [X, X] is non-decreasing. (Note that, a priori, one
only has [X,X]S < [X,X]t a.s., with the null set depending on s and t; it is
the property that [X, X] has cadlag paths that allows one to eliminate the
dependence of the null set on s and t.) Part (iii) is a simple consequence of
part (ii). D
An immediate consequence of Theorem 22 is the observation that if В is a
Brownian motion, then [B, B]t = t, since in Theorem 28 of Chap. I we showed
the a.s. convergence of sums of the form in part (ii) of Theorem 22 when the
partitions are refining.
Another consequence of Theorem 22 is that if X is a semimartingale with
continuous paths of finite variation, then [X, X] is the constant process equal
to Xq. To see this one need only observe that
^ -XT"J< sup |XT"+l - XT" | ? |XT"+l - XT" |
where V is the total variation. Therefore the sums tend to 0 as \\crn\\ —> 0.
Theorem 22 has several more consequences which we state as corollaries.
Corollary 1. The bracket process [X, Y] of two semimartingales has paths
of finite variation on compacts, and it is also a semimartingale.

68 II Semimartingales and Stochastic Integrals
Proof. By the polarization identity [X, Y] is the difference of two increas-
increasing processes, hence its paths are of finite variation. Moreover, the paths are
clearly cadlag, and the process is adapted. Hence by Theorem 7 it is a semi-
martingale. D
Corollary 2 (Integration by Parts). Let X, Y be two semimartingales.
Then XY is a semimartingale and
XY = IX_dY+ f Y-dX + [X,Y].
Proof. The formula follows trivially from the definition of [X,Y]. That XY
is a semimartingale follows from the formula, Theorem 19, and Corollary 1
above. D
In the integration by parts formula above, we have (X-)o = (Y-)o = 0.
Hence evaluating at 0 yields
X0Y0 = (Х_)оУо + (K-)o*o + [X, Y}0.
Since [X, Y]o = AXqAYq = X0Yq, the formula is valid. Without the conven-
convention that (X_)o = 0, we could have written the formula
XtYt= f Xs^dYs+ f
Corollary 3. All semimartingales on a given filtered probability space form
an algebra.
Proof. Since semimartingales form a vector space, Corollary 2 shows they
form an algebra. D
A theorem analogous to Theorem 22 holds for [X,Y] as well as [X, X]. It
can be proved analogously to Theorem 22, or more simply by polarization.
We omit the proof.
Theorem 23. Let X and Y be two semimartingales. Then the bracket process
[X, Y] satisfies the following.
(i) [X, Y]o = X0Y0 and A[X, Y] = AX AY.
(ii) If an is a sequence of random partitions tending to the identity, then
[X,Y] ^
where convergence is in ucp, and where an is the sequence 0 = Tft < T" <
• • • < IT < ¦ ¦ ¦ < T?n, with T? stopping times,
(in) If T is any stopping time, then{XT,Y] = [X,YT] = [XT,YT] = [X,Y)T.

6 The Quadratic Variation of a Semimartingale 69
We next record a real analysis theorem from the Lebesgue-Stieltjes theory
of integration. It can be proved via the Monotone Class Theorem.
Theorem 24. Let a, C, 7 be functions mapping [0,00) to M with a(Q) =
/3@) = 7@) = 0. Suppose a, C, 7 are all right continuous, a is of finite
variation, and C and 7 are each increasing. Suppose further that for all s, t
with s < t, we have
dau\<([ dCu)
s Js
Then for any measurable functions f', g we have
\fg\\da\<{f f2dC
s Js
In particular, the measure da is absolutely continuous with respect to both dC
and dj.
Note that \da\ denotes the total variation measure corresponding to the
measure da, the Lebesgue-Stieltjes signed measure induced by a. We use this
theorem to prove an important inequality concerning the quadratic variation
and bracket processes.
Theorem 25 (Kunita-Watanabe Inequality). Let X and Y be two semi-
martingales, and let H and К be two measurable processes. Then one has
a.s.
/¦OO ЛОО ЛОО
/ \Hs\\Ks\\d[X,Y}s\<( H2d[X,X}s)H K2sd[Y,Y}s)K
Jo Jo Jo
Proof. By Theorem 24 we only need to show that there exists a null set N,
such that for ш ? N, and (s, t) with s < t, we have
d[X,X]u)?( d[Y,Y]u)?. (*)
J s
Let N be the null set such that ifw<?N, then 0 < /g d[X + rY,X + rY}u, for
every r, s, t; s < t, with r, s, t all rational numbers. Then
0 < [X + rY, X + rY]t - [X + rY, X + rY]s
= r\[Y, Y]t - [Y, Y}s) + 2r([X, Y)t - [X, Y]s) + ([X, X}t - [X, X}3).
The right side being positive for all rational r, it must be positive for all real
r by continuity. Thus the discriminant of this quadratic equation in r must be
non-negative, which gives us exactly the inequality (*). Since we have, then,
the inequality for all rational (s,t), it must hold for all real (s,t), by the right
continuity of the paths of the processes. ?

70 II Semimartingales and Stochastic Integrals
Corollary. Let X and Y be two semimartingales, and let H and К be two
measurable processes. Then
лОО лОО лОО
E{ [Hs[[Ks[\d[X,Y]s\}<\[{ Н2АХ,Х)а)Ц\ь,\\{1 K2td[Y,Y]t)l\\L4
Jo Jo Jo
Proof. Apply Holder's inequality to the Kunita-Watanabe inequality of The-
Theorem 25. D
Since Theorem 25 and its corollary are path-by-path Lebesgue-Stieltjes
results, we do not have to assume that the integrand processes H and К be
adapted.
Since the process [X, X] is non-decreasing with right continuous paths, and
since A[X,X]t = (AXtJ for all t > 0 (with the convention that Xo_ = 0),
we can decompose [X, X] path-by-path into its continuous part and its pure
jump part.
Definition. For a semimartingale X, the process [X, X]c denotes the path-
by-path continuous part of [X, X].
We can then write
[X,X]t = [X,X}$ +X2o+
0<s<t
= [x,x}c + J2 (AxsJ-
0<s<t
Observe that [X, X]q = 0. Analogously, [X, Y]c denotes the path-by-path
continuous part of [X, Y].
Comment. In Chap. IV, Sect. 7, we briefly discuss the continuous local mar-
martingale part of a semimartingale X satisfying an auxiliary hypothesis known
as Hypothesis A. This (unique) continuous local martingale part of X is de-
denoted Xе. It can be shown that a unique continuous local martingale part, Xе,
exists for every semimartingale X.4 (If X is an FV process, then Xе = 0, as
we shall see in Chap. III.) It is always true that [Xе, Xе] = [X,X)C. Although
we will have no need of this result in this book, this notation is often used in
the literature.
We remark also that the conditional quadratic variation of a semimartin-
semimartingale X, denoted (X, X), is defined in Chap. Ill, Sect. 4. It is also true
that {Xе, Xе) = [Xе, Xе] = [X,X]C, and these notations are used inter-
interchangeably in the literature. // X is already continuous and Xq = 0, then
{X,X) = [X,X] = [X,Xf.
4 See, for example, Dellacherie-Meyer [46, page 355]. See also in this regard Exer-
Exercise 6 of Chap. IV which proves the result for L2 martingales.

6 The Quadratic Variation of a Semimartingale 71
Definition. A semimartingale X will be called quadratic pure jump if
[X, X]c = 0.
If X is quadratic pure jump, then [X,X]t = X$ + J2o<s<t(AXsJ- The
Poisson process N is an obvious example of a quadratic pure jump semi-
semimartingale. Prom the definition or immediately from Theorem 22, we see that
[N, N]t = Nf. More generally it can be shown that if X is a Levy process with
a Levy decomposition Xt = Bt + Yt as in Theorem 42 of Chap. I, where В is
a Brownian motion and
Yt= f x{Nt{-,dx)-tv{dx))
J{\*\<1}
0<s<t
(using the notation of Theorem 42 of Chap. I), then У is a quadratic pure
jump semimartingale. Last note that the trivial continuous process Xt = t is
quadratic pure jump since [X, X}^ = [X, X]t = 0.
The next theorem gives a simple criterion for a process X to be a quadratic
pure jump semimartingale.
Theorem 26. If X is adapted, cadlag, with paths of finite variation on com-
compacts, then X is a quadratic pure jump semimartingale.
Proof. Without loss of generality we assume Xo = 0. We have already seen
that such an X is a semimartingale (Theorem 7), and that the stochastic inte-
integral with respect to X is nothing more than a path-by-path Lebesgue-Stieltjes
integral (Theorem 17). The integration by parts formula for Lebesgue-Stieltjes
differentials applied to X times itself yields X2 = f X-dX + JXdX, com-
computed path-by-path. The semimartingale integration by parts formula (Corol-
(Corollary 2 of Theorem 22), on the other hand, yields X2 = 2 f X^dX + [X,X].
Moreover
XdX = /(X_ + AX)dX = I X_dX + f AXdX,
and
rt
f X-dXs + f AXsdXs = f
Jo Jo Jo
s<t
Equating the two formulas we deduce [X,X]t = J2s<t(^XsJ, yielding the
theorem. D
Note in particular that if X is adapted with continuous paths of finite
variation, then [X,X]t = X$, all t > 0.
Theorem 27. Let X be a local martingale with continuous paths that are
not everywhere constant. Then [X,X] is not the constant process Xq, and
X2 — [X, X] is a continuous local martingale. Moreover if [X, X]t = 0 for all
t then Xt=0 for all t.

72 II Semimartingales and Stochastic Integrals
Proof. Note that a continuous local martingale is a semimartingale (Corol-
(Corollary 2 of Theorem 8). We have X2 - [X,X] = 2jX_dX, and by the mar-
martingale preservation property (Theorem 20) we have that 2 f X-dX is a local
martingale. Moreover Д2/X-dX — 2(X-)(AX), and since X is continu-
continuous, AX — 0, and thus 2 JX-dX is a continuous local martingale, hence
locally square integrable. Thus X2 — [X, X] is a locally square integrable local
martingale.
By stopping, we can suppose X is a square integrable martingale. As-
Assume further Xo = 0. Next assume that [X, X] actually were constant. Then
[X,X]t = [X,X]0 = X§ = 0, for all t. Since X2 - [X,X] is a local mar-
martingale, we conclude X2 is a non-negative local martingale, with Xq = 0.
Thus X2 = 0, all t. This is a contradiction. If Xo is not identically 0, we set
Xt = Xt — Xo and the result follows. ?
The following corollary is of fundamental importance in the theory of
martingales.
Corollary 1. Let X be a continuous local martingale, and S < T < oo be
stopping times. If X has paths of finite variation on the stochastic interval
(S, T), then X is constant on [S, T]. Moreover if [X, X] is constant on [S, T] П
[0, oo), then X is constant there too.
Proof. M = XT — Xs is also a continuous local martingale, and M has
finite variation on compacts. Moreover [M, M] = \XT — Xs, XT — Xs] =
[X, X]T — [X, X]s, and therefore [M, M] is constant everywhere, hence by
Theorem 27, M must be constant everywhere; thus X is constant on [S, T). D
Observe that if t > 0 is arbitrary and we take 5 = 0 and T = t in Corol-
Corollary 1, then we can conclude that a continuous local martingale with paths
of finite variations on compacts is a.s. constant. While non-trivial continu-
continuous local martingales must therefore always have paths of infinite variation
on compacts, they are not the only such local martingales. For example, the
Levy process martingale Zd of Theorem 41 of Chap. I will have paths of infi-
infinite variation if the Levy measure v has infinite mass in a neighborhood of the
origin. Another example is Azema's martingale, which is presented in detail
in Sect. 8 of Chap. IV.
Corollary 2. Let X and Y be two locally square integrable local martin-
martingales. Then [X, Y] is the unique adapted cadlag process A with paths of finite
variation on compacts satisfying the two properties:
(i) XY — A is a local martingale; and
(ii) AA = AX AY, Ao = X0Y0.
Proof Integration by parts yields
XY= f X_dY+ fY-dX + [X,Y],

6 The Quadratic Variation of a Semimartingale 73
but the martingale preservation property tells us that both stochastic integrals
are local martingales. Thus XY — [X, Y] is a local martingale. Property (ii)
is simply an application of Theorem 23. Thus it remains to show uniqueness.
Suppose А, В both satisfy properties (i) and (ii). Then A — В = {XY —
B) — (XY — A), the difference of two local martingales which is again a local
martingale. Moreover,
A(A -B) = AA-AB = AX AY - AX AY = 0.
Thus A — В is a continuous local martingale, Aq — Bq = 0, and it has paths
of finite variation on compacts. Corollary 1 yields At — Bt — Aq + Bq = 0 and
we have uniqueness. D
Corollary 2 can be useful in determining the process [X, Y]. For example
if X and Y are locally square integrable martingales without common jumps
such that XY is a martingale, then [X, Y] = XoYo. One can also easily verify
(as a consequence of Theorem 23 and Corollaries 1 and 2 above, for example)
that if X is a continuous square integrable martingale and У is a square
integrable martingale with paths of finite variation on compacts, then [X, Y] =
XoYo, and hence XY is a martingale. (An example would be X a Brownian
motion and Y a compensated Poisson process.) Corollary 2 is true as well for
X, Y local martingales, however we need Theorem 29 of Chap. Ill to prove
the general result.
In Chap. Ill we show that any local martingale is a semimartingale (see the
corollary of Theorem 26 of Chap. Ill), and therefore if M is a local martingale
its quadratic variation [M, M]t always exists and is finite a.s. for every t > 0.
We use this fact in the next corollary.
Corollary 3. Let M be a local martingale. Then M is a martingale with
E{M?} < oo, all t > 0, if and only if E{[M,M}t} < oo, all t > 0. If
E{[M,M]t] < oo, then E{M?} = E{[M,M}t}.
Proof. First assume that M is a martingale with E{M?} < oo for all t > 0.
Then M is clearly a locally square integrable martingale. Let Nt = Mt2 -
[M, M]t = 2 JQ Ms-dMs, which is a locally square integrable local martingale
by Theorem 20. Let (Tn)n>i be stopping times increasing to oo a.s. such
that Nfn is a square integrable martingale. Then E{N^n} = E{Nq} = 0, all
t > 0. Therefore
Doob's maximal quadratic inequality gives E{(M^J} < 4Е{М?} < oo.
Therefore by the Dominated Convergence Theorem
E{M?}= lim E{M?ATn}
n—*oo
= lim E{[M,M}tATn}
n—>-oo
= E{[M,M]t},

74 II Semimartingales and Stochastic Integrals
where the last result is by the Monotone Convergence Theorem. In particular
we have that E{[M, M]t} < oo.
For the converse, we now assume E{[M,M]t} < oo, all t > 0. Define
stopping times by
T" = inf{* >0: \Mt\ >n}An.
Then T" increase to oo a.s. Furthermore (MT )* < n + |ДМ<гп| < n +
[M,M]n' , which is in L2. By Theorem 51 of Chap. I, MT" is a uniformly
integrable martingale for each n. Also we have that
for all t > 0. Therefore MT satisfies the hypotheses of the first half of this
theorem, and E{(MfnJ} = E{[MTn,MTn]t}. Using Doob's inequality we
have
E{(Mt\TnJ} < AE{{Mff} = T\T\
<4E{[M,M}t}.
The Monotone Convergence Theorem next gives
E{(Mt*J}= lim E{(Mt\TnJ}
n—*oo
<4E{[M,M]t} < oo.
Therefore, again by Theorem 51 of Chap. I, we conclude that M is a martin-
martingale. The preceding gives E{M?} < oo. D
For emphasis we state as another corollary a special case of Corollary 3.
Corollary 4. If M is a local martingale and E{[M, M\oo} < oo, then M is a
square integrable martingale (that is supt E{M?} = E{M%O\ < oo). Moreover
E{M?} = E{[M, M]t} for all t, 0 < t < oo.
Example. Before continuing we consider again an example of a local martin-
martingale that exhibits many of the surprising pathologies of local martingales. Let
В be a standard Brownian motion in M3 with Bo = A,1,1). Let Mt = Ц-Е^Ц,
where \\x\\ is standard Euclidean norm in K.3. (We previously considered this
example in Sect. 6 of Chap. I.) As noted in Chap. I, the process M is a contin-
continuous local martingale; hence it is a locally square integrable local martingale.
Moreover E{M2} < oo for all t. However instead of i i—> E{M2} being an
increasing function as it would if M were a martingale, lim^oo E{M2} = 0.
Moreover E{[M,M}t} > E{[M,M}0} = 1 since [M,M]t is increasing. There-
Therefore we cannot have E{M2} = E{[M,M]t} for all t. Indeed, by Corollary 3
and the preceding we see that we must have E{[M, M]t} = oo for all t > 0. In
conclusion, M = \\B\\~1 is a continuous local martingale with E{M2} < oo
for all t which is both not a true martingale and for which E{M2} < oo while
E{[M,M]t} = oo for all t > 0. (Also refer to Exercise 20 at the end of this
chapter.)

6 The Quadratic Variation of a Semimartingale 75
Corollary 5. Let X be a continuous local martingale. Then X and [X, X]
have the same intervals of constancy a.s.
Proof. Let r be a positive rational, and define
Tr = inf{* > r : Xt ф Xr}.
Then M = XTr — XT is a local martingale which is constant. Hence [M, M] =
[X, X]Tr - [X, X]r is also constant. Since this is true for any rational r a.s.,
any interval of constancy of X is also one of [X, X].
Since X is continuous, by stopping we can assume without loss of generality
that X is a bounded martingale (and hence square integrable). For every
positive, rational r we define
Sr = mf{t>r:[X,X}t>[X,X}r}.
Then
E{(XSr - Xrf} = E{Xl) - E{X2}
by Doob's Optional Sampling Theorem. Moreover
E{X2Sr} - E{X2} = E{[X,X}Sr - [X,X}r} = 0,
by Corollary 3. Therefore E{(XSr - XrJ} = 0, and XSr = Xr a.s. Moreover
this implies Xq = Xsq a.s. on {Sq = Sr} for each pair of rationale (r, q), and
therefore we deduce that any interval of constancy of [X, X] is also one of
X. О
Note that the continuity of the local martingale X is essential in Corol-
Corollary 5. Indeed, let Nt be a Poisson process, and let Mt—Nt— t. Then M is a
martingale and [M, M]t = Nt; clearly M has no intervals of constancy while
N is constant except for jumps.
Theorem 28. Let X be a quadratic pure jump semimartingale. Then for any
semimartingale Y we have
[X,Y}t = X0Y0+ ? AXSAYS.
0<s<t
Proof The Kunita-Watanabe inequality (Theorem 25) tells us d[X, Y]s is a.s.
absolutely continuous with respect to d[X, X] (path-by-path). Thus [X, X]c =
0 implies [X, Y]c = 0, and hence [X, Y] is the sum of its jumps, and the result
follows by Theorem 23. D
Theorem 29. Let X and Y be two semimartingales, and let H, К € L. Then
[H-X,K-Y]t= f HsKsd[X,Y]s
Jo
and, in particular,

76 II Semimartingales and Stochastic Integrals
[H-X,H-X]t= H2sd[X,X}s.
Jo
Proof. First assume (without loss of generality) that Xq = Yq = 0. It suffices
to establish the following result
[H-X,Y]t= [ Hsd[X,Y]s, (*)
Jo
and then apply it again, by the symmetry of the form [-,-], and by the asso-
associativity of the stochastic integral (Theorem 19).
First suppose H is the indicator of a stochastic interval. That is, H =
l[o,T]i where T is a stopping time. Establishing (*) is equivalent in this case
to snowing [XT, Y] = [X, Y]T, a result that is an obvious consequence of
Theorem 23, which approximates [X, Y] by sums.
Next suppose H = UI^st], where S, T are stopping times, S <T a.s.,
and U e Fs. Then jHsdXs'= U(XT - Xs), and by Theorem 23
= U{[XT,Y}-[XS,Y]}
= U{[X,Y}T - [X,Y]3} = jHsd[X,Y}s.
The result now follows for H € S by linearity. Finally, suppose H € L and
let Hn be a sequence in S converging in ucp to H. Let Zn = Hn-X, Z = H-X.
We know Zn, Z are all semimartingales. We have fH^d[X,Y]s = [Zn,Y],
since Hn € S, and using integration by parts
[Zn,Y] - YZn - JY..dZn - I Zn_dY
= YZn
By the definition of the stochastic integral, we know Zn —> Z in мер, and
since Hn —> H (ucp), letting n —> oo we have
lim [Zn,Y] = YZ- fy-HdX- f Z-dY
n^oo1 J J
fy^dZ- IZ^dY
= YZ-
again by integration by parts. Since limn^oo f H™d[X, Y]s = f Hsd[X,Y]s,
we have [Z, Y] = [H -X,Y}= J Hsd[X, Y)s, and the proof is complete. D
Example. Let Bt be a standard Wiener process with Bq = 0, (i.e., Brownian
motion). B% — t is a continuous martingale by Theorem 27 of Chap. I. Let
H € L be such that E{f* H^ds} < oo, each t > 0. By Theorem 28 of Chap. I

6 The Quadratic Variation of a Semimartingale 77
we have [B,B]t = t, hence [H ¦ B,H ¦ B]t = J*H^ds. By the martingale
preservation property, J HsdBs is also a continuous local martingale, with
(# • B)o = 0. By Corollary 3 to Theorem 27
f
Jo
HsdBsJ} = E{\H B,H- B}t}
o
It was this last equality,
E{(fHsdBsf} = E{fH2sds),
Jo Jo
that was crucial in K. Ito's original treatment of a stochastic integral.
Theorem 30. Let H be a cadlag, adapted process, and let X, Y be two
semimartingales. Let an be a sequence of random partitions tending to the
identity. Then
-YT")
converges in ucp to JHs^d[X,Y]s (ff0- = 0). Here an = @ < To™ < Tf <
¦¦ <Тгп <¦¦¦ <T?J.
Proof. By the definition of quadratic variation, [X, Y] = ХУ-Х_ -Y-Y--X,
where X- Y denotes the process (/0 Xs-dYs)t>o. By the associativity of the
stochastic integral (Theorem 19)
tf_ ¦ [X,Y] = Я_ • (XY) - tf_ • (X_ -Y)-H-' (У_ • X)
= tf_ • (XY) - (tf_X_) ¦ Y - (Я_У_) • X
= tf_ • (XY) - (HX)- ¦ Y - (HY)- ¦ X.
By Theorem 21 the above is the limit of
_ XT")
- YT"). D

78 II Semimartingales and Stochastic Integrals
7 Ito's Formula (Change of Variables)
Let Л be a process with continuous paths of finite variation on compacts.
If H € L we know by Theorem 17 that the stochastic integral H ¦ A agrees
a.s. with the path-by-path Lebesgue-Stieltjes integral JHsdAs. In Sect. 7 of
Chap. I (Theorem 54) we proved the change of variables formula for / ? C1,
namely
f
f(At)-f(A0)= ff'{As
Jo
)dAs
We also saw at the end of Sect. 4 of this chapter that for a standard Wiener
process В with Bo = 0,
/
Jo
* 1 1
BsdBs = -В2 - -t. (**)
Taking f(x) = x2/2, the above formula is equivalent to
f(Bt) - /(Bo) = f f'(Bs)dBs + \ f f"(Bs)ds,
Jo z Jo
which does not agree with the Lebesgue-Stieltjes change of variables formula
(*). In this section we will state and prove a change of variables formula valid
for all semimartingales.
We first mention, however, that the change of variables formula for con-
continuous Stieltjes integrals given in Theorem 54 of Chap. I has an extension
to right continuous processes of finite variation on compacts. We state this
result as a theorem but we do not prove it here because it is merely a special
case of Theorem 32, thanks to Theorem 26.
Theorem 31 (Change of Variables). Let V be an FV process with right
continuous paths, and let f be such that f exists and is continuous. Then
(f(Vt))t>0 is an FV process and
f(Vt)
- f(V0) = f !'{V.-)dVB + ? {/(Ve) - /(Ve_) - f'(Vs.)AVs}.
J°+ 0<s<t
0<s<t
Recall that the notation JQ+ = f,Q f, denotes the integral over the half
open interval @, t]. We wish to establish a formula analogous to the above,
but for the stochastic integral; that is, when the process is a semimartingale.
The formula is different in this case, as we can see by comparing equation (*)
with equation (**); we must add an extra term!
Theorem 32 (Ito's Formula). Let X be a semimartingale and let f be
a C2 real function. Then f(X) is again a semimartingale, and the following
formula holds:

7 Ito's Formula (Change of Variables) 79
f(Xt) - f(X0) = /* f(Xs.)dXs + \ f f"(Xs
Jo+ z Jo+
0+ ^ ./0 +
{f{Xs)-f{Xa-)-f'{Xe-)AXs}.
0<s<t
Proof Note that the jump part of the stochastic integral J f"(Xs-)d[X, X]s is
given by J2s<t f"(Xs-)(AXsJ, and this is a convergent series. By adding and
subtracting 1/2 of this series, we can rewrite Ito's formula in the equivalent
form
f(Xt)-f(X0)= f f'{Xs_)dXs + \ f f"(Xs_)d[X,X]s
+ ? {f(Xs)~ f(Xs.)- f'{Xs-)AXs-\f"{Xs-){AXsJ}
0<s<t
which is perhaps less obviously a generalization of the "classical" case, but
notationally simpler to prove. The proof rests, of course, on Taylor's Theorem
which says
fiv) - fix) = f(x)(y -x) + \f"{x){y - xJ + R(x,y)
where \R(x,y)\ < r(\y — x\)(y — xJ, such that r : R+ —> R+ is an increasing
function with limujo f(u) = 0, which is valid for f ? C2 defined on a compact
set. We now prove separately the continuous case and the general case.
Proof for the continuous case. We first restrict our attention to a continuous
semimartingale X, since the proof is less complicated but nevertheless gives
the basic idea. Without loss of generality we can take Xo = 0. Define stopping
times
Rm = inf{i : \Xt\ > m}.
Then the stopped process XRm is bounded by m, and if Ito's formula is valid
for XRm for each ra, it is valid for X as well. Therefore we assume that X
takes its values in a compact set. We fix a t > 0, and let an be a refining
sequence of random partitions5 of [0, t] tending to the identity [an = @ =
To" < T2n < ¦ • • < T? = t)}. Then
f(Xt) - /(Xo) = E {/ (X^+1) " / (Х?Г)} (* * *)
5 Note that it would suffice for this proof to restrict attention to deterministic
partitions.

80 II Semimartingales and Stochastic Integrals
The first sum converges in probability to the stochastic integral JQ f'(Xs-)dXs
by Theorem 21; the second sum converges to \ /0 f"(Xs)d[X, X]s in probabil-
probability by Theorem 30. It remains to consider the third sum ^Tli R (Xyn, X^v- J •
But this sum is majorized, in absolute value, by
supr(|XTn+i - XTr\){^2(XTr+i -XTrJ},
T+i
and since J2i(Xr?+1 — ХтрJ converges in probability to [X,X]t (Theo-
(Theorem 22), the last term will tend to 0 if limn^oosupir(\X'X'r' t — Xtp\) = 0.
However s i—» Xs(w) is a continuous function on [0,t], each fixed ш, and
hence uniformly continuous. Since limn^ooSup, \T™+1 — T"| = 0 by hypoth-
hypothesis, we have the result. Thus, in the continuous case, f(Xt) — f(Xo) =
/0* f'(Xs-)dXs + \ /0* f"{Xs-)d[X, Xs], for each t, a.s. The continuity of the
paths then permits us to remove the dependence of the null set on (, giving
the complete result in the continuous case.
Proof for the general case. X is now given as a right continuous semimartin-
gale. Once again we have a representation as in (***), but we need a closer
analysis. For any t > 0 we have ^20<s<t(AXsJ < [X,X]t < oo a.s., hence
^0<s<((AIsJ is convergent. Given e > 0 and t > 0, let A = A{e,t) be
a set of jump times of X that has a.s. a finite number of times s, and let
В = B(e,t) be such that Х^ев(А^J < ?2> where A and В are disjoint and
Аи В exhaust the jump times of X on @, t]. We write
f(Xt) - f(X0) =
= E {/(%.) - явд}+E {
i,A i,B
where ?\>A denotes ?\ 1{Ап(тр,щг№}- Then
{ }
j,A s€A
and by Taylor's formula
- XTrJ}

7 Ito's Formula (Change of Variables) 81
As in the continuous case, the first two sums on the right side above converge
J*+f'(Xs-)dXs and \ /0*+
ous case, the fist to sus o
respectively to J*+f'(Xs-)dXs and \ /0*+ f"(Xs^)d[X,X}s. The third sum
converges to
]T !f'{X3.)AX3
Assume temporarily that \XS\ < k, some constant k, all s < t. Then /" is
uniformly continuous, and using the right continuity of X we have
,ХТп+1) < r(e+)[X,X]t,
where r(e+) is Iimsup5|?r(E). Next let e tend to 0. Then r(e+)[X,X]t tends
to 0, and
seA(e,t)
\f(Xs) - f(Xt.) - f(Xs_)AXs - У"(
tends to the series in (* * *), provided this series is absolutely convergent.
Let Vk = inf{t > 0 : \Xt\ > к}, with Xo = 0. By first establishing (***) for
X\\pyk), which is a semimartingale since it is the product of two semimartin-
gales (Corollary 2 of Theorem 22), it suffices to consider semimartingales
taking their values in intervals of the form [—k, k]. For / restricted to [—fc, k]
we have \f(y) - f(x) - (y - x)f'(x)\ < C(y - xf. Then
\f(Xs)-f(Xs_)-f'(Xs_)AXs\<C
0<s<t 0<s<t
< <2 < K[X,X)t < oo a.s.
Thus the series is absolutely convergent and this completes the proof. D
Corollary (Ito's Formula). Let X be a continuous semimartingale and let
/ be a C2 real function. Then f(X) is again a semimartingale and the following
formula holds:
f(Xt) - /(Xo) = f f'(Xs)dXs + \ f f"(Xs)d[X,X]s.
Jo+ z Jo+
Theorem 32 has a multi-dimensional analog. We omit the proof.
Theorem 33. Let X = (X1,..., Xn) be an n-tuple of semimartingales, and
let f : Rn —> M. have continuous second order partial derivatives. Then f(X)
is a semimartingale and the following formula holds:

82 II Semimartingales and Stochastic Integrals
f(Xt) - f(X0)
0<s<t i=l '
The stochastic integral calculus, as revealed by Theorems 32 and 33, is
different from the classical Lebesgue-Stieltjes calculus. By restricting the class
of integrands to semimartingales made left continuous (instead of L), one can
define a stochastic integral that obeys the traditional rules of the Lebesgue-
Stieltjes calculus.
Definition. Let X, Y be semimartingales. Define the Fisk-Stratonovich
integral of Y with respect to X, denoted /0 Vs_ о dXs, by
[ Ув_ о dXs = / Ys.dXs + \[Y,X)ct.
Jo Jo l
The Fisk-Stratonovich integral is often referred to as simply the Stratono-
vich integral. The notation "o" is called Ito's circle. Note that we have
defined the Fisk-Stratonovich integral in terms of the semimartingale integral.
With some work one can slightly enlarge the domain of the definition and we
do so in Sect. 5 of Chap. V. In particular, Theorem 34 below is proved with
the weaker hypothesis that / G C2 (Theorem 20 of Chap. V). We will write
the F-S integral as an abbreviation for the Fisk-Stratonovich integral.
Theorem 34. Let X be a semimartingale and let f be C3. Then
f(Xt)-f(X0) = f f'(Xs.)odXs+ J2 if(Xs) - f(Xs-) - f'(Xs.)AXs}.
J0+ ns.<+
0<s<t
Proof. Note that /' is C2, so that f'{X) is a semimartingale by Theorem 32
and in the domain of the F-S integral. By Theorem 32 and the definition, it
suffices to establish \\f'(X),X}c = \ \l f"(XsJ)d{X,X\cs. However
f'(Xt)-f'(X0)= f f"(Xs.)dXs + l f
0+ ^ ./0+
У] {/'(*.) - f'(Xs-) - f"(Xs..)AXs}.
0<s<t
Thus
[f'{X),X}c = [f"(X-) -X,X}C + [i/<3>(X_) • [X,X},X}C.
The first term on the right side above is /„* f"{Xs.)d[X,X]cs by Theorem 29;
the second term can easily be seen, as a consequence of Theorem 22 and the

7 Ito's Formula (Change of Variables) 83
fact that [X, X] has paths of finite variation, to be ?0<s<t f{3){Xs-){^Xsf.
That is, zero, and the theorem is proved. D
Note that if X is a semimartingale with continuous paths, then Theorem 34
reduces to the classical Riemann-Stieltjes formula f(Xt)—f(Xo) = /0 f'(Xs)o
dXs. This is, of course, the main attraction of the Fisk-Stratonovich integral.
Corollary (Integration by Parts). Let X, Y be semimartingales, with at
least one of X and Y continuous. Then
XtYt - X0Y0 = f Xs- о dYs + [ Fs_ о dXs.
J Jo
0Y0 = f
Jo
Proof. The standard integration by parts formula is
XtYt=
o
However [X, Y}t = [X, Y}$ + X0Y0 if one of X or Y is continuous. Thus adding
\\X, Y]^ to each integral on the right side yields the result. ?
One can extend the stochastic calculus to complex-valued semimartingales.
Theorem 35. Let X, Y be continuous semimartingales, let Zt = Xt + iYt,
and let f be analytic. Then
f(Zt)=f(Z0)+ f f'{Zs)dZs + \f f"(Zs)d[Z,Z]s.
Jo+ z Jo+
Proof. Using Ito's formula for the real and imaginary parts of / yields
Since / is analytic by the Cauchy-Riemann equations ff = /' and |^ =
if. Differentiating again gives 0 = % = /" and ?^ = if" and 0 =
—/". The result now follows by collecting terms and observing that dZs =
dXs + idYs and [Z, Z] = [X + iY, X +iY} = [X, X] + 2i[X, Y] - [Y, Y]. О
Theorem 35 also has a version for Zt = Xt+iYt, with both X and Y cadlag
semimartingales. The proof is almost the same as the proof of Theorem 35.

84 II Semimartingales and Stochastic Integrals
Theorem 36. Let X, Y be cadlag semimartingales, let Zt = Xt + iYt, and
let f be analytic. Then
+ f f(Zs.)dZs + l [ f\ZsJ)d\Z,Z\\
Observe that for a complex-valued semimartingale the process [Z, Z] is in
general a complex-valued process. For many applications, it is more appropri-
appropriate to use the non-negative increasing process
[Z,Z} = {X,X} + {Y,Y]
to play the role of the quadratic variation.
8 Applications of Ito's Formula
As an application of the change of variables formula, we investigate a simple,
yet important and non-trivial, stochastic differential equation. We treat it, of
course, in integral form.
Theorem 37. Let X be a semimartingale, Xq = 0. Then there exists a
(unique) semimartingale Z that satisfies the equation Zt = 1 + JQ Zs_dXs. Z
is given by
Zt=exj>(xt-hx,X]t\ П (
^ ' 0<s<t
where the infinite product converges.
Proof. We will not prove the uniqueness here since it is a trivial consequence of
the general theory to be established in Chap. V. (For example, see Theorem 7
of Chap. V.) Note that the formula for Zt is equivalent to the formula
Zt = exp \xt - 1[Х,Х]Л ТТA + AXs)exp{-AXs}.
Since Xt — \[X,X]t is a semimartingale, and ex is C2, we need only show
that Ils<t(l + AXs)exp{-AXs} is cadlag, adapted, and of finite variation.
It will then be a semimartingale too, and thus Z will be a semimartingale.
The product is clearly cadlag, adapted; it thus suffices to show the product
converges and is of finite variation.
Since X has cadlag paths, there are only a finite number of s such that
|AXsj > 1/2 on each compact interval (fixed w). Thus it suffices to show

8 Applications of Ito's Formula 85
Vt = П A
0<s<t
converges and is of finite variation. Let Us = AXs1{|axs|<i/2}- Then we have
log Vt — X^s<t{'°e(^ +^*) ~~ ^*}> which is an absolutely convergent series a.s.,
since So<s«(^*J — Р^-^Ь < °° a-s-' because | log(l + x) - x\ < x2 when
|ж| < 1/2. Thus log(Vt) is a process with paths of finite variation, and hence
so also is expjlogVj} = Vj.
To show that Z is a solution, we set Kt = Xt — | [X, X]%, and let f(x, y) =
yex. Then Z* = f(Kt, St), where 5t = По<*<*A + Ax*) exp{-AXs}. By the
change of variables formula we have
Zt-1= f Zs_dKs+ f eK°-dSs + \ f Zs-d[K,K]cs (*)
./0+ Jo+ ^ ^o+
0<s<t
t 1 ft Л
= / Zs-dXs-\ [ Zs_d[X,X]cs+ f eK^
Jo+ *¦ Jo+ Jo+
+ \ f Za-d[X,X)ce+ Y. (Zs-Zs--Z)
2 J°+ 0<s<t
since [K, S]c — [S, S]c = 0. Note that S is a pure jump process. Hence
/0*+ eK-dSs = J20<s<teK>-ASs. Also Zs = Ze_(l + AXS), and ZS_
AX, so the last sum on the right side of equation (*) becomes
-eK-
0<s<t 0<s<t
Thus equation (*) simplifies due to cancellation to Zt — 1 = Jo Zs-dXs, and
we have the result. ?
Definition. For a semimartingale X, Xo = 0, the stochastic exponential
of X, written ?(X), is the (unique) semimartingale Z that is a solution of
The stochastic exponential is also known as the Doleans-Dade expo-
exponential. Theorem 37 gives a general formula for ?{X). This formula simplifies
considerably when X is continuous. Indeed, let X be a continuous semimartin-
semimartingale with Xo = 0. Then
?{X)t=exp{Xt-±[X,X]t}.
An important special case is when the semimartingale X is a multiple Л of a
standard Brownian motion В = (Bt)t>0. Since AB has no jumps we have

86 II Semimartingales and Stochastic Integrals
?(XB)t = exp{ABt - y[B,B]t} = exp{ABt - y*
Moreover, since ?(XB)t = 1 + XjQ?(XB)s-dBs we see that ?(XB)t =
eXBt--2-t js a continuous martingale. The process ?{XB) is sometimes referred
to as geometric Brownian motion. Note that the previous theorem gives us
?{X) in closed form. We also have the following pretty result.
Theorem 38. Let X and Y he two semimartingales with Xq = Yq = 0. Then
?{X)?(Y) = ?{X + Y + [X,Y]).
Proof. Let Ut = ?{X)t and Vt = ?(Y)t. Then the integration by parts formula
gives that UtVt - 1 = /0* Us_dVs + /0*+ Vs_dUs + [U, V)t. Since U and V are
exponentials, this is equivalent to
/ U8-V.-dYa+ [ Us.Vs.dXs+ [ Us-V
o+ Jo+ Jo+
Letting Wt = UtVt, we deduce that Wt = 1 + /0* Ws-d(X + Y+ [X,Y])S so
0*
that W = ?{X +Y+[X,Y\), which was to be shown. ?
Corollary. Let X be a continuous semimartingale, Xo = 0. Then ?(X)~1 =
Proof. By Theorem 38,
?{X)?(-X + [X, X}) = ?{X + {-X + [X, X}) + [X, -X}),
since [—X, [X, X]] = 0. However ?@) = 1, and we are done. D
In Sect. 9 of Chap. V we consider general linear equations. In particular,
we obtain an explicit formula for the solution of the equation
./o
Xt=Ht+ XsdZs,
Jo
where Z is a continuous semimartingale. We also consider more general in-
inverses of stochastic exponentials. See, for example, Theorem 63 of Chap. V.
Another application of the change of variables theorem (and indeed of
the stochastic exponential) is a proof of Levy's characterization of Brownian
motion in terms of its quadratic variation.
Theorem 39 (Levy's Theorem). A stochastic process X — (^t)t>o *s a
standard Brownian motion if and only if it is a continuous local martingale
with [X,X]t = t.

8 Applications of Ito's Formula 87
Proof. We have already observed that a Brownian motion В is a continuous
local martingale and that [B,B]t = t (see the remark following Theorem 22).
Thus it remains to show sufficiency. Fix tieR and set F(x, t) — ехр{шх +
Ц^-t}. Let Zt = F(Xt,t) = exp{iuXt + (^-) t}. Since F e C2 we can apply
Ito's formula (Theorem 33) to obtain
/"* и2 Гг и2 /"*
/ ZsdXs + \ / Zads-\ / Zsd{X,X}s
Jo l Jo l Jo
i:
= 1 +ra / ZsdX
su,^i.s,
which is the exponential equation. Since X is a continuous local martingale,
we now have that Z is also one (complex-valued, of course) by the martingale
preservation property. Moreover stopping Z at a fixed time to, Z*°, we have
that Zta is bounded and hence a martingale. It then follows for 0 < s < t that
E{exp{iu(Xt - X3)}\F.} = exp |~(t -
Since this holds for any и G M we conclude that Xt - Xs is independent of
Ts and that it is normally distributed with mean zero and variance (t — s).
Therefore X is a Brownian motion. D
Observe that if M and N are two continuous martingales such that MN
is a martingale, then [M, N] — 0 by Corollary 2 of Theorem 27. Therefore if
Bt = (Bl,... ,-B") is an n-dimensional standard Brownian motion, B\B\ is
a martingale for i ^ j, and we have that
Theorem 39 then has a multi-dimensional version, which has an equally simple
proof.
Theorem 40 (Levy's Theorem: Multi-dimensional Version). LefX. =
(X1,... ,Xn) be continuous local martingales such that
Then X is a standard n-dimensional Brownian motion.
As another application of Ito's formula, we exhibit the relationship between
harmonic and subharmonic functions and martingales.
Theorem 41. Let X = (X1,... ,Xn) be an n-dimensional continuous local
martingale with values in an open subset D o/M™. Suppose that [Xl,Xi] = 0
if i Ф 3i and [Хг,Хг] = A, 1 < г < п. Let и : D —> R be harmonic (resp.
subharmonic). Then u(X) is a local martingale (resp. submartingale).

88 II Semimartingales and Stochastic Integrals
Proof. By Ito's formula (Theorem 33) we have
u(Xt) - u(Xq) = f Vw(Xs) • dXs + i f Au(Xs)dA
Jo+ * Jo+
f f
o+ * Jo+
where the "dot" denotes Euclidean inner product, V denotes the gradient,
and Д denotes the Laplacian. If и is harmonic (subharmonic), then Аи =
0(Au > 0) and the result follows. D
If В is a standard n-dimensional Brownian motion, then В satisfies the
hypotheses of Theorem 41 with the process At = t. That u(Bt) is a submartin-
submartingale (resp. supermartingale) when и is subharmonic (resp. superharmonic) is
the motivation for the terminology submartingale and supermartingale. The
relationship between stochastic calculus and potential theory suggested by
Theorem 41 has proven fruitful (see, for example, Doob [56]).
Levy's characterization of Brownian motion (Theorem 39) allows us to
prove a useful change of time result.
Theorem 42. LetM = (Mt)t>o be a continuous local martingale with Mo — 0
and such that Нгщ-юо \M, M]t = oo a.s. Let
Ts =inf{i>0: [M,M]t>s}.
Define Qs = Ттв and Bs = Мтв. Then (Bs,Qs)s>0 is a standard Brownian
motion. Moreover ([M,M]t)t>o are stopping times for (Qs)s>o and
Mt = B[M,M]t a-s- 0 < i < oo.
That is, M can he represented as a time change of a Brownian motion.
Proof. The (Ts)s>o are stopping times by Theorem 3 of Chap. I. Each Ts
is finite a.s. by the hypothesis that limt_,.oo [M, M]t = oo a.s. Therefore the
<t-fields Qs = Ft, are well-defined. The filtration {Qs)s>o need not be right
continuous, but one can take 7is = Gs+ = ^тв+ to obtain one. Note further
that {[M,M]t < s} = {Ts > t}, hence ([M,M]t)t>o are stopping times for
the filtration Q = (Gs)s>o-
By Corollary 3 of Theorem 27 we have E{MTs} = E{[M, M\Ts} = s < oo,
since [M, M]tb = s identically because [M, M] is continuous. Thus the time
changed process is square integrable. Moreover
E{BU - Bs\gs} = E{MTu - MT. \Ft,} = 0
by the Optional Sampling Theorem. Also
e{bI- в1\да) = e{(bu- B,J\g.}
= E{{MTu -MTaf\FTa}
= E{{M, M)Tu - [M,M}t, \Ft,}
= и — s.

8 Applications of Ito's Formula 89
Therefore .B2 — s is a martingale, whence [B, B]s = s provided В has contin-
continuous paths, by Corollary 2 to Theorem 27.
We want to show that Bs = Mts has continuous paths. However by Corol-
Corollary 5 of Theorem 27 almost surely all intervals of constancy of [M, M\ are also
intervals of constancy of M. It follows easily that В is continuous. It remains to
show that Mt = В[М<щ±. Since Bs = Mts, we have that В[М<щг = Mt[MiMU,
a.s. Since (Ts)s>o is the right continuous inverse of [M,M], we have that
T[m,m\i — *> with equality holding if and only if t is a point of right in-
increase of [M,M]. (If (Ts)s>0 were continuous, then we would always have
that T[M,M]t = t.) However TjM>M]t > t implies that t >-> [M,M]t is constant
on the interval (?, TjMiM]J; thus by Corollary 5 of Theorem 27 we conclude
M is constant on (?,TjMjM]t). Therefore B[M,M]t = Мт[м M]t = Mt a.s., and
we are done. D
Another application of the change of variables formula is the determination
of the distribution of Levy's stochastic area process. Let Bt = (Xt,Yt) be an
M2-valued Brownian motion with (Xq,Yq) = @,0). Then during the times s
to s + ds the chord from the origin to В sweeps out a triangular region of area
^RsdNs, where
R3 = y/X* + У82
and
dNs = -^-dXs + ф-dY,.
Therefore the integral At = /0 RsdNs = /0 (—YsdXs+XsdYs) is equal to twice
the area swept out from time 0 until time t. Paul Levy found the characteristic
function of At and therefore determined its distribution. Theorem 43 is known
as Levy's stochastic area formula.
Theorem 43. Let Bt = (Xt,Yt) be an M?-valued Brownian motion, Bo =
@,0),u6R. Let At = /0* XsdYs - /0* YsdXs. Then
E{eluAt} = \—, 0<i<oo,-oo<u<oo.
cosh(ui)
Proof. Let a(t), C(t) be C1 functions, and set
Then
dVt — iudAf —(X? + Y?)dt
- a{t){XtdXt + YtdYt + dt} + C'(t)dt
= (-iuYt - a(t)Xt)dXt + (iuXt - a{t)Yt)dYt
- l-dt{a'{t)X2t + a'(t)Yt2 + 2a(t) -

90 II Semimartingales and Stochastic Integrals
Next observe that, from the above calculation,
d[V, V]t = (-iuYt - a{t)XtJdt + (iuXt - a(t)YtJdt
= (a(tJ - u2)(X2 + Y2)dt.
Using the change of variables formula and the preceding calculations
dev'=ev<(dVt+±d[V,V}t)
= eVt(-iuYt - a(t)Xt)dXt + eVt(iuXt - a(t)Yt)dYt
+ -eVtdt{(a(tJ - u2 - a'(t))(X2 + Y2) + 2C'(t) - 2a(t)}.
2*
Therefore eVt is a local martingale provided
a'{t) = a{tf - u2
13'(t) = a(t).
Next we fix t0 > 0 and solve the above ordinary differential equations with
a(t0) = /3(t0) = 0. The solution is
a(t) = и tanh(u(i0 - t))
/3(t) = -lQgcoeh(u(i0-t)),
where tanh and cosh are hyperbolic tangent and hyperbolic cosine, respec-
respectively. Note that for 0 < t < t0,
\eVt\ = exp{—a(t)
Thus eVt, 0 < t < to is bounded and is therefore a true martingale, not just a
local martingale, by Theorem 51 of Chap. I. Therefore,
However, Vt0 = iuAt0 since a(to) = /3(io) = 0. As Ao — Xq — Yq = 0, it
follows that Vo = —log cosh(ui0)- We conclude that
E{eiuAt°} = exp{-log cosh(ui0)}
1
cosh(uio)'
and the proof is complete. ?
There are of course other proofs of Levy's stochastic area formula (e.g.,
Yor [246], or Levy [150]). As a corollary to Theorem 43 we obtain the density
for the distribution of At.

8 Applications of Ito's Formula 91
Corollary. Let Bt = (Xt, Yt) be an M2-valued Brownian motion, Bo = @, 0),
and set At = /0 XsdYs — JQ YadXs. Then the density function for the distri-
distribution of At is
2t coshGrx/2i)'
-00 < X < 00.
Proof. By Theorem 43, the Fourier transform (or characteristic function) of
At is E{eiuAt} = cosh'(Mt). Thus we need only to calculate ? / ±^du. The
z0 = Щ_,
Next we
given by
as shown
1 1O Ul tilt
we have
integrate
* 1U11
1 J
Res(/, z0) -
along the closed
Cr- \
in Fig. 1
2/ = 0,
X — Г,
7Г
У=1'
x= -r,
below.
X
c$
= —r
coshBt) • O1
РЫ
Q'(zo)
curve Cr
-r <
0 <y
r > x
IT
net
e-KT/2t
it
traversed
x < r,
IT
1
> -r,
>o,
a;
у
= r
^;uoll^^>^ над a, puic at
counter clockwise, and
cr
c2
cr
cr
Fig. 1. The closed curve of integration Cr = C^ + C^ + C* + C*.
Therefore Jc f(z)dz — 2ni Res(/, Zq). Along C% the integral is
— / —— du =
Jr cosh(ui)
l_r cosh(ui)
The integrands on C2 and Cf are dominated by 2e~r, and therefore

92 II Semimartingales and Stochastic Integrals
lim / f(z)dz = A + era/t) /
•^°° Jcr J-
= 2m Res(/,
.-«
lim / f(z)dz = A + e™/*) / .. ..du
J J cosh(ui)
-2тг-
Finally we can conclude
2ttJ_
g — ШХ J
A11
Loo cosh(ui) 2тг
2t cosh(^f)'
П
The stochastic area process A shares some of the properties of Brownian
motion, as is seen by recalling that At = /0 RsdNs, where TV is a Brownian
motion by Levy's Theorem (Theorem 39), and N and R are independent (this
must be proven, of course). For example A satisfies a reflection principle. If
one changes the sign of the increments of A after a stopping time, the process
obtained thereby has the same distribution as that of A. One can use this
fact to show, for example, that if St = supo<s<tAs, then St has the same
distribution as \At\, for t > 0.
Bibliographic Notes
The definition of semimartingale and the treatment of stochastic integration
as a Riemann-type limit of sums is in essence new. It has its origins in the
fundamental theorem of Bichteler [13, 14], and Dellacherie [42]. The peda-
pedagogic approach used here was first suggested by Meyer [176], and it was then
outlined by Dellacherie [42]. Dellacherie's outline was further expanded by
Lenglart [145] and Protter [201, 202]. These ideas were also present in the
work of M. Metivier and J. Pellaumail, and to a lesser extent Kussmaul, al-
although this was not appreciated at the time by us. See for example [160], [162],
and [138]. A similar idea was developed by Letta [148]. A recent treatment is
in Bichteler [15].
We will not attempt to give a comprehensive history of stochastic inte-
integration here, but rather just a sketch. The important early work was that of
Wiener [228, 229], and then, of course, Ito [93, 94, 95, 97, 98]. However, it
has recently been discovered, by B. Bru and M. Yor, that the young French
mathematician W. Doeblin6 had essentially established Ito's formula in early
6 Wolfgang Doeblin (Doblin) was born in 1915 in Berlin, and after acquiring French
citizenship in 1936 was known as Vincent Dobbin. He was killed in June 1940.

Bibliographic Notes 93
1940, but was killed later that year in action in the Second World War, and
thus his results were lost until recently [26, 27]. Doob stressed the martingale
nature of the Ito integral in his book [55] and proposed a general martingale
integral. Doob's proposed development depended on a decomposition theo-
theorem (the Doob-Meyer decomposition, Theorem 8 of Chap. Ill) which did not
yet exist. Meyer proved this decomposition theorem in [163, 164], and com-
commented that a theory of stochastic integration was now possible. This was
begun by Courrege [36], and extended by Kunita and Watanabe [134], who
revealed an elegant structure of square integrable martingales and established
a general change of variables formula. Meyer [166, 167, 168, 169] extended
Kunita and Watanabe's work, realizing that the restriction of integrands to
predictable processes is essential. He also extended the integrals to local mar-
martingales, which had been introduced earlier by Ito and Watanabe [102]. Up
to this point, stochastic integration was tied indirectly to Markov processes,
by the assumption that the underlying filtration of cr-algebras be "quasi-left
continuous." This hypothesis was removed by Doleans-Dade and Meyer [53],
thereby making stochastic integration a purely martingale theory. It was also
in this article that semimartingales were first proposed in the form we refer
to as classical semimartingales in Chap. III.
A different theory of stochastic integration was developed independently
by McShane [154, 155], which was close in spirit to the approach given here.
However it was technically complicated and not very general. It was shown in
Protter [200] (building on the work of Pop-Stojanovic [194]) that the theory
of McShane could for practical purposes be viewed as a special case of the
semimartingale theory.
The subject of stochastic integration essentially lay dormant for six years
until Meyer [171] published a seminal "course" on stochastic integration. It
was here that the importance of semimartingales was made clear, but it was
not until the late 1970's that the theorem of Bichteler [13, 14], and Del-
lacherie [42] gave an a posteriori justification of semimartingales. The seem-
seemingly ad hoc definition of a semimartingale as a process having a decomposi-
decomposition into the sum of a local martingale and an FV process was shown to be
the most general reasonable stochastic differential possible. (See also Kuss-
maul [138] in this regard, and the bibliographic notes in Chap. III.)
Most of the results of this chapter can be found in Meyer [171], though they
are proven for classical semimartingales and hence of necessity the proofs are
much more complicated. Theorem 4 (Strieker's Theorem) is (of course) due to
Strieker [218]; see also Meyer [173]. Theorem 5 is due to Meyer [175]. There
are many other methods of expanding a filtration and still preserving the
semimartingale property. For further details, see Chap. VI on expansion of
filtrations.
Theorem 14 is originally due to Lenglart [142]. Theorem 16 is known to
be true only in the case of integrands in L. The local behavior of the integral
(Theorems 17 and 18) is due to Meyer [171] (see also McShane [154]). The a.s.

94 Exercises for Chapter II
Kunita-Watanabe inequality, Theorem 25, is due to Meyer [171], while the ex-
expected version (the corollary to Theorem 25) is due to Kunita-Watanabe [134].
That continuous martingales have paths of infinite variation or are con-
constant a.s. was first published by Fisk [73] (Corollary 1 of Theorem 27). The
proof given here of Corollary 4 of Theorem 27 (that a continuous local martin-
martingale X and its quadratic variation [X, X] have the same intervals of constancy)
is due to Maisonneuve [151]. The proof of Ito's formula (Theorem 32) is by
now classic; however we benefited from Follmer's presentation of it [75]. A
popular alternative proof which basically bootstraps up form the formula for
integration by parts, can be found (for example) on pages 57-58 of J. Jacod
and A. N. Shiryaev [110].
The Fisk-Stratonovich integral was developed independently by Fisk [73]
and Stratonovich [217], and it was extended to general semimartingales by
Meyer [171]. Theorem 35 is inspired by the work of Getoor and Sharpe [215].
The stochastic exponential of Theorem 37 is due to Doleans-Dade [49]. It
has become extraordinarily important. See, for example, Jacod-Shiryaev [110].
The pretty formula of Theorem 38 is due to Yor [237]. Exponentials have of
course a long history in analysis. For an insightful discussion of exponentials
see Gill-Johansen [82]. That every continuous local martingale is the time
change of a Brownian motion is originally due to Dubins-Schwarz [59] and
Dambis [37]. The proof of Levy's stochastic area formula (Theorem 43) is
new and is due to Janson. See Janson-Wichura [113] for related results. The
original result is in Levy [150], and another proof can be found in Yor [246].
Exercises for Chapter II
Exercise 1. Let В be standard Brownian motion and let / be a function
mapping M —> M which is continuous except for one point where there is a
jump discontinuity. Show that Xt = f(Bt) cannot be a semimartingale.
Exercise 2. Let Q « P (Q and P are both probability measures and Q
is absolutely continuous with respect to P). Show that if Xn —> X in im-
improbability, then Xn —> X in Q-probability.
Exercise 3. Let (X, Y) be standard two dimensional Brownian motion. (That
is, X and Y are independent one dimensional standard Brownian motions.)
Let 0 < a < 1. Set
Bt = aXt + \/l - a2Yt.
Show that В is a standard one dimensional Brownian motion. Compute [X, B]
and [Y,B].
Exercise 4. Let / : M+ —> K+ be non-decreasing and continuous. Show that
there exists a continuous martingale M such that [M, M]t = /(?)¦
Exercise 5. Let В be standard Brownian motion and let H € L be such that
Ht\ = 1, all t. Let Mt = Jo HsdBs. Show that M is also a Brownian motion.

Exercises for Chapter II 95
Exercise 6. Give an example of semimartingales Xn, X, Y such that
limn-HXjX™ = Xt a.s., all t, but limn_>oo[Xn,F]i ф [X,Y\t a.s., for some
t.
Exercise 7. Let Y, Z be semimartingales, and let Hn, H e L. Suppose
Нт„^оо Я" = Я in ucp. Let X" = / H?dYs and X = / HsdYs. Show that
UmT,_>oo[A-n,Z]t = [X,Z]t in ucp, all i.
Exercise 8. Suppose / and all fn are C2, and that fn and /^ converge
uniformly on compacts to / and /', respectively. Let X, Y be continuous
semimartingales and show that Шпп_>оо[/п(.Х'),У] = [f(X),Y].
Exercise 9. Let M be a continuous local martingale and let A be a continu-
continuous, adapted process with paths of finite variation on compact time sets, with
Ao = 0. Let X = M + A. Show that [X,X] = [M, M) a.s.
Exercise 10. Let X be a semimartingale and let Q be another probability
law, with Q « P. Let [X, X]p denote the quadratic variation of X considered
as a P semimartingale. Show that \X,X}Q = [X,X]P, Q-a.s.
Exercise 11. Let В be standard Brownian motion, and let T = inf{i > 0 :
Bt = lorBt = -2}.
(a) Show that T is a stopping time.
(b) Let M = BtAr and let N = —M. Show that M and N are continuous
martingales and that [M,M]t = [N,N]t = t AT. Show that nevertheless
M and N have different distributions.
Exercise 12. Let X be a semimartinagle such that Ylo<s<t l^-^-sl < °° a-s-'
each t > 0. Let f e C2 and show that J2o<s<t \&f(Xs)\ < oo a.s., each t > 0
as well.
Exercise 13. Let X be a semimartingale and suppose Hn and H are in L.
Further suppose that Hn -> H in ucp. Show that f H?dXs -+ / HsdXs in
ucp.
* Exercise 14. Let X be a semimartingale, and let A be a non-negative,
continuous, increasing process with А^, = oo a.s. Suppose Нт^-юо /0 A +
As)~ldXs exists and is finite a.s. Show that lim^oo ^ = 0 a.s.
Exercise 15. Let M be a continuous local martingale and assume [M, M}^, =
oo a.s. Show that limj-K» [д/мЬ = ^ a's'
Exercise 16. Let В be a standard Brownian motion and let H be adapted
and continuous. Show that for fixed i,
lim -= 5- / HsdBs = Hu
h-*0 Bt+h — Bt Jt
with convergence in probability.

96 Exercises for Chapter II
Exercise 17. Let Y be a continuous, adapted process with Yq = О and with
Y constant on [l,oo). Let
t
^ Ysdsl{t>x).
n
Show that Xn is a semimartingale, each n, but that linin^oo Xn = Y need
not be a semimartingale.
Exercise 18. (Continuation of Exercise 17.) Let В be a standard Brownian
motion and let Xn = n Jt_x Bsdsl^t>x}- Solve
t = s s ' 0 = '
and show that Нт„_юо X™ = Bt a.s., each t, but that linin^oo Zn ф Z, where
Z solves the equation dZt — ZtdBt.
Exercise 19. (Related to Exercises 17 and 18.) Let A? = ? sin(ni), 0 < t <
ж/2. Show that An is a semimartingale for each n, and that /0T \dA™\ = 1,
each n (this means the total variation process, path-by-path, of A), but that
lim sup \At\ =0.
*Exercise 20. (This exercise refers to an example briefly considered in Sect. 6
of Chap. I on local martingles, and again in Sect. 6 of Chap. II.) Let В be
a standard Brownian motion in M3, and let и : К3 \ {0} —> R be given by
u(y) — \\y\\~1 ¦ Assume Bq = x a.s., where x ф 0, show the following.
(a) Mt = u(Bt) is a local martingale.
(b) limt^ooEx{u(Bt))=0.
(c) u{Bt) e L2, each t > 0.
(d) supt>0E*(u(BtJ)<oo.
Conclude that u(B) is not a martingale.
Exercise 21. Let А, С be two non-decreasing cadlag processes with Aq =
Co = 0. Assume that both A^ = lim^oo At and C^ = Цщ^а, Ct are finite.
(a) Show that ^ooCoo = /0°°(^oo - A,)dC, + ^(C» - Cs-)dAs.
(b) Deduce from part (a) the general formula
/•OO POO
= / AsdCs + / Cs_dAs.
Jo Jo
Note that the above formulae are not symmetric. See also Theorem 3 in
Chap. VI.

Exercises for Chapter II 97
Exercise 22. Let A be non-decreasing and continuous, and assume A^ =
lim^oo At is finite, Ao — 0. Show that for integer p > 0,
ЛОО />0O ЛОО
(Aoo)"=p\ / dASl / dAS2-- dASp.
Jo Jsi Jsp-i
*Exercise 23 (expansion of nitrations). Let Б be a standard Brownian
motion on a filtered probability space (O, J7, F, P), and define a new filtration
G by S° = Tt V a{Bx) and Qt = f]u>t G°.
(a) Show that E{Bt - BS\QS} = |5f (Bi - Bs) for 0 < s < t < 1.
(b) Show that
' Вл-Вч ,
I
Jo
1-
where /? is a G-Brownian motion. (Note in particular that the process
(Bt - Jo Bilf"ds)o<t<i is a G martingale on [0,1).)
(c) Show that
f1 Bi-B8l,
-\ds < oo a.s.
/
./o
1-s
and conclude that В is a G semimartingale.
This is a special case of Theorem 3 in Chap. VI.
Exercise 24. With the notation of Exercise 23, let Xt = Bt -?Bi, 0 < t < 1.
(a) Show that
) f
Jo
f ^ps
Jo *¦ ~ s
(b) Deduce from (a) that X satisfies the stochastic differential equation
Xt=b+
Уо 1 ~s
(X is called a Brownian bridge from 0 to 0.)
*Exercise 25. Let {Bt,Tt)t>o be a standard Brownian motion. Let a,b > 0
and T = inf{t > 0 : Bt e (-a,bH}.
(a) Show that
is a local martingale,
(b) Show that

98 Exercises for Chapter II
(c) For 0 < 9 < -^, show that XT is a positive supermartingale, and deduce
(d) Use (c) to show that Xj, = sups<r \XS\ G L1, and conclude that X is a
martingale.
(e) Conclude that
Exercise 26. Let {Bt,Tt)t>o be standard Brownian motion, and let T =
infji > 0 : Bt e (-d,dH}. Let M = BT. Show that
(a) if d < f, then E{exp{±[M,M]T}} < oo; but
(b) if d = f, then Е{ещ>{\[М,M]T}} = oo. (#m?; Use Exercise 25.)
Exercise 27. Let {Bt,Tt)t>o be standard Brownian motion, and let Xt =
e"at(Xo+<J /0 easdBs). Show that X is a solution to the stochastic differential
equation
dXt — -aXtdt + adBt.
Exercise 28. Let В be a standard Brownian motion and let ?(B) denote the
stochastic exponential of B. Show that lim^oo ?(B)t = 0 a.s.
Exercise 29. Let X be a semimartingale. Show that
1 = -dX + d[X,X]
[?(Xy ?(X)
Exercise 30. Let Bbea standard Brownian motion.
(a) Show that M is a local martingale, where
Mt = exp{Bt - -t}.
(b) Calculate [M,M]t, and show that M is a martingale.
(c) Calculate E{eBt}.
The next eight problems involve a topic known as changes of time. For
these problems, let (Q,,J-,?,P) satisfy the usual hypotheses. A change of
time R = (Rt)t>o 1S a family of stopping times such that for every wef!, the
function R.(w) is non-decreasing, right continuous, Rt < oo a.s., and Ro — 0.
Let Qt = Tnt. Change of time is discussed further in Sect. 3 of Chap. IV.
Exercise 31. Show that G = (?t)t>o satisfies the usual hypotheses.
Exercise 32. Show that if M is an F uniformly integrable martingale and
Mt := Mnt, then MisaG martingale.

Exercises for Chapter II 99
Exercise 33. If M is an F (right continuous) local martingale, show that M
is a G semimartingale.
*Exercise 34. Construct an example where M is an F local martingale, but
M is not a G local martingale. (Hint: Let (Хп)пе® be an adapted process. It
is a local martingale if and only if \Xn\dP is a сг-finite measure on Tn-i-, and
E{Xn\Tn-\} = Xn-i, each n > 1. Find Xn where \Xn\dP is not сг-finite on
Tn-i, any n, and let Rn = 2n.)
*Exercise 35. Let R be a time change, with s »-> Rs continuous, strictly
increasing, Ro = 0, and Rt < oo, each t > 0. Show that for a continuous
semimartingale X,
rRt rt
/ HsdXs = / HRudXRu
Jo Jo
for bounded Яб1.
*Exercise 36. Let R and X be as in Exercise 35. No longer assume that
Rt < oo a.s., each t > 0, but instead assume that X is a finite variation
process. Let At — infjs > 0 : Rs > t}.
(a) Show that R strictly increasing implies that A is continuous.
(b) Show that R continuous implies that A is strictly increasing.
(c) Show that for general R, RAt > t, and if R is strictly increasing and
continuous then R^t = t.
(d) Show that for bounded ffeLwe have
Rt rt rtAAoo
HsdXs = / l{R.<O0)HR.dXR. = / HR.dXR..
o Jo Jo
(e) Show that for bounded H e L we have
- Г
'Jo
See in this regard Lebesgue 's change of time formula, given in Theorem 45 of
Chap. IV.
*Exercise 37. Let R be a change of time and let G be the filtration given by
Qt = TRt. Let At = infjs > 0 : Rs > t}. Show that A = (At)t>o is a change of
time for the filtration G. Show also that iit—*Rt is continuous a.s., Ro = 0,
and Roo = oo, then RAt = t a.s., t > 0.
*Exercise 38. Let A, G, be as in Exercise 37 and suppose that RAt = t a.s.,
t > 0. Show that QM С Tt, each t>0.
*Exercise 39. A function is Holder continuous of order a if \f(x) — f(y)\ <
K\x—y\a. Show that the paths of a standard Brownian motion are a.s. nowhere
locally Holder continuous of order a for any a > 1/2. (Hint: Use the fact that

ш
Semimartingales and Decomposable Processes
1 Introduction
In Chap. II we defined a semimartingale as a good integrator and we developed
a theory of stochastic integration for integrands in L, the space of adapted
processes with left continuous, right-limited paths. Such a space of integrands
suffices to establish a change of variables formula (or "Ito's formula"), and it
also suffices for many applications, such as the study of stochastic differential
equations. Nevertheless the space L is not general enough for the consider-
consideration of such important topics as local times and martingale representation
theorems. We need a space of integrands analogous to measurable functions
in the theory of Lebesgue integration. Thus defining an integral as a limit of
sums—which requires a degree of smoothness on the sample paths—is inade-
inadequate. In this chapter we lay the groundwork necessary for an extension of our
space of integrands, and the stochastic integral is then extended in Chap. IV.
Historically the stochastic integral was first proposed for Brownian motion,
then for continuous martingales, then for square integrable martingales, and
finally for processes which can be written as the sum of a locally square
integrable local martingale and an adapted, cadlag processes with paths of
finite variation on compacts; that is, a decomposable process. Later Doleans-
Dade and Meyer [53] showed that the local square integrability hypothesis
could be removed, which led to the traditional definition of a semimartingale
(what we call a classical semimartingale). More formally, let us recall two
definitions from Chaps. I and II and then define classical semimartingales.
Definition. An adapted, cadlag process Л is a finite variation process
(FV) if almost surely the paths of A are of finite variation on each compact
interval of [0, сю). We write /0°° \dAs\ or \A\qo for the random variable which
is the total variation of the paths of A.
Definition. An adapted, cadlag process X is decomposable if there exist
processes N, A such that

102 III Semimartingales and Decomposable Processes
Xt = Xo + Nt + At
with iVo = A) = 0, N a locally square integrable local martingale, and A an
FV process.
Definition. An adapted, cadlag process У is a classical semimartingale
if there exist processes N, В with No — -Bo = 0 such that
Yt = Yo + Nt + Bt
where N is a local martingale and В is an FV process.
Clearly an FV process is decomposable, and both FV processes and de-
decomposable processes are semimartingales (Theorems 7 and 9 of Chap. II).
The goal of this chapter is to show that a process X is a classical semimartin-
semimartingale if and only if it is a semimartingale. To do this we have to develop a small
amount of "the general theory of processes." The key result is Theorem 25
which states that any local martingale M can be written
M = N + A
where N is a local martingale with bounded jumps (and hence locally square
integrable), and A is an FV process. An immediate consequence is that a
classical semimartingale is decomposable and hence a semimartingale by The-
Theorem 9 of Chap. II. The theorem of Bichteler and Dellacherie (Theorem 43)
gives the converse: a semimartingale is decomposable.
We summarize the results of this chapter, that are important to our treat-
treatment, in Theorems 1 and 2 which follow.
Theorem 1. Let X be an adapted, cadlag process. The following are equiva-
equivalent:
(i) X is a semimartingale;
(ii) X is decomposable;
(Hi) given /3 > 0, there exist M, A with Mq = Aq = 0, M a local martingale
with jumps bounded by C, A an FV process, such that Xt = Xo + Mt + At;
(iv) X is a classical semimartingale.
Definition. The predictable cr-algebra V on M+ x О is the smallest cr-
algebra making all processes in L measurable. We also let V (resp. bV) denote
the processes (resp. bounded processes) that are predictably measurable.
The next definition is not used in this chapter, except in the Exercises,
but it is natural to include it with the definition of the predictable cr-algebra.
Definition. The optional cr-algebra О on R+ x О is the smallest cr-algebra
making all cadlag, adapted processes measurable. We also let О (resp. ЪО)
denote the processes (resp. bounded processes) that are optional.

2 The Classification of Stopping Times 103
Theorem 2. Let X be a semimartingale. If X has a decomposition Xt =
Xo + Mt + At with M a local martingale and A a predictably measurable FV
process, Mo = Aq = 0, then such a decomposition is unique.
In Theorem 1, clearly (ii) or (iii) each imply (iv), and (iii) implies (ii),
and (ii) implies (i). That (iv) implies (iii) is an immediate consequence of
the Fundamental Theorem of Local Martingales (Theorem 25). While Theo-
Theorem 25 (and Theorems 3 and 22) is quite deep, nevertheless the heart of Theo-
Theorem 1 is the implication (i) implies (ii), essentially the theorem of K. Bichteler
and C. Dellacherie, which itself uses the Doob-Meyer decomposition theorem,
Rao's Theorem on quasimartingales, and the Girsanov-Meyer Theorem on
changes of probability laws. Theorem 2 is essentially Theorem 30.
We have tried to present this succession of deep theorems in the most
direct and elementary manner possible. In the first edition we were of the
opinion that Meyer's original use of natural processes was simpler than the
now universally accepted use of predictability. However, since the first edition,
R. Bass has published an elementary proof of the key Doob-Meyer decompo-
decomposition theorem which makes such an approach truly obsolete. We are pleased
to use Bass' approach here; see [11].
2 The Classification of Stopping Times
We begin by denning three types of stopping times. The important ones are
predictable times and totally inaccessible times.
Definition. A stopping time T is predictable if there exists a sequence of
stopping times (Tn)n>! such that Tn is increasing, Tn < T on {T > 0}, all n,
and limn^oo Tn = T a.s. Such a sequence (Tn) is said to announce T.
If X is a continuous, adapted process with Xo = 0, and T = ini{t : \Xt\ >
c}, for some с > 0, then T is predictable. Indeed, the sequence Tn = ini{t :
\Xt\ >c—^}Anisan announcing sequence. Fixed times are also predictable.
Definition. A stopping time T is accessible if there exists a sequence
(Tk)k>i of predictable times such that
oo
: Tk(u>) = T(w) < oo}) = P{T < oo).
Such a sequence (Tk)k>i is said to envelop T.
Any stopping time that takes on a countable number of values is clearly ac-
accessible. The first jump time of a Poisson process is not an accessible stopping
time (indeed, any jump time of a Levy process is not accessible).
Definition. A stopping time T is totally inaccessible if for every pre-
predictable stopping time S,

104 III Semimaxtingales and Decomposable Processes
P{w : Т{ш) = S{uj) < 00} = 0.
Let T be a stopping time and Л е Тт- We define
if шеЛ,
It is simple to check that since Л е Тт, Т\ is a stopping time. Note further
that T = хпт(Тл,ТЛо) =ТЛЛ ТЛс.
A simple but useful concept is that of the graph of a stopping time.
Definition. Let Г be a stopping time. The graph of the stopping time
T is the subset of R+ x О given by {(t,u:) : 0 < t = T(w) < 00}; the graph
of T is denoted by [T].
Theorem 3. Let T be a stopping time. There exist disjoint events А, В such
that A Li В = {Т < сю} a.s., Та is accessible and Тв is totally inaccessible,
and T = Та Л Тв a.s. Such a decomposition is a.s. unique.
Proof. If T is totally inaccessible there is nothing to show. So without loss of
generality we assume it is not. We proceed with an inductive construction:
Let R±=T and take
c*i = sup{PE = Ri < 00) : Sis predictable}.
Choose Si predictable such that P(Si = R\ < 00) > ^- and set Vx = S±.
Define R2 = Ri^y^Ri}- F°r the inductive step let
a, = sup{PE = Ri < сю) : S is predictable}
If a, — 0 we stop the induction. Otherwise, choose Si predictable such that
P(Si = Ri < сю) > ^ and set Vi = Si^s. ^oes not equai any of vjti<j<i-i}-
(
^ q y j
Let Ri+i = Ri^Vi^Ri}- The graphs of the Vi are disjoint by construction (the
finite valued parts, of course). Moreover the sets {Ri+i Ф Ri, Ri < сю} are
disjoint. Thus E ^ < T,p(Ri+i Ф Ri, Ri < 00) < 1, which implies that
ai —> 0. The Ri form a non-decreasing sequence, so set U = lim^oo Ri.
If U is not totally inaccessible, there exists a predictable time W such that
P(W — U) > 0 and hence P(W — U) > c^ for some i. This contradicts how
we chose Si at step i of the induction. Therefore U is totally inaccessible, and
В = {U = T < сю}. ?
A beautiful application of Theorem 3 is Meyer's Theorem on the jumps
of Markov processes, a special case of which we give here without proof. (We
use the notation established in Chap. I, Sect. 5 and write FM = (.T^tKt^oo-)
Theorem 4 (Meyer's Theorem). LetX be a (strong) Markov Feller process
for the probability PM, where the distribution of Xq is given by fx, and with its
natural completed filtration FM. Let T be a stopping time with P^(T > 0) — 1.
Let A = {lj : Xt{oj) ф ХТ-(ш) and T{uj) < сю}. Then Т = ТлЛТЛс, where
T/i is totally inaccessible and Тдс is predictable.

3 The Doob-Меуег Decompositions 105
A consequence of Meyer's Theorem is that the jump times of a Poisson
process (or more generally a Levy process) are all totally inaccessible.
We will need a small refinement of the concept of a stopping time cr-algebra.
This is particularly important when the stopping time is predictable.
Definition. Let Г be a stopping time. The ст-algebra Тт- is the smallest
cr-algebra containing To and all sets of the form А Л {t < T}, t > 0 and
ieft.
Observe that Тт- С TT, and also the stopping time T is Тт- measurable.
We also have the following elementary but intuitive result. We leave the proof
to the reader.
Theorem 5. Let T be a predictable stopping time and let (Tn)n>i be an
announcing sequence for T. Then Тт- = c{Un>i ^тп} = Vn>i ^тп ¦
3 The Doob-Meyer Decompositions
We begin with a definition. Let N denote the natural numbers.
Definition. An adapted, cadlag process X is a potential if it is a non-
negative supermartingale such that lim^oo E{Xt} = 0. A process {Хп)пе^
is also called a potential if it is a nonnegative supermartingale for N and
limn^oo E{Xn} = 0.
Theorem 6 (Doob Decomposition). A potential (Хп)пе® has a decom-
decomposition Xn = Mn — An, where An+\ > An a.s., A$ — 0, An e J-n-i, and
Mn = ^{^ool^n}. Such a decomposition is unique-
Proof. Let Mo = Xo and Ao = 0. Define Mx = Mo + {X± - ?{Xi|JF0}), and
Ai = Xo - E{X1\Tq). Define Mn, An inductively as follows:
Mn = Mn-! + {Xn -
An = An_! + (Xn_! - E{Xn\Tn-i}).
Note that E{An] = E{X0} - E{Xn} < E{X0} < oo, as is easily checked by
induction. It is then simple to check that Mn and An so defined satisfy the
hypotheses.
Next suppose Xn = Nn — Bn is another such representation. Then Mn —
Nn = An - Bn and in particular Mx - N± = Ax - Вг ef0- Thus Mx - Nx =
E{M± - Nil^o} = Mo - No = Xo - Xo = 0, hence Mx = Nx. Continuing
inductively shows Mn = Nn, all п. П
We wish to extend Theorem 6 to continuous time supermartingales. Note
the unusual measurability condition that An e J-n-i which of course is
stronger than simply being adapted. The continuous time analog is that the
process A be natural or what turns out to be equivalent, predictably measur-
measurable.

106 III Semimartingales and Decomposable Processes
Throughout this paragraph we assume given a filtered probability space
(fi, J-, F, P) satisfying the usual hypotheses. Before we state the first decom-
decomposition theorem in continuous time, we establish a simple lemma.
Lemma. Let (Yn)n>i be a sequence of random variables converging to 0 in
L2. Then 8uptE{Yn\ft} -* 0 in 1?.
Proof. Let M" = E{Yn\Tt} which is of course a martingale for each fixed n.
Using Doob's maximal quadratic inequality, I?{supt(M"J} < 4jB{(M^,J} =
4?{Yn2} -> 0. D
Definition. We will say a cadlag supermartingale Z with Z$ — 0 is of
Class D if the collection {Zt ¦ T a finite valued stopping time} is uniformly
integrable.
The name "Class D" was given by P. A. Meyer in 1963. Presumably he
expected it to come to be known as "Doob Class" at some point, but it has
stayed Class D for 40 years now, so we see no point in changing it. (There are
no Classes A, B, and C.) We now come to our first version of the Doob-Meyer
decomposition theorem. It is this theorem that gives the fundamental basis
for the theory of stochastic integration.
Theorem 7 (Doob-Meyer Decomposition: Case of Totally Inacces-
Inaccessible Jumps). Let Z be a cadlag supermartingale with Z$ = 0 of Class D,
and such that all jumps of Z occur at totally inaccessible stopping times. Then
there exists a unique, increasing, continuous, adapted process A with Aq = 0
such that Mt = Zt + At is a uniformly integrable martingale.
We first give the proof of uniqueness which is easy. For existence, we will
first establish three lemmas.
Proof of uniqueness. Let Z — M - A and Z = TV — С be two decompositions
of Z. Subtraction yields M - N — A - C, which implies that M — TV is a
continuous martingale with paths of finite variation. We know however by the
Corollary of Theorem 27 of Chap. II that M —TV is then a constant martingale
which implies Mt - Nt = Mo - TV0 = 0 - 0 = 0 for alH. Thus M — TV and
A = C. D
Lemma 1. Let F be a discrete time filtration and let С be a non-decreasing
process with Co = 0, and С к G J-k-i- Suppose there exists a constant TV > 0
such that E{Coo - Ck\fk} < TV a.s. for all k. Then E{C^} < 2TV2.
Proof of Lemma 1. First observe -EjCoo} = E{E{Coo - С0\Т0}} < TV. Let-
Letting Cfe = Cfc+i — Cfe > 0, we obtain by rearranging terms:
k>0 k>0
Thus it follows that

3 The Doob-Meyer Decompositions 107
- Ck\Tk}ck} < 2NE{^ck} < 2NE{C00}
fe>0 fe>0
< 2N2. П
Choose and fix a constant v G Z+ and let Dn = {k2~n : 0 < k2~n < v).
Lemma 2. Let T be a totally inaccessible stopping time. For 5 > 0, let
RE) = suPt<,, P(t < T < t + 6\Ft). Then RF) -> 0 in probability as 6 -> 0.
Proof of Lemma 2. Let о > 0 and Sn(S) = inft{i e Dn : P(t < T < t +
5\Tt) > а} Аи. First we assume Sn(S) is less than T. Since Sn(E) is countably
valued, it is accessible, and since T is totally inaccessible, P(SnE) = T) = 0.
Suppose Г с {Т <t}, and also Г е Tt. Then
?{?{l{t<T<t+5}|jrt}lr} = E{l{t<T<t+s]lr} = 0.
Suppose now P(T < Sn(S)) > 0. Then for some t e Dn, P(T < t, SnF) =
t) > 0. Let Г = {T < t, Sn(S) = t}. Then from the definition of SnF) we
obtain
E{l(t<T<t+s)\ft}lr >alr, (*)
a contradiction. Thus we conclude SnF) < T a.s.
Next we define a stopping time S as follows. Let S(S) = infn Sn(S) and
S = suPn S(?). Fix n, so that S(i) = inffe Sfc(?). Thus on {5 = T}, S(±) <
S. Hence since S = supn5(^), we have S is accessible on {S — T}, which
implies that T is accessible on {S = T}, which in turn implies P(S = T) = 0.
Consequently E{1^s=t}\^s-} — 0 a.s.
Suppose now that the result is not true. That is suppose there exists an
e > 0 such that P(R(S) > a) > e, for all 6 tending to 0. Let /3 > 0 and S </3.
For n sufficiently large
p(E{^{sn(s)<T<sn(S)+s}\^snE)} > a) > e.
We also have P(T — SF)+6) = 0, since if not SF)+6-^ could announce part
of T which would contradict that T is totally inaccessible. Thus P({Sn{S) <
T < Sn(S)+S} A {S(S) <T< SF)+6}) -> 0 as n -л оо where the symbol Д
denotes the symmetric difference set operation. Since the symmetric difference
tends to 0, if P(Sn(S) < T < Sn(S) + 6\Т3п{8) > a) > e for any 6 > 0, then
we must also have
since otherwise were it to tend to 0 along with the symmetric difference tend-
tending to 0, then we would have (*) —> 0, a contradiction. Thus we have P(SE) <
T < S(S) + ftfsiS)) > a on a set Л, with P(A) > e. From the definition of S
and working on the set Л, this implies P(E{l{S<T<s+p}\^s-} > a) > e. Let
C —> 0 to obtain P(T = S\Fs-) > a, and we have a contradiction. ?

108 III Semimartingales and Decomposable Processes
Lemma 3. Let the hypotheses of Theorem 7 hold and also assume \Z\ < N
where N > 0 is a constant, and further that the paths of Z are constant after
a stopping time v. Let W(S) - snpt<u<t+s E{Zt - Zu\Ft}. Then W{8) -> 0
in L2 as 5 -> 0.
Proof of Lemma 3. Since \Z\ < N we have |W(<5)| < 2N. Thus it suffices
to show W(S) —* 0 in probability. Let e, a > 0 and b = a^fe. Define
and W\S) = suPt<u<t+, \E{Zb - z\\Tt\V with W+(S) and W~(S) defined
analogously. Then
sup \Zb-Zbs\\Tt],
which by Doob's inequality implies
P(Wb(S) > a) < \E{ sup \Zb-Zb\2}.
Я r<«<r+«
However since Zb is cadlag and ]AZb| < b, Fatou's Lemma implies
limsup \e{ sup \Zb - Zb\2} <%=e.
s->o a" r<s<r+s a
Also, again using that Z is cadlag, for each fixed w the number of jumps larger
than b is finite. Define inductively
Ti = inf{i > 0 : AZt > b}, Ti+1 = inf{i > Tt : AZt > b}.
Since \Z\ < N be hypothesis, \AZTi\ < 2N. Choose к such that P(Tk < v) <
e. Then
P(W+(S) >a)< P(Tk <
г=1
By the previous lemma we know that R(S) —> 0 in L2, so by taking E small
enough, we get the above expression less than 2e. The reasoning for W~ is
analogous. We achieve W{5) < WbE)+ W+F) +W~F), which gives W{5) ->
0 in L2. D
We return to the proof of Theorem 7.
Proof of existence. First suppose the jumps of Z are bounded by a constant
с Let TN = inf{* > 0 : |Zt| > ЛГ - с} Л ЛГ, and Z? = ZMTjv. Then |Ztw| <

3 The Doob-Meyer Decompositions 109
\ZtL | + с < N, and ZN is constant after T^. Thus we are now reduced to the
case considered by the Lemma 2.
Fix n and let F? = T к . Define
All the a? > 0 since Z is a supermartingale, and also a? s F^-x- Let A%!
Z^=i aj • Then Lk = Z_fc_ + Л^ is an Tfc discrete time martingale. Define
We want to show that ?{supt |Bf — BJ™|2} —> 0 as n, m —> oo. Suppose m>n.
Since Bm and Bn are constant on intervals of the form (k2~m, (k + lJ"m],
the sup of the difference occurs at some k2~m.
Fix t and let и = inf{s : s € Dn and s > t}. Observe that
AT\?T} = ^{^t - Zoolft} is bounded by 2N. Also
= E{E{ZU -
which is bounded by 2N. This implies
with the right side being nonnegative and bounded by WB~n), and of course
also by E{W{2~n)\Tt}. Using Lemmas 1 and 2 we get that ?{supt \Bf -
¦S*"|2} -+ 0 as n, m —> oo. Therefore the sequence B™ is a Cauchy sequence
and we denote its limits by At.
Next we show that the process A has continuous paths. We do this by
analyzing the jumps of the approximating processes Bn. Note that for t =
k2~n,
which are bounded by WB"n). But then supt |AB"| -> 0 in L2. Thus there
exists a subsequence (rij) such that sup4 |AB1 —> 0 a.s., and hence the limit
Л is continuous.
It remains to show that Zt +At is a uniformly integrable martingale. Both
are square integrable, so we only need concern ourselves with the martingale
property. Let s,t e Dn, s<t, and Л e Ts. Then E{(Zt + At)lA} = E{(ZS +
А3Iл}- The result follows by taking limits of Zt+B™.
At the beginning of the proof we made the simplifying assumption that
Z had bounded jumps. We now treat the general case. Choose N and let
T = inf{* > 0 : \Zt\ > N} A N. Then ZT (that is, Z stopped at the time T)
has at most one jump bigger than 2N. By localization and the uniqueness of
the decomposition, it suffices to prove the result for this case. Thus without

110 III Semimartingales and Decomposable Processes
loss of generality we assume Z has at most one jump greater than or equal
to 2N in absolute value, that it occurs at the time T, and that Z is constant
after the time T. We let
s<t
s<t
Since — Z+ and Z~ both have decreasing paths, they are both supermartin-
gales. Suppose we can show the theorem is true for — Z+ and Z~. Let — Z+ —
M+ — A+ and Z~ = M~ — A~ be the two decompositions, with A+ and A~
both continuous. Then Z = Z + (-Z+ + A+) ~(Z~ +A~) = Z + M+-M~ is
a supermartingale with jumps bounded by 2N. Let Z = M - A be the unique
decomposition of Z which we now know exists. Then Z = Z + M+ — M~,
and therefore Z = (M + M+ — M~) — A is the desired (unique) Doob-Meyer
decomposition.
Thus it remains to show that — Z+ and Z~ both have Doob-Meyer de-
decompositions. First observe that \AZT\ < \%т-\ + \Zt\ < N + \ZT\ e L1, and
hence ^{|AZr|} < oo. Choose a,e > 0. Next choose R > N so large that
.E{|Ai?r|l{|A.zT|>fl}} — ?a- The cases for when the jump is positive and when
the jump is negative being exactly the same except for a minus sign, we only
treat the case where the jump is negative. Let Z^ = AZTl{t>T}l{-AZT>R}i
and Zf = Zt - Zf. We define Bn~,BnR,Bnd analogously to the way we
defined Bn in equation (*) above.
We first show 5"~ converges uniformly in t in probability. We have:
-йГ~1>а) (**)
t
< P(sup \Bf - B?d\ > ?) + P(sup |BfR| > ^) + P(sup \B?R\ > ^)
The second term on the right side of (**) above is less than 3e:
P(sup\B?R\ > ?) = P(B^ > \) < *E{B?}
t о о а
< ?-E{\ZR\} < ^-E{\AZT\l{lAZTl>R}}
< 3e.
The third term on the right side of (**) is shown to be less than 3e similarly.
Since \ZR\ is bounded by R, the first term on the right side of (**) can be
made arbitrarily small by taking m and n large enough, analogous to what
we did at the beginning of the proof for Bn.
Thus as we did before in this proof, we can conclude that B"+ converges
uniformly in t in probability as n —> oo, and we denote its limit process by Af.

3 The Doob-Meyer Decompositions 111
We prove continuity of A exactly as before. Finally by taking a subsequence
tij such that Bnj+ converges almost surely, we get
+ E{ lim B?+} +
{
Tlj
by Fatou's Lemma. From this it easily follows that Z+ + A+ is a uniformly
integrable martingale, and the proof is complete. ?
While Theorem 7 is sufficient for most applications, the restriction to su-
permartingales having jumps only at totally inaccessible stopping times can
be insufficient for some needs. When we move to the general case we no
longer have that A is continuous in general (however see Exercises 24 and 26
for supplemental hypotheses that assure that the increasing process A of the
Doob-Meyer decomposition is in fact continuous). Without the continuity we
lose the uniqueness of the decomposition, since there exist many martingales
of finite variation (for example, the compensated Poisson process) that we can
add to the martingale term and subtract from the finite variation term of a
given decomposition, to obtain a second, new decomposition. Instead of the
continuity of A we add the condition that A be predictably measurable.
Theorem 8 (Doob-Meyer Decomposition: General Case). Let Z be a
cadlag supermartingale with Zo = 0 of Class D. Then there exists a unique,
increasing, predictable process A with Aq = 0 such that Mt = Zt + At is a
uniformly integrable martingale.
Before proving Theorem 8 let us introduce the concept of a natural process.
We introduce two definitions and prove the important properties of natural
processes in the next three theorems.
Definition. An FV process A with Aq = 0 is of integrable variation if the
expected total variation is finite: E{fo°° \dAs\} < oo. A shorthand notation for
this is .Ed^loo} < oo. An FV process A is of locally integrable variation
if there exists a sequence of stopping times (T")n>i increasing to oo a.s. such
that E{Jq" \dAs\} = E{\A\Tn} < oo, for each n.
Definition. Let A be an (adapted) FV process, Aq = 0, of integrable varia-
variation. Then Л is a natural process if
for all bounded martingales M.
Here is the key theorem about natural processes. This use of natural pro-
processes was Meyer's original insight that allowed him to prove Doob's conjec-
conjecture, which became the Doob-Meyer decomposition theorem.
Theorem 9. Let A be an FV process, Aq = 0, and Е{\А\Ж} < oo. Then A
is natural if and only if

112 III Semimartingales and Decomposable Processes
/•OO
( f
Me-dAt\ =
/0
for any bounded martingale M.
Proof. By integration by parts we have
/•OO /-00
/ Ms-dAs = MqoAx, - MqAq — / As_dMs — [M, A]oo-
Jo Jo
Then Mo A) = 0 and letting Nt = /0 As-dMs, we know that N is a local
martingale (Theorem 20 of Chap. II). However using integration by parts we
see that JSfiV^} < oo, hence N is a true martingale (Theorem 51 of Chap. I).
Therefore E {/0°° As_dMs} = ElN^,} - E{N0} =0, since N is a martingale,
so that the equality holds if and only if E{\M, Ajoo} = 0. ?
Theorem 10. Let A be an FV process of integrable variation which is natural.
If A is a martingale then A is identically zero.
Proof. Let T be a finite stopping time and let H be any bounded, nonnegative
rp
martingale. Then E{JQ Hs^dAs} = 0, as is easily seen by approximating
sums and the Dominated Convergence Theorem, since /0 |cL4,,| e L1 and
E{AT} = 0. Using the naturality of A, E{HTAT} = E{f* Hs_dAs} = 0,
and letting Ht = Е{1^т>о^\Тг} then shows that Р(Ат > 0) = 0. Since
E{AT} = 0, we conclude AT = 0 a.s., hence A = 0. ?
Theorem 11. Let A be an FV process of integrable variation with AQ = 0.
// A is predictable, then A is natural.
The proof of this theorem is quite intuitive and natural provided we accept
a result from Chap. IV. (We do not need this theorem to prove the theorem
we are using from Chap. IV.)
Proof. Let M be a bounded martingale. First assume A is bounded. Then
the stochastic integral fQ AsdMs exists, and it is a martingale by Theo-
Theorem 29 in Chap. IV combined with, for example, Corollary 3 of Theorem 27
of Chap. II. Therefore E{f?° AsdMs} = E{A0M0} = 0, since Ao = 0. How-
However -Б{/0°° As_dMs} = E{Ao_Mo} = 0 as well, since A0- = 0. Further,
/»oo /»oo /»oo
/ AsdMs- Aa-dMa = (As-As_)dMs
Jo Jo Jo
= f
AAsdMs
]T AASAMS
0<«<oo

3 The Doob-Meyer Decompositions 113
since Л is a quadratic pure jump semimartingale. Therefore
/.OO ;-OO
E{[A,M]oo} = E{ AsdMs- As_dMs} = 0.
Jo Jo
Since M was an arbitrary bounded martingale, A is natural by definition.
Finally we remove the assumption that A is bounded. Let An = nA(AV(-n)).
Then An is bounded and still predictable, hence it is natural. For a bounded
martingale M
E{[M,AU) = limSflM,^"],»} = О,
n
by the Dominated Convergence Theorem. Therefore A is natural. ?
The next theorem is now obvious and should be called a Corollary at best.
Because of its importance, however, we give it the status of a theorem.
Theorem 12. Let M be a local martingale with paths of finite variation on
compact time sets. If M is predictably measurable, then M is constant. That
is, Mt = Mo for all t, almost surely.
Proof. This theorem is a combination of Theorems 10 and 11. ?
Proof of Theorem 8 (Doob-Meyer: General Case). We begin with the unique-
uniqueness. Suppose Z = M - A and Z = N - С are two decompositions. By sub-
subtraction we have M — N = А — С is a martingale with paths of finite variation
which is predictable. By Theorem 12 we have that M — N is constant and
since Mo — No = 0, it is identically zero. This gives the uniqueness.
The existence of the decomposition is harder. We begin by defining stop-
stopping times Tnj, where Tnj is the j-th time \AZt\ is in the half-open interval
bounded by 2~™ and 2~(n~l\ where -oo < n < oo. We decompose Tnj
into its accessible and totally inaccessible parts. Since we can cover the ac-
accessible part with a sequence of predictable times, we can thus separate each
Tnj into a totally inaccessible time and a sequence of predictable times, with
disjoint (stopping time) graphs. Therefore, by renumbering, we can obtain a
sequence of stopping times Ej)j>i with disjoint graphs, each one of which is
either predictable or totally inaccessible, which exhaust the jumps of Z (in
the sense that the jump times of Z are contained in the union of the graphs
of the Si) and are such that for each i there exists a constant 6, such that
bi < \AZSi\ <2bi.
We define Zo(t) = Zt and inductively define Zi+\{t) = Zi{t) and Ai{t) =
0 if Si is totally inaccessible; whereas Ai(t) = — ElAZsil^Si-^iSi^} and
then Zi+i(t) = Zi(t) + Ai(t) in the case where Si is predictable (note that
AZSi G Ll).
We will show that: each At is increasing; Z± is a supermartingale for each
i; and -E{]>Zj=i A,-(oo)} < C, for each i, where С is a constant not depending
on i. We will prove these three properties by induction. We begin by showing
that each At is increasing. This is trivial if Si is totally inaccessible. Let then

114 III Semimartingales and Decomposable Processes
Si be a predictable time. Let S™ be an announcing sequence for Sj. Since each
Zi is a supermartingale, and using the Martingale Convergence Theorem, we
have EiAZ^SJlfs,-} = \\m.nE{AZi{Si)\Ts?}- Now fix n:
and thus Ai is increasing. To see that Zi+\ is a supermartingale, it will suffice
to show that whenever XJ\ and U2 are stopping times with Ui < U2 then
E{Zi+i(Ui)} > E{Zi+i(U2)}. Again letting SJ1 be an announcing sequence
for Si we have
E{Zi(U!)} - E{Zi(U2)} = E{Zi(U{)} - EiZidUi V S?) Л U2)}
V Я?) л U2)} - E{Zi((U! V SO Л U2)}
V SO Л U2)} - E{Zi(U2)}
where each of the summands on the right side above are nonnegative. Then
let n -> 00 to get E{Zi(Ui)} - E{Zi(U2)} > 0. Finally we want to show that
E{Y^l-i Aj(oo)} < C. It is easy to check (see Exercise 1 for example) that
the minimum of predictable stopping times is predictable, hence we can order
the predictable times so that Si < S2 < ¦ ¦ ¦ on the set where they are all
finite. Let Ai be the collection of all j such that j < г and such that the Sj
are all predictable. Then
AZSj} = lira ]T E{ZS) - ZSj}
Ai
+E{ZSt -Z00}+ E{Z0 - ZSk})
= E{Z0-Zoo},
which is bounded by a constant not depending on i.
To complete the proof, we note that because the processes Ai are in-
increasing in t and the expectation of their sum is bounded independently of
г, we have that X)»=i-^*@ converges uniformly in t a.s., as h —> oo, and
we call the limit Aoo(t). Fatou's Lemma gives us that Aoo(t) is integrable.
Each Ai(t) is easily seen to be predictable, and hence A^ is the almost sure
limit of predictable measurable processes, so it too is predictable. Next set
Z(x>(t) = Zt + Aoo(t) = ]haiZi(t). Since each Z+ is a supermartingale, cou-
coupled with the uniform convergence in t of the partial sums of the A+ pro-
processes, we get that Дх>(?) is a supermartingale. Moreover since each Zi is
cadlag, again using the uniform convergence of the partial sums we obtain

3 The Doob-Meyer Decompositions 115
that Zqo is also cadlag. Since the partial sums are uniformly bounded in ex-
expectation, Zqo (t) is of Class D. Finally by our construction of the stopping
times Si, -E{AZoo(T)|:Fr_} = 0 for all predictable times T. We can then
show as before that WF) —> 0 in probability as 6 —> 0 for Z^, and we ob-
obtain Zooty) = Mt - A(t). The process At — A»(?) + A(t) is then the desired
increasing predictable process. ?
The next theorem can also be considered a Doob-Meyer decomposition
theorem. It exchanges the uniform integrability for a weakening of the con-
conclusion that M be a martingale to that of M being a local martingale.
Theorem 13 (Doob-Meyer Decomposition: Case Without Class D).
Let Z be a cadlag supermartingale. Then Z has a decomposition Z — Z§ +
M — A where M is a local martingale and A is an increasing process which is
predictable, and Mq = Aq =0. Such a decomposition is unique. Moreover if
limt^oo E{Zt} > —oo, then ElA^} < oo.
Proof. First consider uniqueness. Let Z = Zq + M — A and Z = Zq + N — С
be two decompositions. Then M — N = A — С by subtraction. Hence A — С
is a local martingale. Let MT" be a uniformly integrable martingale. Then
E{ZQ - ZtAT»} = E{-Mf} + E{AT} = E{Af}
and therefore E{Aj"} < E{Z0 - Zt}, using Theorem 17 of Chap. I. Letting
n tend to oo yields E{At} < E{Zq - Zt}. Thus A is integrable on [0, to], each
to, as is С. Therefore A — С is of locally integrable variation, predictable,
and a local martingale. Since Aq — Cq = 0, by localization and Theorems 10
and 11, A — С = 0. That is, A = C, and hence M = N as well and we have
uniqueness.
Next we turn to existence. Let Tm = mi{t : \Zt\ > m} A m. Then Tm in-
increase to oo a.s. and since they are bounded stopping times Zt™ € L1 each m
(Theorem 17 of Chap. I). Moreover the stopped process ZTm is dominated by
the integrable random variable max(|Zrm|, m). Hence if X = Z7"™ for fixed m,
then H = {Xt',T a stopping time} is uniformly integrable. Let us implicitly
stop Z at the stopping time Tm and for n > 0 define Ytn = Zt - E{Zn\Tt},
with Ytn = l^"n = 0 when t > n. Then Yn is a positive supermartingale
of Class D and hence Ytn = Yon + Mt" - A?. Letting iVtn = E{Zn\Ft}, a
martingale, we have on [0, n] that
Zt=Z0+M?+N?-A?.
To conclude, therefore, it suffices to show that A™ — A™ on [0, n], for m > n.
This is a consequence of the uniqueness already established. The uniqueness
also allows us to remove the assumption that Z is stopped at the time Tm.
Finally, note that since E{At} < E{Z0 — Zt}, and since A is increasing,
we have by the Monotone Convergence Theorem

116 III Semimartingales and Decomposable Processes
EiA^} = lim E{At} < lim E{Z0 - Zt}
t—ЮО t—>OO
which is finite if lim^oo E{Zt} > —oo. П
4 Quasimartingales
Let X be a cadlag, adapted process defined on [0,оо].х
Definition. A finite tuple of points r = (t0, ti,..., tn+i) such that 0 = t0 <
ti < • • • < tn+i = oo is a partition of [0, oo].
Definition. Suppose that т is a partition of [0, oo] and that Xti G L1, each
U e т. Define
i—0
The variation of X along r is defined to be
Varr(X) - E{C(X,t)}.
The variation of X is defined to be
Var(X) =supVarr(X),
r
where the supremum is taken over all such partitions.
Definition. An adapted, cadlag process X is a quasimartingale on [0, oo]
if J5{|Xt|} < oo, for each t, and if Var(X) < oo.
Before stating the next theorem we recall the following notational conven-
convention. If X is a random variable, then
X+ =(,),
X~ =-min(X,0).
Also recall that by convention if X is defined only on [0, oo), we set Xoo = 0.
Theorem 14. Let X be a process indexed by [0, oo). Then X is a quasi-
quasimartingale if and only if X has a decomposition X = Y — Z where Y and Z
are each positive right continuous supermartingales.
1 It is convenient when discussing quasimartingales to include oo in the index set,
thus making it homeomorphic to [0, t] for 0 < t < oo. If a process X is defined
only on [0, oo) we extend it to [0, oo] by setting Хж = 0.

4 Quasimartingales 117
Proof. For given s > 0, let XXs) denote the set of finite subdivisions of [s, oo].
For each r G XXs )> se*
У; = Д{С(Х,т)+|.Гв} and ZJ = E{C{X,t)-\Fs)
where C(X, r)+ denotes X^e-r^i^i ~ -^ti+il^i}") and analogously for
C(.X", r)~. Also let -< denote the ordering of set containment. Suppose а, т G
XXs) with a < т. We claim Y/ < YST a.s. To see this let a = (t0,.. .,tn). It
suffices to consider what happens upon adding a subdivision point t before
?o> after tn, or between ti and ?i+i. The first two situations being clear, let us
consider the third. Set
A = E{Xti - Xt\Fu}; В = E{Xt - Xti+1\Ft};
С = E{Xti - Xti+1\fti};
then С = A + E{B\Fti}, hence
C+ < A+ + E{B\ft.}+
by Jensen's inequality. Therefore
and we conclude Y° < YJ. Since E{Y^} is bounded by VarpsT), taking limits
in L1 along the directed ordered set XXs) we define
Ys = limYsT,
and we can define Zs analogously. Taking a subdivision with to = s and
tn+i = oo, we see YJ - ZTS = SjC" - C~\TS} = Xs, and we deduce that
Ys — Zs = Xs. Moreover if s < t it is easily checked that Ys > E{Yt\Ts} and
Zs > E{Zt\Ts}. Define the right continuous processes Yt = Yt+, Zt = Zt+,
with the right limits taken through the rationals. Then Y and Z are positive
supermartingales and Ys — Zs = Xs.
For the converse, suppose X = Y — Z, where Y and Z are each positive
supermartingales. Then for a partition r of [0, t]
UET
E{YU - Yti+1\fu}} + E{J2 E{Zt
i} - E{Yu+l} + E{ZU} -
= E{Y0} + E{Z0} - (E{Yt} + E{Zt}).
Thus X is a quasimartingale on [0,t], each t > 0. ?

118 III Semimartingales and Decomposable Processes
Theorem 15 (Rao's Theorem). A quasimartingale X has a unique decom-
decomposition X = M + A, where M is a local martingale and A is a predictable
process with paths of locally integrable variation and Aq = 0.
Proof. This theorem is a combination of Theorems 13 and 14. ?
5 Compensators
Let Л be a process of locally integrable variation, hence a fortiori an FV
process. A is then locally a quasimartingale, and hence by Rao's Theorem
(Theorem 15), there exists a unique decomposition
A = M + A
where Л is a predictable FV process. In other words, there exists a unique,
predictable FV process A such that A — A is a local martingale.
Definition. Let A be an FV process with Ao = 0, with locally integrable
total variation. The unique FV predictable process A such that A - A is a
local martingale is called the compensator of A.
The most common examples are when A is an increasing process locally
of integrable variation. When A is an increasing process it is of course a
submartingale, and thus by the Doob-Meyer Theorem we know that its com-
compensator A is also increasing. We also make the obvious observation that
E{At} = E{At} for gi\t,O<t< oo.
Theorem 16. Let A be an increasing process of integrable variation, and let
H eh be such that E{f* HsdAs} < oo. Then,
E{ HsdAs} = E{ HsdAs}.
Jo Jo
Proof. Since A - A is a martingale, so also is / Hsd(As — As), and hence it
has constant expectation equal to 0. ?
In Chap. IV we develop stochastic integration for integrands which are
predictable, and Theorem 16 extends with H G L replaced with H predictable.
One of the simplest examples to consider as an illustration is that of the
Poisson process N = (Nt)t>o with parameter A. Recall that Nt — \t is a
martingale. Since the process At = \t is continuous and obviously adapted,
it is predictable (natural). Therefore Nt — Xt, t > 0. A natural extension of
the Poisson process case is that of counting processes without explosions. We
begin however with a counting process that has only one jump. Let r\ be the
counting process щ = 1{*>t}j where r is a nonnegative random variable. Let
F be the minimal filtration making r a stopping time.

5 Compensators 119
Theorem 17. Let P{r = 0) = 0 and Р(т > t) > 0, each t > 0. Then the F
compensator A off], where r\t = 1{*>т}> *s given fey
where F is the cumulative distribution function of т. If т has a diffuse
distribution (that is, if F is continuous), then A is continuous and At =
-F(t At)).
Before proving Theorem 17 we formalize an elementary result as a lemma.
Lemma. Let (п, Т, P) be a complete probability space. In addition, suppose
r is a positive T measurable random variable and J^° — а{т A t}, where
J*> = J*>° v Af, N are the null sets of T, and Tt = П«>г^5- Then F s°
constructed is the smallest filtration making r a stopping time. Let
Y 6 LX{T). Then
E{Y\Tt) = E{Y\r}l{t>_T} + Mr>p^
Proof. By the hypotheses on r the cr-algebra J^ is equal to the Borel u-
algebra on [0,t] together with the indivisible atom (t, oo). Observe that T^ =
<т{т Аи; и < t}, and the result follows easily. ?
Proof of Theorem 17. Fix t0 > 0 and let ?rn be a sequence of partitions of
[0,t0] with limn-™meshGrn) = 0. Define A? = ^ E{r]u+1 - т]и\Ти} for
0 < t < t0. Then
it X -n 1 , E{1{r>ti}Vti+1}л
by the lemma preceding this proof. The first term on the right above
is Vu+11{u>t} = l{ti+i>i-}l{ti>T} = !{4i>T}- Furthermore, 1{т>ц}Г]и+1 =
1{*i+i>-r} = 1{*i<r<4i+i}- Hence,
Therefore,
1 -тЛЪЛ = 1{^>г} + 1+^ три- \ Ъ
1 r УН)
These simple calculations yield

120 III Semimartingales and Decomposable Processes
+1 - *j*j = E F{ti+-
which are Riemann sums converging to /0 T(l — F(u—))~1dF(u). The re-
remaining claims of the theorem are obvious. D
Corollary 1. If F in Theorem 17 is absolutely continuous, then so also is A.
Corollary 2. If r in Theorem 17 has an exponential distribution, then At =
т At.
Proof. Since in this case F is continuous and a distribution function, we can
represent it as F(x) = 1 - е~ф{-х). We have that At = -ln(l - F(t A t))
and equivalently we can write At = ф(т At). In the case of this corollary, the
function ф(х) = x, and the result follows. ?
Remark. Theorem 17 gives an explicit formula for the compensator of a
process rjt = l{t>T} when one uses the minimal filtration making r a stopping
time. A perhaps more interesting question is the following: suppose that one
has a non-trivial filtration F satisfying the usual hypotheses, and a strictly
positive random variable L that is not a stopping time. Can one expand the
filtration to make a larger filtration G that renders laG stopping time, and
then calculate the G compensator of /ij = l{t>L}? This has a positive answer
with the right hypotheses. To see how to do this, let Xt = 1[o,l)W> which is
not an F adapted process. Let Z denote the optional projection of X onto F.
We prove in Chap. VI that Z is a supermartingale. Let Z = M—A be its Doob-
Meyer decomposition with A predictable. (Note that Mt = E{A00\Jrt}-) Using
the techniques presented in Chap. VI, one can show that if L is "the end of an
F predictable set," then the G compensator of/it = 1{*>L} is A-u which is an F
predictable process! By the end of a predictable set, we mean that there exists
an F predictable set Лс[0,оо]хП such that L(u>) = sup{t < oo : (t,w) е Л}.
In Chap. VI we discuss times that are the ends of optional sets, known as
"honest times," and this is of course less restrictive than being the end of a
predictable set, since any predictable set is also optional.2 See [115] for this
result described here and related ones.
Theorem 17 treats the case of a point process with one jump. A special
case of interest is counting processes. We have the following.
Theorem 18. Let N be a counting process without explosions, adapted to a
filtration G satisfying the usual hypotheses. Then the compensator of N, call
it A, always exists.
Proof Since N has non-decreasing paths, to ensure the existence of a com-
compensator we need only to show that N is locally of integrable variation. Let
Tn be the time of the n-th jump of N. Since N has no explosions, the times
See Exercise 4.

5 Compensators 121
Tn increase to oo a.s. Moreover \Nt^xn\ — ^tATn < n and thus N is locally of
bounded variation, hence certainly locally of integrable variation. ?
Let Nt = Yli>\ l{*>Ti} be a counting process without explosions and let
Si = Ti — Tj_i be its interarrival times, where we take To = 0. Let J^ =
<?{NS; s < tj, and let Tt = J*> V Af, the filtration completed by the .F-null
sets, as usual. (Note that this filtration is already right continuous.) Let us
define the cumulative distribution functions for the interarrival times slightly
informally as follows (note that we are using the unusual format of P(S > x)
rather than the customary P(S < x)):
> t)
Fi(slt..., *_i; t) = P(Si > t\S! = slt..., S^ = st-i)
where the Sj are in [0, oo], j > 1. (If one of the Sj takes the value +oo, then
the corresponding cumulative distribution function is concentrated at +oo.)
Define
i-i; t) =
f* -1
/ — rdFi(si,...,Si-i; s).
Jo *i{si,---,Si-i; s-)
We now have the tools to describe the compensator of N for its minimal
completed filtration. We omit the proof, since it is simply a more notationally
cumbersome version of the proof of Theorem 17.
Theorem 19. Let N be a counting process without explosions and let F be
its minimal completed filtration. Then the compensator A of N is given by
At = ^i(Si) + 02(Si; S2) + ¦¦¦+ &_i(Si,..., 5,_2; Si-i)
+ 0i(Si,-- • ,Si-i; t -T^
on the event {Ti <t< Ti+1}.
Corollary. Let N be a counting process without explosions and independent
interarrival times. Then the functions <j>i defined in equation (*) have the
simplified form
&(si,..., Si_i; t) =
and the compensator A is given by
-1
+фм - Ti)}i{Ti<t<Ti+1}
Example (hazard rates and censored data). In applications, an intuitive
interpretation often used is that of hazard rates. Let Г be a positive random

122 III Semimartingales and Decomposable Processes
variable with a continuous density for its distribution function F, given by /.
The hazard rate Л is defined, when it exists, to be
X(t) = lim -P(t <T <t + h\T>t).
The intuition is that this is the probability the event will happen in the next
infinitesimal unit of time, given that it has not yet happened. Another way
of viewing this is that P(t < T < t + h\T > t) = Xh + o(h). By Theorem 17
we have that the compensator of Nt = l{t>T] is At = /q \(s)ds. Or in
language closer to stochastic integration,
Л/\Т
Nt —I X(s)ds is a local martingale.
Jo
In medical trials and other applications, one is faced with situations where
one is studying random arrivals, but some of the data cannot be seen (for
example when patients disappear unexpectedly from clinical trials). This is
known as arrivals with censored data. A simple example is as follows. Let T
be a random time with a density and let U be another time, with an arbitrary
distribution. T is considered to be the arrival time and U is the censoring
time. Let X = Г Л U, Nt = l{t>x}l{c/>T}, and JV/7 = l{t>x}l{T>c/} with
J?t = a{Nu,N^; и <t}. Let Л be the hazard rate function for Т. Л is known
as the net hazard rate. We define A#, known as the crude hazard rate, by
X* = lim \-P(t <T<t + h\T>t,U>t)
h—>0 h
when the limit exists. We then have that the compensator of N is given by
At = /0 l{X>u}^(u)du = /o X#{u)du = /0 X#(u)du, or equivalently
rt
Nt — I l{x>u}^(u)du is a local martingale.
Jo
If we impose the condition that Л = А#, which can be intuitively interpreted
as
P(s < T < s + ds\T >s) = P(s<T<s + ds\T >s,U> s),
then we have the satisfying result that
r*
Nt — I l{x>u}A(u)<iu is a local martingale,
./o
As we saw in Chap. II, processes of fundamental importance to the the-
theory of stochastic integration are the quadratic variation processes [X, X] =
([X,X]t)t>o, where X is a semimartingale.
Definition. Let X be a semimartingale such that its quadratic variation
process [X, X] is locally integrable. Then the conditional quadratic vari-
variation of X, denoted (X, X) = ({X,X)t)t>o, exists and it is defined to be the
compensator of [X,X]. That is (X,X) = \X^X).

5 Compensators 123
If X is a continuous semimartingale then [X, X] is also continuous and
hence already predictable; thus [X, X] = (X, X) when X is continuous. In
particular for a standard Brownian motion B, [B, B)t = {B,B)t = t, all
t > 0. The conditional quadratic variation is also known in the literature by
its notation. It is sometimes called the sharp bracket, the angle bracket, or the
oblique bracket. It has properties analogous to that of the quadratic variation
processes. For example, if X and Y are two semimartingales such that (X, X),
(Y,Y), and (X + Y,X + Y) all exist, then (X,Y) exists and can be defined
by polarization
(X,Y) = ±((X + Y,X + Y) - (X,X) - (
However {X, Y) can be defined independently as the compensator of [X, Y]
provided of course that [X, Y] is locally of integrable variation. In other
words, there exist stopping times (Tn)n>i increasing to oo a.s. such that
E{JQ \d[X,Y]s\} < oo for each n. Also, (X,X) is a non-decreasing pro-
process by the preceding discussion, since [X, X] is non-decreasing. The condi-
conditional quadratic variation is inconvenient since unlike the quadratic variation
it doesn't always exist. Moreover while [X,X], [X,Y], and [Y,Y] all remain
invariant with a change to an equivalent probability measure, the sharp brack-
brackets in general change with a change to an equivalent probability measure and
may even no longer exist. Although the angle bracket is ubiquitous in the
literature it is sometimes unnecessary as one can often use the quadratic vari-
variation instead, and indeed whenever possible we use the quadratic variation
rather than the conditional quadratic variation (X, X) of a semimartingale X
in this book. Nevertheless the process (X, X) occurs naturally in extensions
of Girsanov's theorem for example, and it has become indispensable in many
areas of advanced analysis in the theory of stochastic processes.
We end this section with several useful observations that we formalize as
theorems and a corollary. Note that the second theorem below is a refinement
of the first (and see also Exercise 25).
Theorem 20. Let A be an increasing process of locally integrable variation,
all of whose jumps occur at totally inaccessible stopping times. Then its com-
compensator A is continuous.
Theorem 21. Let A be an increasing process of locally integrable variation,
and let T be a jump time of A which is totally inaccessible. Then its compen-
compensator A is continuous at T.
Proof. Both theorems are simple consequences of Theorem 7. D
Corollary. Let A be an increasing predictable process of locally integrable
variation and let Г be a stopping time. If Р(А? ф Ат~) > 0 then Г is not
totally inaccessible.

124 III Semimartingales and Decomposable Processes
Proof. Suppose T were totally inaccessible. Let A be the compensator of A.
Then A is continuous at Г. But since A is already predictable, A — A, and
we have a contradiction by Theorem 21. D
Theorem 22. Let T be a totally inaccessible stopping time. There exists a
martingale M with paths of finite variation and with exactly one jump, of size
one, occurring at time T (that is, Mf ф Мт- on {T < oo}j.
Proof Define
Ut — l{t>T}-
Then U is an increasing, bounded process of integrable variation, and we
let A — U be the compensator of U. A is continuous by Theorem 20, and
M = U — A is the required martingale. ?
6 The Fundamental Theorem of Local Martingales
We begin with two preliminary results.
Theorem 23 (Le Jan's Theorem). Let T be a stopping time and let H be
an integrable random variable such that Е{Н\Тт-} = 0 on {T < oo}. Then
the right continuous martingale Ht = E{H\Tt) is zero on [0, T) = {(t,ui) :
0<t< T(lj)}.
Proof. Since the martingale (Ht)t>o is right continuous it suffices to show that
Я41{4<Т} = 0 almost surely for all t. Let Л G Tt. Then An{t <T} belongs
both to Tt- and also to Tt- Hence
0 = E{HlAn{t<T}} = E{HtlAn{t<T}}.
Since Л e Tt is arbitrary, this implies that Я41{4<Т} = 0 a.s. If H is Tt
measurable then H\{T<t} G Tt, and E{H\Tt} = H on {T < t} and the
theorem is proved. ?
Theorem 24. Let T be a predictable stopping time and let A be increasing,
predictable, and locally of integrable variation. Then At and ААт are each
Tt- measurable.
Proof. Let Sn be a sequence of stopping times announcing T. Then Asn G
Tsn С Тт- for each n. Since At- = Шип-юс Asn we have that At- G
Tt-. To show that At G Tt- it suffices to show E{ATH} — 0 for every
bounded random variable H such that Е{Н\Тт-} = 0. Let H be such a
random variable and let Ht = E{H\Tt}, the right continuous martingale with
terminal value H. Then Ht{w) = 0 for 0 < t < T{uj) by Le Jan's Theorem
(Theorem 23). This implies that the left continuous version of Я, Ht~, equals
0 on [0, Г]. Then since the hypotheses imply that A is natural, we have
/•OO
E{HAT} = E{ Hs_l{s<T}dAs} + E{[H,A}T}= 0 + 0 = 0. D
Jo

6 The Fundamental Theorem of Local Martingales 125
Theorem 25 (Fundamental Theorem of Local Martingales). Let M
be a local martingale and let /3 > 0. Then there exist local martingales N,
D such that D is an FV process, the jumps of N are bounded by 2/3, and
M = N + D.
Proof. We first set
ct= Yl \Ams\1{\am3\>/3},
0<s<t
and we want to show С is locally integrable. By stopping we can assume
without loss that M is a uniformly integrable martingale and that Mo = 0.
For each w, Ct{u>) is a finite sum. Define
Rn - in({t :Ct>n or \Mt\ > n).
Then \AMRn\ < \MRn\ + |МН„_| < \MRn\ + n. Moreover CRn < C7Hn_ +
|ДМдп| < |Мн„| + 2n, which is in L1. Since Rn increases to oo, we have С is
locally integrable. By stopping we further assume that E{Coo} < oo.
Next we define
0<s<4
and since С is assumed integrable we deduce that Л is a quasimartingale. By
Theorems 13 and 14 we know that A decomposes as follows:
where L is a local martingale and B1, B2 are non-decreasing, nonnegative,
predictable processes, such that ^{В^} < oo, ^{В^,} < oo. Let
At = B\ - B2t.
Then \At — At\ < Cqo +-Втс + -^то 6 L1, and hence the martingales A — A and
N = M-(A-A)
are uniformly integrable. As usual, we write A for the compensator of A. Then
D = A-A.
It remains only to show that thejjumps of N are bounded by 2/3. To this
end, let T be a stopping time. If AAT = 0 a.s., then
ANT = AMT - AAT
This implies |AiV^| < /3, and we are done by Theorem 7 of Chap. I.
We therefore assume if Л = {\AAT\ > 0}, then P(A) > 0. Let R = Тл.
By Theorem 3 we can decompose R as R — Ra Л Rb, where Ra is accessible
and Rb is totally inaccessible. Then P(Rb < oo) = 0 by Theorem 7, whence

126 III Semimartingales and Decomposable Processes
R = Ra- Let (Tfc)fc>i be a sequence of predictable times enveloping R. It
will suffice to show \ANxk\ < 2/3, for each Tk- Thus without loss of generality
we can take R — T to be predictable. By convention, we set ANT = 0 on
{T = oo}. Since we know that A Ax S Ft- by Theorem 24, we have
ANT = ANT - E{ANT\TT-}
= A(M - A)T + AAT - E{A(M - A)T\TT-} - E{AAT\TT-}
= A(M - A)T - E{A(M - A)T\TT-}.
Since |A(M - A)T\ < /3, the result follows. D
If the jumps of a local martingale M with Mo = 0 are bounded by a
constant /3, then M itself is locally bounded. Let Tn = in?{t : \Mt\ > n}.
Then \Mt/KTn | < n + C. Therefore M is a fortiori locally square integrable.
We thus have a corollary.
Corollary. A local martingale is decomposable.
Of course if all local martingales were locally square integrable, they would
then be trivially decomposable. The next example shows that there are mar-
martingales that are not locally square integrable (a more complex example is
published in Doleans-Dade [50]).
Example. Let (п, J7, P) be complete probability space and let X be a random
variable such that X e L1, but X $. L2. Define the filtration
t>l,
where Tt=J^vM', with N all the P-null sets of T. Let Mt = E{X\Tt}, the
right continuous version. Then M is not a locally square integrable martingale.
The next example shows another way in which local martingales differ from
martingales. A local martingale need not remain a local martingale under a
shrinkage of the filtration. They do, however, remain semimartingales and
thus they still have an interpretation as a differential.
Example. Let У be a symmetric random variable with a continuous distri-
distribution and such that .E{|Y|} = oo. Let Xt = У1{4>1}, and define
* \a{Y}, t>l,
where G = (Qt)t>o is the completed filtration. Define stopping times Tn by
rn = f 0, if \Y\ > n,
I oo, otherwise.

7 Classical Semimartingales 127
Then Tn reduce X and show that it is a local martingale. However X is not
a local martingale relative to its completed minimal filtration. Note that X is
still a semimartingale however.
The full power of Theorem 25 will become apparent in Sect. 7.
7 Classical Semimartingales
We have seen that a decomposable process is a semimartingale (Theorem 9
of Chap. II). We can now show that a classical semimartingale is indeed a
semimartingale as well.
Theorem 26. A classical semimartingale is a semimartingale.
Proof. Let X be a classical semimartingale. Then Xt = Mt + At where M is
a local martingale and A is an FV process. The process A is a semimartin-
semimartingale by Theorem 7 of Chap. II, and M is decomposable by the corollary of
Theorem 25, hence also a semimartingale (Theorem 9 of Chap. II). Since semi-
semimartingales form a vector space (Theorem 1 of Chap. II) we conclude X is a
semimartingale. D
Corollary. A cadlag local martingale is a semimartingale.
Proof. A local martingale is a classical semimartingale. ?
Theorem 27. A cadlag quasimartingale is a semimartingale.
Proof. By Theorem 15 a quasimartingale is a classical semimartingale. Hence
it is a semimartingale by Theorem 26. ?
Theorem 28. A cadlag supermartingale is a semimartingale.
Proof. Since a local semimartingale is a semimartingale (corollary to The-
Theorem 6 of Chap. II), it suffices to show that for a supermartingale X, the
stopped process X1 is a semimartingale. However for a partition т of [0,t],
= E{X0} - E{Xt}.
Therefore X1 is a quasimartingale, hence a semimartingale by Theorem 27. D
Corollary. A submartingale is a semimartingale.

128 III Semimartingales and Decomposable Processes
We saw in Chap. II that if X is a locally square integrable local martingale
and H ? L, then the stochastic integral H ¦ X is also a locally square inte-
integrable local martingale (Theorem 20 of Chap. II). Because of the corollary of
Theorem 25 we can now improve this result.
Theorem 29. Let M be a local martingale and let H G L. Then the stochastic
integral H ¦ M is again a local martingale.
Proof. A local martingale is a semimartingale by the corollary of Theorem 25
and Theorem 9 of Chap. II; thus H ¦ M is defined. By the Fundamental
Theorem of Local Martingales (Theorem 25) for /3 > 0 we can write M =
N + A where N, A are local martingales, the jumps of N are bounded by /3,
and A has paths of finite variation on compacts. Since N has bounded jumps,
by stopping we can assume N is bounded. Define T by
T = inf{t>0: / \dAs\ >m].
Jo
Then ?{/0*AT \dAs\} < m +/3 + E{\AMT\} < oo, and thus by stopping A can
0
be assumed to be of integrable variation. Also by replacing H by }
for an appropriate stopping time S we can assume without loss of generality
that H is bounded, since H is left continuous. We also assume without loss
that Mo — ЛГо = Ao = 0. We know H • N is a local martingale by Theorem 20
of Chap. II, thus we need show only that H ¦ A is a local martingale.
Let on be a sequence of random partitions of [0, t] tending to the identity.
Then Y2 Ht™ (At" - AT") tends to (H ¦ A)t in ucp, where on is the sequence
0 = Tq4 < Tf < • • • < T? < ¦ ¦ ¦. Let (nk) be a subsequence such that the
sums converge uniformly a.s. on [0,t]. Then
E { Г HudAu|Ts\ = E j lim VHTn
Uo J [ Пк ^
р1 - A^")\TS
= lim E \ У Нт«к (Лр1 - A?" )\TS
by Lebesgue's Dominated Convergence Theorem. Since the last limit above
equals (H ¦ A)s, we conclude that H ¦ A is indeed a local martingale. ?
A note of caution is in order here. Theorem 29 does not extend completely
to processes that are not in L but are only predictably measurable, as we will
see in Emery's example of a stochastic integral behaving badly on page 176
in Chap. IV.
Let X be a classical semimartingale, and let Xt = Xq + Mt + At be a
decomposition where Mq = Aq = 0, M is a local martingale, and A is an FV

7 Classical Semimartingales 129
process. Then if the space {?l,T, {Tt)t>o,P) supports a Poisson process N,
we can write
Xt=X0 + {Mt + Nt-t} + {At -Nt+t]
as another decomposition of X. In other words, the decomposition of a classi-
classical semimartingale need not be unique. This problem can often be solved by
choosing a certain canonical decomposition which is unique.
Definition. Let X be a semimartingale. If X has a decomposition Xt =
Xo + Mt + At with Mo = Ao = 0, M a local martingale, A an FV process,
and with A predictable, then X is said to be a special semimartingale.
To simplify notation we henceforth assume Xo = 0.
Theorem 30. If X is a special semimartingale, then its decomposition X =
M + A with A predictable is unique.
Proof. Let X = N + В be another such decomposition. Then M — N = B — A,
hence В — A is an FV process which is a local martingale. Moreover, В — A
is predictable, and hence constant by Theorem 12. Since Bo — Ao = 0, we
conclude В = A. О
Definition. If X is a special semimartingale, then the unique decomposition
X = M + A with Mo = Xo and Ao = 0 and A predictable is called the
canonical decomposition.
Theorem 15 shows that any quasimartingale is special. A useful sufficient
condition for a semimartingale X to be special is that X be a classical semi-
semimartingale, or equivalently decomposable, and also have bounded jumps.
Theorem 31. Let X be a classical semimartingale with bounded jumps. Then
X is a special semimartingale.
Proof. Let Xt = Xo + Mt + At be a decomposition of X with Mo = Aq = 0,
M a local martingale, and A an FV process. By Theorem 25 we can then also
write
Xt - Xo + Nt + Bt
where N is a local martingale with bounded jumps and В is an FV process.
Since X and TV each have bounded jumps, so also does B. Consequently, it is
locally a quasimartingale and therefore decomposes
B = L + B
where L is a local martingale and Б is a predictable FV process (Theorem 15).
Therefore
Xt = Xo + {Nt + Lt} + Bt
is the canonical decomposition of X and hence X is special. ?

130 III Semimaxtingales and Decomposable Processes
Corollary. Let X be a classical semimartingale with continuous paths. Then
X is special and in its canonical decomposition
the local martingale M and the FV process A have continuous paths.
Proof. X is continuous hence trivially has bounded jumps, so it is special by
Theorem 31. Since X is continuous we must have
AMt = -AAT
for any stopping time T (ААт — 0 by convention on {T = oo}). Suppose
A jumps at a stopping time T. By Theorem 3, Г = Гд Л Гв, where TA
is accessible and Tg is totally inaccessible. By Theorem 21 it follows that
P(\AAtb\ > 0) = 0. Hence without loss of generality we can assume T is
accessible. It then suffices to consider T predictable since countably many
predictable times cover the stopping time T. Let Sn be a sequence of stopping
times announcing T. Since A is predictable, we know by Theorem 24 that
ААт is Tt- measurable. Therefore AMt is also Тт- measurable. Stop M
so that it is a uniformly integrable martingale. Then
AMT = E{AMT\TT-}
= 0,
and M, and hence A, are continuous, using Theorem 7 of Chap. I. ?
Theorem 31 can be strengthened, as the next two theorems show. The
criteria given in these theorems are quite useful.
Theorem 32. Let X be a semimartingale. X is special if and only if the
process Jt = sups<t |AXe| is locally integrable.
Theorem 33. Let X be a semimartingale. X is special if and only if the
process X* = sups<t I-Xsl is locally integrable.
Before proving the theorems, we need a preliminary result, which is inter-
interesting in its own right.
Theorem 34. Let M be a local martingale and let Mt* = sups<t \MS\. Then
the increasing process M* is locally integrable.
Proof. Without loss of generality assume Mq = 0. Let Tn be a sequence
of stopping times increasing to oo such that MTn is a uniformly integrable
martingale for each n. Since we can replace Tn with TnAn if necessary, without
loss of generality we can assume that the stopping times Tn are each bounded.
Set Sn = TnAinft{|M4| > ra}. Since MTn is uniformly integrable, and Sn < Tn,
we have that Msn is integrable. But we also have that M?n < n V \Msn | which
is in L1. Since Sn increases to oo a.s., the proof is complete. ?

8 Girsanov's Theorem 131
Proof of Theorems 32 and 33. Since X is special the process A of its canonical
decomposition X = M + A is of locally integrable variation. Writing Xt* <
Mt* + /0 | dAs |, and since Mt* is locally integrable by Theorem 34 and since A is
of locally integrable variation, we have that X* is locally integrable. Further,
note that |AXt| = \Xt — Xt_ | < 2|Xt* |, hence Jt < X* and we have that Jt is
also locally integrable.
For the converse, it will suffice to show that there exists a decomposition
X = M + A where A is of locally integrable variation, since then we can take
the compensator of A and obtain a canonical decomposition. To this end note
that \AAS\ < \AMS\ + \AXS\ < 2M* + Js, which is locally integrable by the
hypotheses of Theorem 32 together with Theorem 34. Since Jt < Xf we have
that | A As | is locally integrable by the hypotheses of Theorem 33 together with
Theorem 34 as well. To complete the proof, let Tn = inf{f > 0 : Jo \dAs\ > n},
and note that /0 n |cL4s| = /0 n~ |cL4s| + |АЛТ„| < n + sups<Tn |A.AS|, which
is in L1 as soon as sups<Tn \AAS\ is. However this last term is locally in L1 as
we have seen, and the proof is complete with one more sequence of stopping
times Rn putting sups<fln | A^4S| in L1 and then constructing a new, single
sequence by taking minimums from the two sequences. ?
A class of examples of special semimartingales is Levy processes with
bounded jumps. By Theorem 31 we need only show that a Levy process Z
with bounded jumps is a classical semimartingale. This is merely Theorem 40
of Chap. I.
For examples of semimartingales which are not special, we need only to
construct semimartingales with finite variation terms that are not locally
integrable. For example, let X be a compound Poisson process with non-
integrable, nonnegative jumps. Or, more generally, let X be a Levy process
with Levy measure v such that /x xv(dx) = oo and/or /^ |a;|^(rfa;) = oo.
8 Girsanov's Theorem
Let X be a semimartingale on a space (il, J7, F, P) satisfying the usual hy-
hypotheses. We saw in Chap. II that if Q is another probability law on (?l,F),
and Q«P, then X is a Q semimartingale as well. If X is a classical semi-
semimartingale (or equivalently, if X is decomposable) and has a decomposition
X = M + A, M a local martingale and A an FV process, then it is often
useful to be able to calculate the analogous decomposition X — N + B, if it
exists, under Q. This is rather tricky unless we make a simplifying assumption
which usually holds in practice: that both Q < P and ?«Q.
Definition. Two probability laws P, Q on (fl, J7) are said to be equivalent
if P < Q and Q < P. (Recall that P«Q denotes that P is absolutely
continuous with respect to Q.) We write Q ~ P to denote equivalence.

132 III Semimartingales and Decomposable Processes
If Q -C P, then there exists a random variable Z in Lx(dP) such that
^ = Z and Ep{Z} = 1, where Ep denotes expectation with respect to the
law P. We let
be the right continuous version. Then Z is a uniformly integrable martingale
and hence a semimartingale (by the corollary of Theorem 26). Note that if Q
3p ) •
We begin with a simple lemma, the easy proof of which we leave as an
exercise for the reader.
Lemma. Let Q ~ P, and Zt = Ep < ^[p\J~t \- An adapted, cadlag process M
is a Q local martingale if and only if MZ is a P local martingale.
Proof. See Exercise 20. ?
Theorem 35 (Girsanov-Meyer Theorem). Let P and Q be equivalent.
Let X be a classical semimartingale under P with decomposition X — M + A.
Then X is also a classical semimartingale under Q and has a decomposition
X = L + C, where
/•* 1
Lt = Mt- / —d[Z,M]s
Jo Zs
is a Q local martingale, and С = X — L is a Q FV process.
Proof. Recall that by Theorem 2 of Chap. II it is trivial that X is a Q semi-
semimartingale. We need to show it is a classical semimartingale, with the above
decomposition being valid.
Since M and Z are P local martingales, they are semimartingales (corol-
(corollary of Theorem 26) and
Z-dM+ I M-dZ
is a local martingale as well (Theorem 29). Using integration by parts we have
ZM - [Z, M}= f Z_dM+ I M-dZ (*)
is also a P local martingale. Since Z is a version of Ep{^\Tt}, we have ;|-
is a cadlag version of Eq{^\Tt}\ therefore ^ is a Q semimartingale. Since
P <C Q, it is also a P semimartingale. Multiplying equation (*) by -| we have
(**)
Note that if we were to multiply the right side of (**) by Z we would obtain
a P local martingale. We conclude by the lemma preceding this theorem that

8 Girsanov's Theorem 133
M — (j^)[Z,M] is a Q local martingale. We next use integration by parts
(under Q):
j-d[z,M}
Let N — J[Z, M]-d (^). Since ^ is a Q local martingale, so also is N (The-
(Theorem 29). The above becomes:
+ Nt + [[Z,M], \\t
^j [Z,M]t = J* ~d[Z,
= f J-d[Z,
O As
Adding our two Q local martingales yields
N + M - ( \ ] [Z, M] = N + M - f -d[Z, M]-N
\ZJ J Z
= M-
which, being the sum of two Q local martingales, is itself one. This establishes
the theorem. ?
The Girsanov-Meyer Theorem has another version using the sharp bracket.
It transforms a P local martingale into a Q special semimartingale and gives its
canonical decomposition. However an extra integrability hypothesis is needed
for the theorem to be true. Note that in the predictable version the integrand
-%- is replaced by its left continuous version -^—.
Theorem 36 (Girsanov-Meyer Theorem: Predictable Version). Let X
be a P local martingale with Xq — 0. Let Q be another probability equivalent
to P and let Zt = E{-jp\J-t}. If (X,Z) exists for the P probability, then the
canonical Q decomposition of X is
Xt = (Xt- j ~d(X,Z)s)+ j
Jo "S- Jo
Proof. Using the same ideas as in the proof of Theorem 35, we want to find
a predictable finite variation term A such that Y = Z(X —A) is a P local
martingale. Using integration by parts yields
ZtXt = [ Zs_dXs + f Xs_dZs + [Z,X]
Jo Jo
= Nt + [Z,X}t-(X,Z)t + {X,Z)t
= Mt + (X,Z)t

134 III Semimartingales and Decomposable Processes
where both N = /„* Zs_dXs + /0*Xs-dZs and M = N + [Z,X] - {X, Z) are
P local martingales. Therefore ZX — {X, Z) is a P local martingale.
Let A = (X,Z), which of course is a predictable FV process. Moreover
we know that A does not jump at totally inaccessible times, and that we
can cover its jumps with predictable times. This implies that A is locally
bounded. Let Tn = ini{t > 0 : \At\ > n}, and let ATn~ be denned by
ATn~ = J l[o,Tn)(s)<?Aa. From now on we assume A = ATn~. Then A is still
an FV process and is still predictable, |Л| < n, and the stopping times Tn
increase to oo a.s. Integration by parts gives us also
1 /"* 1 /"*
-At = / ~-dAa+ /
At J0 ^s- JO
i]..
¦ If we show that [A, ^j is also a Q local martingale, we will be done. It suffices
to show that Eq{[A, ^]t} = 0 for every stopping time T. Since A is FV and
predictable, by stopping at T— if necessary, we can assume the jumps of A are
bounded. And since ^ is a Q local martingale, its supremum process is locally
integrable (Theorem 34). This is enough to give us that [A, j^] is Q locally
of integrable variation. By stopping we assume it is integrable. We then have
that A is Q natural, so if T is a stopping time, we replace A with AT and we
still have a natural process, and thus we have Eq{[A, ^}t} = 0. Since T was
arbitrary, this implies that [A, \\ is a martingale. Of course, the process is
implicitly stopped at several stopping times, so what we have actually shown
is that [A,^] is a Q local martingale.
We now have that
1 /"* 1
—-At = Q-local martingale + / dAs
Zt Jo Zs-
which in turn gives us, recalling that A = {X, Z),
t t
1 /¦* 1
— — Mt + Q-local martingale + / ——d(X, Z)s.
Zt Jo As-
Since Z(jjM) = M = P local martingale, we have that j^M is also a Q local
martingale. Thus finally Xt — /0* -^d(X, Z)s is a Q local martingale, and the
proof is done. ?
Let us next consider the interesting case where Q is absolutely continuous
with respect to P, but the two probability measures are not equivalent. (That
is, we no longer assume that P is also absolutely continuous with respect to
Q.) We begin with a simple result, and we will assume J-o = {0,^} a.s.
Theorem 37. Let X be a P local martingale with Xo = 0. Let Q be another
probability absolutely continuous with respect to P, and let Zt = ^

8 Girsanov's Theorem 135
Assume that (X,Z) exists for P. Then At = JQ -^d(X,Z)s exists a.s. for
the probability Q, and Xt — Jo -^^d(X, Z)s is a Q local martingale.
Proof. Zo = E{Z} = 1, so if we let Rn - ini{t > 0 : Zt < 1/n}, then
Rn increase to oo, Q-a.s., and the process -?— is bounded on [0, Rn]. By
Theorem 36, XRn - f* ^-d{X, Z)fn is a Q local martingale, each n. Since a
local, local martingale is a local martingale, we axe done. ?
We now turn to the general case, where we no longer assume the existence
of (X, Z), calculated with P. (As before, we take X to be a P local martingale.)
We begin by denning a key stopping time: R = inf{f > 0 : Zt = 0, Zt_ > 0}.
Note that Q(R < oo) = 0, but it is entirely possible that P{R < oo) > 0. We
further define Ut — Д^д1{*>д}- Then U is an FV process, and moreover U
is locally integrable (dP). Let Tn increase to oo and be such that XTn is a
uniformly integrable martingale. Then
|Д^я = 1^Т„ЛД - X(TnAR)-\ < |^Т„ЛД| + \X(TnAR)-\ ? L ¦
Thus U has a compensator U, and of course U is predictable and U — U is a
P local martingale.
Theorem 38 (Lenglart-Girsanov Theorem). Let X be a P local martin-
martingale with Xq = 0. Let Q be a probability absolutely continuous with respect
to P, and let Zt = EP{^\Tt}, R = ini{t > 0 : Zt = 0, Zt_ > 0}, and
Ut = kXRl{t>R}. Then
is a Q local martingale.
Proof. Let Rn = inf{t > 0 : Zt < A}. (Recall that Zo = 1, and also note
that it is possible that Rn = R.) Then both XRn and ZRn are P local
martingales. Also note that ARn = /0* -ф^1{2нп>0^[Хк«,ZR«)S, URn, and
уд„ = XRn _ ^Д„ + [/я. аге ац p-well-defined. We can define
/* 1
At = / -^1{Z,:
JO ^s
on [0,R), since d[XRn,ZRn]s does not charge (R,oo), and —h:^szRn>o] = ®
at R. Thus we need only to show YRn is a Q local martingale for each fixed
n, which is the same as showing that ZRnYRn is a P local martingale. Let us
assume all these processes are stopped at Rn to simplify notation. We have
ZY = ZX -ZA + ZU.

136 III Semimartingales and Decomposable Processes
Hence,
d{ZX) =Z^dX + X-dZ + d[Z, X]
= local martingale + d[Z, X]
At Л У\ Л АУ _l 7 А Л
U\s\?j I — s\ LiZj -}- ZjUSI.
= local martingale + ZdA
= local martingale + l^z>Oyd[X, Z]
d{ZU) = Z-dU + UdZ
= local martingale + Z^dU
= local martingale + Z^dU
where the last equality uses that U—U is a local martingale (dP). Summarizing
we have
ZY = ZX -ZA + ZU
= local martingale + \Z, X] — local martingale
+ / ~L{z>o}d[X,Z\) + local martingale + / Z_dU
which we want to be a local martingale under dP. This will certainly be the
case if
d[Z,X] - l{Z>oyd[X,Z] + Z_dU = 0. (*)
However (*) equals
But AZr = Zr — Zr- = 0 — Zr- = —Zr-, and this implies that equation (*)
is indeed zero, and thus the Lenglart-Girsanov Theorem holds. ?
Corollary. Let X be a continuous local martingale under P. Let Q be abso-
absolutely continuous with respect to P. Then (X,Z) = [Z,X] = \Z,X]C exists,
and
Xt - f -l-d[Z,X}cs = Xt - I asd[X,X]s
JO ^s- JO
which is a Q local martingale.
Proof. By the Kunita-Watanabe inequality we have
which shows that it is absolutely continuous with respect to d[X, X]s a.s.,
whence the result. ?

8 Girsanov's Theorem 137
We remark that if Z is the solution of a stochastic exponential equation
of the form dZs = Zs-HsdXs (which it often is), then as = Hs.
Example. A problem that arises often in mathematical finance theory is
that one has a semimartingale S = M + A denned on a filtered probability
space (O,.F,F, P) satisfying the usual hypotheses, and one wants to find an
equivalent probability measure Q such that under Q the semimartingale X is
a local martingale, or better, a martingale. In essence this amounts to finding
a probability measure that "removes the drift." To be concrete, let us suppose
S is the solution of a stochastic differential equation3
dSs = h(s, Ss)dBs + b(s; Sr;r < s)ds,
where В is a standard Wiener process (Brownian motion) under P. Let us
postulate the existence of a Q and let Z = ^ and Zt = E{Z\Tt}, which is
clearly a cadlag martingale. By Girsanov's Theorem
f h(s,Ss)dBs- f ^rd[Z, f h(r,Sr)dBr]s
J0 Jo "s Jo
is a Q local martingale. We want to find the martingale Z. In Chap. IV
we will study martingale representation and show in particular that every
local martingale on a Brownian space is a stochastic integral with respect to
Brownian motion. Thus we can write Zt = 1 + JQ JsdBs for some predictable
process J. If we assume Z is well behaved enough to define Hs = -^f-, then
we have Zt = 1 + /0 HsZsdBs, which gives us a linear stochastic differential
equation to solve for Z. Thus if we let Nt = jQ HsdBs, we get that Zt =
?(N)t.4 It remains to determine H. We do this by observing from our previous
Girsanov calculation that
f h(s,Ss)dBs- f ^rZsHsh(s,Ss)ds= f h(s,Ss)dBs- f Hsh(s,Ss)ds
Jo JO 6S Jq Jq
is a Q local martingale. We then choose Hs = —¦ ' r'—r=—-, which yields
h(s,Ss)
St= I h(s,Ss)dBs+ I b{s;Sr;r<s)ds
Jo Jo
[*" b(s' S ¦ r < s)
is a local martingale under Q. Letting Mt = Bt+ I ' /' ~—-ds denote
Jo 4S> ss)
this Q local martingale, we get that [M, M]t — [B, B]t = t, and by Levy's
Theorem M is a Q-Brownian motion. Finally, under Q we have that S satisfies
the stochastic differential equation
dSt = h(t,St)dMt.
3 Stochastic differential equations are introduced and studied in some detail in
Chap. V.
4 The stochastic exponential E is defined on page 85.

138 III Semimartingales and Decomposable Processes
There is one problem with the preceding example: we do not know a priori
whether our solution Z is the Radon-Nikodym density for simply a measure
Q, or whether Q is an actual bona fide probability measure. This is a constant
problem. Put more formally, we wish to address this problem:
Let M be a local martingale. When is ? (M) a martingale?
The only known general conditions that solve this problem are Kazamaki's
criterion and Novikov's criterion.5 Moreover, these criteria apply only to lo-
local martingales with continuous paths. Novikov's is a little less powerful than
Kazamaki's, but it is much easier to check in practice. Since Novikov's crite-
criterion follows easily from Kazamaki's, we present both criteria here. Note that
if M is a continuous local martingale, then of course ?(M) is also a continuous
local martingale. Even if, however, it is a uniformly integrable local martin-
martingale, it still need not be a martingale; we need a stronger condition. As an
example, one can take u(x) = \\x\\~1 and Nt = u(Bt) where В is standard
three dimensional Brownian motion. Then N is a uniformly integrable local
martingale but not a martingale.
Nevertheless, whenever M is a continuous local martingale, then ?{M) is a
positive supermartingale, as is the case with any nonnegative local martingale.
Since ?(M)q = 1, and since ?(M) is a positive supermartingale, we have
E{?(M)t} < 1, all t. (See the lemma below.) It is easy to see that Z is a
true martingale if one also has E{?(M)t} = 1, for all t. We begin with some
preliminary results.
Lemma. Let M be a continuous local martingale with Mo = 0. Then
E{?(M)t} < 1 for all t > 0.
Proof. Recall that ?(M)o = 1. Since M is a local martingale, ?(M) is a
nonnegative local martingale. Let Tn be a sequence of stopping times reducing
?{M). Then E{?(M)tATn} = 1, and using Fatou's Lemma,
E{?(M)t} = Е{Итт{?(М)глТп} < liminiE{?(M)tATn} = 1. ?
Theorem 39. Let M be a continuous local martingale. Then
Proof.
which implies that
5 There also exist partial results when the local martingale is no longer continuous
but only has cadlag paths. See Exercise 14 of Chap. V for an example of these
results.

8 Girsanov's Theorem 139
and this together with the Cauchy-Schwarz inequality and the fact that
E{?(M)t} < 1 gives the result. D
Lemma. Let M be a continuous local martingale. Let 1 < p < oo, ^ +1 = 1.
Taking the supremum below over all bounded stopping times, assume that
s\rpE\eK2^-l!MT\ < oo.
т
Then ?(M) is an Lq bounded martingale.
Proof. Let 1 < p < oo and r = y-_1 ¦ Then s = ^~— and ^ + -j — 1. Also we
note that (g — y/lj:)s — 2i /-'-n which we use in the last equality of the proof.
We have
?{M) =
We now apply Holder's inequality for a stopping time S:
E{?(M)%} = E
Recalling that E{?(.y/qrM)s} < 1, we have the result. D
Theorem 40 (Kazamaki's Criterion). Let M be a continuous local mar-
martingale. Suppose supT Е{е^Мт)у < oo, where the supremum is taken over all
bounded stopping times. Then ?{M) is a uniformly integrable martingale.
Proof. Let 0 < a < 1, and p > 1 be such that tjp_u < ?• Our hypothesis
combined with the preceding lemma imply that ?(aM) is an LQ bounded
martingale, where - + - = 1, which in turn implies it is a uniformly integrable
martingale. However
?(aM) = e°-M
= ?(M)a2 ea{1~a)M,
and using Holder's inequality with a~2 and A - a2) yields (where the 1 on
the left side comes from the uniform integrability):

140 III Semimartingales and Decomposable Processes
Now let a increase to 1 and the second term on the right side of the last
inequality above converges to 1 since 2a(l — a) —> 0. Thus 1 < E{?(M)OO},
and since we know that it is always true that 1 > ?{?(M)OO}, we are done. ?
As a corollary we get the very useful Novikov's criterion. Because of its
importance, we call it a theorem.
Theorem 41 (Novikov's Criterion). Let M be a continuous local martin-
martingale, and suppose that
Then ?{M) is a uniformly integrable martingale.
Proof. By Theorem 39 we have Е{е?Мт} < Е{е^м'м^т}ъ, and we need only
to apply Kazamaki's criterion (Theorem 40). ?
We remark that it can be shown that 1/2 is the best possible constant
in Novikov's criterion, even though Kazamaki's criterion is slightly stronger.
Note that in the case of the example treated earlier, we have [N, N]t = JQ H^ds
where H = ^'Лгт''. Clearly whether or not \ [N, N]t has a finite exponential
moment will depend on our choice of b and of h. A standard simple solution
is to choose b and h so that b/h is bounded.
Partial results exist that give an analogue of Novikov's criterion in the
general (cadlag) case. See for example Exercise 14 of Chap. V, and the
sources [156] and [157].
Example. This example is taken from statistical communication theory. Sup-
Suppose we are receiving a signal corrupted by noise, and we wish to determine
if there is indeed a signal, or if we are just receiving noise (e.g., we could be
searching for signs of intelligent life in our galaxy with a radio telescope). Let
x(t) be the received signal, ?(i) the noise, and s(t) the actual (transmitted)
signal. Then
A frequent assumption is that the noise is "white." A white noise is usually
described as a second order wide sense stationary process with a constant
spectral density function (that is, E{?t} = 0, and V(r) = E{XtXt+T} = 5TS0,
where ST is the Dirac delta function at r, and So is the constant spectral
density; one then has S(v) = /^ el2nuTVг(т)йт). Such a process does not exist
in a rigorous mathematical sense. Indeed it can be interpreted as the derivative
of the Wiener process, in a generalized function sense. (See Arnold [2, page
53], for example.) This suggests that we consider the integrated version of our
signal-noise equation:
[
Jo
Xt= [ s(u)du + Wt
J

8 Girsanov's Theorem 141
where IF is a standard Wiener process and where St is thought of as
"/0 x(s)ds," the cumulative received signal.
The key step in our analysis is the following consequence of the Girsanov-
Meyer Theorem.
Theorem 42. Let W be a standard Brownian motion on (fi,.F,F, P), and
let H G L be bounded. Let
I*
Hsds + Wt
and define Q by ^ = exp{/0 -HsdWs — \j0 H2ds}, for some T > 0. Then
under Q, X is a standard Brownian motion for 0 < t < T.
Proof. Let ZT = exp{/0T -HsdWs - \ JQT H2ds}. Then if Zt = E{ZT\Tt} we
know by Theorem 37 of Chap. II that Z satisfies the equation
Zt = l- f Zs-HsdWs.
Jo
By the Girsanov-Meyer Theorem (Theorem 35), we know that
,t 1
Nt = Wt- / — d[Z,W]s
Jo ^s
is a Q local martingale. However
[Z, W]t = \-Z_H ¦ W, W\t = f -ZsHsd[W, W]s
Jo
f*
= — I ZsHsds,
Jo
since [W, W]t = t for Brownian motion. Therefore
Г —
Jo %s
pt
= Wt+ Hsds
Jo
hence X is a Q local martingale. Since (Jo Hsds)t>o is a continuous FV pro-
process we have that [X, X]t = [W, W]t = t, and by Levy's Theorem (Theorem 39
of Chap. II) we conclude that X is a standard Brownian motion. D
Corollary. Let W^bea standard Brownian motion and H G L be bounded.
Then the law of
Xt= / Hsds + Wt,
Jo
0 < t < T < oo, is equivalent to Wiener measure.

142 III Semimartingales and Decomposable Processes
Proof. Let C[0,T] be the space of continuous functions on [0, T\ with values
in M (such a space is called a path space). If W = (Wt)o<t<T is a standard
Brownian motion, it induces a measure fiy/ on C[0, T):
fiw(A) = P{w : t » Wt{w) e A}.
Let fix be the analogous measure induced by X. Then by Theorem 42 we
have fix ~ f*w and further we have
= exp ( Г -HsdWs - \ [H>ds\ . D
[J?2 J J
Remark. We have not tried for maximum generality here. For example the
hypothesis that H be bounded can be weakened. It is also desirable to weaken
the restriction that H G L. Indeed we only needed that hypothesis to be able
to form the integral /0 HsdWs. This is one example to indicate why we need
a space of integrands more general than L.
We are now in a position to consider the problem posed earlier: is there
a signal corrupted by noise, or is there just noise (that is, does s(t) = 0 a.e.,
a.s.)? In terms of hypothesis testing, let Ho denote the null hypothesis, Hx
the alternative. We have:
Ho : XT = WT
HX:XT= f Hsds + WT-
Jo
We then have
= exp I / -HsdWs -- H*ds ,
'o z Jo J
by the preceding corollary. This leads to a likelihood ratio test:
if -r-^-(w) < A, reject Ho,
dfix
if -t^-(oj) > A, fail to reject Ho,
dfix
where the threshold level A is chosen so that the fixed Type I error is achieved.
To indicate another use of the Girsanov-Meyer Theorem let us consider
stochastic differential equations. Since stochastic differential equations6 are
treated systematically in Chap. V we are free here to restrict our attention to
a simple but illustrative situation. Let W be a standard Brownian motion on a
space {?l,J-,F,P) satisfying the usual hypotheses. Let fi(w,s,x) be functions
satisfying (г = 1,2):
6 Stochastic "differential" equations have meaning only if they are interpreted as
stochastic integral equations.

9 The Bichteler-Dellacherie Theorem 143
(i) \fi(w,s,x) - fi(u,s,y)\ < K\x-y\ for fixed (w,s);
(ii) /;(-,s,:r)e.Fs for fixed (s,:r);
(iii) fi(w,-,x) is left continuous with right limits for fixed (w,x).
By a Picard-type iteration procedure one can show there exists a unique so-
solution (with continuous paths) of
Xt = X0+ f f1(-,s,Xs)dWs + f f2{.,s,Xs)ds. (***)
Jo Jo
The Girsanov-Meyer Theorem allows us to establish the existence of solutions
of analogous equations where the Lipschitz hypothesis on the "drift coeffi-
coefficient" /2 is removed. Indeed if X is the solution of (***), let 7 be any
bounded, measurable function such that -y(w,s,Xs) e L. Define
g(w,s,x) = f2(u>,s,x) +f1(w,s,x)'y(s,u,x).
We will see that we can find a solution of
= Y0+ [ M-,a,Ya)dBa+ f g(-,s,Ys
Jo Jo
)ds
provided we choose a new Brownian motion В appropriately.
We define a new probability law Q by
dQ ( fT 1 fT \
-^=exp / ~f{s,Xa)dWa-- / -y(s,XsJds .
«^ \Jo г Jo )
By Theorem 42 we have that
Bt = Wt- [ i(s,Xs)ds
Jo
is a standard Brownian motion under Q. We then have that the solution X
of (* * *) also satisfies
Xt=X0+ f h(;S,Xs)dBs+ f (h + fl<y)(;8,Xa)d8
Jo Jo
= X0+ [ fi(;8,Xa)dBa+ f g(;8,Xa)d8,
Jo Jo
which is a solution of a stochastic differential equation driven by a Brownian
motion, under the law Q.
9 The Bichteler-Dellacherie Theorem
In Sect. 7 we saw that a classical semimartingale is a semimartingale. In this
section we will show the converse.

144 III Semimartingales and Decomposable Processes
Theorem 43 (Bichteler-Dellacherie Theorem). An adapted, cadlag pro-
process X is a semimartingale if and only if it is a classical semimartingale. That
is, X is a semimartingale if and only if it can be written X = M + A, where
M is a local martingale and A is an FV process.
Proof. The sufficiency is exactly Theorem 26. We therefore establish here only
the necessity.
Since X is cadlag, the process Jt = Ylo<s<t AXsl{\AXa\>i} has paths of
finite variation on compacts; hence it is an FV process. Let Y — X — J.
Then Y has bounded jumps. If we show that X is a classical semimartingale
on [O,io], some io, then Y = X — J is one as well; moreover У is special
by Theorem 31. Let Y = Yo + M + A be the canonical decomposition of Y.
Suppose ii > t0 and X is also shown to be a classical semimartingale on [0, ti].
Then let Y = Yo + N + В be the canonical decomposition of Y on [0, ti]. We
have that, for t < to,
M -N = B~~A~
by subtraction. Thus В — A is a predictable, FV process which is a local
martingale; hence by Theorem 12 we have В — A. Thus if we take tn increasing
to oo and show that X is a classical semimartingale on [0, tn], each n, we have
that X is a classical semimartingale on [0, oo).
We now choose u0 > 0, and by the above it suffices to show X is a classical
semimartingale on [0, щ]. Thus it is no loss to assume X is a total semimartin-
semimartingale on [0, uo] ¦ We will show that X is a quasimartingale, under an equivalent
probability Q. Rao's Theorem (Theorem 15) shows that X is a classical semi-
semimartingale under Q, and the Girsanov-Meyer Theorem (Theorem 35) then
shows that X is a classical semimartingale under P.
Let us take H e S of the special form:
n-l
^t = 2j^»1№,Ti+i] (*)
i=0
where 0 = To < T\ < ¦ ¦ ¦ < Tn-\ < Tn = u0. In this case the mapping Ix is
given by
IX{H) = (H ¦ X)uo = H0(XTl -Xo) + --- + tfn_!(XU0 -Xr,.,).
The mapping Ix : Su —> L° is continuous, where L° is endowed with the
topology of convergence in probability, by the hypothesis that X is a total
semimartingale. Let
В = {H e S : H has a representation (*) and \H\ <1}.
Let /3 = Ix{B), the image of В under Ix- It will now suffice to find a
probability Q equivalent to P such that Xt e ^(dQ), 0 < t < u0 and
such that supuep Eq(U) = с < oo. The reason this suffices is that if we
take, for a given 0 — to < t\ < ¦ ¦ ¦ < tn = uq, the random variables

9 The Bichteler-Dellacherie Theorem 145
Ho = sign(EQ{Xtl - X0\F0}), #! = sign(?Q{Xt2 - Xu |.Ftl}),..., we have
that for this H G B,
T? ( т 1 TT\\ Z?flZ?fV v If II IP fv V 'XT Л\Л
^Q^xy^i)) = &Q\\&Q\Ato - Atl|y-Oj| + • • • + \?iQ\Atnl — Лгп А„_1/|/-
Since this partition r was arbitrary, we have Var(X) — suprVarr(X) <
suPuep ^q(^) = c < °o> and so X is a Q quasimartingale.
Lemma 1. lim sup P(\Y\ > c) = 0.
c—ooye/3
Proof of Lemma 1. Suppose linic^ooSupyg^Pdyi > c) > 0. Then there ex-
exists a sequence с„ tending to oo, Yn G C, and a > 0 such that Р(|У^| > d) > a,
all n. This is equivalent to
Since Yn G C, there exists tf" G В such that /х(Яп) = Yn. Then Ix{±-Hn) =
-1-1х{Нп) = -^У„ 6 /3, if с„ > 1. But ^Я" tends to 0 uniformly a.s. which
implies that Ix{^-Hn) = — Yn tends to 0 in probability. This contradicts
(*)• " " ?
Lemma 2. There exists a law Q equivalent to P such that Xt G Lx{dQ),
0<t<u0.
Proof of Lemma 2. Let Y = sup0<t<uo\Xt\. Since X has cadlag paths,
Y < oo a.s. Moreover if D is a countable dense subset of [0,щ], then
У = supteZ) |Xt|. Hence У is a random variable. Let Am — {m < У < m+1},
and set Z = Em=o^mW' Then Z is bounded, strictly positive, and
YZ 6 L^dP). Define Q by setting $ = ^^yZ. Then ?д{|Х4|} <
= Ep{YZ}/EP{Z} < oo. Hence EQ{|Xt|} < oo, 0 < t < u0. П
Observe that C С L1(dQ) for the law Q constructed in Lemma 2. Lemma 3
below implies that X is an R quasimartingale for R ~ Q ~ P. Hence by
Rao's Theorem (Theorem 15) it is a classical R semimartingale, and by the
Girsanov-Meyer Theorem (Theorem 35) it is a classical semimartingale for
the equivalent law P as well. Thus Lemma 3 below will complete the proof of
Theorem 43. We follow Yan [233].
Lemma 3. Let C be a convex subset of L1(dQ), 0 € /3, that is bounded in
probability. That is, for any e > 0 there exists а с > 0 such that <3(? > c) < e,
for any С G C. Then there exists a probability R equivalent to Q, with a
bounded density, such that supue/3 Eu(U) < oo.
Proof of Lemma 3 and end of proof of Theorem 43. To begin, note that the
hypotheses imply that C С L1(dR). What we must show is that there ex-
exists a bounded random variable Z, such that P(Z > 0) = 1, and such that
suPce/3 eq(z0 <oo.

146 III Semimartingales and Decomposable Processes
Let ief such that Q(A) > 0. Then there exists a constant d such that
Q(( > d) < Q(A)/2, for all ( e /3, by assumption. Using this constant d,
let с = 2d, and we have that 0 < c\A ф /3, and moreover if B+ denotes all
bounded, positive r.v., then clA is not in the Lx{dQ) closure oif3—B+, denoted
C-B+. That is, clA ? /3 - B+. Since the dual of L1 is L°°, and /3 - B+ is
convex, by a version of the Hahn-Banach Theorem (see, e.g., Treves [223, page
190]) there exists a bounded random variable Y such that
sup EQ{Y(C-v)}<cEQ{YlA}. (*)
CB
Replacing ry by al{y<o} and letting a tend to oo shows that Y > 0 a.s., since
otherwise the expectation on the left side above would get arbitrarily large.
Next suppose r\ = 0. Then the inequality above gives
supEQ{Y(} < cEq{YIa} < +oo.
Now set H = {Y G B+ : supCe/3 Eq{YQ < oo}. Since 0 G B+, we know H
is not empty. Let Л = {all sets of the form {Z = 0},Z 6 H}. We wish to
show that there exists a Z € 7i such that Q(Z — 0) — inf^e^ Q(A). Suppose,
then, that Zn is a sequence of elements of 7i. Let Cn — sup^e/3 E{Zn?} and
dn = ||ZnlU°°- (Since 0 G /3, we have с„ > 0). Choose Ь„ such that ХЖС« <
oo and J2bndn < oo, and set Z = ^2bnZn. Then clearly Z G W. Moreover,
{Z = 0} = f]n{Zn = 0}. Thus Л is stable under countable intersections, and
so there exists a Z such that Q(Z = 0) = inf^e^ Q(A).
We now wish to show Z > 0 a.s. Suppose not. That is, suppose Q(Z =
0) > 0. Let У satisfy (*) (we have seen that there exists such a Y and that
it hence is in Ti). Further we take for our set A in (*) the set A = {Z = 0},
for which we are assuming Q(A) > 0. Since 0 G /3 and 0 G B+, we have from
Lemma 2 that
} = E{Yl{z=0}}.
Since each of Y and Z are in 7i, their sum is in 7i as well. But then the above
implies
Q{Y + Z = 0} = Q{Z = 0} - Q({Z = 0} П {Y > 0}) < Q(Z = 0).
This, then, is a contradiction, since Q(Z = 0) is minimal for Z G 7i. Therefore
we conclude Z > 0 a.s., and since Z G B+, it is bounded as well, and Lemma 3
is proved; thus also, Theorem 43 is proved. D
We state again, for emphasis, that Theorems 26 and 43 together allow
us to conclude that semimartingales (as we have defined them) and classical
semimartingales are the same.

Exercises for Chapter III 147
Bibliographic Notes
The material of Chap. Ill comprises a large part of the core of the "general
theory of processes" as presented, for example in Dellacherie [41], or alterna-
alternatively Dellacherie and Meyer [45, 46, 44]. We have tried once again to keep the
proofs non-technical, but instead of relying on the concept of a natural process
as we did in the first edition, we have used the approach of R. Bass [11], which
uses the P. A. Meyer classification of stopping times and Doob's quadratic in-
inequality to prove the Doob-Meyer Theorem (Theorem 13), the key result of
the whole theory.
The Doob decomposition is from Doob [55], and the Doob-Meyer decom-
decomposition (Theorem 13) is originally due to Meyer [163, 164]. The theory of
quasimartingales was developed by Fisk [73], Orey [187], К. М. Rao [207],
Strieker [218], and Metivier-Pellaumail [159].
The treatment of compensators is new to this edition. The simple example
of the compensator of a process with one jump of size one (Theorem 17), dates
back to 1970 with the now classic paper of Dellacherie [40]. The case of many
jumps (Theorem 19) is due to C. S. Chou and P. A. Meyer [30], and can be
found in many texts on point processes, such as [24] or [139]. The example of
hazard rates comes from Fleming and Harrington [74].
The Fundamental Theorem of Local Martingales is due to J. A. Yan and
appears in an article of Meyer [172]; it was also proved independently by
Doleans-Dade [51]. Le Jan's Theorem is from [112].
The notion of special semimartingales and canonical decompositions is due
to Meyer [171]; see also Yoeurp [234]. The Girsanov-Meyer theorems (The-
(Theorems 35 and 36) trace their origin to the 1954 work of Maruyama [152],
followed by Girsanov [83], who considered the Brownian case only. The two
versions presented here are due to Meyer [171], and the cases where the two
measures are not equivalent (Theorems 37 and 38) are due to Lenglart [142].
The example from finance theory (starting on page 137) is inspired by [203].
Kazamaki's criterion was published originally in 1977 [124]; see also [125],
whereas Novikov's condition dates to 1972 [184].
The Bichteler-Dellacherie Theorem (Theorem 43) is due independently to
Bichteler [13, 14] and Dellacherie [42]. It was proved in the late 1970's, but
the first time it appeared in print was when J. Jacod included it in his 1979
tome [103]. Many people have made contributions to this theorem, which had
at least some of its origins in the works of Metivier-Pellaumail, Mokobodzki,
Nikishin, Letta, and Lenglart. Our treatment was inspired by Meyer [176] and
by Yan [233].
Exercises for Chapter III
Exercise 1. Show that the maximum and the minimum of a finite number
of predictable stopping times is still a predictable stopping time.

148 Exercises for Chapter III
Exercise 2. Let S be a totally inaccessible stopping time, and let T = S + 1.
Show that Г is a predictable stopping time.
Exercise 3. Let S, T be predictable stopping times. Let Л = {S = T}. Show
that Sa. is a predictable stopping time.
Exercise 4. Let (Q,T, F, P) be a filtered probability space satisfying the
usual hypotheses. Show that the predictable cr-algebra (on K+ x Q) is con-
contained in the optional cr-algebra. (Hint: Show that a cadlag, adapted process
can be approximated by processes in L.)
Exercise 5. Let S, T be stopping times with S < T. Show that (S, T] =
{{t,u) : S(u) < t < T(u)} is a predictable set. Show further that [S,T)
and (S,T) are optional sets. Last, show that if T is predictable, then (S, T)
is a predictable set, and if both S and T are predictable, then \S,T) is a
predictable set.
Exercise 6. Let Г be a predictable stopping time, and let (Sln)n>i be a
sequence of stopping times announcing T. Show that Тт- — Vn Fsn-
Exercise 7. A filtration F is called quasi left continuous if for every pre-
predictable stopping time T one has Тт = -Яг--7 Show that if F is a quasi left
continuous filtration, then whenever one has a non-decreasing sequence of
stopping times (Sn)n>\, with ИгПп^оо Sn = T, it follows that Тт = Vn ^~sn-
(Note: One can have a quasi left continuous filtration such that there exists
a stopping time T with Тт- ф J~t-)
Exercise 8. Show that the natural completed filtration of a Levy process is
a quasi left continuous filtration.
*Exercise 9. Let X be a semimartingale with E{[X, X]^} < oo and suppose
X has a decomposition X = Xq -f M + A, with A predictable. Show that
E{[A, A]oo} < ?{[X,X]oo}. (Hint: Recall that if T is a predictable stopping
time, then
ДЛт1{т<оо} =?{ДХг1{т<оо}|Яг-})
and also that the set {(t,u>) : AAt(u>) Ф 0} is a countable disjoint union of
sets of the form {(t,uj) : Т»(ш) = <}, where the stopping times (Tj)j>j are
predictable.)
Exercise 10. Let X and Y be semimartingales with the processes [X, X] and
[Y, Y] locally integrable. Prove the following "sharp bracket" version of the
Kunita-Watanabe inequality, for jointly measurable processes H, K:
f \HsKs\\d(X,Y)s\ < ( I H2sd(X,X)s)^( f K2sd(Y,Y)s)i
Jo Jo Jo
a.s.
(Warning: Recall that (X,Y) can exist as the compensator of [X,Y] even
when (X,X) and (Y,Y) do not exist.)
7 A quasi left continuous filtration is formally defined in Chap. IV on page 189.

Exercises for Chapter III 149
Exercise 11. Show that if A is a predictable finite variation process, then it
is of locally integrable variation. That is, show that there exists a sequence of
stopping times (Tn)n>i such that E{J0 " |cL4s|} < oo.
Exercise 12. Let TV be a counting process with its minimal completed fil-
filtration, with independent interarrival times. Show that the compensator A of
TV has absolutely continuous paths if and only if the cumulative distribution
functions of the interarrival times are absolutely continuous.
Exercise 13. Let T be an exponential random variable and Nt = l{t>T}-
Let To = cr{T} and suppose the filtration F is constant: Ft — Ро for all
t > 0. Show that the compensator A of TV is At = 1{<>т}- (This illustrates
the importance of the filtration when calculating the compensator.)
Exercise 14. Let TV be a compound Poisson process, Nt = ^Zj>i Uil{t>Ti}
where the times Tj are the arrival times of a standard Poisson process with
parameter A and the ?/, are i.i.d. and independent of the arrival times. Suppose
JS{|?/j|} < °° and E{Ui} — ц. Show that the compensator A of TV for the
natural filtration is given by At = A//i.
Exercise 15. Show that if T is exponential with parameter A, its hazard rate
is constant and equal to A. Show also that if R is Weibull with parameters
{a, 13), then its hazard rate is \(t) = aft"?*-1.
Exercise 16. Let T be exponential with parameter A and have joint distri-
distribution with U given by P(T > t,U > s) = exp{-At - /xs - 6ts} for t > 0,
s > 0, where A, //, and в are all positive constants and also в < A//. Show that
the crude hazard rate of (T, U) is given by A* (t) = A + 0t.
*Exercise 17. Let M be a martingale on a filtered space where the filtra-
filtration is quasi left continuous. Show that (M,M) is continuous. (Hint: See the
discussion on quasi left continuous filtrations on page 189 of Chap. IV.)
Exercise 18. Let X be a semimartingale such that the process Dt =
sups<t|AX,| is locally integrable. Show that every decomposition of X,
X = Xo + M + C, has that the process С is of locally integrable variation.
Exercise 19. Suppose that X is locally special, that is, there exists a
sequence (Tn)n>i of stopping times increasing to oo a.s. such that XTn =
Mn + An, with An a predictable finite variation process, for each n. Show
that X is special. (That is, о semimartingale which is locally special is also
special.)
Exercise 20. Prove the Lemma on page 132. Let Q ~ P, and Zt =
Ep I jp \Ft \ ¦ Show that an adapted, cadlag process M is a Q local martingale
if and only if MZ is a P local martingale.
*Exercise 21. Let F с G be filtrations satisfying the usual hypotheses, and
let X be a G quasimartingale. Suppose that Yj = E{Xt\J-t}, and that Y can
be taken cadlag (this can be proved to be true, although it is a little hard to

150 Exercises for Chapter III
prove). Show that Y is also a quasimartingale for the filtration F. (Hint: Use
Rao's Theorem.)
*Exercise 22. Suppose that A is a predictable finite variation process with
?{[A, A\1} < oo, and that M is a bounded martingale. Show that [A, M] is
a uniformly integrable martingale.
Exercise 23. Let Г be a strictly positive random variable, and let Ft =
a{T Л s; s < t}. Show that (J-t)t>o is the smallest filtration making T a
stopping time.
Exercise 24. Let Z be a cadlag supermartingale of Class D with Zq = 0 and
suppose for all predictable stopping times T one has ?^{А^т|Яг-} = О, a.s.
Show that if Z = M — A is the unique Doob-Meyer decomposition of Z, then
A has continuous paths almost surely.
Exercise 25. Let A be an increasing process of integrable variation, and let
Г be a predictable jump time of A such that Е{ААт\Тт-} = О. Then its
compensator A is continuous at T. (This exercise complements Theorem 21.)
*Exercise 26. A supermartingale Z is said to be regular if whenever a
sequence of stopping times (Tn)n>i increases to T, then limn_,00.E{iy7'Ti} =
E{Zt}- Let Z be a cadlag supermartingale of Class D with Doob-Meyer
decomposition Z = M — A. Show that A is continuous if and only if Z is
regular.
*Exercise 27 (approximation of the compensator by Laplacians). Let
Z be a cadlag positive supermartingale of Class D with \hat^ooE{Zt} — 0.
(Such a supermartingale is called a potential.) Let Z — M — A be its Doob-
Meyer decomposition and assume further that A is continuous. Define
At=\j {Zs - E{Zs+h\fs})ds
Show that for any stopping time T, lim/j_»o AT = At with convergence in L1.
*Exercise 28. Let Z be a cadlag positive supermartingale of Class D with
lim^oo E{Zt} = 0. Let Z = M — A be its Doob-Meyer decomposition. Let Ah
be as given in Exercise 27. Show that for any stopping time T, lim/i_,o AT —
At, but in this case the convergence is weak for L1; that is, the convergence
is in the topology cr(Ll,L°°).8
Exercise 29 (discrete Laplacian approximations). Let Z be a cadlag
positive supermartingale of Class D with lim^oo E{Zt} = 0. Let Z = M - A
be its Doob-Meyer decomposition and assume further that A is continuous.
Define
Xn converges to X in ^(Z/1,!/00) if Xn, X are in Ll and for any a.s. bounded
random variable Y, E(XnY) ->• E(XY).

Exercises for Chapter III 151
k=0
Show that limn^oo A1^ = Аж with convergence in L1.
Exercise 30. Use Meyer's Theorem (Theorem 4) to show that if X is a strong
(Markov) Feller process for its natural completed filtration FM, and if X has
continuous paths, then the filtration F^ has no totally inaccessible stopping
times. (This implies that the natural filtration of Brownian motion does not
have any totally inaccessible stopping times.)
*Exercise 31. Let (Q,^, F, P) be the standard Brownian space. Show that
the optional cr-algebra and the predictable cr-algebra coincide. (Hint: Use
Meyer's Theorem (Theorem 4) and Exercise 30.)
* Exercise 32. Let (fl,.F,F, P) be a filtered probability space satisfying
the usual hypotheses. Let X be a (not necessarily adapted) cadlag stochas-
stochastic process such that for A > 0, E{fQ°° e~xt\Xt\dt} < oo. Let R\(Xt) =
E{J0°° e~AsXt+sds|^rt}, the right continuous version. Show that
f*
Mx(t) = Rx{Xt) - RX(XO) + X,- XRx(Xs)ds
Jo
is an F martingale.
*Exercise 33 (Knight's compensator calculation method). Let X be
a cadlag semimartingale. In the framework of Exercise 32 suppose the limits
below exist both pathwise a.s. and are in L1, and are of finite variation in
finite time intervals:
At = lim A / (Xu - XRxXu)du, 0 < t.
A-*°° Jo
Show that X is a special semimartingale, and A is the predictable term in its
semimartingale decomposition.

IV
General Stochastic Integration and Local
Times
1 Introduction
We defined a semimartingale as a "good integrator" in Chap. II, and this led
naturally to defining the stochastic integral as a limit of sums. To express an
integral as a limit of sums requires some path smoothness of the integrands and
we limited our attention to processes in L, the space of adapted processes with
paths that are left continuous and have right limits. The space L is sufficient
to prove Ito's formula, the Girsanov-Meyer Theorem, and it also suffices in
some applications such as stochastic differential equations. But other uses,
such as martingale representation theory or local times, require a larger space
of integrands.
In this chapter we define stochastic integration for predictable processes.
Our extension from Chap. II is very roughly analogous to how the Lebesgue
integral extends the Riemann integral. We first define stochastic integration
for bounded, predictable processes and a subclass of semimartingales known
as H2. We then extend the definition to arbitrary semimartingales and to
locally bounded predictable integrands.
We also treat the issue of when a stochastic integral with respect to a
martingale or a local martingale is still a local martingale, which is not always
the case. In this respect we treat the subject of sigma martingales, which has
recently been shown to be important for the theory of mathematical finance.
2 Stochastic Integration for Predictable Integrands
In this section, we will weaken the restriction that an integrand H must be
in L. We will show our definition of stochastic integrals can be extended to a
class of predictably measurable integrands.
Throughout this section X will denote a semimartingale such that Xq = 0.
This is a convenience involving no loss of generality. If Y is any semimartin-
semimartingale we can set Yt = Yt — Yq, and if we have defined stochastic integrals for

154 IV General Stochastic Integration and Local Times
semimartingales that are zero at 0, we can next define
HsdYs + H0Y0.
f HsdYs= f\
Jo Jo
When Yo ф 0, recall that we write Jo HsdYs to denote integration on @, t],
and JQ HsdYs denotes integration on the closed interval [0,t].
We recall for convenience the definition of the predictable cr-algebra, al-
already defined in Chap. III.
Definition. The predictable cr-algebra V on K+ x Q is the smallest cr-
algebra making all processes in L measurable. That is, V — a{H : H G L}.
We let hV denote bounded processes that are V measurable.
Let X = M + A be a decomposition of a semimartingale X, with Xo —
Mo = Ao = 0. Here M is a local martingale and A is an FV process (such
a decomposition exists by the Bichteler-Dellacherie Theorem (Theorem 43
of Chap. Ill))- We will first consider special semimartingales. Recall that a
semimartingale X is called special if it has a decomposition
X = N + A
where TV is a local martingale and Л is a predictable FV process. This decom-
decomposition is unique by Theorem 30 in Chap. Ill and it is called the canonical
decomposition.
Definition. Let X be a special semimartingale with canonical decomposition
X = TV + A~. The 7i2 norm of X is defined to be
Jo
\dAs
The space of semimartingales 7i2 consists of all special semimartingales
with finite H2 norm.
In Chap. V we define an equivalent norm which we denote || • ||Я2.
Theorem 1. The space ofH2 semimartingales is a Banach space.
Proof. The space is clearly a normed linear space and it is easy to check
that || • ||W2 is a norm (recall that E{NX\ = E{[N,N]00}, and therefore
2
\\X||^2 = 0 implies that E{NOO} = 0 which implies, since N is a martingale,
that TV = 0). _
To show completeness we treat the terms TV and A separately. Consider
first TV. Since ^{TV^} = ||[TV,TV]to2|||2, it suffices to show that the space of
L2 martingales is complete. However an L2 martingale M can be identified
with Moo G L2, and thus the space is complete since L2 is complete.
Next suppose (An) is a Cauchy sequence of predictable FV processes in
|| • ||2 where \\A\\P = \\ /0°° |cL4s|||LP, p > 1. To show (An) converges it suffices

2 Stochastic Integration for Predictable Integrands 155
to show a subsequence converges. Therefore without loss of generality we can
assume ^2n ||A™||2 < oo.
Then Y An converges in || • ||j to a limit A. Moreover
lim У / \dA™\ =0
n>m
in L1 and is dominated in L2 by 5Zn Jo°° l^"l- Therefore J] An converges
to the limit A in || • Ц2 as well, and there is a subsequence converging almost
surely. To see that the limit A is predictable, note that since each term in
the sequence {An)n>\ is predictable, the limit A is the limit of predictably
measurable processes and hence also predictable. ?
For convenience we recall here the definition of L.
Definition. L (resp. bL) denotes the space of adapted processes with caglad1
(resp. bounded, caglad) paths.
We first establish a useful technical lemma.
Lemma. Let A be a predictable FV process, and let H be in L such that
E{jo°° \Hs\\dAs\} < 00. Then the FV process (/„* HadAs)t>0 is also pre-
predictable.
Proof. We need only to write the integral JQ HsdAs as the limit of Riemann
sums, each one of which is predictable, and which converge in ucp to Jo HsdAs,
showing that it too is predictable. ?
The results that follow will enable us to extend the class of stochastic
integrands from bL to bP, with X G 7i2 (and Xq — 0). First we observe that
if Я jE_bL_and X e H2, then the stochastic integral Я • X_ e W2._Also if
X = N + A is the canonical decomposition of X, then Я - N + H ¦ A is the
canonical decomposition of Я • X by the preceding lemma. Moreover,
\\H-X\\H2 = ||( ГH2d{N,N]s)i\\L2 + || Г \Hs\\dAs
Jo Jo
№¦
The key idea in extending our integral is to notice that [N,N] and A are
FV processes, and therefore oj-by-a; the integrals /0 H2(u>)d[N, N]s(u>) and
/0* |#s||cL4e| make sense for any Я G hV and not just Я G L.
Definition. Let X G H2 with X = N + A its canonical decomposition, and
let H, J e hV. We define dx(H, J) by
dx(H,J) = ||( Г(н8 - JS)MWW1/2IU* + II Г \н8 - J8\\dA8\\\La.
Jo Jo
1 "caglad" is the French acronym for left continuous with right limits.

156 IV General Stochastic Integration and Local Times
Theorem 2. For X G H2 the space bL is dense in bP under dx(-, ¦)•
Proof. We use the Monotone Class Theorem. Define
Л = {H G bP : for any e > 0, there exists J G bL such that dx(#, •/) <
Trivially Л contains bL. If Я™ G Л and Я™ increases to Я with H bounded,
then Я G bP, and by the Dominated Convergence Theorem if S > 0 then for
some N(S), n > iV(<J) implies dx(H,Hn) < S. Since each Я™ G Л, we choose
no > JV(($) and there exists J G bL such that dx(J, Hn°) < 8. Therefore given
e > 0, by taking S = e/2 we can find J G bL such that dx (J, H) < e, and
therefore ЯбЛ An application of the Monotone Class Theorem yields the
result. D
Theorem 3. Let X G H2 and Hn G bL such that Hn is Cauchy under dX-
Then Hn -X is Cauchy in H2.
Proof. Since \\Hn ¦ X -Hm ¦ X\\H2 = dx(Hn,Hm), the theorem is immediate.
?
Theorem 4. Let X G П2 and H G bP. Suppose Hn G bL and Jm G bL
are two sequences such that limn dx(Hn,H) = limm dx(Jm, H) = 0. Then
Hn ¦ X and Jm ¦ X tend to the same limit in 7i2.
Proof. Let Y = limn^oo Hn ¦ X and Z = limm-xx, Jm ¦ X, where the limits
are taken in Ti2. For e > 0, by taking n and m large enough we have
\\Y - Z\\H2 < \\Y - Hn ¦ X\\H2 + \\Hn -X-Jm- X\\H2 + \\Jm ¦ X - Z\\H2
<2e+\\Hn-X-Jm-X\\H2
<2e + dx(Hn,Jm)
<2e + dx(Hn,H) + dx(H,Jm)
<4?,
and the result follows. ?
We are now in a position to define the stochastic integral for H G bV (and
X G 7i2).
Definition. Let X be a semimartingale in H2 and let Я G bP. Let Hn G bL
be such that limn^oo dx(Hn, H) = 0. The stochastic integral H ¦ X is the
(unique) semimartingale Y G H2 such that Нт^^Я™ • X = Y in 7i2. We
write H-X = {J* HsdXs)t>0.
We have defined our stochastic integral for predictable integrands and
semimartingales in H2 as limits of our (previously defined) stochastic integrals.
In order to investigate the properties of this more general integral, we need to
have approximations converging uniformly. The next theorem and its corollary
give us this.

2 Stochastic Integration for Predictable Integrands 157
Theorem 5. Let X be a semimartingale in 7i2. Then
E{(Snp\Xt\J}<S\\X\\2n2.
t
Proof. For a process H, let H* = supt \Ht\. Let X = N + Abe the canonical
decomposition of X. Then
/»oo
X* <N*+ / \dls\.
Jo
Doob's maximal quadratic inequality (Theorem 20 of Chap. I) yields
and using (a + bJ < 2a2 + 2b2 we have
E{(X*J} < 2E{(N*J} + 2E{( / \dAs\J}
Jo
POO
<8E{[N,N]oo} + 2\\ \dAs\\\h
Jo
<8\\Х\\2Ю. П
Corollairy. Let (Xn) be a sequence of semimartingales converging to X in
7i2. Then there exists a subsequence (n^) such that limnk^oo(Xnk — X)* = 0
a.s.
Proof. By Theorem 5 we know that (Xn - X)* = supt \X™ — Xt\ converges
to 0 in L2. Therefore there exists a subsequence converging a.s. ?
We next investigate some of the properties of this generalized stochastic
integral. Almost all of the properties established in Chap. II (Sect. 5) still
hold.2
Theorem 6. Let X,Y etf and H, К е hV. Then
and
Proof. One need only check that it is possible to take a sequence Hn e bL
that approximates H in both dx and dy. ?
Theorem 7. Let T be a stopping time. Then (H ¦ X)T — .fflrox] " X =
Т
2 Indeed, it is an open question whether or not Theorem 16 of Chap. II extends to
integrands in bV. See the discussion at the end of this section.

158 IV General Stochastic Integration and Local Times
Proof. Note that 1[0,т] e bL, so #1[о,т] e bP. Also, XT is clearly still in W2.
Since we know this result is true for H e bL (Theorem 12 of Chap. II), the
result follows by uniform approximation, using the corollary of Theorem 5. ?
Theorem 8. The jump process (A(H ¦ X)s)s>o is indistinguishable from
(Hs(AXs))s>0-
Proof. Recall that for a process J, AJt — Jt — Jt-, the jump of J at time
t. (Note that H ¦ X and X are cadlag semimartingales, so Theorem 8 makes
sense.) By Theorem 13 of Chap. II we know the result is true for Я е bL.
Let H e hV, and let Hn e bL such that lim^oo d* (#n, #) = 0. By the
corollary of Theorem 5, there exists a subsequence (nk) such that
lim (Hnk -X-H-Xy =0 a.s.
This implies that, considered as processes,
lim А(НПк -Х)=А(Н-Х),
outside of an evanescent set.3 Since each Hnk e bL, we have A(H"k ¦ X) =
H"k(AX), outside of another evanescent set. Combining these, we have
lim Hn"(AX)l{AX^0} = lim A(Hn" ¦ X)l{Axm
= A(H ¦
and therefore
In particular, the above implies that lim^^oo H"k (ш) exists for all (t,w) in
{AX ф 0}, a.s. We next form
А = {ш: there exists t>0 such that lim Н"к{ш)фНг{ш) and АХг(ш)ф0\.
Bl-tOO
Suppose P(A) > 0. Then
dx(Hnk,H)>\\lA{f {H™k-HsJd{ ]T (ANUJ)}\\L2
^° 0<u<s . .
,00 (*)
+ ||1л{/ \H?*-He\d( Yl
0<u<
and if AXS ф 0, then |ДЖ,| + \AAS\ > 0. The left side of (*) tends to 0
as nk -» со, and the right side of (*) does not. Therefore P(A) — 0, and we
conclude A(H ¦ X) = limnfc^oo HnkAX = HAX. D
3 A set Л С R+ x П is evanescent if 1л is a process that is indistinguishable from
the zero process.

2 Stochastic Integration for Predictable Integrands 159
Corollary. Let X e 7i2, H e bV, and T a finite stopping time. Then
Proof. By Theorem 8, (H ¦ X)T~ = (H ¦ X)T - НтАХт1{г>т}- On the other
hand, XT~ = XT - AXrl{t>T}- Let At = AXTl{t>T}- By the bilinearity
(Theorem 6), H ¦ (XT~) = H ¦ (XT) -HA. Since Я • (XT) = (H ¦ X)T by
Theorem 7, and НА — НтАХт1{г>т], the result follows. G
The next three theorems all involve the same simple proofs. The result is
known to be true for processes in bL; let (Hn) € bL approximate H € hV in
dx(•,•)> and by the corollary of Theorem 5 let n^ be a subsequence such that
lim (НПк -Х-Н-ХУ=О a.s.
Then use the uniform convergence to obtain the desired result. We state these
theorems, therefore, without proofs.
Theorem 9. Let X e 7i2 have paths of finite variation on compacts, and
H e hV. Then H ¦ X agrees with a path-by-path Lebesgue-Stieltjes integral.
Theorem 10 (Associativity). Let X e H2 and H,K ? bV. Then К ¦ X 6
H2 and H-{K-X) = (HK) ¦ X.
Theorem 11. Let X € 7i2 be a (square integrable) martingale, and H € hV.
Then H ¦ X is a square integrable martingale.
Theorem 12. Let X,Y etf and H,K e bV. Then
[H-X,K-Y]t= j HsKsd{X,Y}s, (t > 0),
and in particular
[H ¦ X,H ¦ X]t = [ H2d[X,X}3, (t > 0).
Jo
Proof. As in the proof of Theorem 29 of Chap. II, it suffices to show
[H-X,Y]t= fHsd[X,Y]s.
Jo
Let (Яп) G bL such that lim^oo dx(Hn, H) = 0. Let Tm = inf{? > 0 : \Yt\ >
m}. Then (T) are stopping times increasing to со a.s. and |yj"* | < m.4 Since
it suffices to show the result holds on [0, Tm), each m, we can assume without
loss of generality that У_ is in bL. Moreover, the Dominated Convergence
Theorem gives limn^oodx(HnY-,HY-) = 0. By Theorem 29 of Chap. II, we
have
Recall that Y_ denotes the left continuous version of Y.

160 IV General Stochastic Integration and Local Times
[Hn-X,Y]t = f H?d[X,Y]a (t>0),
Jo
and again by dominated convergence
lim [Hn -X,Y} = [ Hsd[X,Y}s (t > 0).
It remains only to show lim^ooftf" • X,Y] = [H ¦ X,Y]. Let Zn = Hn ¦ X,
and let nk be a subsequence such that limnk^oo(Z"k — Z)* = 0 a.s., where
Z — H ¦ X (by the corollary to Theorem 5). Integration by parts yields
[Zn\Y] = ZnkY - (У_) • Znk - [Zn_k)-Y
= ZnkY - (Y_Hnk) ¦ X - {Z"Lk) ¦ У,
where we have used associativity (Theorem 10). We take limits so that
lim [Znk,Y] = ZY -Y- ¦ (H ¦ X) - Z- -Y
= ZY - У_ ¦ (Z) - Z_ • У
= {Z,Y} = [H-X,Y]. а
At this point the reader may wonder how to calculate in practice a canon-
canonical decomposition of a semimartingale X in order to verify that X € H2.
Fortunately Theorem 13 will show that H2 is merely a mathematical conve-
convenience.
Lemma. Let A be an FV process with Ao — 0 and /0°° |cL4s| el2. Then
A e П2. Moreover \\A\\H2 < 6|| /0°° |di4,|||L2.
Proof. If we can prove the result for A increasing then the general result will
follow by decomposing A = A+ — A~. Therefore we assume without loss of
generality that A is increasing. Hence as we noted in Sect. 5 of Chap. Ill, the
compensator Л of Л is also increasing and E{Aoo} = ^{Лоо} < oo.
Let M be a martingale bounded by a constant k. Since A - A is a local
martingale, Corollary 2 to Theorem 27 of Chap. II shows that
L = M{A -A)-[M,A- A]
is a local martingale. Moreover
sup \Lt\ < к(Аоо + Лоо) +2к^ \&(А ~ А)*\
S
< ЗЛКД» + 4») eL1.
Therefore L is a uniformly integrable martingale (Theorem 51 of Chap. I) and
Е{ЬЖ} = E{L0} = 0. Hence

2 Stochastic Integration for Predictable Integrands 161
A - Л)оо} = E{[M,A- i]oc}
= E{[M,A]0O}-E{[M,A]0O}
= E{[M,A}00],
because A is natural. By the Kunita-Watanabe inequality (the corollary to
Theorem 25 of Chap. II)
E{\[M,AU\} <
к
where the second inequality uses 2ab < a2 +b2. However
E^M^U) = E{Ml)
(Corollary 4 of Theorem 27 of Chap. II) and also [А, А}^ < А2^ a.s. Therefore
Since M is an arbitrary bounded martingale we are free to choose
Moo = (A - А)о
and we obtain
and using the Monotone Convergence Theorem we conclude
??{(Л-Л&}<Я{^}.
Consequently
E{Al} < 2E{Al} + 2E{(A - 1I} < ЩА^} < oo,
and A — A is a square integrable martingale, and
\\A\\w = \\(A- lUlU» + \\Aooh' < ЗЦЛооЦ^
for A increasing. ?
Remarks. The constant 6 can be improved to l+\/8 < 4 by not decomposing
A into A+ and A~. This lemma can also be proved using the Burkholder-
Gundy inequalities (see Meyer [171, page 347]).

162 IV General Stochastic Integration and Local Times
In Chap. V we use an alternative norm for semimartingales which we
denote || • ||я_р, 1 < p < со. The preceding lemma shows that the norms
|| ¦ ||И2 and || • \\gi are equivalent.
The restrictions of integrands to ЪТ> and semimartingales to H2 are
mathematically convenient but not necessary. A standard method of re-
relaxing such hypothesis is to consider cases where they hold locally. Recall
from Sect. 6 of Chap. I that a property тг is said to hold locally for
a process X if there exists a sequence of stopping times (Tn)n>o such that
0 = T° < T1 < T2 < ¦ ¦ ¦ < Tn < ¦ ¦ ¦ and lim^oo Tn = со a7s., and such
that Хг1{Г»>0} has property тг for each n. Since we are assuming our semi-
semimartingales X satisfy Xo = 0, we could as well require only that XT" has
property тг for each n. A related condition is that a property hold prelocally.
Definition. A property тг is said to hold prelocally for a process X with
Xo = 0 if there exists a sequence of stopping times (Tn)n>i increasing to со
a.s. such that XT"~ has property тг for each n > 1.
Recall that XT~ = Xtl{o<t<T} +^T-l{t>T}- The next theorem shows
that the restriction of semimartingales to H2 is not really a restriction at all.
Theorem 13. Let X be a semimartingale, Xq — 0. Then X is prelocally in
7i2. That is, there exists a non-decreasing sequence of stopping times {Tn),
ooT™ = со a.s., such that XT"~ e H2 for each n.
Proof. Recall that XT"~ = Xtl[OtTn) + ^т"-1[Т",оо)- By the Bichteler-
Dellacherie Theorem (Theorem 43 of Chap. Ill) we can write X = M + A,
where M is a local martingale and A is an FV process. By Theorem 25 of
Chap. Ill we can further take M to have bounded jumps. Let C be the bound
for the jumps of M. We define
Tn = inf{t > 0 : [M,M]t > n or / \dA3\ > n}
Jo
and let Y = XT"~. Then Y has bounded jumps and hence it is a special
semimartingale (Theorem 31 of Chap. III). Moreover
Y = XTn~ = MT" + ATn~ - (ДМГI[Т,]00),
C,
where L = MT" and С = Ат"~ - (ДМт»I[т«)Оо)- Then [L, L] < n + 01, so
L is a martingale in Ti2 (Corollary 4 to Theorem 27 of Chap. II), and also
/•OO
/ \dCs\ <n+ |ДМт«| < 2n + C,
Jo
hence С e H2 by the lemma. Therefore XT"~ = L + С € V2. ?

2 Stochastic Integration for Predictable Integrands 163
We are now in a position to define the stochastic integral for an arbi-
arbitrary semimartingale, as well as for predictable processes which need not be
bounded.
Let X be a semimartingale in H2. To define a stochastic integral for pre-
predictable processes H which are not necessarily bounded (written Я е ?), we
approximate them with Hn e bV.
Definition. Let X e H2 with canonical decomposition X = N + A. We say
Я € V is (H2,X) integrable if
/*oo /*
E{ H2d[N,N}s} + E{(
Jo Jo
< CO.
Theorem 14. Let X be a semimartingale and let H e V be (Ti2,X) inte-
integrable. Let H" = Я1{|я|<п} € ЬР. Then H" ¦ X is a Cauchy sequence in
H2.
Proof. Since H" e bV, each n, the stochastic integrals H" ¦ X are defined.
Note also that итп^юЯ" = H and that \Hn\ < \H\, each n. Then
\\Hn-X-Hm-X\\H2 =dx(Hn,Hm)
/•OO /*OO
= ||( / {Щ - H™Jd\N,-N\sYi2\\L. +1| / |#; - H?\\dAe\\\L,,
Jo Jo
and the result follows by two applications of the Dominated Convergence
Theorem. ?
Definition. Let X be a semimartingale in H2, and let H e V be (H2,X)
integrable. The stochastic integral Я • X is defined to be limn^oo Hn ¦ X,
with convergence in H2, where Я™ = #
Note that Я • X in the preceding definition exists by Theorem 14. We
can "localize" the above theorem by allowing both more general Я е V and
arbitrary semimartingales with the next definition.
Definition. Let X be a semimartingale and И е V. The stochastic integral
И X is said to exist if there exists a sequence of stopping times T" increasing
to со a.s. such that XT"~ e H2, each n > 1, and such that H is (W2,XT"~)
integrable for each n. In this case we say H is X integrable, written Я е
L(X), and we define the stochastic integral by
H ¦ X = H ¦ (Хтп~), on[0,Tn),
each n.
Note that if m > n then

164 IV General Stochastic Integration and Local Times
where Hk = Hl{\H\<k}, by the corollary of Theorem 8. Hence taking limits we
have H ¦(XT"l^)T"~ = H-(XT"~), and the stochastic integral is well-defined
for Я € L{X). Moreover let R* be another sequence of stopping times such
that XRl~ e H2 and such that Я is (H2,XRe~) integrable, for each I. Again
using the corollary of Theorem 8 combined with taking limits we see that
on [0, Rl Л T"), each ? > 1 and n > 1. Thus in this sense the definition of
the stochastic integral does not depend on the particular sequence of stopping
times.
If Я e hV (i.e., Я is bounded), then Я € L(X) for all semimartingales
X, since every semimartingale is prelocally in Ti2 by Theorem 13.
Definition. A process Я is said to be locally bounded if there exists a
sequence of stopping times (Sm)m>i increasing to со a.s. such that for each
m > 1, (HtAS™l{sm>o})t>o is bounded.
Note that any process in L is locally bounded. The next example is suffi-
sufficiently important that we state it as a theorem.
Theorem 15. Let X be a semimartingale and let H &V be locally bounded.
Then H e L(X). That is, the stochastic integral H ¦ X exists.
Proof. Let (Sm)m>i, (Tn)n>i be two sequences of stopping times, each in-
increasing to со a.s., such that Hsml{sm>o} is bounded for each m, and
XT"- E П2 for each n. Define Rn = minE",T"). Then Я = Ял{Лп>0}
on @,Rn) and hence it is bounded there. Since XR"~ charges only @,Rn),
we have that H is (H2,XRn~) integrable for each n > 1. Therefore using the
sequence Rn which increases to oo a.s., we are done. ?
We now turn our attention to the properties of this more general integral.
Many of the properties are simple extensions of earlier theorems and we omit
their proofs. Note that trivially the stochastic integral Я • X, for H e L(X),
is also a semimartingale.
Theorem 16. Let X be a semimartingale and let H,j? L{X). Then aH +
PJ E L{X) and {aH + pj) ¦ X = aH ¦ X + pj ¦ X. That is, L{X) is a linear
space.
Proof. Let (Rm) and (Tn) be sequences of stopping times such that Я is
(H2,XRm~) integrable, each m, and J is (H2,XT"~) integrable, each n. Tak-
Taking Sn = Rn ЛГ, it is easy to check that aH + pj is (H2,XS"~) integrable
for each n. ?
Theorem 17. Let X,Y be semimartingales and suppose H e L{X) and
H E L(Y). Then H e L(X + Y) and H ¦ (X + Y) = H ¦ X + H Y.

2 Stochastic Integration for Predictable Integrands 165
Theorem 18. Let X be a semimartingale and Я 6 L{X). The jump process
(Д(Я ¦ X)s)s>o is indistinguishable from (Hs(AXs))s>o.
Theorem 19. LetT be a stopping time, X a semimartingale, and H € L(X).
Then
(Я ¦ Xf = Я1[0>Т] • X = Я ¦ (XT).
Moreover, letting со— equal со, we have moreover
(Н-Х)Т-=Н-(ХТ~).
Theorem 20. . Let X be a semimartingale with paths of finite variation on
compacts. Let Я € L(X) be such that the Stieltjes integral Jo \Hs\\dXs\ exists
a.s., each t > 0. Then the stochastic integral H ¦ X agrees with a path-by-path
Stieltjes integral.
Theorem 21 (Associativity). Let X be a semimartingale with К б L(X).
Then H e L(K ¦ X) if and only if HK e L(X), in which case H ¦ (K ¦ X) =
{HK) ¦ X.
Theorem 22. Let X,Y be semimartingales and let H e L(X), К е L(Y).
Then
[H-X,K-Y}t= f HsKsd[X, Y]s (t > 0).
Jo
Note that in Theorem 22 since Я • X and Я • Y are semimartingales, the
quadratic covariation exists and the content of the theorem is the formula.
Indeed, Theorem 22 gives a necessary condition for Я to be in L(X), namely
that /0* H^d[X, X]s exists and is finite for all t > 0. The next theorem (The-
(Theorem 23) is a special case of Theorem 25, but we include it because of the
simplicity of its proof.
Theorem 23. Let X be a semimartingale, let H € L(X), and suppose Q is
another probability with Q -C P. If Hq ¦ X exists, it is Q indistinguishable
from Hp ¦ X.
Proof. Hq ¦ X denotes the stochastic integral computed under Q. By Theo-
Theorem 14 of Chap. II, we know that Hq ¦ X = Hp ¦ X for Я е L, and therefore
if X e H2 for both P and Q, they are equal for Я G hV by the corollary
of Theorem 5. Let (Re)e>i, (Tn)n>i be two sequences of stopping times in-
increasing to со a.s. such that Я is {7i2,XR'~) integrable under Q, and Я is
(П2,ХТ"~) integrable under P, each ? and n. Let Sm = RmATm, so that Я
is (H2, Xsm~) integrable under both P and Q. Then Я ¦ X = lim^oo Я™ • X
on [0,Sm) in both dx{P) and dx{Q), where Hn = Я1{|Я|<п} е hV. Since
H'p ¦ X — Hq ¦ X, each n, the limits are also equal. D
Much more than Theorem 23 is indeed true, as we will see in Theorem 25,
which contains Theorem 23 as a special case. We need several preliminary
results.

166 IV General Stochastic Integration and Local Times
Lemma. Let X e Ti2 and X = N + A be its canonical decomposition. Then
E{[N,NU} <E{{X,XU}.
Proof. First observe that
[X,X} = [N,N} + 2{N,A] + iA,A\.
It suffices to show E{[N, А}^} = 0, since then
?{[ЛГ,ЛПоо} = E{[X,XU - p, Л]»},
and the result follows since [A, A}oo > 0. Note that
??{|[ЛГ,Л]оо|} < (^[Ж^]оо}I/2(Я{[ЗДоо}I/2 < oo,
by the Kunita-Watanabe inequalities. Also -Е{[М,Л]оо} = О for all bounded
martingales because A is natural. Since bounded martingales are dense in
the space of L2 martingales, there exists a sequence (Mn)n>i of bounded
martingales such that lim^oo E{[Mn -77, Mn-Щ^} = О. Again using the
Kunita-Watanabe inequalities we have
E{\[N - Mn Ди} < (E{{N - Mn,N - МП}ОО}I'2{Е{\А,А~}ОО}I'2
and therefore lim^oo ^{[M™^]^} = .Е{[77Д]оо}. Since E{[Mn, A}^} = 0,
each n, it follows that ?{[iV, Л]^} = 0. П
Note that in the preceding proof we established the useful equality
E{[X,X]t} = E{[~N,TV}t} + E_\\A^A\t} for a semimartingale X e H2 with
canonical decomposition X = N + A.
Theorem 24. For X a semimartingale in Ti2,
\\X\\n*< sup \\(H ¦ Xy^L* +2\\[X,X}V2\\L* <5\\X\\n2.
|Я|1
Proof. By Theorem 5 for |tf | < 1
\\(H ¦ X)*J\Li < V8\\H ¦ X\\H* < V8\\X\\Hi.
Since
2\\[X,X]1J12\\li <2||[M,M]V2||l2+2|| Г \dAe\\\L,
Jo
where X — M + A is the canonical decomposition of X, we have the right
inequality.
For the left inequality we have \\{М,М}^2\\^ < ||pf,X]^2||L2 by the
lemma preceding this theorem. Moreover if \H\ < 1, then

2 Stochastic Integration for Predictable Integrands 167
\\(H-A)t\\La<\\(H-X)t\\L2
<\\{h ¦ xy^Wv
Next take Я — щ; this exists as a predictable process since A is predictable,
and therefore A predictable implies that |A|, the total variation process, is also
predictable. Consequently we have
/¦O
| /
Jo
. a
We present as a corollary to Theorem 24, the equivalence of the two
pseudonorms sup^^ \\(H ¦ -X")Soll-c2 an<^ ll^llw2- We will not have need of
this corollary in this book,5 but it is a pretty and useful result nevertheless.
It is originally due to Yor [241], and it is actually true for all p, 1 < p < oo.
(See also Dellacherie-Meyer [46, pages 303-305].)
Corollary. For a semimartingale X (with Xo = 0),
and in particular sup^^j ||(# • X)Io\\l2 < oo if and only if ||X||W2 < oo.
Proof. By Theorem 5 if \\X\\ns < oo and |#| < 1 we have
• X\\H2 < VS\\X\\H2.
Thus we need to show only the left inequality. By Theorem 24 it will suffice
to show that
\\[X,X}H2\\L2< sup
||
for a semimartingale X with Xo = 0.
To this end fix a * > 0 and let 0 = To < Тг < ¦ ¦ ¦ < Tn = t be a random
partition of [0,t]. Choose ei,... ,en non-random and each equal to 1 or —1
and let H = ^"=1 ?»1(Т;_1,т<]- Then Я is a simple predictable process and
Let a — sup^^j ||(Я • X)^o\\l2. We then have
5 However the corollary does give some insight into the relationship between The-
Theorems 12 and 14 in Chap. V.

168 IV General Stochastic Integration and Local Times
a2 > E{(H
2 > E{(H ¦ X)l} = V
If we next average over all sequences E\,... ,en taking values in the space
{±1}", we deduce
a2
Next let am = {27™} be a sequence of random partitions of [0,t] tending to
the identity. Then Х^(-^т™ —XTr^ J converges in probability to [X,X]t. Let
{mk} be a subsequence so that 5Zi(^Tmfc ~ XT™kJ converges to [X, X]t a.s.
Finally by Fatou's Lemma we have
E{[X,X]t} < lirnmf E{J2(XT^k - Хт^J} < а2.
г
Letting t tend to oo we conclude that
E{[X,XU}<a2= sup
||
It remains to show that if sup^j^j \\(H ¦ -X')?o||/,2 < oo, then X s H2. We
will show the contrapositive. If X ? H2, then sup^^ \\(H ¦ ^)^o||l2 = oo.
Indeed, let Tn be stopping times increasing to oo such that XT"~ is in Ti2
for each n (c?, Theorem 13). Then
\\Хтп-\\П2<3 sup
Letting n tend to oo gives the result. ?
Before proving Theorem 25 we need two technical lemmas.
Lemma 1. Let A be a non-negative increasing FV process and let Z be a
positive uniformly integrable martingale. Let T be a stopping time such that
A = AT~ (that is, A^ = At-) and let к be a constant such that Z < к on
[0,T). Then
Z^} <
Proof. Since A0- — Zo- = 0, by integration by parts
AtZt = I As_dZs + I Zs-dAs + [A,Z]t
Jo Jo
f As_dZs + f
Jo Jo
= I As_dZs+ f ZsdAs
J jU/Ag

2 Stochastic Integration for Predictable Integrands 169
where the second integral in the preceding is a path-by-path Stieltjes integral.
Let Rn be stopping times increasing to oo a.s. that reduce the local martingale
(Jo As-dZs)t>o- Since dAs charges only [О, Г) we have
ft rt
kdA,
/ ZsdAs <
Jo Jo
for every t > 0. Therefore
E{(AZ)Rn} = E{ / As.dZs} + E{ / ZsdAs}
Jo Jo
The result follows by Fatou's Lemma. D
Lemma 2. Let X be a semimartingale with Xo — 0, let Q be another prob-
probability with Q -C P, and let Zt = Ep{^\!Ft\- If Г is a stopping time such
that Zt < к on [0, Г) for a constant k, then
\\XT-\\H2{Q) <5Vk\\XT-\\H2{P).
Proof. By Theorem 24 we have
\\XT-\\n2(Q) < sup EQ{((H ¦ XT-)*J}i +2EQ{[XT-,XT-\i}
< sup EP I -j^-((H ¦ XT~)*
sup EP{((H ¦ XT-)*)-
where we have used Lemma 1 on both terms to obtain the last inequality
above. The result follows by the right inequality of Theorem 24. ?
Note in particular that an important consequence of Lemma 2 is that if
Q < P with ^ bounded, then X <s H2(P) implies that X <s H2(Q) as well,
with the estimate ||X||^2(qj < 5-\/fc||X||ft2(p), where к is the bound for -jp.
Note further that this result (without the estimate) is obvious if one uses the
equivalent pseudonorm given by the corollary to Theorem 24, since
sup EQ{((H ¦ X)^J} = smp EP{^(
<k sup ЕрЩН-Х)^J},
where again к is the bound for ^.

170 IV General Stochastic Integration and Local Times
Theorem 25. Let X be a semimartingale and H s L(X). If Q -С P, then
H e L(X) under Q as well, and Hq ¦ X = HP ¦ X, Q-a.s.
Proof. Let Tn be a sequence of stopping times increasing to oo a.s. such that
H is (Хтп-,П2) integrable under P, each n > 1. Let Zt = EP{^jj;\Ft}, the
cadlag version. Define Sn = M{t > 0 : \Zt\ > n} and set Rn = Sn A Tn.
Then XRn~ € H2(P)nH2(Q) by Lemma 2, and Я is (H2,XRn-) integrable
under P. We need to show Я is (W2,Xfi"~) integrable under Q, which will
in turn imply that Я e -Ь(^) under Q. Let Хл"~ = N + С be the canonical
decomposition under Q. Let Hm = Я1{|я|<т}- Then
| |tf Л
= ||tfm • XR"-\\H2{Q) <
and then by monotone convergence we see that H is (H2,XRn~) integrable
under Q. Thus H <s L(X) under Q, and it follows that HQ ¦ X = HP ¦ X,
<2-a.s. О
Theorem 25 can be used to extend Theorem 20 in a way analogous to the
extension of Theorem 17 by Theorem 18 in Chap. II.
Theorem 26. Let X, X be two semimartingales, and let H 6 L(X), H e
L(X). Let A = {w : Н.(ш) = tf.(w) and Х.(ш) = X(w)}, and let В = {w :
11—> Xt(oj) is of finite variation on compacts}. Then H¦ X = H-X on A, and
H ¦ X is equal to a path-by-path Lebesgue-Stieltjes integral on B.
Proof. Without loss of generality assume P(A) > 0. Define Q by Q(A) =
P(A\A). Then Q -C P and therefore H e L(X), H 6 L(X) under_Q as well as
under P by Theorem 25. However under Q the processes H and H as well as
X and X are indistinguishable. Thus Hq X = HqX and hence ЙХ = HX
P-a.s. on A by Theorem 25, since Q «P.
The second assertion has an analogous proof (see the proof of Theorem 18
of Chap. II). ?
Note that one can use stopping times to localize the result of Theorem 26.
The proof of the following corollary is analogous to the proof of the corollary
of Theorem 18 of Chap. II.
Corollary. With the notation of Theorem 26, let S, T be two stopping times
with S < T. Define
С = {a; : Щш) = Ht{w); Xt(uj) = Xt(u>); S(u) < t < Т(ш)},
D = {a> : t н-> Xt(tjS) is of finite variation on S(u>) < t < Т(ш)}.
Then Я • XT - H ¦ Xs = H ¦ X~T - H ¦ ~XS on С and Я • XT - H ¦ Xs equals
a path-by-path Lebesgue-Stieltjes integral on D.

2 Stochastic Integration for Predictable Integrands 171
Theorem 27. Let Pk be a sequence of probabilities such that X is a Pk
semimartingale for each k. Let R = Y^k=i -^fc-ffc where Afc > 0, each k, and
J2kLiXk = 1. Let H e L(X) under R. Then Я e L(X) under Pk and
Hr • X = Hpk ¦ X, Pk-a.s., for all к such that Xk > 0.
Proof If Afc > 0 then Pk < R. Moreover since Рк(Л) < -^R(A), it follows
that Я <s L(X) under Pk. The result then follows by Theorem 25. D
We now turn to the relationship of stochastic integration to martingales
and local martingales. In Theorem 11 we saw that if M is a square integrable
martingale and Я e bV, then Я ¦ M is also a square integrable martingale.
When M is locally square integrable we have a simple sufficient condition for
H to be in L(M).
Lemma. Let M be a square integrable martingale and let Я e V be such
that E{f?° H2d[M, M]s} < oo. Then Я ¦ M is a square integrable martingale.
Proof. If Яk 6 bV, then Hk ¦ M is a square integrable martingale by The-
Theorem 11. Taking Hk = Hl{\H\<ky, and since Я is (H2,M) integrable, by
Theorem 14 Hk ¦ M converges in H2 to H ¦ M which is hence a square inte-
integrable martingale. ?
Theorem 28. Let M be a locally square integrable local martingale, and let
H & V. The stochastic integral H ¦ M exists (i.e., H e L(M)) and is a locally
square integrable local martingale if there exists a sequence of stopping times
(Tn)n>i increasing to oo a.s. such that E{JQ H2d[M,M]s} < oo.
Proof. We assume that M is a square integrable martingale stopped at the
time Tn. The result follows by applying the lemma. D
Theorem 29. Let M be a local martingale, and let H sV be locally bounded.
Then the stochastic integral H ¦ M is a local martingale.
Proof. By stopping we may, as in the proof of Theorem 29 of Chap. Ill, assume
that H is bounded, M is uniformly integrable, and that M = N + A where
N is a bounded martingale and A is of integrable variation. We know that
there exists Rk increasing to oo a.s. such that MR sH2, and since H 6 bV
there exist processes H? s bL such that \\He ¦ MRk~ - H ¦ MRk~\\H2 tends
to zero. In particular, He ¦ MR ~ tends to H ¦ MR ~ in ucp. Therefore we
can take Hl such that Hl ¦ MRk~ tends to Я • M in ucp, with He <s bL.
Finally without loss of generality we assume He ¦ M converges to Я • M in
ucp. Since He ¦ M = He ¦ N + He ¦ A and He ¦ N converges to H ¦ N in ucp,
we deduce He ¦ A converges to Я • A in ucp as well. Let 0 < s < t and assume
Y 6 bFs. Therefore, since A is of integrable total variation, and since He ¦ A
is a martingale for H^ e bL (see Theorem 29 of Chap. Ill), we have

172 IV General Stochastic Integration and Local Times
E{Y f HudAu} = E{Y f lim H*udAu} = E{Y lim f HidAu}
J5+ J S-\- УО° УО° J S-\-
rt
= lim E{Y / HeudAu}
e-*°° Js+
= 0,
where we have used Lebesgue's Dominated Convergence Theorem both for
the Stieltjes integral w-by-w (taking a subsequence if necessary to have a.s.
convergence) and for the expectation. We conclude that (f* HsdAs)t>0 is a
martingale, hence H ¦ M = H • N + H ¦ A is also a martingale under the
assumptions made; therefore it is a local martingale. D
In the proof of Theorem 29 the hypothesis that H ? V was locally bounded
was used to imply that if M — N + A with N having locally bounded jumps
and A an FV local martingale, then the two processes
л
\ H2d[N,N}s and
Jo
are locally integrable. Thus one could weaken the hypothesis that H is locally
bounded, but it would lead to an awkward statement. The general result, that
M a local martingale and H s L(M) implies that H ¦ M is a local martingale,
is not truel
See Emery's example, which precedes Theorem 34 for a stochastic integral
with respect to an Ti2 martingale which is not even a local martingale! Emery's
counterexample has lead to the development of what are now known as sigma
martingales, a class of processes which are not local martingales, but can be
thought to be "morally local martingales." This is treated in Section 9 of this
chapter.
Corollary. Let M be a local martingale, Mo = 0, and let Г be a predictable
stopping time. Then MT~ is a local martingale.
Proof. The notation MT~ means
M?~ = Mtl{t<Ty + MT-l{t>T},
where Mo_ = Mo = 0. Let (Sn)n>i be a sequence of stopping times announc-
announcing T. On {T > 0},
and since l[o,s»] is a left continuous, adapted process it is predictable; hence
1[о,т) is predictable. But /0 l{0,T)(s)dMs = MtT~ by Theorem 18, and hence
it is a local martingale by Theorem 29. D

2 Stochastic Integration for Predictable Integrands 173
Note that if M is a local martingale and T is an arbitrary stopping time,
it is not true in general that MT~ is still a local martingale.
Since a continuous local martingale is locally square integrable, the theory
is particularly nice.
Theorem 30. Let M be a continuous local martingale and let H 6 V be such
that /0 H2d[M, M]s < oo a.s., each t>0. Then the stochastic integral H ¦ M
exists (i.e., Я e L(M)) and it is a continuous local martingale.
Proof. Since M is continuous, we can take
Rk = mi{t>0: \Mt\ >k}.
Then \MtARk\ < к and therefore M is locally bounded, hence locally square
integrable. Also M continuous implies [M, M] is continuous, whence if
= inf{t > 0 : / H2d[M,M]s > к},
Jo
we see that (/0* H2d[M, M]s)t>0 is also locally bounded. Then H-M is a locally
square integrable local martingale by Theorem 28. The stochastic integral H ¦
M is continuous because Д(Я-М) = H(AM) and AM = 0 by hypothesis. D
In the classical case where the continuous local martingale M equals В, а
standard Brownian motion, Theorem 30 yields that if H S V and /0 H^ds <
oo a.s., each t > 0, then the stochastic integral (Я • Bt)t>0 = (Jo* HsdBs)t>0
exists, since [B,B]t = t.
Corollary. Let X be a continuous semimartingale with (unique) decomposi-
decomposition X = M + A. Let Я е V be such that
/ H2sd[M,M]s+ \Hs\\dAs\ <oo a.s.
Jo Jo
each t > 0. Then the stochastic integral (Я • X)t = /0 HsdXs exists and it is
continuous.
Proof. By the corollary of Theorem 31 of Chap. Ill, we know that M and A
have continuous paths. The integral Я • M exists by Theorem 30. Since Я • A
exists as a Stieltjes integral, it is easy to check that Я s L(A), since A is
continuous, and the result follows from Theorem 20. ?
In the preceding corollary the semimartingale X is continuous, hence
[X, X] = [M, M] and the hypothesis can be written equivalently as
/ H?d[X,X].+ I \Hs\\dAs
o Jo
< oo a.s.
Jo ~ Jo
each t > 0.
We end our treatment of martingales with a special case that yields a
particularly simple condition for Я to be in L(M).

174 IV General Stochastic Integration and Local Times
Theorem 31. Let M be a local martingale with jumps bounded by a constant
P. Let H eV be such that f* H2d{M,M]s < oo a.s., t > 0, and E{H%} < oo
for any bounded stopping time T. Then the stochastic integral (Jo HsdMs)t>o
exists and it is a local martingale.
Proof Let Rn = inf{t > 0 : fQH2sd[M,M]s > n}, and let Tn = mm(Rn,n).
Then T™ are bounded stopping times increasing to oo a.s. Note that
E{ f
./о
2
H2d[M,M]s}<n-
< n + C2Е{Щп} < oo,
and the result follows from Theorem 28. ?
The next theorem is, of course, an especially important theorem, the Dom-
Dominated Convergence Theorem for stochastic integrals.
Theorem 32 (Dominated Convergence Theorem). Let X be a semi-
martingale, and let Hm e V be a sequence converging a.s. to a limit H. If
there exists a process G 6 L(X) such that \Hm\ < G, all m, then Hm, H are
in L(X) and Hm ¦ X converges to H ¦ X in ucp.
Proof. First note that if \J\ < G with J e V, then J e L(X). Indeed, let
(T")n>i increase to oo a.s. such that G is (H2,XTn~) integrable for each n.
Then clearly
E{ / J2d[N,N]s} + E{( / |Js||cL4s|J}
Jo Jo
/•OO POO
<E{ G2d[N,N]s} + E{( |Gs||rfls|J}<oo,
Jo Jo
and thus J is (H2,XT"~) integrable for each n. (Here N + A is the canonical
decomposition of XT"~.)
To show convergence in ucp, it suffices to show uniform convergence in
probability on intervals of the form [0, to] for to fixed. Let e > 0 be given, and
choose n such that P(Tn < t0) < e, where Хт"~ E H2 and G is (W2,XT"~)
integrable. Let XT"~ = N + A, the canonical decomposition. Then
E{snp\Hm ¦ X1- - H ¦ XTn~\2}
t<t0
< 2EISUP \(Hm - H) ¦ N\2} + 2E{( / \H™ - Hs\\dAs\J}.
t<t0 J0
The second term tends to zero by Lebesgue's Dominated Convergence Theo-
Theorem. Since \Hm -H\< 2G, the integral (Hm - H) ¦ ~N is a square integrable
martingale (by the lemma preceding Theorem 28). Therefore using Doob's
maximal quadratic inequality, we have

2 Stochastic Integration for Predictable Integrands 175
?{sup \{Hm - H) ¦ АГ|2} < AE{\{Hm - H) ¦ Nto\2}
t<t0
rto
= 4E{[°(H™-HsJd{N,N}s},
Jo
and again this tends to zero by the Dominated Convergence Theorem. Since
convergence in L2 implies convergence in probability, we conclude for 8 > 0,
limsupP{sup \Hm Xt-H -Xt\>6}
m—юо t<t0
< limsupP{sup \((Hm - H) ¦ XTn~)t\ > 6} + P(Tn < t0)
m->oo t<t0
and since e is arbitrary, the limit is zero. ?
We use the Dominated Convergence Theorem to prove a seemingly in-
innocuous result. Generalizations, however, are delicate as we indicate following
the proof.
Theorem 33. Let F = (^)«>о and G — (Gt)t>o be two filtrations satisfying
the usual hypotheses and suppose Tt С Qt, each t > 0, and that X is a
semimartingale for both F and G. Let H be locally bounded and predictable for
F. Then the stochastic integrals Hr ¦ X and HG ¦ X both exist, and they are
equal.6
Proof. It is trivial that H is locally bounded and predictable for (Gt)t>o as
well. By stopping, we can assume without loss of generality that H is bounded.
Let
H = {all bounded, T predictable Я such that H? ¦ X = HG ¦ X}.
Then H is clearly a monotone vector space, and H contains the multiplicative
class bL by Theorem 16 of Chap. II. Thus using Theorem 32 and the Monotone
Class Theorem we are done. D
It is surprising that the assumption that H be locally bounded is impor-
important. Indeed, Jeulin [114, pages 46, 47] has exhibited an example which shows
that Theorem 33 is false in general.
Theorem 33 is not an exact generalization of Theorem 16 of Chap. II.
Indeed, suppose F and G are two arbitrary filtrations such that X is a semi-
semimartingale for both F and G, and H is bounded and predictable for both of
them. If It = Tt П Qt, then X is still an (Xt)t>o semimartingale by Strieker's
Theorem, but it is not true in general that H is (lt)t>o predictable. It is an
open question as to whether or not Hf ¦ X = Hg ¦ X in this situation. For a
partial result, see Zheng [248].
¦ X and Hg ¦ X denote the stochastic integrals computed with the filtrations
)г>о and (Gt)t>o, respectively.

176 IV General Stochastic Integration and Local Times
Example (Emery's example of a stochastic integral behaving badly).
The following simple example is due to M. Emery, and it has given rise to the
study of sigma martingales, whose need in mathematical finance has become
apparent. Let X = (Xt)t>o be a stochastic process given by the following
description. Let T be an exponential random variable with parameter Л = 1,
let U be an independent random variable such that P{U = 1} = P{U =
-1} = 1/2, and set X = Ul{t>Ty. Then X together with its minimal filtration
satisfying the usual hypotheses is a martingale in Ti2. That is, X is a stopped
compound Poisson process with mean zero and is an L2 martingale. Let Ht =
f l{t>o}- Therefore H is a deterministic integrand, continuous on @,oo), and
hence predictable. Consequently the path-by-path Lebesgue-Stieltjes integral
Zt = /o HsdXs exists a.s. However H ¦ X is not locally in W for any p > 1.
(However since it is still a semimartingale7, it is prelocally in H2.) Moreover,
even though X is an L2 martingale and Я is a predictable integrand, the
stochastic integral H ¦ X is not a local martingale because -EJIZsl} = оо for
every stopping time S such that P(S > 0) > 0.
The next theorem is useful, since it allows one to work in the convenient
space Ti2 through a change of measure. It is due originally to Bichteler and
Dellacherie, and the proof here is due to Lenglart. We remark that the use of
the exponent 2 is not important, and that the theorem is true for H? for any
p> 1.
Theorem 34. Let X be a semimartingale on a filtered complete probability
space (fi, T, F, P) satisfying the usual hypotheses. Then there exists a proba-
probability Q which is equivalent to P such that under Q, X is a semimartingale
in Ti2. Moreover, ^ can be taken to be bounded.
Before we begin the proof we establish a useful lemma.
Lemma. Let (fi, J7, P) be a complete probability space and let Xn be a se-
sequence of a.s. finite valued random variables. There exists another probability
Q, equivalent to P and with a bounded density, such that every Xn is in
L2(dQ).
Proof. Assume without loss that each of the Xn is positive. For a single ran-
random variable X we take Ak = {k < X < к +1} and Y = J2k>i 2~к1лк- Then
Y is bounded and
For the general case, choose constants an such that P(Xn > an) < 2~". The
Borel-Cantelli Lemma implies that a.s. Xn < an for all n sufficiently large.
Next choose constants cn such that ^n>i c«an < °°- Let Yn be the bounded
7 H ¦ X is still a semimartingale since it is of finite variation a.s. on compact time
sets.

2 Stochastic Integration for Predictable Integrands 177
density chosen for Xn individually as done in the first part of the proof, and
take Y = J2n>i спУп so that we have the result. ?
Proof of Theorem 34- Using the lemma we make a first change of measure
making all of the random variables [X, X]n integrable. Recall that [X, X] is
invariant under a change to an equivalent probability measure. By abusing
notation, we will still denote this new measure by P. This implies that if
Jt = sups<t |ДХЯ|, then J2 < [X, X]t and thus J is locally in I?. Hence it
is also a fortiori locally in L1, and thus X is special. We write its canonical
decomposition X = M + A, with Xo = Ao. We now make a second change
to an equivalent law, this time called Q, again using the lemma, such that
Jq \dAs\ 6 L2(dQ) for each n. Then X is special under Q as well, since each
[X, X]n remains in L1 for Q. Let X = N + C denote its Q decomposition. We
apply Girsanov's Theorem to get
I
-z—d(M, Z)a
0 ^s-
where Zt = Ep{Z\Tt\ with Z = ^p. For convenience we write this as Ct =
At + Vt. We now need to establish that ?q{(/0* \dv*\?} = EQ{\v\t} < °°-
Note that we have Eq{V?} < oo, since Vt = Mt - At, and At ? L2(dQ). It
is easy to check that Ep{[M,M]n} < 4Ep{[X,X]n} < oo by an argument
similar to the proof of the lemma preceding Theorem 24. Finally L2(dP) С
L2(dQ). Since V is predictable, there exists a predictable process H taking
values in { — 1,1} such that Jo \dVs\ = JQ HsdVs. Setting 5 = H ¦ X, we have
[S,S] = [X, X], and the random variables [S,S]n are then integrable (dP). S
has a decomposition 5 = L + D where L = H -M and D = H -A, hence under
the law Q we have L = H-N+H-V, always using that the stochastic integral is
invariant under a change to an equivalent probability measure. We deduce that
Eq{(!o HsdVsJ} < oo. Since EQ{(J* HsdVsJ} = EQ{(J* \dVs\J}, coupled
with \H ¦ A\t = \A\t, the result follows. ?
Corollary (Lenglart's Inclusion Theorem). Let (?l,J-,W,P) satisfy the
usual hypotheses, and suppose Q is an equivalent probability measure with a
bounded density with respect to P. Then H2(P) С H2(Q). That is, U2 with
respect to P is contained in the space of H2 semimartingales with respect to
Q.
Proof. Let X be an H2 semimartingale with respect to P. As we saw in
the proof of Theorem 34, since Q has a bounded density, we obtain that
[X, X] is in Ll{dQ), and hence X is special for Q. We then obtain a canonical
decomposition of X and show both terms are in 7i2 analogously to how we
did it in the proof of Theorem 34. D

178 IV General Stochastic Integration and Local Times
3 Martingale Representation
In this section we will be concerned with martingales, rather than semimartin-
gales. The question of martingale representation is the following. Given a col-
collection Л of martingales (or local martingales), when can all martingales (or
all local martingales) be represented as stochastic integrals with respect to
processes in A? This question is surprisingly important in applications, and
it is particularly interesting (in finance theory, for example) when Л consists
of just one element.
Throughout this section we assume as given an underlying complete, fil-
filtered probability space (fi, J-, (J-t)t>o, P) satisfying the usual hypotheses. We
begin by considering only I? martingales. Later we indicate how to extend
these results to locally square integrable local martingales.
Definition. The space M2 of I? martingales is all martingales M such
that supt E{M?} < oo, and Mo = 0 a.s. Notice that if M e M2, then
lim^oo^jM2} = E{M^} < oo, and Mt = E{Moo\J:t}. Thus each MeM2
can be identified with its terminal value M^. We can endow M2 with a norm
and also with an inner product
(M,N)=E{MOONOO},
for M,N ? M2. It is evident that M2 is a Hilbert space and that its dual
space is also M2.
If E{M2} < oo for each t, we call M a square integrable martingale.
If in addition supt E{M2} = E{M%O) < oo, then we call M an L2 martingale.
The next definition is a key idea in the theory of martingale representation.
It differs slightly from the customary definition because we are assuming all
martingales are zero at time t = 0. If we did not have this hypothesis, we
would have to add the condition that for any event Л 6 To, any martingale
M in the subspace F, then M\a ? F.
Definition. A closed subspace F of M2 is called a stable subspace if it is
stable under stopping (that is, if M e F and if T is a stopping time, then
MT e F).8
Theorem 35. Let F be a closed subspace o/M2. Then the following are
equivalent.
(a) F is closed under the following operation. For M ? F, (M - МгIл ? F
for Л ? Tt, any t > 0.
(b) F is a stable subspace.
Recall that Mj = MtAT, and Mo = 0.

3 Martingale Representation 179
(c) If M 6 F and H is bounded, predictable, then (f* HsdMs)t>o = H ¦ M e
F.
(d) If M e F and H is predictable with ?{/0°° H2d[M,M]s} < oo, then
H -M ? F.
Proof Property (d) implies (c), and it is simple that (c) implies (b). To get
(b) implies (a), let T = tA, where
Then T — tA is a stopping time when A 6 Tt, and (M - МгIа = M — MT;
since F is assumed stable, both M and MT are in F. It remains to show only
that (a) implies (d). Note that if H is simple predictable of the special form
г=\
with Ai ? Tti, 0 < i < n, 0 = t0 < tx < ¦ ¦ ¦ < tn+1 < oo, then H ¦ M e F
whenever M ? F. Linear combinations of such processes are dense in bL which
in turn is dense in bV under <^м(-, •) by Theorem 2. But then hV is dense in
the space of predictable processes Ti such that E{f^° H^d[M, M]s} < oo, as
is easily seen (cf., Theorem 14). Therefore (a) implies (d) and the theorem is
proved. ?
Since the arbitrary intersection of closed, stable subspaces is still closed
and stable, we can make the following definition.
Definition. Let Л be a subset of M2. The stable subspace generated by
Л, denoted <S(.4), is the intersection of all closed, stable subspaces containing
Л.
As already noted on page 178, we can identify a martingale M e M2
with its terminal value M^ e L2. Therefore another martingale ./V € M2 is
(weakly) orthogonal to M if ^{Л^ооМоо} = О. There is however another,
stronger notion of orthogonality for martingales in M2.
Definition. Two martingales N, M e M2 are said to be strongly orthog-
orthogonal if their product L = NM is a (uniformly integrable) martingale.
Note that if JV, M € M2 are strongly orthogonal, then NM being a
(uniformly integrable) martingale implies that [N, M] is also a local martin-
martingale by Corollary 2 of Theorem 27 of Chap. II. It is a uniformly integrable
martingale by the Kunita-Watanabe inequality (Theorem 25 of Chap. II).
Thus M,N ? M2 are strongly orthogonal if and only if [M, N] is a uni-
uniformly integrable martingale. If N and M are strongly orthogonal then
^{Л^ооМоо} = S{Loo} = E{L0} = 0, so strong orthogonality implies or-
orthogonality. The converse is not true however. For example let M e M2, and

180 IV General Stochastic Integration and Local Times
let Y e To, independent of M, with P(Y = 1) = P(Y = -1) = \. Let
Nt = YMU t > 0. Then TV G M2 and
EiN^M^} = E{YMl) = E{Y}E{Ml} = 0,
so M and TV are orthogonal. However MN = YM2 is not a martingale (unless
M = 0) because ?{yM2|JF0} = y?{Mt2|.Fo} ^ о = M02.
Definition. For a subset A of M2 we let Ax (resp. Ax) denote the set of all
elements of M2 orthogonal (resp. strongly orthogonal) to each element of A.
Lemma 1. If A is any subset of M2, then Ax is (closed and) stable.
Proof. Let Mn be a sequence of elements of Ax converging to M, and let
TV 6 A. Then MnN is a martingale for each n and Ax will be shown to be
closed if MN is also one, or equivalently that [M, TV] is a martingale. However
E{\[Mn,N] - [M,N}t\} = E{\[Mn-M,N]t\}
< (E{[Mn -M,Mn- \t]t})^2(E{[N, Nit}I/2
by the Kunita-Watanabe inequalities. It follows that [Mn,N]t converges to
[M, N]t in L1, and therefore [M, N] is a martingale, and Ax is closed. Also Ax
is stable because M e Ax, N 6 A implies [MT, N] = [M, N]T is a martingale
and thus MT is strongly orthogonal to N. D
Lemma 2. Let N, M be in M2. Then the following are equivalent.
(a) M and N are strongly orthogonal.
(b) S(M) and N are strongly orthogonal.
(c) S(M) and S(N) are strongly orthogonal.
(d) S(M) and N are weakly orthogonal.
(e) S(M) and S(N) are weakly orthogonal.
Proof. If M and N are strongly orthogonal, let A = {./V} and then M e Лх.
Since Ax is a closed stable subspace by Lemma 1, S(M) с {Аг}х- Therefore
(b) holds and hence (a) implies (b). The same argument yields that (b) implies
(c). That (c) implies (e) which implies (d) is obvious. It remains to show that
(d) implies (a).
Suppose N is weakly orthogonal to S(M). It suffices to show that [N, M]
is a martingale. By Theorem 21 of Chap. I it suffices to show E{[N, М]т} = О
for any stopping T. However E{[N,M]T} = E{[N, M7}^ = 0, since TV is
orthogonal to MT which is in S(M). О
Theorem 36. Let M1,..., Mn 6 M2, and suppose Мг, iVP are strongly or-
orthogonal for i ф j. Then S(M1,... ,M") consists of the set of stochastic
integrals
n
H1 ¦ M1 + ¦ ¦ ¦ + Hn ¦ Mn = ]T #* • M\

3 Martingale Representation 181
where Hl is predictable and
Е{П{Н\)Ч{М\М%}
Jo
poo
< oo, 1 < i < n.
Proof. Let I denote the space of processes Yl7=i ^% ' Мг, where Нг satisfy
the hypotheses of the theorem. By Theorem 35 any closed, stable subspace
must contain X. It is simple to check that X is stable, so we need to show only
that X is closed. Let
/•OO
= {HeV:E{ H*d[M, M}s} < oo}.
Jo
Then the mapping (ff^ff2,... ,#") —> J2"=i H% ' Ml is an isometry from
L2Mi © • • • © L2Mn into M2. Since it is a Hilbert space isometry its image X is
complete, and therefore closed. ?
Theorem 37. Let Abe a subset of M2 which is stable. Then Ax is a stable
subspace, and if M 6 A then M is strongly orthogonal to A. That is, A-1 =
Ax, andS(A) = A^ =AxX =AXX.
Proof. We first show that Ax - Ax. Let M ? A and N ? Ax. Since TV is or-
orthogonal to S(M), by Lemma 2, TV and M are strongly orthogonal. Therefore
Ax С Ax. However clearly Ax С Ах, whence Ax = Ах, and thus Ах = Ax
is a stable subspace by Lemma 1.
By the above applied to Ax, we have that (Ах)х = (Ax)x.
It remains to show that S(A) — Ахх. Since Axx = A, the closure of A
in M2, it suffices to show that A is stable. However it is simple to check that
condition (a) of Theorem 35 is satisfied for A, since it already is satisfied for
A, and we conclude A is a stable subspace. ?
Corollary 1. Let A be a stable subspace of M2. Then each M e M2 has a
unique decomposition M = A + B, with A e A and В G Ax.
Proof. A is a closed subspace of M2, so each M e M2 has a unique decom-
decomposition into M = A + В with A ? A and В e Ax. However Ax = Ax by
Theorem 37. ?
Corollary 2. Let M, TV e M2, and let L be the projection of TV onto S(M),
the stable subspace generated by M. Then there exists a predictable process
H such that L = H -M.
Proof. We know that such an L exists by Corollary 1. Since {M} consists of
just one element we can apply Theorem 36 to obtain the result. D

182 IV General Stochastic Integration and Local Times
Definition. Let A be finite set of martingales in M2. We say that A has the
(predictable) representation property if T = M2, where
n
1={X : X = ]?я* • M*, Af* e A},
each Hг predictable such that
poo
E{ I {HiJd[M\M%} < oo, 1 < i < n.
Jo
Corollary 3. Let A = {M1,...,Mn} С М2, and suppose M\M^> are
strongly orthogonal for i ф j. Suppose further that if N 6 M2, N ± A in the
strong sense implies that N = 0. Then A has the predictable representation
property.
Proof. By Theorem 36 we have S(A) = I. The hypotheses imply that
= {0}, hence S(A) = M2. ?
Stable subspaces and predictable representation can be considered from
an alternative perspective. Up to this point we have assumed as given and
fixed an underlying space (Q, F, (^)г>о, Р), and a set of martingales A in
M2. We will see that the property that S(A) = M2, intimately related to
predictable representation (cf., Theorem 36), is actually a property of the
probability measure P, considered as one element among the collection of
probability measures that make L2 (ft)t>o martingales of all the elements of
A.
Our first observation is that since the filtration F = (J-t)t>o is assumed to
be P complete, it is reasonable to consider only probability measures that are
absolutely continuous with respect to P.
Definition. Let А С M2. The set of M2 martingale measures for A,
denoted Ai2(A), is the set of all probability measures Q defined on Vo<t<<x> -^t
such that
(i) Q « P,
(ii) д = Роп^0)
(Hi) if X G A then X e M2(Q),
where M2(<3) denotes all (Q,?) L2 martingales.9
Lemma. The set M.2 is a convex set.
9 Note that while F satisfies the usual hypotheses under P, Jo does not contain all
of the Q-null sets, and thus F does not completely satisfy the usual hypotheses
under Q. This does not create any problems, however.

3 Martingale Representation 183
Proof. Let Q and R ? M2(A), and let S = \Q + A - \)R, 0 < A < 1. Then
for X e M2(A),
supEs{M2} = sup[A?Q{M2} + A - \)ER{M2}} < oo,
t t
since Q, R e M2(A). Also if Я ? hTs, s <t, then
Es{MtH} = \EQ{MtH} + A - А)?Я{М4Я}
= A?q{Ms#} + A - \)ER{MSH}
= ES{MSH},
and 5 ? Л<2(Л). П
Definition. A measure Q e M2(A) is an extremal point of M2(A) if
whenever Q = \R + A - AM with R, S ? Л<2(Л), i? ^ 5, 0 < A < 1, then
A = 0or 1.
Theorem 38. ?e? А С M2. If S(A) = M2 Йеп Р is an extremal point of
M2(A).
Proof. Suppose P is not extremal. We will show that S(A) Ф M2. Since P is
not extremal, there exist Q,R? M2(A), Q Ф R, such that P = A<3 + A-A)i?,
0 < A < 1. Let
Loo = dP'
and let Lt = E{^p\J-t}- Then 1 = ^ = Aioo + A~A)^ > A-Lqo a.s., so that
-^oo < \ a.s. Therefore L is a bounded martingale with Lo = 1 (since Q = P
on .Fo), and thus L - Lo is a nonconstant martingale in M2(P). However, if
X ? A and Я € hTv,, then X is a Q martingale and for s < t,
EP{XtLtH} = EpiXtL^H} = EQ{XtH} = EQ{XSH}
= Ep{XsLooH}
= EP{XSLSH},
and XL is a P martingale. Therefore X(L - Lo) is a P martingale, and
L - Lo 6 M2 and it is strongly orthogonal to A. By Theorem 37 we cannot
have S(A) = M2. ?
Theorem 39. Let А С M2. If P is an extremal point of ЛЛ2{А), then every
bounded P martingale strongly orthogonal to A is null.
Proof. Let L be a bounded nonconstant martingale strongly orthogonal to A.
Let с be a bound for \L\, and set
dQ = (l -)dP and dR=(l + —)dP.
2c 2c
We have Q, R 6 M2(A), and P — \Q + ^R is a decomposition that shows
that P is not extremal which is a contradiction. D

184 IV General Stochastic Integration and Local Times
Theorem 40. Let Л = {M1,..., M"} С М2, with Мг continuous and M\
Mi strongly orthogonal for i ф j. Suppose P is an extremal point of Л42(А).
Then
(a) every stopping time is accessible;
(b) every bounded martingale is continuous;
(c) every uniformly integrable martingale is continuous; and
(d) Л has the predictable representation property.
Proof, (a) Suppose T is a totally inaccessible stopping time and P(T < oo) >
0. By Theorem 22 of Chap. Ill, there exists a martingale M with АМт =
1{Т<оо} and M continuous elsewhere. Moreover, the martingale M can be
taken of finite variation with Mo — 0. Therefore, since each M% is continuous,
[M, Мг] = О. Moreover taking (for example)
Rn = inf{t > 0 : \Mt
R
\MR | < n + 1 and thus M is locally bounded. Indeed, we then have M
are bounded martingales strongly orthogonal to Ml. By Theorem 39 we have
MR = 0 for each n. Since limn^oo-R" = o° &-s., we conclude M = 0, a
contradiction.
(b) Let M be a bounded martingale which is not continuous, and assume
Mq = 0. Let TE = inf{? > 0 : |AMt| > e}. Then there exists e > 0 such
that for T = Te, P{\AMT\ > 0} > 0. By part (a) the stopping time T is
accessible, hence without loss we may assume that T is predictable. Therefore
MT~ is a bounded martingale by the corollary to Theorem 29, whence N =
MT — MT~ = AMj1{ojj is also a bounded martingale. However N is a
finite variation bounded martingale, hence [N, Мг] = 0 each i. That is, N is
a bounded martingale strongly orthogonal to Л. Hence N = MT — MT~ = 0
by Theorem 39, and we conclude that M is continuous.
(c) Let M be a uniformly integrable martingale closed by M^. Define
Then Mn are bounded martingales and therefore continuous by part (b).
However
1
Pjsup \МГ - Mt\ > e} < - E{\M" - Moo }
t ?
by an inequality of Doob10, and the right side tends to 0 as n tends to сю.
Therefore there exists a subsequence (n^) such that limfc^oo M"k = Mt a.s.,
uniformly in t. Thus M is continuous.
(d) By Corollary 3 of Theorem 37 it suffices to show that if N ? Ax then
N = 0. Suppose N ? Ax. Then N is continuous by (c). Therefore N is locally
bounded, and hence by stopping, N must be 0 by Theorem 39. ?
See, for example, Breiman [23, page 88].

3 Martingale Representation 185
The next theorem allows us to consider subspaces generated by countably
infinite collections of martingales.
Theorem 41. Let M e M2, Y" € M2, n > 1, and suppose Y^ converges
to Уоо in L2, and that there exists a sequence Hn e L(M) such that Y™ =
j0 H™dMs, n > 1. Then there exists a predictable process H ? L(M) such
thatYt = f*HsdMs.
Proof. If Y? converges to Y"oo in L2, then Y" converges to Y in M2. By
Theorem 36 we have that S{M) = X(M), the stochastic integrals with respect
to M. Moreover Y" e S(M), each n. Therefore Y is in the closure of <S(M);
but S(M) is closed, so У € S(M) = I(M), and the theorem is proved. ?
Theorem 42. Let Л = {Ml,M2,..., M",...}, with M' ? M2, and suppose
there exist disjoint predictable sets A1 such that lAid[Ml,Ml] = d[Mi,Ml],
i>l.LetAt = YZLi /о иЛ»Нм^м%- Suppose that
(a) ElAoo} < oo; and
(b) for J* С J^oo such that for any Xi € ЪР, we have Xlt = Е{Х*\Тг} =
/0 H\dM\, t > 0, for some predictable process Hl.
Then M = Yl'iLi Мг exists and is in M2, and for any Yeb\JiJri,ifYt =
E{Y\Ft), we have that Yt = f0HsdMs, for the martingale M = ?Si Mi
and for some H € L(M).
Proof Let Nn = Y?i=iM\ Then [Nn,Nn}t = EtiS^MW]
hence E{(N^J} = E{{Nn, N"}^} < ?{Ax,}, and Nn is Cauchy in M2 with
limit equal to M. By hypothesis we have that if Xх 6 ЪТг then
= Х\= I H\dM\ = f \Ai
Jo Jo
= f iAiHt
Jo
Therefore if i ф j we have that
= f lAiHilAJ
Jo
- / 1л*пл>ЩЩ<1[М,М\.
Jo
= 0,
since Лг П A> =0, by hypothesis. However using integration by parts we have

186 IV General Stochastic Integration and Local Times
xixi = f x\_dxi + [ xi_dx\ + [x\
Jo Jo
= / X\_dX{ + j Xi_dX\
Jo Jo
/
o
i*dMa,
f
Jo
= f Hi*
Jo
where H1^ is defined in the obvious way. By iteration we have predictable
representation for all finite products \\i<n Xх. The Monotone Class Theorem
together with Theorem 41 then yields the result. D
We conclude this section by applying these results to a very important
special case, namely n-dimensional Brownian motion. This example shows
how these results can be easily extended to locally square integrable local
martingales.
Theorem 43. Let X = (X1,..., Xn) be an n-dimensional Brownian motion
and let F — (ft)o<t<oo denote its completed natural filtration. Then every
locally square integrable local martingale M for F has a representation
where Hl is (predictable, and) in L(Xl).
Proof. Fix to, 0 < t < to, and assume X is stopped at to- Then letting
Л = {Xx,...,Xn}, we have that Л С M2. Let М2(Л) be all probability
measures Q such that Q «; P, and under Q all of Л С M2(<2).n Since
[X1, Xi] is the same process under Q as under P we have by Levy's Theorem
(Theorem 40 of Chap. II) that X = {X1,...,Xn) is a Q-Brownian motion
as well. Therefore, for bounded Borel functions /i,..., /n, we have that for
t\ < t2 • ¦ • < tn < to, fi(Xtl) ПГ=1 fi{Xti+i —Хъ) have the same expectation
for Q as they do for P. Thus P = Q on Fto, and we conclude that M2(A)
is the singleton {P}. The probability law P is then trivially extremal for
Л42(Л), and therefore by Theorem 40 we know that Л has the predictable
representation property.
Suppose M is locally square integrable and Mt = MtM0 ¦ Let (Т*%>о be
a sequence of stopping times increasing to oo a.s., To = 0, and such that
MT € M2, each к > 0. Then by the preceding there exist processes H''k
such that
мтк = Тн*>к-.
11 Recall that M2(Q) denotes all L2 Q martingales.

3 Martingale Representation 187
By defining Hl = YX=\ нг'к\тк~\т^ we have that W e Ь{Хг) and also
м =
Next let (te)e>o be fixed times increasing to oo with to = 0. For each tt we
know there exist Нг'е such that for a given locally square integrable local
martingale N
*=i
By defining W = YT=iHi'ei(U-uU] one easily checks that W € ЦХг) and
that
П
Nt = ^Hl -X1. D
Corollary 1. As in Theorem 43, let F be the completed natural filtration of
an n-dimensional Brownian motion. Then every local martingale M for F is
continuous.
Proof. In the proof of Theorem 43 we saw that the underlying probability
law P is extremal for Л = {X1,... ,Xn}. Therefore by Theorem 40(c), ev-
every uniformly integrable martingale is continuous. The corollary follows by
stopping. D
Corollary 2. Let X — (X1,..., Xn) be an n-dimensional Brownian motion
and let F be its completed natural filtration. Then every local martingale M
for F has a representation
Mt = Mo + J2 I H\
where Hl are predictable.
Proof. By Corollary 1 any local martingale M is continuous, hence it is locally
square integrable. It remains only to apply Theorem 43. D
Corollary 3. Let X = {X1,..., Xn) be an n-dimensional Brownian motion
and let F be its completed natural filtration. Let Z e Т^ be in L1. Then
there exist H% predictable in L(X%) with /0°°(i/*Jds < oo a.s. such that
Htdxt.

188 IV General Stochastic Integration and Local Times
Proof. Let Zt = E{Z\Jr{\, taking the cadlag (and hence continuous) version.
By Corollary 2 we have
By Theorem 42 of Chap. II we have that Zt = B\ztz]t for some Brownian
motion B, 0 < t < oo. Letting t tend to oo shows that Z^ = B\\mt^oo^z,z\t^
which shows that \Z,Z\OO < oo a.s. However,
< oo a.s.
" /-OO " l-t
V / {Hlfds = lim V / {Hlfds = lim [Z,Z]t = [Z.Z]
Finally, take t = oo, observe that Z^ = Z and that Tq is a.s. trivial. Hence
Zq is constant and therefore Zq = E{Z}. О
Corollary 4. Let X = (X1,... ,Xn) be an n-dimensional Brownian motion
and let F be its completed natural filtration. Let Z e I/1(^roo) and Z > 0 a.s.
Then there exist Jl predictable with J^°(JlJds < oo a.s. such that
Proof. By Corollary 3 there exist predictable H% such that if Z( = E{Z\Tt\,
then
Zt = ?{*} ? Г
Therefore
+ E Г
(Note that since (^t)t>o is continuous and never 0, -^ is locally bounded.)
The proof is completed by setting J* = -%-Нг8 and taking exponentials of both
sides. G
Corollary 5. Let F be the completed natural filtration of an n-dimensional
Brownian motion. If T is a totally inaccessible stopping time, then T = oo
a.s.
Proof. This is merely Theorem 40 (a). D
We end this section with a quite general martingale representation result,
which while often true, nevertheless requires a countable number of martin-
martingales to have the representation. We then apply it to give a condition for
compensators to have absolutely continuous paths. But first we state a defi-
definition from measure theory that is not widely known.

3 Martingale Representation 189
Definition. A measurable space (f2, T) is said to be separable if there exists
a countable family of functions on (or subsets of) Q which generate the a-
algebra T.
Theorem 44. Let (fi, J7, F, P) be a filtered complete probability space satisfy-
satisfying the usual hypotheses. Assume also that the o-algebra T — T^ and that it is
separable. Then there exists a countable sequence of martingales (M\,M2, ¦ • •)
in L2 such that they are orthogonal, J2i>i E{[M\ M*]oo} < oo, and such that
if N is any L2 martingale, then there exists a sequence of predictable pro-
processes (HX,H2,...) such that Nt = J2i>i /о Щ^^а- That is, the martingales
{Мг,г > 1} have martingale representation.
Proof. Since J- is separable, there exists a countable basis of L2, call it
(M°,M1, M2,...), where M° is constant and Е{М*} = 0 for г ф 0. One can
take such a basis to be orthonormal in the usual way. That is, Е{МгМ:>} = О
for i ф j, and E{(M1J} = 1 for all i. If one then multiplies Mi by 2~г and
defines the martingales Ml = E{M%\Tt}, we have the construction. The space
of stochastic integrals with respect to all of the Ml where the sum converges
in L2 is easily seen to be a stable subspace. Next let N be any martingale in
L2 with No = 0. Then we have
rt
I + Lt
where L is orthogonal to the stable subspace generated by the countable
collection of martingales (Ml)i>\. This implies that L^ is orthogonal to all
of the random variables M^ — Ml. Since these random variables form a basis
of L2, this implies that L must be constant. Since Lo = 0 we have that L is
identically zero. ?
Before we apply Theorem 44 to compensators, we need to recall and for-
formalize two definitions, and prove Lebesgue's change of time formula. The first
definition we have already seen in Exercise 7 of Chap. Ill, and the second one
we saw in Theorem 42 of Chap. II as well as Exercises 31 through 36 of that
chapter.
Definition. A (right continuous) filtration F on a filtered probability space
satisfying the usual hypotheses is called quasi left continuous if for every
predictable stopping time T one has Тт = Ft- ¦
Note that if T is a predictable stopping time, and if Tn is an announcing
sequence of stopping times for T, and if X e L1, then limn^oo E{X\J-j-n} =
E{X\J-t-}. If M is a uniformly integrable martingale, then
Е{Мт\^т-} = lim E{MT\TTn} = lim MTn = MT--
n—>oo n—>oo
Therefore E{&Mt\Ft-} = 0 a.s. and if АМт is Тт- measurable, it must
be 0 almost surely. So if F is quasi left continuous, no martingales can jump

190 IV General Stochastic Integration and Local Times
at predictable times. Since the accessible part of any stopping time can be
covered by a countable sequence of predictable times, we conclude that if
F is quasi left continuous then martingales jump only at totally inaccessible
stopping times.
Definition. Let A — {At)t>o be an adapted, right continuous increasing
process, which need not always be finite-valued. The change of time (also
known as a time change) associated to A is the process
rt = inf{s > 0 : As > t}.
Some observations are in order. We have that t н-> rt is non-decreasing
and hence rt_ exists. Also, since {At > s} = U?>o{^< > s + ?}> we nave
that t н-> rt is right continuous. It is continuous if A has strictly increasing
paths. Moreover ATt > ATt_ > t, and A(Tt_)_ < -A(rt)_ < t. (Here we use the
convention tq- = 0.) Finally note that {rs_ < i) = {s < At}, which implies
that T(_ is a stopping time, and since Tj = lim?_>o T(t+?)- we conclude that r^
is also a stopping time.
Theorem 45 (Lebesgue's Change of Time Formula). Let a be a positive,
finite, right continuous, increasing function on [0, oo). Let с denote its right
continuous inverse (change of time). Let f be a positive Borel function on
[0, oo). If G is any positive, finite, right continuous function on [0, oo) with
G(O-) = 0, then
/•OO /*OO
/ f(s)dG(a(s)) = / f(c(s-))l{c(s_)<oo]dG(s),
Jo Jo
and in particular
лОО ЛОО
/ f(s)da(s) = / f(c(s))l{c(s)<oo}ds.
Jo Jo
Here the integrals are taken over the set [0, oo).
Proof. First consider / of the form f(s) = l[0)U](s), for 0 < и < со. The
left side of the equation then reduces to G(a(u)). For the right side note that
/(c(s-))l{c(s_)<oo} = l{c(s_)<u} = l{s<a(«)} and it follows that the right side
is also equal to G(a(u)). By subtraction and linearity we also have the result
for / of the form 1 («,„]. The Monotone Class Theorem gives the result for any
/ positive with compact support, and approximations then gives the general
case. If we take G(s) = s we get /0°° f(s)da(s) - /0°c/(c(s-))l{c(s_)<oo}ds,
and since с only jumps at most countably often because it is increasing, and
since ds does not charge countable sets, we have /0°° f(c(s—))l{c(s-)<oo}^s =
Jo° /(c(s))l{c(s)<oo}^s and the second statement is proved, whence the theo-
theorem. D

3 Martingale Representation 191
Corollary. Let a be a positive, finite, continuous, strictly increasing function
on [0,oo). Let с denote its continuous inverse (change of time). Let / be a
positive Borel function on [0,00). Then
fC(t) ft
/ f(s)da(s) = / f(c(s))ds.
Jo Jo
Proof. It suffices to rewrite the left side of the equation as
l[o,c(t)](s)f(s)da(s)
r°
/
Jo
/
Jo
and observe that с is also continuous and strictly increasing, which implies
VcWiW5)) = 1[o,*](s)- a
Our goal, stated informally in words, is if a filtration is quasi left contin-
continuous, then modulo a change of time, all compensators of adapted counting
processes with totally inaccessible jumps have paths which are absolutely con-
continuous. This is achieved in Theorem 47. We begin with two definitions.
Definition. Let (Q,J-, F, P) be a filtered probability space satisfying the
usual hypotheses, and let <S denote the space of all square integrable martin-
martingales with continuous paths a.s. It is easy to see that <S is a stable subspace.12
For an arbitrary square integrable martingale M, let Mc denote the orthogo-
orthogonal projection of M onto <S. Then Mc is called the continuous martingale
part of M. If we write M = Mc + (M — Mc) as its orthogonal decomposition,
then we have Md = (M-Mc) where Md is called the purely discontinuous
part of the martingale M.
Note that if a local martingale is locally square integrable, we can extend
the definition of continuous part and purely discontinuous part trivially, by
stopping. Also note that the term "purely discontinuous" is misleading: it
is not a description of a path property of a martingale, but rather simply
refers to the property of being orthogonal to the stable subspace of continuous
martingales. See Exercises 6 and 7 in this regard.
Definition. Let (fi, ,F,F, P) be a filtered probability space satisfying the
usual hypotheses. We call it an absolutely continuous space if for any
purely discontinuous locally square integrable martingale M, d{M, M)t <C dt.
Theorem 46. Let (fi, T, F, P) be an absolutely continuous space. Then the
compensators for all adapted counting processes with totally inaccessible jump
times and without explosions, are absolutely continuous.
Proof. Let N be an adapted counting process and let N be its compensator,
so that X = N — N is a locally square integrable local martingale. Since N is
continuous we have [X,X]t = Y,s<t(^NsJ = Nt, since (&NSJ = ANS = 1
12 See Exercise 6.

192 IV General Stochastic Integration and Local Times
when N jumps at s. Therefore N = (X,X), and since (X, X) has absolutely
continuous paths by hypothesis, we are done. D
Remark (Yan Zeng). With a little more work, one can show the following:
Let (П, J-, F, P) satisfy the usual hypotheses. If it is an absolutely continuous
space, then all jump times of square integrable martingales are totally inac-
inaccessible. Moreover it is an absolutely continuous space if and only if for every
totally inaccessible stopping time T, if Nt = l{t>r}> then dNt <c dt a.s.
If we take the uninteresting counting process Nt = J2i>i !{*>»} which
jumps at constant (and hence predictable) times, then its compensator is
simply Nt = Nt, and the martingale X = N - N is identically zero. To avoid
these trivialities we assume that the process N is continuous. Note however
that N is not a priori absolutely continuous. This is to explain our hypotheses
in the next theorem:
Theorem 47 (Absolutely Continuous Compensators). Let (?l,F,W,P)
be a filtered probability space satisfying the usual hypotheses. Assume that the
a-algebra J- = J-oq and that it is separable}3 Assume further that F is quasi
left continuous. To prevent trivialities assume also that it is not an absolutely
continuous space. Then there exists a change of time Tt and a new filtration G
given by Qt = TTt, such that if N is a G adapted counting process with totally
inaccessible jump times and without explosions, then dNt <C dt.
Proof. Let (MJ)i>i be the countable sequence constructed in Theorem 44.
Define the strictly increasing process At = t + ^2i>i[M%, M%]t, and note that
E{At\ < oo for 0 < t < oo. Let С denote the compensator of A. One can
check that Ct = t + '^2li>i{M%,M%)t. Then С is both strictly increasing and
continuous, since F is quasi left continuous. Let
Tt = inf{s > 0 : Cs > t} and then Ct = M{s > 0 : rs > t},
where r is the time change of the theorem statement.
Let N be a G adapted counting process and let X = N — N. As-
Assume, by stopping if necessary, that X^ e L2. We write (X, X)G to de-
denote the sharp bracket taken using the G filtration. Note that Lt = Nct
is also a counting process, and since r continuous implies that Qct С Tt
(see Exercise 38 of Chap. II), we conclude L is adapted to F. We then have
[L,L]t = Es<t(ALsJ = Lt, hence Lt = (L,L)*t. But also [X,X]t = Nt, and
we have
[X,X]t-{X,X)f = Nt-{X,X)f.
Hence
NCt-(X,X)%=Lt-(X,X)%,
A separable сг-algebra is defined on page 189.

4 Martingale Duality and Jacod-Yor Theorem 193
and by the uniqueness of the compensator of L we have (X, X)^t = (L, L)\.
On the other hand, by Theorem 44 we know that Lt - Lt = J2i>i /о
for some predictable processes (Нг), and thus
ST f i 2 i i?
I
[
o
= / J.dC
for some (predictable) nonnegative processes cl whose existence is assured
because A{М\М% « dCt a.s. The process Js = ?4>1(Я]Jс', ЬУ Fubini's
Theorem. Next, note that since (L-L, L-Lft = (X, X)%t, we have (X, X)f =
(L — L,L — L)?Tt = /on JsdCs, which by the corollary of Lebesgue's change of
time formula, implies {X,X)f = JQ JTsds. D
4 Martingale Duality and the Jacod-Yor Theorem on
Martingale Representation
We have already defined the space H2 for semimartingales; we use the same
definition for martingales, which in fact historically came before the definition
for semimartingales. We make the definition for all p, p > 1.
Definition. Let M be a local martingale with cadlag paths. We define the
W norm of M to be ||M||P = ?{[M,M]^2}1/p- If \\Щн* < oo, we say that
M is in HP. Finally we call Jiv the space of all cadlag martingales with finite
W norm.
It is simple to verify that || • ||WP is a norm if we identify martingales
that are almost surely equal. Indeed, the only property that is not obvi-
obvious is the subadditivity. For the case p = 1 for example, we have that
[M + N,M + N}1/2 < [M,M]^2 + [N,N]V2 since by the Kunita-Watanabe
inequality [M,N] < [M, M]1/2^,^]1/2, which gives the subadditivity. It is
also simple to verify that 7ip is a linear space. We state without proof the
important martingale inequalities known as the Burkholder-Davis-Gundy in-
inequalities, and often referred to by the acronym BDG. (For a proof one can
consult many different sources, including [146] and [44].)
Theorem 48 (Burkholder-Davis-Gundy Inequalities). Let M be a mar-
martingale with cadlag paths and let p > 1 be fixed. Let Mt* = sups<t |MS|. Then
there exist constants cp and Cp such that for any such M
E{[M,M]}}$ <cpE{{m;Y}^ <CpE{[M,M}f}p
for all t, 0 < t < oo. The constants are universal: they do not depend on the
choice of M.

194 IV General Stochastic Integration and Local Times
We have the following simple results concerning Ti1.
Theorem 49. Ti2 С Ti1 and local martingales of integrable variation are a
subset ofTt1.
Proof. First note that if M is a cadlag martingale, then E{^[M,M\OO\ <
(E{[M, Mjoo}I/2 which gives the first statement. For the second, if M has
integrable variation then [M,M]oo = ^8(ДМ8J < (%2S |AMS|J, whence
\/[M, M]oo < /0°° |<iMs|, and taking expectations, we have the result. D
Theorem 50. Ti2 is dense in Ti1 and bounded martingales are dense in Ti1.
Proof. Let M be a local martingale in Ti1. By the Fundamental Theorem of
Local Martingales (Theorem 25 of Chap. Ill) we have M = N + U where
N is a local martingale with jumps bounded by 1 and U has paths of locally
integrable variation. If (Tn)n>i is a sequence of stopping times increasing to oo
a.s., then ||MTn||-fti < ||M||^i, and also MTn converges to M in Ti1. Choose
Tn to be the first time [U, U]t has total variation larger than n, and also
I[N,N}T"| < n-1; that is, Tn = inf{t > 0 : /0*[17,U}t > n, or [N,N]t>n-1}
with Tn increasing to oo. Then [N, N]Tn < n and thus iVTn is in Ti1 for each n.
Therefore we turn out attention to UTn, and since [U, U]Tn < n + \А11тп\, we
need consider only the one jump of U at time Tn : AUrn ¦ Let ? denote
Letting ?k = ?l{|?|<fc}) since ^ is bounded we can compensate ?а
and call the resulting martingale V?. The compensator of ^1{(>т„} is m W2
because ?k is bounded. We have then that V? is in H2 and converges to Vn
as к —> oo in total variation norm, and hence also in Ti.1. This proves that Ti.2
is dense in Ti1, and since bounded martingales are dense in Ti.2, they too are
dense in Ti1. ?
Theorem 51. Let M be a local martingale. Then M is locally in Ti1.
Proof. By the Fundamental Theorem of Local Martingales we know that M =
N + U, where N has jumps bounded by a constant C and U is locally of
integrable variation. By stopping, we thus assume that N is bounded and
U has paths of integrable variation. The result then follows by the previous
theorem (Theorem 49). D
One can further show, by identifying Ttp with Lp through the terminal
value of the martingale, that Tip is a Banach space (in particular it is complete)
for each p > 1. One can further show that the dual space of continuous linear
functional on Tip is TiQ, where - + - = l,l<p<oo. This continues nicely
the analogy with IP except for the case p = 1. It can be shown (see [47])
that the dual of Ti1 is not Ti°°. It turns out that a better analogy than IP
is that of Hardy spaces in complex analysis, where C. Fefferman showed that
the dual of Ti1 can be identified with the space of functions of bounded mean
oscillation, known by its acronym as BMO. With this in mind we define the
space of BMO martingales.

4 Martingale Duality and Jacod-Yor Theorem 195
Definition. Let M be a local martingale. M is said to be in BMO if M is
in H2 and if there exists a constant с such that for any stopping time T we
have
?{(Моо - МТ-?\ТТ} < c2 a.s,
where Mo- = 0 by convention. The smallest such с is defined to be the BMO
norm of M, and it is written ЦМЦвмо- If the constant с does not exist, or if
M is not in H2, then we set ЦМЦвмо = со.
Note that in the above definition E{M^c\Jro\ < c2 (with the convention
that Mo_ = 0) and therefore ||M||W2 < ||М||вмо- Note in particular that
\\M\\bmo = 0 implies that M = 0. •
Let T be a stopping time and A ? J~t- Replacing Г with Тд shows that
the above definition is equivalent to the statement
?{(Moo - MT_J} < c2P(T < oo)
for every stopping time T. This in turn gives us an equivalent description of
the BMO norm:
where the supremum is taken over all stopping times T. Note that this second
characterization gives that || • Цвмо is a semi-norm, since it is the supremum
of quadratic semi-norms. An elementary property of BMO martingales is that
M € BMO if and only if all of the jumps of M are uniformly bounded. Thus
trivially, continuous I? martingales and bounded martingales are in BMO.
The next inequality is quite powerful.
Theorem 52 (Fefferman's Inequality). Let M and N be two local mar-
martingales. Then there exists a constant с such that
/•o
E{
./o
BMO-
Fefferman's inequality is a special case of the following more general result.
Theorem 53 (Strengthened Fefferman Inequality). There exists a con-
constant с such that for all local martingales M and N, and U an optional process,
E{ Г \Us\\d[M,N]s\} < cE{{ Г C
Jo Jo
Proof. Let Ct = /0* U2d[M, M]s and define H and К by
Ht = /7T- ,—7^=j=l{c!>o}, Kt =
y/Ct + /Gtl{t0}
BMO-

196 IV General Stochastic Integration and Local Times
Using integration by parts yields
Hfd[M,M}t = i{Ct>^J
From the definitions of H and К we have
н?к? > \u2i{Cl>0}.
The Kunita-Watanabe inequality implies
/•OO />OO
/ \Us\l{Cs=0}\d[M,N]s\<{ U2l{
Jo Jo
/•OO
= ( / l{cs
Jo
and since \d[M,N]s\ is absolutely continuous with respect to d[M,M]s as a
consequence of the Kunita-Watanabe inequality, we have
1 /-OO
}s\} = -^E{J \l{c3>o}Us\\d[M,N}s\}
= 0 a.s.,
/•OO
<
/•OO
E{ \HsKs\\d[M,N}s\}
Jo
< \\E{ I H^d[M,M]s} \\E{ I K2sd[N,N]s}.
But
/•OO /*O
E{ / fl?d[M, M]s} < E{ /
Jo Jo
and
/•OO />OO
/ K2d[N,N}s} = E{ ([N,N]00-[N,N}e-)dK?}
o Jo
= E{ f
Jo
But EiEUN^oolF.} - [N,N]S-} is bounded by \\N\\%MO on @, oo), hence
we have that
E{ Г K2d[N,N}s} = E{ r
Jo Jo
and the result follows. О

4 Martingale Duality and Jacod-Yor Theorem 197
Remark. The constant с in Theorems 52 and 53 can be taken to be y/2, as
can be seen from an analysis of the preceding proof.
Theorem 54. Let N e H2 ¦ Then N is in BMO if and only if there is a
constant с > 0 such that for all M € 7i2,
\E{[M,NU}\<c\\M\\nl.
Moreover \\N\\bmo < л/бс.
Proof. If N is in BMO, then we can take с = y/2 ^N\\bmo from Fefferman's
inequality (Theorem 52) and the remark following the proof of Theorem 53.
Now suppose that M is in H2 and that |.E{[M, JVjoo}! < c||M||^i; we want
to show that N is in BMO. We do this by first showing that |ЛГ0| < с a.s.,
and then showing that N has bounded jumps.
Let Л = {|JV0| > c}. Suppose P(A) > 0. Let $ = 5®^ilyl. Then ?;{|^|} =
1, and if we define the trivial martingale Mt = ? for all t > 0, then M ? 7i2
and ||M||wi = E{\?\} = 1, whence
J> С ——
This of course is a contradiction and we conclude |JV0| < с a.s.
We next show |AJV| < 2c. Since every stopping time T can be decomposed
into its accessible and totally inaccessible parts, and since each accessible time
can be covered by a countable collection of predictable times with disjoint
graphs, we can assume without loss of generality that T is either totally inac-
inaccessible or predictable. We further assume P(T > 0) = 1. Suppose then that
P(|AJVT| > 2c) > 0, and set
sign(AJVT)
Let M be the martingale consisting of ?l{t>r} minus its compensator. Then
M is in 7i2 and has at most one jump, which occurs at T. The jump is given
by
.}, T is predictable,
AMT = { s
I ? T is totally inaccessible.
Note that we also have ||M||^i is less than the expected total variation of M,
which in turn is less than 2.E{|?|} = 2. If T is totally inaccessible then
U} = E{AMTANT} = E{$ANT} =
P{\ANT\ > 2c) J
>2c>c\\M\\ni,
which is a contradiction. On the other hand, if T is predictable, then we know
that E{ANt\J~t-} = 0 and thus we are reduced to the same calculation and

198 IV General Stochastic Integration and Local Times
the same contradiction. We conclude that P(\ANt\ > 2c) = 0, and thus JV
has jumps bounded by 2c.
Last let T be any stopping time. Let M = N — NT and 77 = [N, JVjoo —
[iV,iV]T. Then M is in H2 and [M^M]^ = [M.iVjoo = rj. By our hypotheses
it now follows that
E{v] = ЕЦМ^и} <c\\M\\w =
= сЕ{л/п1{Т«х,}}
< сЕ{г,}г/2Р{Т < o
which in turn implies
E{[N,NU - [N,N}T} = E{V} < c2P{T < 00).
This then implies
Eil^NU^r} - [N,N]T < c2 a.s.,
and since
E^Noo - JVT_J|^T} = E{[N,NU\FT} - [N,N]T + iVo2l{T=o} + (AiVTJ
almost surely, we obtain
E{Nl~N2_}<6c2
which yields the result. ?
During the proof of Theorem 54 we proved, inter alia, that the jumps of a
local martingale in BMO are bounded. We formalize this result as a corollary,
which in turn has its own corollary.
Corollary 1. Let N be a local martingale in BMO. Then N has bounded
jumps.
Corollary 2. Let N be a local martingale in BMO. Then N is locally
bounded.
Proof. Let с be a bound for the jumps of N which we know exists by the
previous corollary. Next let Tn = inf{i > 0 : |iVt| > n]. Then |JVTn| < n + c,
and N is locally bounded. ?
The key result concerning Ti1 and BMO is the Duality Theorem which is
Theorem 55 that follows. First let us lay the foundation. For N chosen and
fixed in BMO we define the operator Ln from Ti1 to К by
LN(M) = E{[M,N}OO}
for all M in Ti1. Then one can easily check to see that Ln is linear, and Feffer-
man's inequality proves that it is bounded as well, and therefore continuous.
If BMO is the Banach space dual of H1 then it is also complete, a fact that
is apparently not easy to verify directly.

4 Martingale Duality and Jacod-Yor Theorem 199
Theorem 55 (The Dual of W} is BMO). The Banach space dual of all
(bounded) linear functionals on H1 can be identified with BMO. Moreover if
Ln is such a functional then the norms \\Lpf\\ and \\N\\bmo are equivalent.
Proof. Let N be in BMO. By Fefferman's inequality we have
\LN(M)\ = \E{[M,NU}\ <
for all M in Л1. This shows that Ln is in the dual of H1 and also that
\\Ln\\ < c\\N\\bmo- Note further that Ln cannot be trivial since Ln(N) =
E{[N, iVjoo} > 0 unless N is identically 0. Therefore the mapping F(N) = LN
is an injective linear mapping from BMO into Ti1*, the dual of Ti1.
Let L be an arbitrary linear functional in the dual of Ti1. We have
This means that L is also a bounded linear functional on Ti2. Since Ti2 is
isomorphic as a Hilbert space to the L2 space of the terminal random variables
of the martingales in Ti2, we have that there must exist a unique martingale
N in Ti2 such that for any M in Ti1 we have:
\LN{M)\ = ^{MooAUl = \E{[M,NU}\ <
Clearly Ln = L on Ti2, and since Ti2 is dense in Ti1 by Theorem 50, we
have that L and Ln are the same functional on Ц1. This shows that BMO
equipped with the norm \\LN\\ is isomorphic to H1* and thus, being the dual
of a Banach space, it is itself a Banach space and in particular it is complete.
Combining equation (*) with Theorem 54 we have that \\Lpf\\ and ЦЛ^Цдмо
are equivalent norms. This completes the proof. ?
While Fefferman's inequality, the space of BMO martingales, and the du-
duality of 7i°° and BMO are all of interest in their own right, we were motivated
to present the material in order to prove the important Jacod-Yor Theorem on
martingale representation, which we now present, after we recall the version
of the Hahn-Banach Theorem we will use.
Theorem 56 (Hahn-Banach Theorem). Let X be a Banach space and let
Y be a closed linear subspace. Then Y = X if and only if the only bounded
linear functional L which has the property that L(Y) = 0 is the functional
which is identically zero.
Theorem 57 (Jacod-Yor Theorem on Martingale Representation).
Let Л be a subset of H2 containing constant martingales. Then S(A), the
stable subspace of stochastic integrals generated by A, equals H2 if and only
if the probability measure P is an extremal point of Л42(А), the space of
probability measures making all elements of A square integrable martingales.

Then Q and R are both in M2(A), and P = \Q + \R shows that P
200 IV General Stochastic Integration and Local Times
Proof. The necessity has already been proved in Theorem 38. By the Hahn-
Banach Theorem, Ti1 = S{A) if and only if L(S{A)) = 0 implies L is identi-
identically zero, where L is a bounded linear functional. Let Ibea bounded linear
functional which is such that L{S{A)) = 0. Then there exists a martingale
N in BMO such that L = Ln- The local martingale N is locally bounded,
so by stopping we can assume it is bounded and that JVo = 0. (See the two
corollaries of Theorem 54.) Let us also assume it is not identically zero, and
let с be a bound for N. We can then define two new probability measures Q
and R by
2c ' 2c
ft Л О / Л \ Ч -М—v 1 S^L . 1 "I—fc » . 1 -M—v *
IS
not extremal in Л42(А), a contradiction. Therefore we must have that L is
identically zero and we have that Ti1 = S{A). As far as Ti2 is concerned, it
is a subspace of Ti1, hence Ti2 С <S(.4). But by construction of S(A), it is
contained in H2, and we have martingale representation. ?
5 Examples of Martingale Representation
In Sect. 3 we have already seen the most important example of martingale
representation, that of Brownian motion. In this section we give a method
to generate a family of examples which are local martingales with jumps,
and which have the martingale representation property. The limitations are
that the family of examples is one dimensional (so that we exclude vector-
valued local martingales such as n-dimensional Brownian motion), and that
the descriptions of the jumps are all of the same rather simple kind.
The idea is to construct a class of local martingales Ti such that for 1бН
we have both that Xq = 0 and the compensator of [X, X]t is At = t. If X has
the martingale representation property, then there must exist a predictable
process H such that
[X,X]t-t= / HsdXs. (*)
Jo
The above equation is called Emery's structure equation, and it is written
(in a formal sense) in differential notation as
In order to establish that solutions to Emery's structure equation actually
exists we write it in a form resembling a differential equation:
d[X,X]t = dt + <j>(Xt-)dXt {**)
Equation (**) is unusual and different from the stochastic differential equa-
equations considered later in Chap. V, since while the unknown is of course the

5 Examples of Martingale Representation 201
local martingale X (and part of the structure equation is to require that any
solution X be a local martingale), no a priori stochastic process is given in
the equation. That is, it is lacking the presence of a given stochastic driving
term such as, for example, a Brownian motion, a compensated Poisson pro-
process, or more generally a Levy process. Since therefore no probability space
is specified, the only reasonable interpretation of equation (**) is that of a
weak solution. That is, we want to show there exists a filtered probabil-
probability space (ft, J-, F, P) satisfying the usual hypotheses, and a local martingale
X, such that X verifies equation (**). It would also be nice to have weak
uniqueness which means that if X and Y are solutions of (**) for a given ф,
possibly defined on different filtered probability spaces, then X and Y have
the same distribution as processes. That means that for every A, a Borel
set on the function space of cadlag functions mapping R+ to R, we have
P{uj : t t-> Xt{ui) G A) = Q(uj : t i-> Xt(oj) e A), where P and Q are the
probability measures where X and У are respectively defined.
Inspired by knowledge of stochastic differential equations, it is natural to
conjecture that such weak solutions exist and are unique if the coefficient
ф is Lipschitz continuous.14 This is true for existence and was proven by
P. A. Meyer [179]; see alternatively [136]. Since the proof uses weak conver-
convergence techniques which are not within the scope of this book, we omit it.
Theorem 58 (Existence of Solutions of the Structure Equation). Let
ф : R —» К be Lipschitz continuous. Then Emery's structure equation
[X,X]t-t= [ ф{Ха.NХа (***)
Jo
has a weak solution with both (Xt)t>o and (/0 ф(Х3-)AХа)г>о local martin-
martingales.
The issue of uniqueness is intriguing. Emery has shown that one has
uniqueness when ф is linear, but uniqueness for others ф'з, including the Lip-
Lipschitz case, is open. The next theorem collects some elementary properties of
a solution X.
Theorem 59. Let X be a (weak) solution of (* * *). Then the following hold.
(i) E{X%} = E{[X,X]t} = t, and X is a square integrable martingale on
compact time sets.
(ii) All jumps of X are of the form AXt = </>(Xt_).
(Hi) X has continuous paths if and only if ф is identically 0, in which case X
is standard Brownian motion,
(iv) If a stopping time T is a jump time of X, then it is totally inaccessible.
Proof. We prove the statements in the order given. Since a solution X
and the integral term are both required to be local martingales, we know
Lipschitz continuity is defined and discussed in Chap. V.

202 IV General Stochastic Integration and Local Times
there exists a sequence (Tn)n>i of stopping times increasing to oo such that
/otAT" 4>{Xs-)dXs is in L\ Therefore E{[X,X}tATJ = E{tATn}, and apply-
applying the Monotone Convergence Theorem to each side of the equality in this
equation yields -E{pf, X]t} = t which further implies that X is a martingale
on [0,t] for each t < oo and that E{X?} = E{[X,X]t} = t.
For the second statement, recall that
s<t
and hence we have A[X,X]t = (AXtJ = ф{Хг-)АХи and dividing both
sides by AXt (when it is not zero) gives the result.
For the third statement, suppose ф is identically zero. Then by the second
statement X has no jumps and must be continuous. Since X is then a con-
continuous local martingale with [X, X\t = t, it is Brownian motion by Levy's
Theorem. For the converse, if we know that X is continuous, if it is non-trivial
it has paths of infinite variation since it is a local martingale. Thus so too does
the term /0 ф(Xs-)dXs. But notice that this term is the right side of equa-
equation (* * *) which is of finite variation, and we have a contradiction, so we
must have that ф is zero.
For the fourth statement, we implicitly stop X so that it is a uniformly
integrable martingale. Next let T be a jump time of X. Then T = Та Л Tg
where A is the accessible part of T and В is the totally inaccessible part of
T. Since Та can be covered by a countable sequence of predictable times with
disjoint graphs, we can assume Та is predictable without loss of generality.
Thus it suffices to show P(A) — 0. However since Та is predictable, and
X is a uniformly integrable martingale, we have E{AXta\Fta-} = 0. But
АХтА = ф(ХтА-) by part (ii) of this theorem, and ф(Хтл-) € У~тл- since
Та is predictable, which implies that the jump of X at Та is zero, which in
turn implies that P(A) =0. ?
Let us now consider a special class of structure equations where ф is
assumed to be an affine function. That is, we assume ф is of the form
ф(х) — a + fix. We analyze these special cases when a and /3 vary. Emery has
named the solutions corresponding to affine structure equations the Azema
martingales, since J. Azema's work on Markov processes and expansion of
filtrations led him to the amazing formula of "the" Azema martingale given
later in Sect. 7. Equation (**) now becomes
d{X,X}t = dt + (a + pXt^)dXu
and when /3 = 0, it reduces to
d[X, X\t = dt + otdXt,
and when in addition a = 0 we have seen that X is standard Brownian mo-
motion. Note that Levy's Theorem gives us weak uniqueness in this case, since

5 Examples of Martingale Representation 203
any solution with a = /3 = 0 must have the same distribution, namely that of
Wiener measure. We have much more, as we see in the next theorem. However
it has a long and difficult proof. Rather than present it, we refer the inter-
interested reader to the excellent treatment of M. Emery, in his original paper [69]
proving the result.
Theorem 60 (Emery's Uniqueness Theorem). Let X be a local martin-
martingale solution of the structure equation
d[X,X]t = dt + (a + pXt-)dXu Xo = x0.
Then X is unique in law. That is, any other solution Y must have the same
distribution as does X. Moreover X is a strong Markov process.
The uniqueness is especially significant in light of the next theorem. By
martingale representation we mean that every square integrable martin-
martingale can be represented as a stochastic integral with respect to one fundamen-
fundamental local martingale.
Theorem 61. Consider the equation
[X,X]t-t= I
Jo
on a filtered probability space (Q,J-,?,P) which satisfies the usual hypotheses.
Then X has martingale representation for its completed natural filtration if
and only if the law P is an extreme point of the convex set of all probabilities on
@, J-, F) for which X is a martingale and verifies the equation. Moreover if the
equation with fixed initial condition Xq has weak uniqueness of solutions, then
every solution X of the equation has martingale representation with respect
to the smallest filtration satisfying the usual hypotheses and to which X is
adapted.
Proof. By the Jacod-Yor Theorem (Theorem 57) we need to verify that P is
extremal in the set M2 of all probability measures such that X is a square
integrable martingale. It is clearly true if P is extremal. Suppose then that P
is not extremal, and let Q and R both be in Л42, such that P = AQ + A — X)R,
with 0 < Л < 1. Both Q and R are absolutely continuous with respect to P,
so under Q and R the terms [X, X] and JQ (fi(Xs-)dXs are the same a.s. (resp.
dQ and dR). Therefore X satisfies equation (®) for both Q and R as well as
P. Thus if P is not extremal in Л42 then it is also not extremal in the set of
probability measures such that X satisfies equation (®).
To prove the second statement, let О denote the canonical path space of
cadlag paths, with X being the projection process given by Xt(uj) = uj(t). We
have just seen that among the solutions of equation (<8>), the ones having mar-
martingale representation are those whose law constitutes an extremal probability
measure. But if weak uniqueness holds, the collection of all such probabilities
consists of only one, and thus extremality is trivial. ?

204 IV General Stochastic Integration and Local Times
We now examine several special cases as a and /3 vary. Let a = /3 = 0,
and we have seen that X is standard Brownian motion. Because of Theo-
Theorem 61 we conclude that (one dimensional) Brownian motion has martingale
representation, recovering a special case of Theorem 43.
Next suppose a — 1 and /3 = 0. In this case the equation of Theorem 60
with Xo = 0 becomes, in integral form,
Therefore Xt — [X,X]t — t and hence X is a finite variation martingale.
Moreover AXt = 1, so X only jumps up, with jumps always of size 1. Now let
N be a standard Poisson process with arrival intensity A = 1, and let Xt —
Nt~t, the compensated Poisson process. Then X satisfies the equation, and by
weak uniqueness all such X are compensated Poisson processes with A = 1.
We conclude that a compensated standard Poisson process has martingale
representation with respect to its natural (completed) filtration. For general
a (and not just a = 1), but still with C = 0, it is simple to check that
is the unique solution of the equation of Theorem 60 if N is a standard Poisson
process. Note that (as is well known) Xa converges (weakly) to Brownian
motion asa-tO.
We now consider the more interesting cases where /3^0. We repeat the
equation of Theorem 60 here (in integrated form) for ease of reference:
[X,X]t=xo + t +
Jo
Observe that X is a solution of the above equation if and only if X + j is a
solution of
with initial condition Xo = xo + § • Therefore without loss of generality we
can assume that a = 0 and we do this from now on.
We have two explicit examples for equation (®®). When /3 = -1, we take
Xt = -=Mt = V-sign (Bt)y/(t-gt),
where M is Azema's martingale15, В is standard Brownian motion, and
gt = sup{s < t : Bs = 0}. By Theorem 86 of Sect. 8 of this chapter, we know
that [X,-X]? = 0 and [X,-X]t = 2gt. Integration by parts gives /0 Xs-dXs —
(t - 9t) ~ 9t = t - [X,X]t, since [X,X]t = J23<t(AXaJ, because of course
[X, X]% = 0. This proves the assertion that for /3 = — 1, Xt = -4=Mt, where
15 Azema's martingale is treated in detail in Sect. 8 later in this chapter.

6 Stochastic Integration Depending on a Parameter 205
M is Azema's (original) martingale as presented in Sect. 8, and thus we have
martingale representation for Azema 's martingale.
Our second (and last) explicit example is for /3 = —2. In this case our
equation becomes
[X,X]t-t = -2 [ Xs-
Jo
and using integration by parts we obtain 2 /0 Xs-dXs = X"l — [X, X\t- Equat-
Equating terms gives Xf = t, and since E{Xt} = 0 for all t because xq = 0 we
deduce that
p(xt = уД) = p(xt = -Vt) = I
for all t > 0. The jumps of X occur from a change of sign, and they arrive
according to a Poisson process with intensity j^dt. Such a process can be seen
to be a martingale (as in [204]) by constructing a process X with a distribution
as above and with filtration F. Then for 0 < s < t,
where N is independent of the cr-algebra J-s and has a Poisson distribution
with parameter Л = \ ln(|). In this way it is easily seen to be a martingale.
We call it the parabolic martingale. Once again we are able to conclude:
we have martingale representation for the parabolic martingale.
6 Stochastic Integration Depending on a Parameter
The results of this section are of a technical nature, but they are needed for our
subsequent investigation of semimartingale local times. Nevertheless they have
intrinsic interest. For example, Theorems 64 and 65 are types of Fubini Theo-
Theorems for stochastic integration. A more comprehensive treatment of stochastic
integration depending on a parameter can be found in Strieker-Yor [219] and
Jacod [103]. Throughout this section (А, A) denotes a measurable space.
Theorem 62. Let Yn(a,t,tj) be a sequence of processes that are (i) A®
Б(К+) <8> T measurable, and (ii) for each fixed a the process Yn(a,t,uj) is
cadlag. Suppose Yn(a,t, •) converges in ucp for each a € A. Then there exists
an A® B(R+) <8>F measurable process Y = Y(a, t,ui) such that
(a) Y(a,t, ¦) — Ншп-юо Yn(a,t, ¦) with convergence in ucp;
(b) for each a e A, Y is a.s. cadlag.
Moreover there exists a subsequence nk{a) depending measurably on a such
that lim^a^oo Y^ = Yt uniformly in t on compacts, a.s.

206 IV General Stochastic Integration and Local Times
Proof. Let S^itj = supt<u |Уг(М, •) - Y^>(a,t,-)\. Since Yi is cadlag in t
the function (a,oj) >—> S^ j is A <8> J7 measurable. By hypothesis we have
Ит^-юо S">itj = 0 in probability. Let no(a) = 1, and define inductively
nk(a) = inf{m > max(&,rafc_i(a)) : sup P{S^ti>j > 2~k) < 2~~k}.
i,j>m
We then define
Zk(a,t,u)=Yn^(a,t,c).
Since each a t—> rifc(a) is measurable, so also is Zk. Define
t<u
then also (а,ш) н-> Т"^(о>) is jointly measurable, since Z* have cadlag paths
(in t). Moreover by our construction P(Tgkk+m > 2~k) < 2~k for any m > 1.
The Borel-Cantelli Lemma then implies that Ит^-юо Т*г^ = О almost surely,
which in turn implies that
lim Zl(a,t, •) exists a.s.,
t—>oo
with convergence uniform in t. Let Ла be the set where Z% converges uniformly
(note that Aa G Л <8> T and Р(Л°) = 1, each fixed a), and define
Then У is cadlag thanks to the uniform convergence, and it is jointly mea-
measurable. D
Theorem 63. Let X be a semimartingale with Xq =0 a.s. and let H(a, t, из) =
Ht{uj) be A®V measurable16 and bounded. Then there is a function Z(a,t,u>)
in A® B(R+) <S> T such that for each a & A, Z(a,t,uj) is a cadlag, adapted
version of the stochastic integral /0 HgdXs.
Proof. Let H = {H G bA<S>V such that the conclusion of the theorem holds}.
If К = K(t,uj) e hV and / = f(a) e ЪА, and if H(a,t,u) = f(a)K(t,w),
then
/ H(a,s,-)dXs= [ f(a)K(s,-)dX, = f(a) [ K(s,-)dXs,
o Jo Jo
and thus clearly H = fK is in Ц. Also note that 7i is trivially a vector space,
and that Я of the form H = fK generate ЪА <8> V.
Next let Hn ? Ti and suppose that Hn converges boundedly to a process
Я G ЪА <8> V. By Theorem 32 (for example) we have that Я" • X converges
uniformly in t in probability on compacts, for each a. Therefore H ? 7i, and
an application of the Monotone Class Theorem yields the result. ?
16 Recall that V denotes the predictable cr-algebra.

6 Stochastic Integration Depending on a Parameter 207
Corollary. Let X be a semimartingale (Xo = 0 a.s.), and let H(a,t,u>) =
Щ(и>) G A <g> V be such that for each a the process Ha G L(X). Then there
exists a function Z(a, t,uj) = Z? 6 Д<8>??(]ВЦ_) <g> .F such that for each a, Z?
is an a.s. cadlag version of /0 HfdXs.
Proof. By Theorem 32 the bounded processes Za<k = Hal^Ha.\<kyX converge
to HaX in мер, each a. But Za>k can be chosen cadlag and jointly measurable
by Theorem 63. The result now follows by Theorem 43. D
Theorem 64 (Pubini's Theorem). Let X be a semimartingale, Щ =
H(a,t,u>) be a bounded A®V measurable function, and let jjl be a finite mea-
measure on A. Let Z? = /0 HfdXs be A® B(R+) <8>T measurable such that for
each a, Za is a cadlag version of Ha ¦ X. Then Yt = fA Zf/j,(da) is a cadlag
version of H ¦ X, where Ht = JA H?n(da).
Proof. By pre-stopping we may assume without loss of generality that X G
H2, and because the result holds for the finite variation part of the canonical
decomposition of X by the ordinary Stieltjes Fubini Theorem, we may further
assume that X is a martingale with E{[X,X]^} < сю. Next suppose Щ
is of the form H(a,t,u>) = K(t,u>)f(a) where К G hV and / is bounded,
measurable. Then К G L(X) and / \f(a)\n(da) < сю. In this case we have
Zta = f(a)K ¦ X, and moreover
/ Z?n{da) = / f{a)K • Xn{da) = К ¦ X f f{a)n{da)
= (J' f(a)i*(da)K) ¦ X
= HX.
By linearity the same result holds for the vector space V generated by pro-
processes of the form K(t,u>)f(a) with К G ЪР and / bounded, measurable.
By the Monotone Class Theorem it now suffices to show that if Hn G V
and limn^oo#n = H, then the result holds for H. Let Z%t = Щ ¦ X, the
cadlag version. Then by Jensen's and the Cauchy-Schwarz inequalities,
[
a t
[ sup |
a t
= / ?{suP |Z» t - Z»|>(da)
Ja t
< 4 I E{{Zant00 - Z^fj^da) = 4 I E{[Zan - Z\ Zan - Z°]oo}Ai(do)
Ja Ja
by Doob's quadratic inequality for the martingales Z" and Za, and by Corol-
Corollary 3 of Theorem 27 of Chap. II. Continuing, the preceding equals
= 4 f E{ Г {HI, - H^2d{X,X]s}v(da),
Ja Jo

208 IV General Stochastic Integration and Local Times
and the above tends to 0 by three applications of the Dominated Convergence
Theorem.
We conclude from the preceding that
/ sup \Zl t - Z?\n(da) < oo a.s.
Ja t
and therefore JA \Z?\/j,(da) < oo for all t, a.s. Moreover
?{sup | / Zantlx{da) - f Z?n(da)\} < E{ f sup |Z»t - Z
t Ja Ja Ja t
which tends to 0. Therefore taking Hn^t = J H^tn(da) we have Hn ¦ Xt =
fA Z^t^{da) converges in ucp to f Z^/j,(da). Since Hn ¦ X converges to H ¦ X
by Theorem 32, we conclude H X = J Z?n(da). ?
The version of Fubini's Theorem given in Theorem 64 suffices for the
applications of it used in this book. Nevertheless it is interesting to determine
under what more general conditions a Fubini-type theorem holds.
Theorem 65 (Fubini's Theorem: Second Version). Let X be a semi-
martingale, let H^ = H(a,t,u>) be A®V measurable, let/j, be a finite positive
measure on A, and assume
Letting Z% = /0* H%dXs be A® B(R+) ® T measurable and Za cadlag for
each a, then Yt = fA Z^n(da) exists and is a cadlag version of H ¦ X, where
f
Proof. By pre-stopping we may assume without loss of generality that X G H2
and that \\Ha\\L2(dfl) is (H2,X) integrable. Let X = iV + Л be the canonical
decomposition of X. Then
E{ (H°Jv(da)d[N,N}s}+E{( \\H°\\L4dll)\dAs\J}
Jo Ja Jo
< oo.
Next observe that
лОО /*ОО
E{ \\H°\\L4dfl)\dAs\}>E{\\ / \Has
Jo Jo
/
o
= с j E{ Г \Hf\\dAs\Mda).
Ja Jo
Also

6 Stochastic Integration Depending on a Parameter 209
E{ Г j {Has?v{da)d[N,N]s} = f E{ Г{HasL[N,N}s}^da),
Jo Ja Ja Jo
and therefore ?{/0°° \H°\\dAs\} < oo and E{f™(H°Jd[N,N]s} < oo for
уц-almost all a G A. Whence Ha G L(X) for /j, almost all a G A.
Next define Hn = i71{|^|<n}, and the proof of Theorem 64 works as well
here. ?
The hypotheses of Theorem 65 are slightly unnatural, since they are not
invariant under the transformation
Я - -±-Ha pi -> фЫёа)
<p(a)
where <p is any positive function such that f ip(a)n(da) < сю. This can
be alleviated by replacing the assumption (JA(HaJyu(da)I/2 e L(X) with
(/ ^H(J) <"(^a)I</2 G L(X) for some positive <p e Ll{dn). One can also relax
the assumption on ц to be cr-finite rather than finite.
Example. The hypotheses of Theorem 65 are a bit strange, however they
are in some sense best possible. We give here an example of a parameterized
process (.Hra)a6A) a positive, finite measure jjl on A, and a semimartingale X
such that
(i) (a, t) —» Щ is A <g> V measurable,
(ii) Ha e L(X), each a ? A, and
(Hi) fA\H°\p{da) e ЦХ)
but such that if Z" = Jo H"dXs then JA Z^/j,(da) does not exist as a Lebesgue
integral. Thus a straightforward extension of the classical Fubini Theorem for
Lebesgue integration does not hold.
Indeed, let A = N = {1, 2,3,... } and let ц be any finite positive measure
on A such that ц({а}) > 0 for all a ? A. Let X be standard Brownian motion,
let to < t\ < ti < ¦ ¦ ¦ be an increasing sequence in [0,1], and define
Щ = -
a
Then
1 a
a=l
is in L2(dt), whence Ht = fA \H^\/j,(da) G L(X), and moreover if t > 1,
я • xt =
a=l
where the sum converges in I?. However if t > 1 then

210 IV General Stochastic Integration and Local Times
z? = Ha-xt = -MWrHta -ta_i
a
and
/
jA
= °° a-s-
a=l
because {ta—ta-i) 1^2(Xta -Xta_1) is an i.i.d. sequence and J2T=ia 1 = °°-
Note that this example can be modified to show that we also cannot replace
the assumption that
/
A
with the weaker assumption that {jA{Hf)pn{da))l/'p e L(X) for some p < 2.
7 Local Times
In Chap. II we established Ito's formula (Theorem 32 of Chap. II) which
showed that if / : R —> R is C2 and X is a semimartingale, then f(X) is again
a semimartingale. That is, semimartingales are preserved under C2 transfor-
transformations. This property extends slightly: semimartingales are preserved under
convex transformations, as Theorem 66 below shows. (Indeed, this is the best
one can do in general. If В = (Bt)t>o is standard Brownian motion ai}d
Yt = f(Bt) is a semimartingale, then / must be the difference of convex func-
functions. (See Cinlar-Jacod-Protter-Sharpe [34].) We establish a related result in
Theorem 71, later in this section.) Local times for semimartingales appear in
the extension of Ito's formula from C2 functions to convex functions (Theo-
(Theorem 70).
Theorem 66. Let f : R —» К be convex and let X be a semimartingale. Then
f(X) is a semimartingale and one has
f(Xt) - f(X0) = / f'(Xs_)dXs + At
Jo+
where f is the left derivative of f and A is an adapted, right continuous,
increasing pmcess. Moreover AAt = f(Xt) — f(Xt_) — f'(Xt-)AXt.
Proof. First suppose \X\ is bounded by n, and in H2, and that Xo = 0.
Let g be a positive C°° function with compact support in (—oo,0] such that
/!^5(s)^s = 1- Let fn{t) = n jTtx, f(t + s)g(ns)ds. Then /n is convex and
C2 and moreover f'n increases to /' as n tends to сю. By Ito's formula
U(Xt) - /„(Xo) = f fn{Xs-)dXs + Ant
Jo

7 Local Times 211
where
2 Jo
The convexity of / implies that An is an increasing process. Letting n tend
to oo, we obtain
-I'1'
Jo
f(xt) - f(x0) = / f'(xs_)dxs + A (*)
./o
where limn^oo Л" = At in L2, and where the convergence of the stochastic
integral terms is in H2 on [0, t].
We now compare the jumps on both sides of the equation (*). Since
f° f'(Xs-)dXs = 0 we have that Ao = 0. When t > 0, the jump of the
left side of (*) is f(Xt) — f{Xt-), while the jump of the right side equals
f'(Xt_)AXt + AAt. Therefore AAt = f(Xt) - f(Xt-) - f'{Xt-)AXt, and
the theorem is established for \X\ bounded by n and in H2.
Now let X be an arbitrary semimartingale with Xo — 0. By Theorem 13 we
know there exists a sequence of stopping times (Tn)n>i, increasing to сю a.s.
such that XT ~ e TL2 for each n. An examination of the proof of Theorem 13
shows that there is no loss of generality in further assuming that \XT ~\ < n,
also. Then let Yn = JO[0T™) and we have
f(Ytn) - f(Yon) = f f(y,n_)dY? + Ant,
Jo
which is equivalent to saying
f(Xt) - /(Xo) = Г f'{X,-)dX, + A?
Jo
on [0,Tn). One easily checks that (A"+1)T"- = (An)T"-, and we can define
A = An on [0,Tn),each n.
The above extends without difficulty to functions g : K2 —» R of the form
g(Xt,H) where H is an Tq measurable random variable and x ^ g(x,y) is
convex for every y. For general X we take Xt = Xt — Xo, and then f(Xt) =
f(Xt + Xo) = g(Xt, Xo), where g(x, y) = f(x + y). This completes the proof.
?
Notation. For x a real variable let x+, x~ be the functions x+ = max(:r,0)
and x~ = — min(:r,0). For х, у real variables, let x V у = max(x) y) and
x Л у = min(a;, у).
Corollary 1. Let X be a semimartingale. Then \X\, X+, X~ are all semi-
martingales.

212 IV General Stochastic Integration and Local Times
Proof. The functions f(x) — \x\, g(x) = x+, and h(x) = x~ are all convex, so
the result then follows by Theorem 66. ?
Corollary 2. Let X, Y be semimartingales. Then X V Y and X A Y are
semimartingales.
Proof. Since semimartingales form a vector space and xVy = ~(\x — y\+x + y)
and x Ay = T,(x + у — \x — y\), the result is an immediate consequence of
Corollary 1. ?
We can summarize the surprisingly broad stability properties of semi-
semimartingales.
Theorem 67. The space of semimartingales is a vector space, an algebra, a
lattice, and is stable under C2, and more generally under convex transforma-
transformations.
Proof. In Chap. II we saw that semimartingales form a vector space (The-
(Theorem 1), an algebra (Corollary 2 of Theorem 22: Integration by Parts), and
that they are stable under C2 transformations (Theorem 32: Ito's Formula).
That they form a lattice is by Corollary 2 above, and that they are stable
under convex transformations is Theorem 66. G
Definition. The sign function is defined to be
... /l, ifar>0,
sign(x) = i
1—1, if x < 0.
Note that our definition of sign is not symmetric. We further define
ho(x) = \x\ and ha(x) = \x — a\. (*)
Then sign(:r) is the left derivative of ho(x), and sign(:r—a) is the left derivative
of ha{x). Since ha(x) is convex by Theorem 66 we have for a semimartingale
X
ha(Xt) = \Xt -a\ = \X0-a\+ f sign(Xs_ - a)dXs + Aat, {**)
where A^ is the increasing process of Theorem 66. Using (*) and (**) as
defined above we can define the local time of an arbitrary semimartingale.
Definition. Let X be a semimartingale, and let ha and Aa be as defined in
(*) and (**) above. The local time at a of X, denoted Щ = La(X)t, is
defined to be the process given by
Lat =Aat- Y, iha(Xs) - K{XS_) - h'a(Xs_)AXs}.
0<s<t
Notice that by the corollary of Theorem 63 the integral fQ+ sign(Xs_ — a)dXs
in (**) has a version which is jointly measurable in (a,t,u>) and cadlag in

7 Local Times 213
t. Therefore so does (A")t>o, and finally so too does the local time L". We
always choose this jointly measurable, cadlag version of the local time, without
any special mention. We further observe that the jumps of the process Aa
defined in (**) are precisely ^23<t{ha(Xa) - ha(Xs_) — ha(Xs_)AXs} (by
Theorem 66), and therefore the local time (!<" )t>o is continuous in t. Indeed,
the local time La is the continuous part of the increasing process Aa.
The next theorem is quite simple yet crucial to proving the properties of
La that justify its name.
Theorem 68. Let X be a semimartingale and let La be its local time at a.
Then
i-t
(Xt -a)+ - (Xo -a)+ = / l{x,_>a}dXs+ 2^ l{xa->a}{Xs - a)~
Jo+ o<s<t
0<s<t
(Xt - a)' - (X
o -a)-=- l{Xs.<a}dXs + V
Jo+ o<s<t
Jo+ o<s<t
0<s<t
Proof. Applying Theorem 66 to the convex functions f(x) = (x — a)+ and
g(x) — (x — a)~ we get
f(Xt) = /(Xo) + Г f'(X,-)dX, + C+
Next let
g(Xt) = g(X0) + j g'{Xs.)dXs + C~
Jo+
D+ =C+- ^ {/№) " /№-) " f'(Xa-)AX.}t
0<s<t
0<s<t
and subtracting the formulas we get Ct+ - Ct~ — 0 and hence Df = Dt~. Also
Df + ДГ = Lf, so that Dt+ = Dt~ = |l/", and the proof is complete. ?
The next theorem, together with the "occupation time density" formula
(Corollary 1 of Theorem 70), are the traditional justifications for the termi-
terminology "local time."

214 IV General Stochastic Integration and Local Times
Theorem 69. Let X be a semimartingale, and let L% be its local time at the
level a, each a 6 №. For a.a. w, the measure in t, dL^(u), is carried by the set
{s:Xs_(to)=Xs{to) = a}.
Proof. Since L" has continuous paths, the measure dL^(u) is diffuse, and since
{s : Xs-(u>) = a} and {s : Xs-(w) = Xs(w) = a} differ by at most a countable
set, it will suffice to show that dL^(u>) is carried by the set {s : Xs^(u>) = a}.
Suppose 5, T are stopping times and that 0 < S < T such that [5, T) с
{(s,u) : A~s_(w) < a} = {A"_ < a}. Then X < a on [S,T) as well. Hence by
the first equation in Theorem 68 we have
(A -O)f -(Л -OM = / l{X,_>a}ttAs+ > l{Xs_>a}(^s -a)
/ Q '
^ i
S<s<T
However the left side of the above equation equals zero, and all terms on the
right side except possibly ^{Lj, — Las) also equal zero. Therefore ^{Lj, — Las) =
0, whence Щ,~Ь8.
Next for r€Q, the rationale, define the stopping times Sr(u>), r > 0, by
олл-/г' ifAV-M<a,
I сю, it Xr-(u>) > a.
Then define
Tr{to) = ini{t > Sr(uj) : Xt-(w) > a}.
Then [5,.,^) С {A"_ < a}, and moreover the interior of the set {X- <
a} equals (JreQ r>o(^'^)- As we have now seen, dLa does not charge the
interior of the set {X- < a}. This set is open on the left, hence it differs from
its interior by an at most countable set, and since dLa is diffuse it doesn't
charge countable sets. Thus dLa does not charge the set {A~_ < a} itself.
Analogously one can show dLa does not charge {A~_ > a}. Hence its
support is contained in the set {a~_ = a}, and we are done. ?
The next theorem gives a very satisfying generalization of Ito's formula
(Theorem 32 of Chap. II).
Theorem 70 (Meyer-Ito Formula). Let f be the difference of two convex
functions, let f be its left derivative, and let jjl be the signed measure (when
restricted to compacts) which is the second derivative of f in the generalized
function sense. Then the following equation holds:
f(Xt)-f(X0)= f f'(Xs_)dXs+
Jo+
0<s<t
1 Z
- /
г J-
(*)

7 Local Times 215
where X is a semimartingale and L" = Ц:{Х) is its local time at a.
Proof. Notice that equation (*) is trivially true if / is an affine function (i.e.,
f{x) — ax + b). Next let us first assume that ц is a signed measure with finite
total mass on a compact interval. Define a function g by
{x) = - J \x- y\n(dy).
It is well known that / and g differ by at most an affine function h(x) = ax+b.
Therefore without loss of generality we can assume that the function / in (*)
is of the form f(x) — \ j \x — y\/j,(dy). Then f'(x) = | J*sign(:r — y)n(dy) and
f"(x) — /j,(dx). Moreover if
J*= J2 \Xs-y\-\Xs.-y\-sign(Xs_~y)AXs,
0<s<t
then
0<s<t
Also, letting H\ = \Xt — y\ — \X0 - y\, one has
l- f Jytlx{dy) = J2 {/№) - f(Xs-) - f'(Xs_)AXs}. A)
0<s<t
. B)
Next consider Zt" = /0*+sign(Xs_ - y)dXs, and let Zy be the B(M) ®
T measurable version, which we know exists by Theorem 63. By
Pubini's Theorem (Theorem 64) we have
^ C)
Since Ц = Щ - J? - Z%, we have
\ I L^(da) = ~J HMda) -\J JtKda) - \ J Zatix{da) D)
and combining D) with A), B), and C), yields the formula (*).
It remains to reduce the general case to that of jjl supported on a compact
interval with finite total mass. By linearity it further suffices to treat the case
where / is convex. Define
[/(«) + f'(n)(x-n), ifx>n,
fn{x) = < f(x), if -n<x<n,
[f(-n)+f'(-n)(x + n), \ix<-n.
Then /n is also convex and its second (generalized) derivative jjin{dx) agrees
with jj, on \—n,ri) and is of finite total mass. Let У" = fn(X), and Tn =

216 IV General Stochastic Integration and Local Times
inf{* > 0 : \Xt\ > n}. Note that Yn = f{X) on [0,Tn), and that Щп = О for
all a, \a\ > n, by Theorem 69. Therefore / L?nn{da) = fLfn{da) for t < Tn,
and by the preceding (*) holds for Y = fn{X) = f(X) on [0,Tn). Therefore
(*) holds on [0,T"). Since the stopping times (Tn)n>i increase to сю a.s., we
have (*) holds on all of R+ x fi, and the theorem is proved. ?
The next formula gives an interpretation of semimartingale local time as
an occupation density relative to the random "clock" d[X,X}^.17
Corollary 1. Let X be a semimartingale with local time (La)aeK- Let g be
a bounded Borel measurable function. Then a.s.
/ Latg(a)da= / g(Xe-)d[X,X]ct.
J-oo Jo
Proof. Let / be convex and C2. Comparing (*) of Theorem 70 with Ito's
formula (Theorem 32 of Chap. II) shows that
/ LaJ"(a)da= / f»(Xa-)d[X,X]ct
J—oo JO
where jji{da) is of course f"(a)da. Since the above holds for any continuous
and positive function /", a monotone class argument shows that it must hold,
up to a P-null set, for any bounded, Borel measurable function g. D
We record here an important special case of Corollary 1.
Corollary 2. Let X be a semimartingale with local time (La)aeu- Then
[x,xyt= Г
J-c
Ltc
Corollary 3 (Meyer-Tanaka Formula). Let X be a semimartingale with
continuous paths. Then
ft
у у I I I • / у \ i у i 7" U
Jo+
Proof. This is merely Theorem 70 with f(x) = \x\, which implies n(da) =
2eo(da), point mass at 0. The formula also follows trivially from the definition
of L°. ?
If Xt ~ Bt is a standard Brownian motion, then Mt — fQ sign(Bs)dBs is a
continuous local martingale, and [M,M]t — /0 sign(BsJd[B,B]s = [B,B]t =
t. Therefore by Levy's Theorem (Theorem 39 of Chap. II) we know that Mt
17 Recall that [X, X]c denotes the path-by-path continuous part oft^ [X, X]t with
[X,X]c0 = 0.

7 Local Times 217
is another Brownian motion. We therefore have, where Bq = x for standard
Brownian motion,
\Bt\ = \BQ\+Pt+Lu (**)
where fit = /0 sign(Bs)dBs is a Brownian motion, and Lt is the local time of
В at zero. Formula (**) is known as Tanaka's Formula.
Observe that if f(x) = \x\, then /" = 25(x), where 5 is the "delta
function at 0," which of course is a generalized function, or "distribution."
Thus Corollaries 1 and 3 give the intuitive interpretation of local time as
Ц = /0* 5{Xs)d[X,X]s, and Ц = /0* S(XS - a)d[X,X]s, for continuous semi-
martingales.
Local times also allow us to give simple examples of functions / that are
only slightly more general than convex functions and are such that f(X) need
not be a semimartingale when X is one. Let f(x) = |ж|а, 0 < a < 1. Then
/ cannot be expressed as the difference of two convex functions because the
slope of / becomes vertical as x decreases to zero. The proof of this result is
simpler if we restrict a to 0 < a < 1/2.
Theorem 71. Let X be a continuous local martingale with Xq — 0, and let
0 < a < 1/2. Then Yt — \Xt\a is not a semimartingale unless X is identically
zero.
Proof. Let us suppose that Y = \X\a is a semimartingale. Let /3 = I/a > 2.
We then have
=Yf = C f YtldYs + ^^ f
Jo l Jo
since Y has continuous paths. Using Theorem 69 and the Meyer-Tanaka for-
formula (Corollary 3 of Theorem 70) we have, where Lt — L® is the local time
of X at 0,
Lt = l{xs=o}dLs
Jo
= - / l{xs=0}sign(Xs)dX5+ / l{Xs=0)d\X\s
Jo Jo
The integral Jo l{xs=o} sign(Xs)dXs is identically zero. For if T™ is a sequence
of stopping times increasing to сю a.s. such that XT" is a bounded martingale,
then
tAT" ЛЛТ"
/
/tAT ЛЛТ
l{Xs=0} sign(Xs)dXsJ} = E{ / l{x3=0}d[X, X]s}
Jo
L?ATnl{0}(a)da} = 0.

218 IV General Stochastic Integration and Local Times
Therefore Lt = L°t = /0* 1{Хв=0)й\Х\8. Since {Xs = 0} equals {Ys = 0}, this
becomes
= l{Y3=o}d\X\s
Jo
Vi{ys=0}(iF8 + t^ Г
A Jo
Jo
= C [t
Jo
= 0.
Using the Meyer-Tanaka formula again we conclude that
\Xt\ = f sign(Xs)dXs.
Jo
Since X is a continuous local martingale, so also is the stochastic integral on
the right side above (Theorem 30); it is also non-negative, and equal to zero
at 0. Such a local martingale must be identically zero. This completes the
proof. D
It is worth noting that if X is a bounded, continuous martingale with
Xo = 0, then Yt = \Xt\a, 0 < a < 1/2 is an example of an asymptotic
martingale, or AMART, which is not a semimartingale.18
We next wish to determine when there exists a version of the local time
L% which is jointly continuous in (a, t) *->¦ L" a.s., or jointly right continuous
in (a,t) и-> Lf. We begin with the classical result of Kolmogorov which gives
a sufficient condition for joint continuity. There are several versions of Kol-
mogorov's Lemma. We give here a quite general one because we will use it
often in Chap. V. In this section we use only its corollary, which can also be
proved directly in a fairly simple way.
Before the statement of the theorem we establish some notation. Let A
denote the dyadic rational points of the unit cube [0,1]™ in R™, and let Am
denote all x e A whose coordinates are of the form k2~m, 0 < к < 2т.
Theorem 72 (Kolmogorov's Lemma). Let (E,d) be a complete metric
space, and let Ux be an E-valued random variable for all x dyadic rationals
in R™. Suppose that for all x, y, we have d(Ux, Uy) is a random variable and
that there exist strictly positive constants e, C, /3 such that
Then for almost all и the function x н-> Ux can be extended uniquely to a
continuous function from M.n to E.
18 For a more elementary example of an asymptotic martingale that is not a semi-
semimartingale, see Gut [86, page 7].

7 Local Times 219
Proof. We prove the theorem for the unit cube [0,1]™ and leave the extension
to R™ to the reader. Two points x and у in Дт are neighbors if supj \хгуг\ —
2~m. We use Chebyshev's inequality on the inequality hypothesized to get
P{d(Ux,Uy) > 2~am} <
Let
Am = {u : 3 neighbors x,y e Am with d{Ux(u), Uy{u)) > 2~ат}.
Since each x € Дт has at most 3™ neighbors, and the cardinality of Дт is
2mn, we have
P{Am) < c2m{a?-0)
where the constant с = 3nC. Take a sufficiently small so that ae < /3. Then
Р(Лт) < c2~mS
where 5 = C—a.e > 0. The Borel-Cantelli Lemma then implies Р(Лт infinitely
often) = 0. That is, there exists an mo such that for m > mo and every pair
(u, v) of points of Дт that are neighbors,
d{Uu,Uv) <2-am.
We now use the preceding to show that x i—» Ux is uniformly continuous
on Д and hence extendable uniquely to a continuous function on [0,1]™. To
this end, let x,y ? A be such that \\x — y\\ < 2~k~1. We will show that
d(Ux, Uy) < c2~ak for a constant c, and this will complete the proof.
Without loss of generality assume к > mo- Then x = (x1,... ,xn) and
у = (у1,... ,yn) in Д with \\x — y\\ < 2~k~1 have dyadic expansions of the
form
xi = иг +^o}2-J'
where a*-, b^ are each 0 or 1, and u,v are points of A^ which are either equal
or neighbors.
Next set uo = и, щ = u0 + aic+i2~k~1, щ = щ + Ofe+22~fe'2, We also
make analogous definitions for vo,vi,vz, Then u,_i and щ are equal or
neighbors in Ak+i, each i, and analogously for «j_i and w,. Hence
j=k

220 IV General Stochastic Integration and Local Times
and moreover
d(Uu{u),Uv{u))<2-ak.
The result now follows by the triangle inequality. ?
Comment. If the complete metric space (E,d) in Theorem 72 is separable,
then the hypothesis that d(Ux, Uy) be measurable is satisfied. Often the metric
spaces chosen are one of R, Rrf, or the function space С with the sup norm
and these are separable.
A complete metric space that arises often in Chap. V is the space E = Vn
of cadlag functions mapping [0, сю) into R™, topologized by uniform conver-
convergence on compacts. While this is a complete metric space, it is not separable.
Indeed, a compatible metric is
oo
<*(/.») = ? ^AЛ sup \f(s)-g(s)\).
However if fa(t) = l[QiOO)(*), then d{fa,f0) = 1/2 for all a, /3 with 0 < a <
/3 < 1, and since there are uncountably many such a, /3, the space is not
separable. Fortunately, however, the condition that d{Ux, Uy) be measurable
is nevertheless satisfied in this case, due to the path regularity of the functions
in the function space Z>™. (Note that in many other contexts the space Vn is
endowed with the Skorohod topology, and with this topology T>n is a complete
metric space which is also separable; see for example Ethier-Kurtz [71] or
Jacod-Shiryaev [110].)
We state as a corollary the form of Kolmogorov's Lemma (also known as
Kolmogorov's continuity criterion) that we will use in our study of local times.
Corollary 1 (Kolmogorov's Continuity Criterion). Let {X^)t>o,aeun
be a parameterized family of stochastic processes such that 1i—> Xf is cadlag
a.s., each о ? R™. Suppose that
\\
s<t
for some e, /3 > 0, C{t) > 0. Then there exists a version X? of Xf which
is B(M.+) <g> B(M.n) <g> T measurable and which is cadlag in t and uniformly
continuous in о on compacts and is such that for all о € R™, t > 0,
X?=X<1 a.s.
(The null set {Xf ф X"} can be chosen independently of t.) In this sec-
section we will use the above corollary for parameterized processes Xa which
are continuous in t. In this case the process obtained from the corollary of
Kolmogorov's Lemma, Xa, will be jointly continuous in (a, t) almost surely.
In particular, Kolomogorov's Lemma can be used to prove that the paths
of standard Brownian motion are continuous.

7 Local Times 221
Corollary 2. Let В be standard Brownian motion. Then there is a version
of В with continuous paths, a.s.
Proof. Since Bt - Bs is Gaussian with mean zero and variance t — s, we know
that E{\Bt - Bs\4} < c(t - sJ. (One can give a cute proof of this moment
estimate using the scaling property of Brownian motion.) If we think of time
as the parameter and the process as being constant in time, we see that the
exponent 4 is strictly positive, and that the exponent on the right, 2, is strictly
bigger than the dimension, which is of course 1. Corollary 2 now follows from
Corollary 1. D
Hypothesis A. For the remainder of this section we let X denote a semi-
martingale with the restriction that 53o<s<t I^-Xal < °° a-s-> eacn * > 0-
Observe that if (?l,Jr,(J-'t)t>o,P) is a probability space where {J-t)t>o is
the completed minimal filtration of a Brownian motion В = (Bt)t>o, then
all semimariingales on this Brownian space verify Hypothesis A. Indeed, by
Corollary 1 of Theorem 43 all the local martingales are continuous. Thus if
X is a semimartingale, let X — M + A be a decomposition with M a local
martingale and A an FV process. Then the jump processes AX and AA are
equal, hence
0<s<t 0<s<t
since A is of finite variation on compacts.
Let X be a semimartingale satisfying Hypothesis A, and let
Jt =
0<s<t
that is, J is the process which is the sum of the jumps. Because of our hypoth-
hypothesis J is an FV process and thus also a semimartingale. Moreover Yt = Xt - Jt
is a continuous semimartingale with Yo = 0, and we let
Y = M + A
be its (unique) decomposition, where M is a continuous local martingale and
Л is a continuous FV process with Mo = Ao — 0. Such a process M is then
uniquely determined and we can write
M = XC,
the continuous local martingale part of X.
Notation. We assume given a semimartingale X satisfying Hypothesis A. If
Z is any other semimartingale we write
= Г
Jo+
»->o}
dZs (a?

222 IV General Stochastic Integration and Local Times
This notation should not be confused with that of Kolmogorov's Lemma (The-
(Theorem 72). It is always assumed that we take the В(Ж)®В(М.+)®3Г measurable,
cadlag version of Za (cf., Theorem 63).
Before we can prove our primary regularity property (Theorem 75), we
need two preliminary results. The first is a very special case of a family of
martingale inequalities known as the Burkholder-Davis-Gundy inequalities.
Theorem 73 (Burkholder's Inequality). Let X be a continuous local mar-
martingale with Xo =0, 2 < p < oo, and T a finite stopping time. Then
where Cp = {gP(Hfcii)}P/2, „ah I + A = 1.
Proof. By stopping, it suffices to consider the case where X and [X, X] are
bounded. By Ito's formula we have
P{P 1} f
\XT\P=p[ signfJQI^r1^ + P{P~ 1} f \Xs\r-4[X,X)s.
Jo * Jo
By Doob's inequalities (Theorem 20 of Chap. I) we have (with - + | — 1)
< qpE{\XTf}
ll Г \Xsr2d[x,x)s}
Jo
with the last inequality by Holder's inequality. Since Е{(Хт)р}х~* < оо, we
divide both sides by it to obtain
and raising both sides to the power p/2 gives the result. ?
Actually much more is true. Indeed for any local martingale (continuous
or not) it is known that there exist constants cp, Cp such that for a finite
stopping time T
E{(XWYlp < cpE{[X,X}pT/2}^p < CpE{{X*TYY'i>
for 1 < p < oo. See Sect. 3 of Chap. VII of Dellacherie-Meyer [46] for these
and related results. When the local martingales are continuous some results
even hold for 0 < p < 1 (see, e.g., Barlow-Jacka-Yor [9, Table 4.1 page 162]).

7 Local Times 223
Theorem 74. Let X be a semimartingale satisfying Hypothesis A. There ex-
exists a version of(Xc)f such that (a,t,u>) i—> (Xc)"(w) is B(M.)®V measurable,
and everywhere jointly continuous in (a, t).
Proof. Without loss of generality we may assume X — Xo € H2. If it is not,
we can stop X — Xo at Tn—. The continuous local martingale part of
is then just (XC)T". Suppose -oo < о < b < oo, and let
at{a,b) = E{{ f l{b>Xa_>a}d[X,X)csJ}.
Jo
By Corollary 1 of Theorem 70 we have
at{a,b) -
Г
J a
. .2 ,. 1
b —
2 f 1
ЧЬ-с
by the Cauchy-Schwarz inequality. The above implies
By the definition,
bt
and therefore
Е{{ЦJ} <_
<
at(a, b) < (b- a)
<A?<\Xt-XQ\-
Z2E{\Xt-X0\2}l
сЩХ-Х0\\Ъ+4:
2 sup -B{(L");
ue(a,b)
- / sign(Xs_ ¦
-2E{([ sign(J
Jo+
Г ud J\
a Ja *
pb
I Ja *
- lt)dXs
te_ - «)dXsJ}
¦ (Л — Л0)\\пз
X0\\^2 <oo,
and the bound is independent of u. Therefore
at{a,b)<{b-aJr,
for a constant Г < oo, and independent of t. Next using Burkholder's inequal-
inequality (Theorem 73) we have
/•OO
E{sup\(Xc)bs - (Xc)as\4} < C4E{( / l{b>Xs__>a}d[X,X}csJ}
s Jo
< Ci sup at(a, b)
t
< С4Г{Ъ - aJ.
The result now follows by applying Kolmogorov's Lemma (the corollary of
Theorem 72). D

224 IV General Stochastic Integration and Local Times
We can now establish our primary result.
Theorem 75. Let X be a semimartingale satisfying Hypothesis A. Then there
exists a B(M.)®V measurable version of(a,t,u>) >—* L^(u>) which is everywhere
jointly right continuous in a and continuous in t. Moreover a.s. the limits
Ц~ — limf,_,aif,<aLj exist.
Proof. Since X satisfies Hypothesis A, the process Jt = J2o<s<t ^^* is an
FV semimartingale, and Y = X — J is a continuous semimartingale. We let
Y = M + A be the (unique) decomposition of Y, with Mo = Ao = 0.
Then X - M + A + J. Further define
0<s<t 0<s<t
Observe that \S?\ < ?0<s<t \AX*\ < oo. By Theorem 68 we have
(Xt - o)+ - (Xo - a)+ = {M)at + (i)? + (J)at + S? + ±Ц.
Consider
We have
and
/'
Jo+
a<b
0+
l{x._>b}dAe
l{x._>b)dAe
where the convergence is uniform in t on [0, r], and r > 0. We have analogous
results for (J)f and also for Sf, because it is dominated by J2o<s<t l^^sl <
oo. Since we already know that (M)" is continuous, the proof is complete. ?
The next three corollaries are simple consequences of Theorem 75. Re-
Recall that for a semimartingale X satisfying Hypothesis A, we let Jt =
J2o<s<t ^Xs, Y = X - J, and Y = M + A be the (unique) decomposition of
Y, with Ao = 0.
Corollary 1. Let X be a semimartingale satisfying Hypothesis A. Let X =
M + A + J be a decomposition with M and A continuous and J the jump
process of X, and let (L")t>0 be its local time at the level a. Then
at-Lat~ =2 f l{Xs_
Jo
= 2 f l{Xs=
Jo
dAs.

7 Local Times 225
An example of a semimartingale satisfying Hypothesis A but having a
discontinuous local time is JQ = \Bt\ where В is standard Brownian motion
with Bo = 0. Here La{X) = La(B) + L~a(B) for о > 0, L°(X) = L°(B), and
La(X) — 0 for о < 0. J. Walsh [227] has given a more interesting (but more
complicated) example.
Corollary 2. Let X be a semimartingale satisfying Hypothesis A. Let A be
as in Corollary 1. The local time (La) is continuous in t and is continuous at
о = oo if and only if
/ l{Xs=ao}\dAs\ = 0.
Jo
Observe that if X = B, a Brownian motion (or any continuous local mar-
martingale) , then A = 0 and the local time of X can be taken everywhere jointly
continuous.
Corollary 3. Let X be a semimartingale satisfying Hypothesis A. Then for
every (a, t) we have
and
Lt =
lim- / l{a<xs<a+s}d[X,X}cs, a.s.
^° ? Jo
I ft
lim)-/ 1{a-s<xs<a}d[X,X}cs, a.s.
?^° e Jo
The lack of symmetry in the above formula stems from the definition of
local time, where we defined sign(a:) in an asymmetric way:
x > 0,
x < 0.
A symmetrized result follows trivially, namely
T a j_ то,—
^Уо 4\xs~a\<e}d[X,X\l, a.s.
Corollary 3 is intuitively very appealing, and justifies thinking of local time
as an occupation time density. Also if X = B, a Brownian motion, then
Corollary 3 becomes the classical result
as, a.s.
Local times have interesting properties when viewed as processes with
"time" fixed, and the space variable "a" as the parameter. The Ray-Knight
Theorem giving the distribution of (Ьу~а)о<а<ъ the Brownian local time
sampled at the hitting time of 1, as the square of a two dimensional Bessel
diffusion, is one example.

226 IV General Stochastic Integration and Local Times
The Meyer-Ito formula (Theorem 70) can be extended in a different di-
direction, which gives rise to the Bouleau-Yor formula, allowing non-convex
functions of semimartingales. The key idea is that the function a i—» Ьц in-
induces a measure on R if L is the local time of a semimartingale X satisfying
Hypothesis A and U is a positive random variable.
Theorem 76. Let X be a semimartingale satisfying Hypothesis A, U a pos-
positive random variable, La the local times of X. Then the operation
n
J
where f(x) = J2 fi^(a,i,ai+1](x)> can be extended uniquely to a vector measure
on В(Ж) with values in L°.
Proof. Recall that L° denotes finite-valued random variables. By Theorem 68
we know that
1 _ - /"*
-Lat = (Xt -a) -(X0-a) + l{xs_<
1 Jo+
0<s<t 0<s<t
We write Sf for the last two sums on the right side of the equation above.
Note first that (Хц — a)~ — (Xo — a)~ is Lipschitz continuous in о and
hence absolutely continuous in a. Therefore it induces a measure in a. Also
the function а м 5^ is cadlag and of finite variation in a, and moreover
f \daSu\ < 2j2o<s<u\^Xs\- Finally consider the stochastic integral. Let
1 ~Xu)-Jo
= i
и
0+
и
Therefore by the Dominated Convergence Theorem for stochastic integrals
(Theorem 32) we have that Yli fi(Xy'+1 — X^) extends to a measure with
values in L° and moreover / f{a)daXfj = J0+f(Xs^)dXs, for / bounded,
Borel measurable on R. Thus daLfj can be defined as the sum of three L°-
valued measures on (R, Z?(R)), and the theorem is proved. D
In the proof of Theorem 76 we proved the following result, which is im-
important enough to state as a corollary.

8 Azema's Martingale 227
Corollary. Let X be a semimartingale satisfying Hypothesis A, U a positive
random variable, and X" = Jo l{x3-<a}dXs. Then daXy can be defined as
an L°-valued measure, and for / e Ы?(Й) we have
/ f{a)daXb = I f{Xs_)dXs.
Ju Jo+
Combining Theorem 76 and its corollary yields the Bouleau-Yor formula.
We leave its proof to the reader.
Theorem 77 (Bouleau-Yor Formula). Let X be a semimartingale satisfy-
satisfying Hypothesis A, U a positive random variable, f a bounded, Borel function,
and F(x) = /oz f(u)du. Then
F(Xu) - F(X0) = / f(Xs_)dXs - \ ff(a)daLau
+ Y, {F(Xs)-F(Xs_)-f(Xs_)AXs}.
0<s<U
Combining Theorem 77 with the Meyer-Ito formula (Theorem 70) we ob-
obtain the following relationship.
Corollary. Let A" be a semimartingale satisfying Hypothesis A, and let T be
a finite-valued random variable. Let La be the local times of X and let / be
a C1 function. Then
/ f'{a)LaTda =- f{a)daLaT.
J—oo J—oo
Remark. The Meyer-Ito formula (Theorem 70) and the Bouleau-Yor formula
can be extended in interesting ways. This was first done in [77], which inspired
subsequent work by N. Eisenbaum [62] and others (e.g., [6]). See [80] for a
compilation of selected results in these areas and also new ones.
8 Azema's Martingale
In order to prove the regularity properties of local times in Sect. 7, we needed
Hypothesis A: a semimartingale X satisfies Hypothesis A if X is such that
^0<s<t|AA"s| < oo a.s., each t > 0. This hypothesis was needed to prove
Theorems 74 through 77 and their corollaries. We present here a counterex-
counterexample, known as Azema's martingale, that shows that these results do not
hold for general semimartingales. In particular, Theorems 81 and 82 show
that Azema's martingale has a local time that is zero at every level a except
a = 0, and therefore there is no regular version in the space variable.
Let (п, T,F, (Bt)t>o,P) be a standard Brownian motion, Bo = 0, with
the filtration F = {Tt)t>o completed (and hence right continuous). We define

228 IV General Stochastic Integration and Local Times
/l, ifa:>0,
sign(x) = <
v ; [-1, ifx<0.
Set g% = cr{sign(Bs); s < t}, and let G = (&)t>o denote the filtration (?to)t>o
completed. (It is then right continuous as a consequence of the strong Markov
property of Brownian motion.)
Let Mt = E{Bt\Qt},t > 0. Then for s<t,
E{Mt\gs} = E{Bt\gs} = E{E{Bt\rs}\gs} = E{B8\gs} = ms,
and MisaG martingale. We always take the cadlag version of M.
Definition. The process Mt — E{Bt\Qt} is called Azema's martingale.
Fundamentally related to Azema's martingale is the process which gives
the last exit from zero before time t. We define
gt = sup{s < t : Bs - 0}.
To study the process g we need the reflection principle for Brownian motion,
which we proved in Chap. I (Theorem 33 and its corollary) and which we
recall here for convenience.
Theorem 78 (Reflection Principle for Brownian Motion). Let В be
standard Brownian motion, Bo = 0, and с > 0. Then
P(supBs > c) = 2P(Bt > c).
s<t
An elementary conditioning argument provides a useful corollary.
Corollary. Let В be standard Brownian motion, Bo = 0, and 0 < s < t.
Then
P( sup Bu > 0, Bs < 0) = 2P(Bt >0,Bs< 0).
s<u<t
Theorem 79 is known as Levy's arcsine law for the last exit from zero.
Theorem 79. The process gt is G adapted, and P(gt < s) = ^arcsiny^,
0 < s <t.
Proof. For s<twe have almost surely
{<?*<*} = ( П {s«>o» LJ( П {в»<°})
s<u<t s<u<t
from which it follows that gt is Qt adapted.
To calculate the distribution of gt we observe for s < t (using the symmetry
ofB),

8 Azema's Martingale 229
P(gt >s) = 2P{gt >s,Bs< 0)
= 2P( sup Bu >0,Bs< 0)
s<«<t
= 4P(Bt>0,Bs<0),
by the corollary of Theorem 78. Since (Bs,Bt) are jointly Gaussian, there
exist X, Y that are independent N@,1) random variables with
Bs = y/sX and Bt = л/sX + ^t-sY.
Then {Bs < 0} = {X < 0}, and {Bt > 0} = {Y > --j?Lx}. To calculate
P({Y > -^Lx,X < 0}), use polar coordinates:
P(Bt > 0, Bs < 0) = —
7Г —arcsin
1 ,7Г /7
= —( arcsinA/-)
1 1 /I
= arcsin \ -.
4 2тг V *
Therefore 4P(Bt > 0,Bs < 0) = 1 - ^ arcsin yf and P(gt < s) =
- arcsin л/Т. D
Theorem 80. Azema's martingale M is given by Mt = sign(??t)y^J\/t — gt,
t>0.
Proof. By definition,
Mt = Е{Б*|а*} = E{sign(Bt)\Bt\\Gt} = si
However ?{|Bt||(?t} = JB{|Bt||9t; sign(Bs), s < *}. But the process sign(Bs) is
independent of \Bt\ for s < gt, and sign(??s) = sign(Bt) f°r 9t < s <t. Hence
sign(Bs) is constant after gt. It follows that
E{\Bt\\Gt} = E{\Bt\\gt}.
Therefore if we can show 2?{|.Bt||9t = s} = \f\^/t — s, the proof will be
complete.
Given 0 < s < t we define
T = t Л inf{u > s : Bu = 0}.
Note that T is a bounded stopping time. Then

230 IV General Stochastic Integration and Local Times
E{\Bt\;gt<s} = 2E{Bt;gt<s and Bs > 0}
= 2E{Bt;T = t and Bs > 0}
= 2E{BT;Bs>0}
= 2E{Bs;Bs>0},
since BT = 0 on {T ф t}, and the last step uses that В is a martingale, and
s < T. The last term above equals E{\BS\} = \j\\ft- Therefore
?P(9t < s)
since j^P{gt < s) — —,1 , by Theorem 79. D
Let T{® = a{Ms; s < t), the minimal filtration of Azema's martingale M,
and let H = CHt)t>o be the completed filtration.
Corollary. The two filtrations H and G are equal.
Proof. By definition Mt G Qt, each t > 0. Hence Tit С ^t. However, by
Theorem 80, sign(Mt) = sign(Bt), so that Qt С Tit- ?
Next we turn our attention to the local times of Azema's martingale. For
a semimartingale (Xt)t>0, let (L^(X))t>0 = La{X) denote its local time at
the level a.
Theorem 81. The local time at zero of Azema's martingale is not identically
zero.
Proof. By the Meyer-Ito formula (Theorem 70) for f(x) = \x\ we have
\Mt\ = f sign(Ms_)dMs +L°t(M) + Y, {\Ms\ - \Ms-\- sign(Ms_)AMs}.
J° 0<s<t
Set Yt = \Mt\. Since sign(Ms_)AMs = AYS, we have that \Mt\ - Ц(М)
is a martingale. If L°(M) were the zero process, then Y = \M\ would be a
non-negative martingale with Yo = 0, hence identically zero. Since Y is not
identically zero, the theorem is proved. D
Theorem 82. The local times at all levels except zero of Azema's martingale
are 0. That is, Ц(М) = 0, t > 0, if а ф 0.

8 Azema's Martingale 231
Proof. By Theorem 69 we know that the measure dL^(u>) on K+ is carried by
the set {s : Ms_(o>) = М3(ш) = a}, a.s., for each a. However if а ф 0, this set
is countable. Since s н-> L" is a.s. continuous, it cannot charge a countable set.
Therefore dLas is the zero measure for а ф 0. Since Lg = 0, we have L" = 0
for а ф 0. D
Theorems 81 and 82 together show that Azema's martingale provides a
counterexample to a general version of Theorem 75, for example. Therefore a
hypothesis such as Hypothesis A is necessary, and also we see that Azema's
martingale does not satisfy Hypothesis A. Since all semimartingales for the
Brownian motion nitration F do satisfy Hypothesis A, as noted in Sect. 7, we
conclude the following.
Theorem 83. Azema's martingale M is not a semimartingale for the Brown-
Brownian filtration F.
While the local time at zero, L°(M), is non-trivial by Theorem 81, more
information is available. It is actually the same as the Brownian local time,
as Theorem 85 below shows. First we need a preliminary result.
Theorem 84. The local time L°(B) of Brownian motion is G adapted.
Theorem 84 will be proved if one can express L°(B) as the a.s. limit of
measurable functionals of the zero set of Brownian motion. Such a result is
classical: see, e.g., Kingman [126, page 730].
Theorem 85. The local times at zero of Brownian motion and Azema's mar-
martingale are the same. That is, L°(B) = L°(M).
Proof. As we saw in the proof of Theorem 81, \Mt\ - L®(M) is a martingale.
Since the process L°(M) is non-decreasing, continuous and adapted, it is
natural. If A is another natural process such that \M\ - A is a martingale,
then L°(M) = Aby (for example) Theorem 30 of Chap. III. Thus, it suffices
to show that \M\ — L°(B) is a martingale for the filtration G.
By Tanaka's formula, \Bt\ - L^(B) = iVt, where iV is an F martingale.
Then
E{\Bt\ - L°t(B)\Gt} = E{Nt\gt}
and by Theorem 84
E{\Bt\\gt} - L°t(B) is a martingale.
Since
E{\Bt\\gt} = E{sign(Bt)Bt\gt} = sign(Bt)E{Bt\gt}
= sign(Bt)Mt
= \щ,
we have \Mt\ — L^(B) is a martingale, and the theorem is proved. D

232 IV General Stochastic Integration and Local Times
Corollary 1. The process \M\ - L°(B) is a G martingale.
Proof. This corollary is proved at the end of the proof of Theorem 85. D
Corollary 2. The process l{B(>o}\/fV^ ~ 9t ~ \L®{B) is a G martingale.
Proof. Since 1{ms^>o}M~ — 1{ms_<o}^s+ = 0> it follows from Theorem 68
that Mt+ — ^L®(M) is a martingale. However Mt+ = 1{.в4>о}\/|лА — 9t by
Theorem 80, and L?(M) = L?(B) by Theorem 85, and the theorem follows.
D
The next theorem shows that Azema's martingale is quadratic pure jump.
Theorem 86. For Azema's martingale M, [M,M]C = 0, and [M,M]t — \gt-
Proof. By Corollary 2 of Theorem 70,
/•OO
[M,M]ct= / Latda.
J — OO
By Theorem 82, [M,M]C = 0. We conclude that [M,M}t = *?0<s<t(AMsJ,
and it follows from Theorem 80 that [M, M}t = |gt. ~ П
Theorem 86 allows us to compute the compensation of [M, M]; that is, the
unique predictable, increasing process A such that [M, M] — A is a martingale.
This process A is denoted (M,M) in the literature, and it was defined in
Sect. 4 of Chap. III.
Theorem 87. Let M be Azema's martingale. Then [M,M]t — jt is a mar-
martingale. That is, {M,M)t = ~t.
Proof. Recall that Mt2 - [M, M]t is a martingale (cf., Theorem 27 of Chap. II).
By Theorems 80 and 86,
Since the above is a martingale, it remains only to observe that \gt - \t is a
martingale. D
Note that if Xt = sign(Bt)\/2V* ~ fft> then X is a quadratic pure jump
martingale with {X, X)t = t. Recall that if N is a Poisson process with
intensity one, then Nt - t is a martingale. Since [N, N]t = Nt, we have
[iV, N]t — t is a martingale; thus {N, N)t = t. Another example of a martin-
martingale Y with (У, Y)t = t is Brownian motion. Since [B,B]t = t is continuous,

9 Sigma Martingales 233
9 Sigma Martingales
We saw with Emery's example (which precedes Theorem 34) a stochastic
integral with respect to an Ti2 martingale that was not even a local martingale.
Yet "morally" we feel that it should be one. It is therefore interesting to
consider the space of stochastic integrals with respect to local martingales.
We know these are of course semimartingales, but we want to characterize
the class of processes that arise in a more specific yet reasonable way.
By analogy, recall that in measure theory a er-finite measure is a measure /j,
on a space E, B) such that there exists a sequence of measurable sets УЦ е В
such that (Ji Ai = S and ц(Лг) < oo, each i. Such a measure can be thought
of as a special case of a countable sum of probability measures. It is essentially
a similar phenomenon that brings us out of the class of local martingales.
Definition. An Kd-valued semimartingale X is called a sigma martingale if
there exists an Rd-valued martingale M and a predictable K+-valued process
H e L(M) such that X = H-M.
In the next theorem, H will always denote a strictly positive predictable
process.
Theorem 88. Let X be a semimartingale. The following are equivalent:
(i) X is a sigma martingale;
(ii) X = H ¦ M where M is a local martingale;
(iii) X = H • M where M is a martingale;
(iv) X = H ¦ M where M is a martingale in H1 ¦
Proof. It is clear that (iv) implies (iii), that (iii) implies (ii), and that (iii)
obviously implies (i), so we need to prove only that (i) implies (iv). Without
loss assume that Mo = 0. Since A" is a sigma martingale it has a representation
of the form X = H ¦ M where M is a martingale. We know that we can
localize M in Ti1 by Theorem 51, so let Tn be a sequence of stopping times
tending to oo a.s. such that MT" e H1 for each n. Set To = 0 and let Nn =
1(т„_1,т„] ¦ MT". Let an be a strictly positive sequence of real numbers such
that iZn^W^Wn1 < °°. Define N = ^ZnanNn and one can check that N
is an Jil martingale. Set J = 1{#=о} + Н^2пО!Й11(т„_1,т„]- Then it is simple
to check that X = J ¦ N and that J is strictly positive. D
Corollary 1. A local sigma martingale is a sigma martingale.
Proof. Let A" be a local sigma martingale, so that there exists a sequence of
stopping times tending to oo a.s. such that XTn is a sigma martingale for
each n. Since XT" is a sigma martingale, there exists a martingale Mn in
H1 such that XTn = Hn ¦ Mn, for each n, where Hn is positive predictable.
Choose фп positive, predictable so that \\фпНп ¦ MT"||Wi < 2"n. Set Го = О
and ф° = ф^1щ. Then ф = ф° + ^2п>1Фп1(тп~1,т„] ls strictly positive and
predictable. Moreover ф ¦ X is an Ti1 martingale, whence X = 4 • ф • X, and
the corollary is established. ?

234 IV General Stochastic Integration and Local Times
Corollary 2. A local martingale is a sigma martingale.
Proof. This is simply a consequence of the fact that a local martingale is
locally a martingale, and trivially a martingale is a sigma martingale. D
The next theorem gives a satisfying stability result, showing that sigma
martingales are stable with respect to stochastic integration.
Theorem 89. If X is a local martingale and Я e L(X), then the stochastic
integral H ¦ X is a sigma martingale. Moreover if X is a sigma martingale
(and a fortiori a semimartingale) and H e L{X), then H ¦ X is a sigma
martingale.
Proof. Clearly it suffices to prove the second statement, since a local mar-
martingale is already a sigma martingale. But the second statement is simple.
Since A" is a sigma martingale we know an equivalent condition is that there
exists a strictly positive predictable process ф and a local martingale M such
that X = ф ¦ M. Since H is predictable, the processes Я1 = Я1{#>0} + 1
and H2 = — Я1{я<0} + 1 are both strictly positive and predictable. Moreover
H = Я1 - H2. The processes Я1 and H2 are both in L{X) because H is. By
associativity and linearity of the stochastic integral,
H ¦ X = Н{ф ¦ M) = (Нф) ¦ M = Н1ф ¦ M - Н2ф ¦ M,
and since Нгф are both strictly positive for i = 1, 2 we have by Theorem 88
that Нгф ¦ M is a sigma martingale for each « = 1,2. Sigma martingales form
a vector space, and we are done. ?
We also would like to have sufficient conditions for a sigma martingale to
be a local martingale.
Theorem 90. A sigma martingale which is also a special semimartingale is
a local martingale.
Proof. Any sigma martingale is a semimartingale by definition; here we also
assume it is special. Thus it has a canonical decomposition X = M + A where
M is a local martingale and A is a process of finite variation on compacts
which is also predictable. We want to show A = 0. We assume Ao = 0.
Choose H predictable, H > 0 everywhere, such that H ¦ X is a martingale,
which we can easily do using Theorem 88. Then (Я Л 1) ¦ X = (^-) • (Я • X)
is a local martingale, and hence without loss of generality we can assume H
is bounded. Then Я • A is a finite variation predictable process, and we have
H ¦ A = H ¦ X — H ¦ M which is a local martingale. Thus we conclude that
H ¦ A = 0. If we replace H with JH where J is predictable, \J\ < 1, and is
such that /0 JsdAs = JQ \dAs\, then we conclude that /0 Hs\dAs\ — 0, and
since H > 0 everywhere, we conclude that A = 0 establishing the result. D
We now have that any criterion that ensures that the semimartingale X
is special, will also imply that the sigma martingale A" is a local martingale.

Bibliographic Notes 235
Corollary 1. If a sigma martingale X has continuous paths, then it is a local
martingale.
Corollary 2. If A" is a sigma martingale and if either Xf = sups<t \XS\ or
Yt = sups<t \AXS\ is locally integrable, then X is a local martingale.
Note that as a corollary to Corollary 2 we have the following.
Corollary 3. If A" is a sigma martingale with bounded jumps, then A" is a
local martingale.
Bibliographic Notes
The extension by continuity of stochastic integration from processes in L to
predictable processes is presented here for essentially the first time. How-
However the procedure was indicated earlier in Protter [201, 202]. Other ap-
approaches which are closely related are given by Jacod [103] and Chou-Meyer-
Stricker [31] (see an exposition in Dellacherie-Meyer [46, page 381]).
The "H" in the space Ti2 of semimartingales comes from Hardy spaces,
and this bears explaining. When the semimartingale X G H2 is actually a
martingale, the space of Hp martingales has a theory analogous to that of
Hardy spaces. While this is not the case for semimartingales, the Hp norms
for semimartingales are, in a certain sense, a natural generalization of the W
norms for martingales, and so the name has been preserved. The 7ip norm
was first introduced by Emery [64], though it was implicit in Protter [197].
Most of the treatment of stochastic integration in Sect. 2 of this chapter is
new, though essentially all of the results have been proved elsewhere with dif-
different methods. The author benefited greatly throughout by discussions with
S. Janson. Theorem 1 (that H2 is a Banach space) is due to Emery [66]. The
equivalence of the two pseudonorms for Ti2 (the corollary of Theorem 24)
is originally due to Yor [241], while the fact that U2{Q) С Н2(Р) if $ is
bounded is originally due to Lenglart [144]. Theorem 25 is originally due to
Jacod [103, page 228]. The concept of a predictable cr-algebra and its impor-
importance to stochastic integration is due to Meyer [166]. The local behavior of
the stochastic integral was first investigated by McShane for his integral [154],
and then independently by Meyer [171], who established Theorem 26 and its
corollary for the semimartingale integral. The first Dominated Convergence
Theorem for the semimartingale integral appearing in print seems to be in
Jacod [103].
The theory of martingale representation dates back to Ito's work on mul-
multiple stochastic integrals [96], and the theory for M2 presented here is largely
due to Kunita-Watanabe [134]. A more powerful theory (for M1) is presented
in Dellacherie-Meyer [46]. Absolutely continuous spaces are defined here for
the first time. The author benefited from discussions with Yan Zeng on this
subject. That the dual of Ti} is BMO for martingales is due to the combined

236 Exercises for Chapter IV
efforts of many researchers, culminating in Meyer's paper [170]; we follow the
treatment in [87]. The Jacod-Yor Theorem is from [111], and Sect. 9 largely
follows Emery [69].
The theory of stochastic integration depending on a parameter is due to
the fundamental work of Strieker-Yor [219], who used a key idea of Doleans-
Dade [48]. The Fubini Theorems for stochastic integration have their origins
in the book of Doob [55] and Kallianpur-Striebel [120] for the ltd integral,
Kailath-Segall-Zakai [119] for martingale integrals, and Jacod [103] for semi-
martingales. The counterexample to a general Fubini Theorem presented here
is due to Janson.
The theory of semimartingale local time is of course abstracted from Brow-
nian local time, which is due to Levy [149]. It was related to stochastic inte-
integration by Tanaka (see McKean [153]) for Brownian motion and the ltd inte-
integral. The theory presented here for semimartingale local time is due largely to
Meyer [171]. See also Millar [181]. The measure theory needed for the rigorous
development of semimartingale local time was developed by Strieker-Yor [219],
and the Meyer-Ito formula (Theorem 70) was formally presented in Yor [242],
as well as in Jacod [103]. Theorem 71 is due to Yor [244], and has been ex-
extended by Ouknine [188]. A more general version (but more complicated) is
in Qinlar-Jacod-Protter-Sharpe [34]. The proof of Kolmogorov's Lemma given
here is from Meyer [177]. The results proved under Hypothesis A are all due
to Yor [243] except the Bouleau-Yor formula [22]. See Bouleau [21] for more
on this formula.
Azema's martingale is of course due to Azema [4], though our presenta-
presentation of it is new and it is due largely to Janson. For many more interesting
results concerning Azema's martingale, see Azema-Yor [5], Emery [69], and
Meyer [179].
Sigma martingales date back to the work of Emery [67], and to Chou [29].
The importance of sigma martingales was clarified through the work in math-
mathematical finance of Delbaen and Schachermayer [38]. See [110] for a more
comprehensive treatment.
Exercises for Chapter IV
Exercise 1. Let A" be a semimartingale. Show that X is special if and only
if the increasing process Ct — [X, X}t is locally integrable; that is, there
exists a sequence of stopping times Tn increasing to infinity a.s. such that
E{[X,Х]т„} < oo for each time Tn.
* Exercise 2. Let A" be a special semimartingale with canonical decomposi-
decomposition Xt = Xo + Mt + At. Show that the following two inequalities hold:
E{[A,AU} < ЕЦХ^и) and

Exercises for Chapter IV 237
Exercise 3. Let A2, T, P) be a complete probability space. Let Xn be a
sequence of random variables such that P(|Xn| < сю) = 1, all n. Show that
there exists another probability Q which is equivalent to P in the sense that
the null sets are the same for both measures, with j^ bounded, and every
random variable Xn is in Ll(dQ).
Exercise 4. Let X be a semimartingale. Show that there is a probability
measure Q, equivalent to P, such that under Q, X has a decomposition X =
M + A, where M is a martingale with EQ([M,M]t) < oo for each t, 0 <
t < oo, and A is a predictable process of finite variation on compacts with
Eq(J0 |cL4,j|) < oo for each t, 0 < t < oo. (Hint: Use Exercises 1, 2, and 3.)
*Exercise 5. Let Xn be a sequence of semimartingales. Show that there
exists one probability measure Q, equivalent to P, such that under Q, each
Xn has a decomposition Xn = Mn + An, where Mn is a martingale with
Ео([Мп, Mn]t) < oo for each t, 0 < t < oo and each n, and An is a predictable
process of finite variation on compacts with Eq(J0 \dA™\) < oo for each t,
0 < t < oo and each n.
Exercise 6. Let S denote the space of all square integrable martingales with
continuous paths a.s., on a given filtered complete probability space satisfying
the usual conditions. Show that S is a stable subspace. (This exercise was
referred to in Definition 3.)
Exercise 7. Let Z be a Levy process with ?^{|Zt|} < oo and E{Zt} = 0 for
all t > 0. Show that Z is a martingale, and that Zc is either 0 or Brownian
motion, where Zc is defined in Exercise 6. Conclude that any martingale Levy
process Z that has no Brownian component is purely discontinuous.
Exercise 8. Let N be a standard Poisson process with arrival intensity A.
Show that the martingale Nt — Xt is purely discontinuous. (Note that this gives
an example of a purely discontinuous martingale whose sample paths are not
purely discontinuous in the sense that they do not change only by jumps.)
Exercise 9. Let M be a square integrable martingale. Show that [M, M}% =
[Mc,Mc]t = [Mc,M]t, all t > 0, almost surely.
Exercise 10 (example of orthogonal projection). Let T be a stopping
time. Show that the collection S of all square integrable martingales stopped
at T is a stable subspace. Also show that if M is an arbitrary square integrable
martingale, then its orthogonal decomposition with respect to this subspace
is M = MT + (M — MT). Give a description of the orthocomplement of S.
Exercise 11 (example of orthogonal projection). Let T be a totally
inaccessible stopping time and let M be a square integrable martingale. Let
Ut = ДМт1{4>т} and let L be the martingale of U minus its compensator.
We denote L by ПМ. Show that ПМ and M - ПМ are orthogonal. Let S
denote the space of all square integrable martingales which are continuous at
the instant T. Show that S is a stable subspace, and that ПМ represents the
orthogonal projection of M onto the orthocomplement of S.

238 Exercises for Chapter IV
* Exercise 12. Let T be an arbitrary stopping time, T > 0, and let X be
a random variable in L1. Show that M = Xl{t>x} is a uniformly integrable
martingale if and only if E{X\J~t-} = 0.
*Exercise 13. Let T be a stopping time, T > 0, and let M be a square
integrable martingale. Show that the decomposition
Mt = [(MT - E{MT\TT-})l{t>T}] + [Mt - (MT - Е{Мт\Рт-}I{г>т})]
is an orthogonal decomposition. Show that one of the stable subspaces in
question is the space of all square integrable martingales M such that Mt is
Tt- measurable. (Hint: Use Exercise 12.)
*Exercise 14. Let Z be a Levy process on a complete probability space
(Sl,T, P) and let F be its minimal filtration completed (and satisfying the
usual hypotheses). Show that (Sl,J-, F, P) is an absolutely continuous space.
Exercise 15. Consider the structure equation d[X,X]t = dt + /3Xt-dXt.
Show that this equation has a scaling property: if X is a martingale solu-
solution, then so too is jX\2t for every А ф 0. Note that by the uniqueness in
distribution of the solution, the processes (Xt)t>o and (j-XA2t)t>o have the
same distribution.
* Exercise 16. Let X be a martingale solution of the structure equation of
Exercise 15. Show that Xе = 0, where Xе is defined in Exercise 6.
Exercise 17. Let M be a local martingale in 7ip for some p > 1. Show
that M is a uniformly integrable martingale. Show further that if p = 1 then
E{MQO] < c||M||fti where the constant с is universal. That is, с does not
depend on M.
Exercise 18. Suppose that M is a bounded martingale. Show that
This can improved to replace \/5 with 2.
Exercise 19. Show that there are martingales with finite BMO norm which
are not bounded. (Hint: Let M be a bounded martingale and Я be a pre-
predictable process such that |#| < 1, and let N = H ¦ M. Then it is easy to see
that N is in BMO, and to construct such an N that is not bounded.)
Exercise 20. Prove this version of the Fundamental Theorem of Local Mar-
Martingales. Let M be a uniformly integrable martingale. Show that there exist N
and U such that N is in BMO, U is of integrable variation, and M = U + V.
If M is a local martingale this holds locally.
Exercise 21. Show that an arbitrary local martingale is a semimartingale by
using the Burkholder-Davis-Gundy inequalities instead of the Fundamental
Theorem of Local Martingales. (See [61].)

Exercises for Chapter IV 239
Exercise 22. Let M be a local martingale. If T is a predictable stopping time,
then Ijq^^s) € ЪР so that Mj~ = Jo l[olr)(s)dMs is a local martingale. Give
an example of a local martingale M and a stopping time T such that MT~ is
not a local martingale.
Exercise 23. Let X be a continuous semimartingale. Show that
[X, X]. (Hint: Use local times.)
* Exercise 24. Let В be standard Brownian motion and let a > 0. Show that
/0 \Bt\~adt < oo a.s. if and only if a < 1. (Hint: Use local times.)
Exercise 25. Let X be a semimartingale. Show that L%(X) = ±L°(X+)
where X+ — max(X, 0) and L\(X) denotes the local time of X at time t and
level 0.
Exercise 26. Let A be adapted, continuous, and of finite variation on com-
compacts, and let В be standard Brownian motion. Let A denote Lebesgue mea-
measure on the line. Show that A(s : Bs = As) = 0 a.s.
Exercise 27. Let X be a continuous semimartingale. Show that L0. € Tt ,
each t > 0, where T\ denotes the smallest right continuous completed fil-
filtration generated by the process
Exercise 28. Let X be a semimartingale. Show that for each fixed (u>, t) the
section x i—> L*(X) has compact support.
Exercise 29 (Skorohod's Lemma). This result will be useful for the next
two exercises. Let x be a real valued continuous function on [0, сю) such that
x@) > 0. Show there exists a pair of functions (y, a) on [0, сю) such that
x — у + а, у is positive, and a is increasing, continuous, a@) = 0, and the
measure das induced by a is carried by the set {s : y(s) = 0}. Show further
that the function a is given by a(t) = sups<t(—x(s) V 0).
Exercise 30. Let M be a continuous local martingale with Mo = 0. Let
\Mt\ = Nt + L%(M). Show that N is a continuous local martingale with
No = 0, and that L°t(M) = sups<t(-Ns).
*Exercise 31. Let X be a continuous semimartingale, and suppose Y satisfies
the equation dYt = -s\gn(Yt)dXt. Show that \Yt\ = Xt* - Xt, where X* =
supu<tXu. (Hint: Use Tanaka's formula and Exercise 29.)
Exercise 32. Suppose that X and Y are continuous semimartingales such
that LPt(Y -X) = 0. Show that
L°t(X V Y) = f l{YtmdL°s(X) + f l{Xa<0]dL°s(Y).
Jo Jo
Exercise 33. Let X and Y be continuous semimartingales. Show that
L°t(X AY)
= f l{Ys>0]dL°s(X) + f l{Xs>0}dL°s(Y) + f l{Xs=Ys=o}dLos(Y+-X+).
Jo Jo Jo

240 Exercises for Chapter IV
Exercise 34 (Ouknine's Formula). Let X and Y be continuous semi-
martingales. Show that
L°t{X V Y) + L°t(X Л Y) = L°t(X) + L°t(Y).
* Exercise 35 (Emery-Perkins Theorem). Let В be standard Brownian
motion and set Lt = L^(B). Let X = В + cL, с € R fixed. Assume that
there exists a set D С М+ х п such that (i) D is predictable for Tx; (ii)
P(w : (t,w) e D) = 1 each t > 0; and (iii) P(w : (Tt(w),w) € D) = 0 each
t > 0, where Tt = inf(s > 0 : Ls > t).
(a) Show that Bt - J* lD(s)dXs. Also show that Bt 6fx, each t > 0, and
conclude that JFB = Tx = JTs+ci, all cEM.
(b) Let
„ИтХ
™~* fc=i
Use the fact that s i—> (BsATt,LsATt) and s н-> (—BTt_s,LiTt — Ьтг„3) have
the same joint distribution to show that D satisfies (i), (ii), and (iii).
* Exercise 36. With the notation of Exercise 35, let Y = \B\+cL. Show that
TY С JTlBl,for all с
*Exercise 37. With the notation of Exercise 36, Pitman's Theorem states
that У is a Bessel process of order 3, which in turn implies that for TY, with
c=l,
f
J0
for an ^"y-Brownian motion /3. Use this to show that TY ф T^B\ for с = 1.
(Note: One can show that TY = T^B\ for all с ф 1.)
* Exercise 38 (Barlow's Theorem). Let X solve the equation
Xt = Xo + [ u(Xs)dBs + [ b(Xs)ds
J J
[ [
o Jo
where В is a standard Brownian motion. The above equation is said to have
a unique weak solution if whenever X and У are two solutions, and if
fix is defined by цх{Л) = Р{ш '¦ t i—> Xt(u>) € A}, then fix — Hy- The
above equation is said to have a unique strong solution if whenever both
X and Y are two solutions on the probability space on which the Brownian
motion В is denned, then P{u : f н> 1((ш) = f и У4(ш)} = 1. Assume that
the equation has a unique weak solution with the property that if X, Y are
any two solutions defined on the same probability space, then for all t > 0,
L®(X — Y) — 0. Show that the equation has a unique strong solution.

Exercises for Chapter IV 241
Exercise 39. Consider the situation of Exercise 38, and suppose that a and
b are bounded Borel. Further, suppose that (c(x) — u(y)J < p(\x — y\) for all
x, y, where p : [О, сю) —+ [0, сю) is increasing with J* -4^-du = +oo for every
e > 0. Show that if X, Y are two solutions, then L%(X - Y) = 0.
*Exercise 40. Let Б be a standard Brownian motion and consider the equa-
equation
Xt = / sign(Xs)dBs.
Jo
Show that the equation has a unique weak solution, but that if X is a solution
then so too is —X. (See Exercise 38 for a definition of weak and strong solu-
solutions.) Hence, the equation does not have a unique strong solution. Finally,
show that if X is a solution, then X is not adapted to the nitration TB. (Hint:
Show first that Bt = JQ l^Xa^oyd\Xs\.)
Exercise 41. Let X be a sigma martingale and suppose X is bounded below.
Show that X is in fact a local martingale.
*Exercise 42. Show that any Levy process which is a sigma martingale is
actually a martingale (and not just a local martingale).
Exercise 43. Let (Un)n>i be i.i.d. random variables with P(U\ = 1) =
P(UX = -1) = 1/2, and let X = Е„>1 2""t/«l{t>9n} where (qn)n>i is an
enumeration of the rationale in @,1). Let Ht = X)n>i ^^-{t>gn} ап<^ show
that X ? H2 and H € L(X), but also that X is of finite variation and
Y = H ¦ X has infinite variation. Thus the space of finite variation processes
is not closed under stochastic integration!
* Exercise 44. Let Xn be a sequence of semimartingales on a filtered complete
probability space (fi, T, F, P) satisfying the usual hypotheses. Show there ex-
exists one probability Q, which is equivalent to P, with a bounded density such
that under Q each Xn is a semimartingale in H2.
Exercise 45. Let Б be a standard Brownian motion and let Lx be its local
time at the level x. Let
Л = {u> : dLxs (u>) is singular as a measure with respect to ds]
where of course ds denotes Lebesgue measure on R+. Show that Р(Л) = 1.
Conclude that almost surely the paths s i—> Lx are not absolutely continuous.
(Hint: Use Theorem 69.)
Exercise 46. Let В be a standard Brownian motion and let N be a Poisson
process with parameter A = 1, with В and N independent as processes. Let
L be the local time process for В at level 0, and let X be the vector process
Xt — (Bt, NLt). Let F be the minimal filtration of X, completed in the usual
way.
(a) Show that X is a strong Markov process.

242 Exercises for Chapter IV
(b) Let T be the first jump time of X. Show that the compensator of the
process Ct = l{t>T} is the process At = LtAT-
Note that this gives an example of a compensator of the first jump time of a
strong Markov process which has paths that are almost surely not absolutely
continuous.
**Exercise 47. Let Z be a Levy process. Show that if Z is a sigma martingale,
then Z is a martingale. (Note: This exercise is closely related to Exercise 29
of Chap. I.)

V
Stochastic Differential Equations
1 Introduction
A diffusion can be thought of as a strong Markov process (in Rn) with contin-
continuous paths. Before the development of Ito's theory of stochastic integration
for Brownian motion, the primary method of studying diffusions was to study
their transition semigroups. This was equivalent to studying the infinitesimal
generators of their semigroups, which are partial differential operators. Thus
Feller's investigations of diffusions (for example) were actually investigations
of partial differential equations, inspired by diffusions.
The primary tool to study diffusions was Kolmogorov's differential equa-
equations and Feller's extensions of them. Such approaches did not permit an
analysis of the paths of diffusions and their properties. Inspired by Levy's in-
investigations of sample paths, Ito studied diffusions that could be represented
as solutions of stochastic differential equations1 of the form
Xt = Xo + f o{Xs)dBs + f b(Xs)ds
Jo Jo
(*)
where Б is a Brownian motion in R™, a is an n x n matrix, and b is an n-vector
of appropriately smooth functions to ensure the existence and uniqueness of
solutions. This gives immediate intuitive meaning. If (Ft)t>o is the underlying
filtration for the Brownian motion B, then for small e > 0
-XI- eb\Xt)){Xl+? - X{ - eV(Xt))\Ft} = (aa'f(Xt)s + o(e),
where a' denotes the transpose of the matrix a.
Ito's differential "dB" was found to have other interpretations as well. In
particular, "dB" can be thought of as "white noise" in statistical communi-
communication theory. Thus if ?t is white noise at time t, Bt = /0 ?sds, an equation
1 More properly (but less often) called "stochastic integral equations."

244 V Stochastic Differential Equations
which can be given a rigorous meaning using the theory of generalized func-
functions (cf., e.g., Arnold [2]). Here the Markov nature of the solutions is not as
important, and coefficients that are functionals of the paths of the solutions
can be considered.
Finally it is now possible to consider semimartingale driving terms (or
"semimartingale noise"), and to study stochastic differential equations in full
generality. Since "dB" and adf are semimartingale differentials, they are
always included in our results as special cases. While our treatment is very
general, it is not always the most general available. We have at times preferred
to keep proofs simple and non-technical rather than to achieve maximum
generality.
The study of stochastic differential equations (SDEs) driven by general
semimartingales (rather than just by dB, dt, dN, and combinations thereof,
where N is a Poisson process) allows one to see which properties of the so-
solutions are due to certain special properties of Brownian motion, and which
are true in general. For example, in Sect. 6 we see that the Markov nature of
the solutions is due to the independence of the increments of the differentials.
In Sects. 8 and 10 we see precisely how the homeomorphic and diffeomorphic
nature of the flow of the solution is a consequence of path continuity. In Sect. 5
we study Fisk-Stratonovich equations which reveal that the "correction" term
is due to the continuous part of the quadratic variation of the differentials. In
Sect. 11 we illustrate when standard moment estimates on solutions of SDEs
driven by Brownian motion and dt can be extended to solutions of SDEs
driven by Levy processes.
2 The HJ3 Norms for Semimartingales
We defined an 7i2 norm for semimartingales in Chap. IV as follows. If X is a
special semimartingale with Xq = 0 and canonical decomposition X = N + A,
then
\ /
Jo
/0
We now use an equivalent norm. To avoid confusion, we write H_ instead
of TL2. Moreover we will define H_p, 1 < p < oo. We begin, however, with a
different norm on the space E> (i.e., the space of adapted cadlag processes).
For a process H € D we define
H* = sup \Ht
t
Occasionally if H is in L (adapted and caglad) we write ||i?||sp as well, where
the meaning is clear.

2 The Яр Norms for Semimartingales 245
If A is a semimartingale with paths of finite variation, a natural definition
of a norm would be \\A\\P = || Jo°° |cL4s|||?P, where |cL4s(a;)| denotes the total
variation measure on R+ induced by s i—> As(ui). Since semimartingales do
not in general have such nice paths, however, such a norm is not appropriate.
Throughout this chapter, we will let Z denote a semimartingale with Zq = 0,
a.s. Let Z be an arbitrary semimartingale (with Zq = 0). By the Bichteler-
Dellacherie Theorem (Theorem 43 of Chap. Ill) we know there exists at least
one decomposition Z = N + A, with N a local martingale and A an adapted,
cadlag process, with paths of finite variation (also Щ = Aq = 0 a.s.). For
1 < p < oo we set
/>OO
jp(N,A) = \\[N,N}1Ji2+ / |dA,|||bP.
Definition. Let Z be a semimartingale. For 1 < p < oo define
where the infimum is taken over all possible decompositions Z — N + A where
N is a local martingale, A ? D with paths of finite variation on compacts,
and Aq — No = 0.
The corollary of Theorem 1 below shows that this norm generalizes the
IP norm for local martingales, which has given rise to a martingale theory
analogous to the theory of Hardy spaces in complex analysis. We do not pursue
this topic (cf., e.g., Dellacherie-Meyer [46]).
Theorem 1. Let Z be a semimartingale (Zo = 0). Then \\[Z, Z}H2\\LP <
и?, A <P< oo).
Proof. Let Z = M + A, Mo = Ac, = 0, be a decomposition of Z. Then
s
OO
/•OO
<{M,M]l?2+ \dA.\,
Jo
where the equality above holds because A is a quadratic pure jump semi-
1 /9
martingale. Taking Lp norms yields \\[Z,Z]^ \\lp < jp{M, A) and the result
follows. ?
Corollary. If Z is a local martingale (Zo = 0), then \\Z\\h_p = \\{Z, Z]U2\\lp-

246 V Stochastic Differential Equations
Proof. Since Z is a local martingale, we have that Z = Z+0 is a decomposition
of Z. Therefore
By Theorem 1 we have \\[Z, Z]H2\\lp < \\Z\\hp, hence we have equality. D
Theorem 2 is analogous to Theorem 5 of Chap. IV. For most of the proofs
which follow we need only the case p = 2. Since this case does not need
Burkholder's inequalities, we distinguish it from the other cases in the proof.
Theorem 2. For 1 < p < oo there exists a constant cp such that for any
semimartingale Z, Zq = 0,
Proof. A semimartingale Z is in ГО, so ||Z||s" makes sense. Let Z — M + A
be a decomposition with Mo = Aq — 0. Then
(-O
/
JO
using (o + 6)p < 2p-1(ap + ЬР).
In the case p = 2 we have by Doob's maximal quadratic inequality that
) = 4E{[M,MU}.
For general p, 1 < p < сю, we need Burkholder's inequalities, which state
for a universal constant cp which depends only on p and not on the local
martingale M. For continuous local martingales and p > 2 we proved this
using Ito's formula in Chap. IV (Theorem 73). For general local martingales
and for all finite p > 1 see, for example, Dellacherie-Meyer [46, page 287].
Continuing, letting the constant cp vary from line to line we have
/•OO
/ \dAs\f}
Jo
лОО
+ ( \dAa\y}
Jo
<cp{jp(M,A)f,
and taking p-th roots yields the result. D
Corollary. On the space of semimartingales, the H_p norm is stronger than
the 5P norm, 1 < p < oo.
Theorem 3 (Emery's Inequality). Let Z be a semimartingale, H € L,
l + l = I A < p < oo, l<q <oo). Then

)\\Lr
Г
2 The M_p Norms for Semimartingales 247
Proof. Let H ¦ Z denote (Jo HsdZs)t>Q. Recall that we always assume Z§ = 0
a.s., and let Z = M + A be a decomposition of Z with Mo = Ao = 0 a.s. Then
H ¦ M + H ¦ A is a decomposition of H ¦ Z. Hence
\\н-г\\ш<зЛн-м,н-А).
Next recall that [Я • M, H ¦ M] = / H2sd[M, M]s, by Theorem 29 of Chap. II.
Therefore
/¦OO ЛОО
jr(H-M,H-A) = \\( H2d[M,M}sI'2 + \Hs\\dAs\\\Lr
Jo Jo
<\\H*OO([M,M}1JJ+ Г \dA
Jo
< \\H*OO\\LP\\(\M,M}^2+ Г \dAs\)\\L4
Jo
= \\H\\gpjq(M,A),
where the last inequality above follows from Holder's inequality. The foregoing
implies that
\\H-Z\\&r<\\H\\gPjg(M,A)
for any such decomposition Z = M + A. Taking infimums over all such de-
decompositions yields the result. ?
For a process XeD and a stopping time T, recall that
X = -Х"Д[о,г) + -Xrl[r,oo)>
XT~ = -Х"Д[0)г) +-Х"г-1[г,оо)-
A property holding locally was defined in Chap. I, and a property holding
prelocally was defined in Chap. IV. Recall that a property тг is said to hold
locally for a process X if Хтп\{хп>0) has property тг for each n, where Tn
is a sequence of stopping times tending to oo a.s. If the process X is zero at
zero (i.e., Xo = 0 a.s.) then the property тг is said to hold prelocally if XT"~
has property тг for each n.
Definition. A process X is locally in 5P (resp. H_p) if there exist stopping
times (T")n>i increasing to oo a.s. such that XTnl{Tn>oy is in ^p (resp. Др)
for each n, 1 < p < oo. If Xo = 0 then X is said to be prelocally in S_p (resp.
?LP) if XT"- is in if (resp. Kp) for each n.
While there are many semimartingales which are not locally in H?, all
semimartingales are prelocally in H_p. The proof of Theorem 4 below closely
parallels the proof of Theorem 13 of Chap. IV.
Theorem 4. Let Z be a semimartingale (Zo = 0). Then Z is prelocally in
Hp,l<p<oo.

248 V Stochastic Differential Equations
Proof.. By the Fundamental Theorem of Local Martingales (Theorem 25 of
Chap. Ill) and the Bichteler-Dellacherie Theorem (Theorem 43 of Chap. Ill)
we know that for given e > 0, Z has a decomposition Z = M+A, Mo = Ao = 0
a.s., such that the jumps of the local martingale M are bounded by e. Define
inductively
Tk : [M, M)l/2 + / |<Me| > * + 1}.
./o
The sequence (Tk)k>x are stopping times increasing to oo a.s. Moreover
- AMTkl[TkiOo)) = N + C
is a decomposition of ZTk~. Also, since [М,М]тк = [М,М]тк- + (АМткJ,
we conclude
joo(N,C) = ||[7V,7V]V2+ Г \dC.\U
Jo
= \\([М,М}Тк- + (AMrJ2I/2 + / * \dC.\\\L*
Jo
(fc + e)||L~ <oo.
Therefore ZTk~ € K°° and hence it is in 2?P as well, 1 < p < oo. ?
Definition. Let Z be a semimartingale in H_°° and let a > 0. A finite sequence
of stopping times 0 = To < Tx < ¦ ¦ ¦ < Tk is said to a-slice Z if Z = ZTk~
and || (Z — ZTi)Ti+1~\\H°° <a, 0<i<k — I. If such a sequence of stopping
times exists, we say Z is a-sliceable, and we write Z ? S(a).
Theorem 5. Let Z be a semimartingale with Z$ = 0 a.s.
(i) For a > 0, if Z € S(a) then for every stopping time T, ZT € S(a) and
ZT~ e SBa).
(ii) For every a > 0, there exists an arbitrarily large stopping time T such
thatZT~ ?S(a).
Proof. Since ZT~ = MT + (AT- -AMTl[Tt0o)), and since ||ZT||g=o < ||Z||^~
always, one concludes ||ZT~||#°° < 2||Z||#°°, so that (i) follows.
Next consider (ii). If semimartingales Z and Y are a-sliceable, let T?
and T? be two sequences of stopping times respectively a-slicing Z and Y.
By reordering the points T? and Tj and using (i), we easily conclude that
Z + Y is 8a-sliceable. Next let Z = M + A, Mo = Ao = 0 a.s., with the
local martingale M having jumps bounded by the constant /3 = a/24. By the
preceding observation it suffices to consider M and A separately.

3 Existence and Uniqueness of Solutions 249
For A, let To = 0, Tk+1 = inf{* > Tk : /^ \dAs\ > a/8 or /„* |<Me| > *}.
Then ATk~ € <S(a/8) for each k, and the stopping times (Tk) increase to oo
a.s.
For M, let Ro = 0, Rk+l = inf{t > Rk : [M,M]t - [M,M}Rk >
01 or [M,M]t > k}. Then MRk~ e R°°, each k, and moreover
(M - MR-)R>+*- = MR^ - MR* - (AMRk+1l{Rk+1>Rk})llRk+uOo).
Hence
1 - [M,M]HJ1/2 +
2 + [M,M}Rk+l_ - [M,M]HJ1/2 + \AMRk+1\\\b
Thus for each k, MRk~ € <S(A + \/2)/3), and since C = a/24, the result
follows. ?
3 Existence and Uniqueness of Solutions
In presenting theorems on the existence and uniqueness of solutions of stochas-
stochastic differential equations, there are many choices to be made. First, we do not
present the most general conditions known to be allowed; in exchange we are
able to give simpler proofs. Moreover the conditions we do give are extremely
general and are adequate for the vast majority of applications. For more gen-
general results the interested reader can consult Jacod [103, page 451]. Second,
we consider only Lipschitz-type hypotheses and thus obtain strong solutions.
There is a vast literature on weak solutions (cf., e.g., Stroock-Varadhan [220]).
However, weak solutions are more natural (and simpler) when the differen-
differentials are the Wiener process and Lebesgue measure, rather than general semi-
martingales.
A happy consequence of our approach to stochastic differential equations
is that it is just as easy to prove theorems for coefficients that depend not
only on the state Xt of the solution at time t (the traditional framework), but
on the past history of the process X before t as well.
We begin by stating a theorem whose main virtue is its simplicity. It is a
trivial corollary of Theorem 7 which follows it. Recall that a process H is in
L if it has caglad paths and is adapted.
Theorem 6. Let Z be a semimartingale with Zo = 0 and let / : R+ x п x R —>
R be such that
(i) for fixed x, (t,ui) н-> f(t,w,x) is in L; and

250 V Stochastic Differential Equations
(ii) for each (t,oj), \f(t,w,x)—f(t,oj,y)\ < К(ш)\х—у\ for some finite random
variable K.
Let Xq be finite and J-o measurable. Then the equation
Xt=X0+ [ f{s,-,Xs_)dZs
Jo
admits a solution. The solution is unique and it is a semimartingale.
Of course one could state such a theorem for a finite number of differentials
dZ?, 1 < j< d, and for a finite system of equations.
In the theory of (non-random) ordinary differential equations, coefficients
are typically Lipschitz continuous, which ensures the existence and the unique-
uniqueness of a solution. In stochastic differential equations we are led to consider
more general coefficients that arise, for example, in control theory. There are
enough different definitions to cause some confusion, so we present all the def-
definitions here in ascending order of generality. Note that we add, for technical
reasons, the non-customary condition (ii) below to the definition of Lipschitz
which follows.
Definition. A function / : R+ x R™ —> R is Lipschitz if there exists a (finite)
constant к such that
(i) \f(t,x)-f(t,y)\ < k\x-y\, each teM.+ , and
(ii) t i-> f(t,x) is right continuous with left limits, each x € I".
/ is said to be autonomous if f(t,x) = f(x), all t > 0.
Definition. A function / : R+ x п x R™ —> R is random Lipschitz if /
satisfies conditions (i) and (ii) of Theorem 6.
Let D™ denote the space of processes X = (X1,..., Xn) where each PeD
A <i <n).
Definition. An operator F from D™ into D1 = D is said to be process
Lipschitz if for any X, Y in D™ the following two conditions are satisfied:
(i) for any stopping time T, XT" = Yr~ implies F(X)T~ = F(Y)T-, and
(ii) there exists an adapted process К ? L such that
\\F(X)t-F(Y)t\\<Kt\\Xt-Yt\\.
Definition. An operator F mapping Dn to D1 = D is functional Lipschitz
if for any X, Y in D™ the following two conditions are satisfied:
(i) for any stopping time T, XT" = YT~ implies F(X)T~ = F(Y)T~, and
(ii) there exists an increasing (finite) process К = (Kt)t>o such that \F(K)t -
F(Y)t\ < Kt\\X - Y||* a.s., each t > 0.

3 Existence and Uniqueness of Solutions 251
Note that if g(t,x) is a Lipschitz function, then f(t,x) = g{t—,x) is ran-
random Lipschitz. A Lipschitz, or a random Lipschitz, function induces a process
Lipschitz operator, and if an operator is process Lipschitz, then it is also
functional Lipschitz.
An autonomous function with a bounded derivative is Lipschitz by the
Mean Value Theorem. If a function / has a continuous but not bounded
derivative, / will be locally Lipschitz; such functions are defined and con-
considered in Sect. 7 of this chapter.
Let A = (j4t)t>o be continuous and adapted. Then a linear coeffi-
coefficient such as f(t,oj,x) = At(ui)x is an example of a process Lipschitz co-
coefficient. A functional Lipschitz operator F will typically be of the form
F(X) = f(t,w;Xs, s < t), where / is defined on [0,t] xdx D[0,t] for each
t > 0; here -D[0,?] denotes the space of cadlag functions defined on [0,t]. An-
Another example is a generalization of the coefficients introduced by Ito and
Nisio [101], namely
F(X)t= f g(u,u>,Xu)iJ,(u!,du)
Jo
for a random signed measure /i and a bounded Lipschitz function g with
constant C(w). In this case, the Lipschitz process for F is given by Kt(ui) =
C(a>)||/x(u;)t||, where ||/x(a»)t|| denotes the total mass of the measure fi{w,du)
on [0,t].
Lemmas 1 and 2 which follow are used to prove Theorem 7. We state and
prove them in the one dimensional case, their generalizations to n dimensions
being simple.
Lemma 1. Let 1 < p < oo, let J € 5P, let F be functional Lipschitz with
F@) = 0, and suppose supt |/ft(w)| < к a.s. Let Z be a semimartingale in
Я°° such that \\Z\\h°° < ^Tk • Then the equation
Xt = Jt+ f F(X)s_dZs
Jo
has a solution in S_p. It is unique, and moreover
Proof. Define A : gp -> gp by Л(ХL = Jt + /0* F(X)sdZs. Then by Theo-
Theorems 2 and 3 the operator is 1/2 Lipschitz, and the fixed point theorem gives
existence and uniqueness. Indeed
\\X\\gP <\\J\\gP + \\ I F(X)s^dZs\\§r

252 V Stochastic Differential Equations
Since \\F(X)\\gP = \\F(X) - F@)\\&, we have ||X||r < ||J||r + ?
which yields the estimate. ?
Lemma 2. Let 1 < p < oo, let J € 5P, let F be functional Lipschitz with
F@) = 0, and suppose supt |^t(w)| < к < oo a.s. Let Z be a semimartingale
such that Z € ^г^Га-)- Then the equation
Xt = Jt
/'
./о
has a solution in 5P. It is unique, and moreover ||X||gp < C(k, Z)\\ J||gp, where
C(k, Z) is a constant depending only on к and Z.
Proof. Let z = \\Z\\h-» and j = \\J\\sp. Let 0 = T$,Ti,... ,Te be the slicing
times for Z, and consider the equations, indexed by i = 0,1,2,...,
X = JTi~ + JF(X)s_dZj--: г = 0,1,2,.... (t)
Equation (г) has the trivial solution X = 0 since J°~ = Z°~ = 0 for all t,
and its S_p norm is 0. Assume that equation (г) has a unique solution Хг,
and let Xх = ||X*||sp. Stopping next at T, instead of T, —, let Y% denote the
unique solution of Y% = JTi + JF(Yl)s^.dZji, and set y% = ||F*||sp. Since
Yl = Хг + {AJTi + Р(Хг)т,-Агт,}1[т.<оо), we conclude that
Гг || ^ \\vi\\ i oil Til i IIZTYVMII II ^ll
IISf -S ЦЛ ||gp + Z|| J ||<Jp + \\r (Л. )\\Sp\\Zi \\Hf°
< xi + 2j + kx'z
= x*(l + kz) + 2j;
hence
We set for U e D, DtU = (U - ит')т^-. Since each solution X of
equation (г + 1) satisfies XTi = Y% on [0,Ti+1), we can change the unknown
by U = X - (Yi)T'+1-, to get the equations U = DiJ+J F(Yi + U)s-dDiZs.
However since F{Yi + 0) need not be 0 we define Gt(-) = F(Yi + •) - F^),
and thus the above equation can be equivalently expressed as
U = {DiJ+ f FfX^a-dDiZ,) + I Gi{U)s_dDiZs.
We can now apply Lemma 1 to this equation to find that it has a unique
solution in S_p, and its norm ul is majorized by
«' < 2\\DiJ + f
)

3 Existence and Uniqueness of Solutions 253
We conclude equation (г + 1) has a unique solution in SJ' with norm хг+1
dominated by (using (*))
xi+l <иг + уг < Aj + 2y* < 8j + 2A + kz)x\
Next we iterate from г = 0 to ? — 1 to conclude that
Finally, since Z = ZTt ~, we have seen that the equation X = J+f F(X)s-dZs
has a unique solution in S_p, and moreover X = Xе + J — JTi~. Therefore
\\X\\& < xe + 2j, and hence C(k, Z) < 2 + ЧЩ^1}- ?
Theorem 7. Given a vector of semimartingales Z = (Z1,... ,Zd), Zo = 0
processes J' eD, 1 < г < n, and operators Fj which are functional Lipschitz
A < i < n, 1 < j < d), the system of equations
A < г < n) has a solution in D™, and it is unique. Moreover if (Jl)i<n is a
vector of semimartingales, then so is (Xl)i<n.
Proof The proof for systems is the same as the proof for one equation pro-
provided we take F to be matrix-valued and X, J and Z to be vector-valued.
Hence we give here the proof for n = d — 1. Thus we will consider the equation
f
Jo
F(X)s.dZs.
Assume that maxjj supt Kl'J(ui) < к < оо a.s. Also, by considering the equa-
equation
Xt = (Jt+ f F@)s_dZs) + I G(X)s_dZs,
Jo Jo
where G(X) = F(X) - F@), it suffices to consider the case where F@) = 0.
We also need Lemmas 1 and 2 only for p = 2. In this case C2 = y/8.
Let T be an arbitrarily large stopping time such that JT~ e ^2 and such
that ZT~ € S( .]gk )¦ Then by Lemma 2 there exists a unique solution in 52
of
X(T)
= J?~ + f
Jo
By the uniqueness in ^2 one has, for R>T, that X(R)T~ = X(T)T-, and
therefore we can define a process lonllx [0,oo) by X = X(T) on [0,T).
Thus we have existence.

254 V Stochastic Differential Equations
Suppose next Y is another solution. Let S be arbitrarily large such that
(X — Y)s~ is bounded, and let R — min(Sr, T), which can also be taken
arbitrarily large. Then XR~ and YR~ are both solutions of
= JH~+
Jo
and since ZR~ e S( \- ), we know that XR~ = YR~ by the uniqueness
established in Lemma 2. Thus X = Y, and we have uniqueness.
We have assumed that maxij supt К}'3 (ш) < к < oo a.s. By proving
existence and uniqueness on [O,toL for to fixed, we can reduce the Lips-
chitz processes K\'3 to the random constants Kl'^(oj), which we replace with
K(w) = maxjj¦ KI'oj(lo). Thus without loss of generality we can assume we
have a Lipschitz constant K(oj) < oo a.s. Then we can choose a constant с
such that P[K < c) > 0. Let П„ = {К < с + n}, each n = 1,2, 3,.... Define
a new probability Pn by Pn(A) = P(A П П„)/Р(П„), and note that Pn <C P.
Moreover for n > m we have Pm <C Pn- From now on assume n > m in the
rest of this proof. Therefore we know that all P semimartingales and all Pn
semimartingales are Pm semimartingales, and that on Qm a stochastic inte-
integral calculated under Pm agrees with the the same one calculated under Pn,
by Theorem 14 of Chap. II. Let Yn be the unique solution with respect to Pn
which we know exists by the foregoing. We conclude Yn = Ym on flm, a.s.
(dPm). Define
Yt =
n=l
and we have Y = Yn a.s. (dPn) on Qn, and hence also a.s. (dP) on Qn, each
n. Since fl = U^L1(f2n \ iin-i) a-s. (dP), we have that on Qn:
Yt = Jt+ f F(Yn)s_dZs
Jo
= Jt+ f F{Y)s^dZs
Jo
a.s. (dP), for each n. This completes the proof. ?
Theorem 7 can be generalized by weakening the Lipschitz assumption on
the coefficients. If the coefficients are Lipschitz on compact sets, for example,
in general one has unique solutions existing only up to a stopping time T;
at this time one has limsupt^T |Xt| = oo. Such times are called explosion
times, and they can be finite or infinite. Coefficients that are Lipschitz on
compact sets are called locally Lipschitz. Simple cases are treated in Sect. 7
of this chapter (cf., Theorems 38, 39, and 40), where they arise naturally in
the study of flows.

3 Existence and Uniqueness of Solutions 255
We end this section with the remark that we have already met a funda-
fundamental stochastic differential equation in Chap. II, that of the stochastic
exponential equation
I
Jo
Z0=0.
There we obtained a formula for its solution (thus a fortiori establishing the
existence of a solution), namely
0<s<t
The uniqueness of this solution is a consequence of Theorem 7, or of Theo-
Theorem 6.
A traditional way to show the existence and uniqueness of solutions of
ordinary differential equations is the Picard iteration method. One might well
wonder if Picard-type iterations converge in the case of stochastic differential
equations. As it turns out, the following theorem is quite useful.
Theorem 8. Let the hypotheses of Theorem 7 be satisfied, and in addition
let (X°y = Hl be processes in D A < i < n). Define inductively
m+l\i _
{X,
and let X be the solution of
xi = Ji + ? / FWs-dzi (i < i < n).
3 = 1 J°
Then Xm converges to X in ucp.
Proof We give the proof for d = n = 1. It is easy to see that if итт^оо Хт =
X prelocally in S_ , then Итт^оо Xm = X in ucp; in any event this is proved
in Theorem 12 in Sect. 4. Thus we will show that limm^oo Xm = X prelocally
in ;S2. We first assume supt Kt < a < oo a.s. Without loss of generality we
can assume Z e S(a), with a - -57^, and that J € §^. Let 0 — To < T\ <
• • • < Tfc be the stopping times that a-slice Z. Then (Xm)Tl~ and XTl~ are
in S2 by Lemma 1. Then
by Theorems 2 and 3. Therefore

256 V Stochastic Differential Equations
so that lim^oo \\(Xm+1 - X)Tl-\\s2 = 0. We next analyze the jump at Tx.
Since
we have
MX™*1 -xf% <
where z = ||?||я~ < oo. Therefore
\\(Xm+1 - X)T- ||i2 < _J_
Next suppose we know that
where 7 = IKX1 - Х)Т"~\\з2. Then
f
JTt
JTt
where ^+1 = (Z - Z7''O'^1-. Therefore by iterating on ?, 0 < I < k,
\\(Xm+1 -Xf^'Wg, < \\(Xm+l -X)T'\\s2 + aV8a\\{Xm -Xf'+^
2Ш-1
Note that the above expression tends to 0 as m tends to oo. Therefore Xm
tends to X prelocally in S2 by a (finite) induction, and hence limm^oo Xm —
X in ucp.
It remains to remove the assumption that supt Кt < a < oo a.s. Fix a
t < oo; we will show ucp on [0,t]. Since t is arbitrary, this will imply the
result. As we did at the end of the proof of Theorem 7, let с > 0 be such that
P{Kt < c) > 0. Define Пп = {w : Kt(w) < c + n}, and Pn(A) = Р(Л\Пп).
Then Pn <C P, and under Pn, limm^ooXm = X in ucp on [0, t]. For e > 0,
choose N such that n > N implies P(&n) < ?- Then
- X)l >5)< Pn((Xm - X)*t >5) + e,
hence Итт^оо P((Xm — XI > S) < e. Since e > 0 was arbitrary, the limit
must be zero. ?

4 Stability of Stochastic Differential Equations 257
4 Stability of Stochastic Differential Equations
Since one is never exactly sure of the accuracy of a proposed model, it is
important to know how robust the model is. That is, if one perturbs the model
a bit, how large are the resulting changes? Stochastic differential equations
are stable with respect to perturbations of the coefficients, or of the initial
conditions. Perturbations of the differentials, however, are a more delicate
matter. One must perturb the differentials in the right way to have stability.
Not surprisingly, an H? perturbation is the right kind of perturbation. In this
section we will be concerned with equations of the form
f
n+ / Fn(Xn)s-dZ™ (*n)
Jo
w
JO
' F(X)s^dZs, (*)
/o
where Jn, J are in D, Zn, Z are semimartingales, and Fn, F are functional
Lipschitz, with Lipschitz processes Kn, K, respectively. We will assume that
the Lipschitz processes Kn, К are each uniformly bounded by the same con-
constant, and that the semimartingale differentials Zn, Z are always zero at 0
(that is, Zq = 0 a.s., n > 1, and Zq = 0 a.s.).
For simplicity we state and prove the theorems in this section for one
equation (rather than for finite systems), with one semimartingale driving
term (rather than a finite number), and for p = 2. The generalizations are
obvious, and the proofs are exactly the same except for notation. We say a
functional Lipschitz operator F is bounded if for all H G D, there exists a
non-random constant с < oo such that F{H)* < c.
Theorem 9. Let J, Jn e D; Z, Zn be semimartingales; and F, Fn be func-
functional Lipschitz with constants K, Kn, respectively. Assume that
(i) J, Jn are in g2 (resp. Щ2) and limn^oo Jn = J in ?2 (resp. ??*);
(ii) Fn are all bounded by the same constant c, and limn^oo Fn(X) = F(X)
in S2, where X is the solution of (*); and
(Hi) max.(supn Kn, К) < о < oo a.s. (a not random), Z e ^{^js~)i (^™)«>i
are in H2, and limn^oo Zn = Z in H_2.2
Then Нтп-юо Xn = X in S2 (resp. in H2), where Xn is the solution of (*n)
and X is the solution of (*).
Proof. We use the notation H ¦ Zt to denote JQ HsdZs, and H ¦ Z to denote
the process (H • Zt)t>o- We begin by supposing that J, (Jn)n>i are in 52 and
J™ converges to J in 52. Then
2 S(a) is defined in Sect. 2 on page 248.

258 V Stochastic Differential Equations
X - Xn = J - Jn + (F(X) - Fn(X))_ ¦ Z
+ (Fn(X) - P(I"))_ • Z + Fn(Xn)_ ¦ (Z - Zn).
Let Yn = (F(X) - Fn(.X"))_ • Z + Fn(Xn) ¦ (Z - Zn). Then
X -Xn = J-Jn + Yn + {Fn(X) - Fn(I"))_ • Z. (**)
For U G D define Gn by
Gn(U) = Fn(X) - Fn{X - U).
Then Gn{U) is functional Lipschitz with constant a and Gn@) = 0. Take
U = X — Xn, and (**) becomes
U = (J - Jn + Yn) + Gn(U)- ¦ Z.
By Lemma 2 preceding Theorem 7 we have
II* - Xn\\ga < C(a, Z)\\J -Jn + Yn\\g2 < C(a, Z){\\J - Jn\\g* + \\Yn\\g2}.
Since C(a, Z) is independent of n and Игг^-^ \\J - J™||52 = 0 by hypothesis,
it suffices to show limn^oo ||УИ||52 = О. But
\\Yn\\g, < \\{F{X) - Fn(X))_ ¦ Z\\& + \\Fn(Xn)_ ¦ (Z - Zn)\\g,
by Theorem 2 and Emery's inequality (Theorem 3). Since Ц^Ця00 < oo by
hypothesis and since
Xm^ \\F{X) - F"(X)||r = Jiit^ \\Z - Zn\\m = 0,
again by hypothesis, we are done.
Note that if we knew J™, J G Я2 and that Jn converged to J in Я2, then
\\X - Xn\\^ < || J - JI^ + ||yn||^2 + C\\X - Xn\\g, |
We have seen already that limn-^oo \\X — Xn\\g2 = 0, hence it suffices to show
limn-xx, ||^п||н;2 = 0. Proceeding as in (* * *) we obtain the result. D
Comment. Condition (ii) in Theorem 9 seems very strong; it is the per-
perturbation of the semimartingale differentials that make it necessary. In-
Indeed, the hypothesis cannot be relaxed in general, as the following example
shows. We take О = [0,1], P to be Lebesgue measure on [0,1], and {Ft)t>o
equal to T, the Lebesgue sets on [0,1]. Let ф{?) = min(?, 1), t > 0. Let
fn(u>) > 0 and set Ztn(u>) = <t>(t)fn(u>), u> G [0,1], and Zt{J) = 4>(t)f(u).

4 Stability of Stochastic Differential Equations 259
Let Fn(X) = F{X) = X, and finally let J? = Jt = 1, all t > 0. Thus the
equations (*n) and (*) become respectively
/
/o
Xt = l+ [ Xs_dZs,
J
= l+ [
Jo
which are elementary continuous exponential equations and have solutions
We can choose /„ such that limn^oo E{f%} = 0 but lim^oo E{f%} ф 0 for
p > 2. Then the Z™ converge to 0 in H2 but Xй does not converge to X = 1
(since / = 0) in ?p, for any p > l.Tndeed, limn^oo E{f%} Ф 0 for p > 2
implies linin-^oo ?{e*^" } ф 1 for any ? > 0.
The next result does not require that the coefficients be bounded, because
there is only one, fixed, semimartingale differential. Theorem 10, 11, and 13
all have H_2 as well as S2 versions as in Theorem 9, but we state and prove
only the 52 versions.
Theorem 10. Let J, Jn G D; Z be a semimartingale; F, Fn be functional
Lipschitz with constants K, Kn, respectively; and let Xn, X be the unique
solutions of equations (*n) and (*), respectively. Assume that
(i) Jn, J are in 52 and Нп^-^ Jn = J in S2;
(ii) lim Fn(X) = F(X) in S2, where X is the solution of (*); and
n—>oo —
(Hi) max(supn Kn,K) < a < oo a.s. for a non-random constant a, and Z €
Then limn^oo Xn = X in Q2 where Xn is the solution of (*n) and X is
the solution of (*).
Proof Let Xn and X be the solutions of equations (*n) and (*), respectively.
Then
X - Xn = J - Jn + (F(X) - Fn{X))_ ¦ Z + (Fn(X) - Fn(Xn))_ ¦ Z.
We let Yn = (F(X) - Fn(X))_ ¦ Z, and we define a new functional Lipschitz
operator Gn by
Gn{U) = Fn{X) - Fn(X - U).
Then Gn@) = 0. If we set U = X - Xn, we obtain the equation
U = J-Jn + Yn + Gn{U)- ¦ Z.

260 V Stochastic Differential Equations
Since Z e K°°, by Emery's inequality (Theorem 3) we have Yn -> 0 in Я2,
and hence also in 52 (Theorem 2). In particular ||yn||s2 < oo, and therefore
by Lemma 2 in Sect. 3 we have
\\U\\g. <C{a,Z)\\J - Jn + Yn\\g.,
where C(o, Z) is independent of n, and where the right side tends to zero as
n —> oo. Since U = X — Xn, we are done. G
We now wish to localize the results of Theorems 9 and 10 so that they hold
for general semimartingales and exogenous processes Jn, J. We first need
a definition, which is consistent with our previous definitions of properties
holding locally and prelocally (defined in Chap. IV, Sect. 2).
Definition. Processes Mn are said to converge locally (resp. prelocally)
in ?p (resp. Rp) to M if Mn, M are in ?p (resp. Др) and if there exists a
sequence of stopping times Tk increasing to oo a.s. such that limn^^, \\(Mn —
M)Tkl{Tk>o}h» = 0 (resp. lim^oo \\(Mn - M)Tk~\\gJ, = 0) for each к > 1
(resp. if replaced by Rp).
Theorem 11. Let J, Jn ? D; Z be a semimartingale (Zo — 0); and F, Fn
be functional Lipschitz with Lipschitz processes K, Kn, respectively. Let Xn,
X be solutions respectively of
[ Fn(Xn)s_dZs, (*n)
f F{X)s
Jo
(i) Jn converge to J prelocally in S2;
Xt = Jt+ f F{X)s_dZs.
Assume that
(ii) Fn(X) converges to F(X) prelocally in S2 where X is the solution of(*);
and
(Hi) max(supn Kn, К) < о < oo a.s. (a not random).
Then linin^oo Xn = X prelocally in 52 where Xn is the solution of (*n) and
X is the solution of (*).
Proof. By stopping at T- for an arbitrarily large stopping time T we can
assume without loss of generality that Z G ?(^7g~) by Theorem 5, and that
J™ converges to J in S2 and F(Xn) converges to F(X) in 52, by hypothesis.
Next we need only to apply Theorem 10. ?
We can recast Theorem 11 in terms of convergence in ucp (uniform conver-
convergence on compacts, in probability), which we introduced in Sect. 4 of Chap. II
in order to develop the stochastic integral.

4 Stability of Stochastic Differential Equations 261
Corollary. Let Jn, J G D; Z be a semimartingale (Zo = 0); and F, Fn be
functional Lipschitz with Lipschitz processes K, Kn, respectively. Let X, Xn
be as in Theorem 11. Assume that
(i) J™ converges to J in ucp,
(ii) Fn(X) converges to F(X) in ucp, and
(iii) max(supn Kn,K) < a < oo a.s. (o not random).
Then lim Xn = X in ucp.
n—»oo
Proof. Recall that convergence in ucp is metrizable; let d denote a distance
compatible with it. If Xn does not converge to 0 in ucp, we can find a subse-
subsequence n' such that infn/ d(Xn , 0) > 0. Therefore no sub-subsequence (Xn")
can converge to 0 in ucp, and hence Xn cannot converge to 0 prelocally in
S2 as well. Therefore to establish the result we need to show only that for
any subsequence n', there exists a further subsequence n" such that Xn' con-
converges prelocally to 0 in §_ . This is the content of Theorem 12 which follows,
so the proof is complete. G
Theorem 12. Let Hn, Я G D. For Hn to converge to H in ucp it is necessary
and sufficient that there exist a subsequence n' such that linv^oo Hn' — H,
prelocally in S2.
Proof. We first show the necessity. Without loss of generality, we assume that
H = 0. We construct by iteration a decreasing sequence of subsets (Nfc) of
N= {1,2,3,...}, such that
lim sup |Я™| = О a.s.
"^°°0<S<fc
By Cantor's diagonalization procedure we can find an infinite subset N' of N
such that
Iw^ sup |Я™| =0 a.s.,
each integer к > 0. By replacing N with N' we can assume without loss of
generality that Я™ tends to 0 uniformly on compacts, almost surely. We next
define
Tn=inf{t>0:|H?|>l},
Sn = inf Tm.
m>n
Then Tn and Sn are stopping times and the Sn increase to oo a.s. Indeed,
for each к there exists N(w) such that for n > N(u>), sup0<g<fc |Я"(и>)| < 1.
Hence Tn{w) > к and S^^(u;) > k. Next, define
— (/3 ) .

262 V Stochastic Differential Equations
Then
{Lm)* < sup |tf™
0<s<n
which tends to 0 a.s. Moreover, when ra > n,
(Lmy< sup |ЯП< sup |ЯП<1.
0<s<Sn 0<s<Tm
Hence by the Dominated Convergence Theorem we have that (Lm)* tends to
0 in L2, which implies the result.
For the sufficiency, suppose that Hn does not converge to H in ucp. Then
there exist t0 > 0, 8 > 0, e > 0, and a subsequence n' such that P{(Hn —
Н)ь> > 5} > ? for each "'• Then II (Я"' - я)?011ь2 > Sy/ё for all n'. Let Tk
tend to oo a.s. such that (Hn' - H)T ~ tends to 0 in S2, each k. Then there
exists К > 0 such that P(Tk < t0) < Щ- for all k> K. Hence
hm ||(Я"-Я)?0|ия< lim
We conclude 5-^/i < ^^, a contradiction. Whence Hn converges to H in
ucp. ?
Recall that we have stated and proven our theorems for the simplest case
of one equation (rather than finite systems) and one semimartingale driving
term (rather than a finite number). The extensions to systems and several
driving terms is simple and essentially only an exercise in notation. We leave
this to the reader.
An interesting consequence of the preceding results is that prelocal ^p
and prelocal H_p convergence are not topological in the usual sense. If they
were, then one would have that a sequence converged to zero if and only if
every subsequence had a sub-subsequence that converged to zero. To see that
this is not the case for S2 for instance, consider the example given in the
comment following Theorem 9. In this situation, the solutions Xn converge to
X in ucp. By Theorem 12, this implies the existence of a subsequence n' such
that ИгПп'-юоХ™' = X, prelocally in 52. However we saw in the comment
that Xn does not converge to X in S2. It is still a priori possible that Xn
converges to X prelocally in ^2, however. In the framework of the example a
stopping time is simply a non-negative random variable. Thus our counter-
counterexample is complete with the following real analysis result (see Protter [199,
page 344] for a proof). There exist non-negative functions fn on [0,1] such
that НгПп-^оо Jo /„(xJrfx = 0 and limsupn_>oo/yl(/n(x))p(ix = +oo for all
p > 2 and all Lebesgue sets Л with strictly positive Lebesgue measure. In
conclusion, this counterexample gives a sequence of semimartingales Xn such
that every subsequence has a sub-subsequence converging prelocally in S_p,
but the sequence itself does not converge prelocally in 5P, A < p < oo).

4 Stability of Stochastic Differential Equations 263
Finally, we observe that such non-topological convergence is not as un-
unusual as one might think at first. Indeed, let Xn be random variables which
converge to zero in probability but not a.s. Then every subsequence has a sub-
subsequence which converges to zero a.s., and thus almost sure convergence
is also not topological in the usual sense.
As an example of Theorem 10, let us consider the equations
X? = J?+ [ Fn(Xn)sdWs + f Gn{Xn)sds,
Jo Jo
Xt = Jt+ I F{X)sdWs + f G{X)sds
Jo Jo
where W is a standard Wiener process. If (Jn — J)j converges to 0 in L2, each
t > 0, and if Fn, Gn, F, G are all functional Lipschitz with constant К < oo
and are such that {Fn{X) - F{X))*t and {Gn{X) - G{X))t converge to 0 in
L2, each t > 0, then (Xn - X)% converges to 0 in L2 as well, each t > 0. Note
that we require only that Fn{X) and Gn(X) converge respectively to F(X)
and G(X) for the one X that is the solution, and not for all processes in D.
One can weaken the hypothesis of Theorem 9 and still let the differentials
vary, provided the coefficients stay bounded, as the next theorem shows.
Theorem 13. Let Jn, J e D; Z, Zn be semimartingales (Zg = Zo = 0 a.s.);
andF, Fn be functional Lipschitz with Lipschitz processes К, Кп, respectively.
Let Xn, X be solutions of (*n) and (*), respectively. Assume that
(i) Jn converges to J prelocally in S2;
(ii) Fn(X) converges to F(X) prelocally in S_2, and the coefficients Fn, F
are all bounded by с < oo;
(Hi) Zn converges to Z prelocally in H2; and
(iv) max(supn Kn,К) < о < oo a.s. (a not random).
Then lim Xn = X prelocally in S2.
n-»oo =
Proof. By stopping at Г— for an arbitrarily large stopping time T we can
assume without loss that Z G ^{wj5~) by Theorem 5, and that J™ converges
to J in ;g2, Fn(X) converges in g2 to F(X), and Zn converges to Z in Я2,
all by hypothesis. We then invoke Theorem 9, and the proof is complete. D
The assumptions of prelocal convergence are a bit awkward. This type of
convergence, however, leads to a topology on the space of semimartingales
which is the natural topology for convergence of semimartingale differentials,
just as ucp is the natural topology for processes related to stochastic integra-
integration. This is exhibited in Theorem 15.
Before defining a topology on the space of semimartingales, let us recall
that we can define a "distance" on D by setting, for Y, Z еШ

264 V Stochastic Differential Equations
r(Y) = ? 2""Я{1 Л sup
and d(Y, Z) — r{Y — Z). This distance is compatible with uniform convergence
on compacts in probability, and it was previously defined in Sect. 4 of Chap. II.
Using stochastic integration we can define, for a semimartingale X,
f(X) = sup r(tf • X)
where the supremum is taken over all predictable processes bounded by one.
The semimartingale topology is then defined by the distance d(X, Y) =
f(X - Y).
The semimartingale topology can be shown to make the space of semi-
martingales a topological vector space which is complete. Furthermore, the fol-
following theorem relates the semimartingale topology to convergence in HJ". For
its proof and a general treatment of the semimartingle topology, see Emery [66]
or Protter [201].
Theorem 14. Let 1 < p < oo, let Xn be a sequence of semimartingales, and
let X be a semimartingale.
(i) If Xn converges to X in the semimartingale topology, then there exists a
subsequence which converges prelocally in Hp'.
(ii) If Xn converges to X prelocally in H_p, then it converges to X in the
semimartingale topology.
In Chap. IV we established the equivalence of the norms ||^||н2 an(i
suP|h|<i 11-^" -^Ils2 ш the corollary to Theorem 24 in Sect. 2 of that chapter.
Given this result, Theorem 14 can be seen as a uniform version of Theorem 12.
We are now able once again to recast a result in terms of ucp convergence.
Theorem 13 has the following corollary.
Corollary. Let J™, JeB; Zn, Z be semimartingales (Z? = Zo = 0); and
jP™, F be functional Lipschitz with Lipschitz processes K, Kn, respectively.
Let Xn, X be solutions of (*n) and (*), respectively. Assume that
(i) J™ converges to J in ucp,
(ii) Fn(X) converges to F(X) in ucp where X is the solution of (*), and
moreover all the coefficients Fn are bounded by a random с < oo,
(iii) Zn converges to Z in the semimartingale topology, and
(iv) max(supn Kn, K) < a < oo a.s.
Then lim Xn = X in ucp.
П—ЮО
Proof. Since Zn converges to Z in the semimartingale topology, by Theo-
Theorem 14 there exists a subsequence n' such that Zn converges to Z prelocally
in Я2. Then by passing to further subsequences if necessary, by Theorem 12
we may assume without loss that J™ converges to J and Fn(X) converges

4 Stability of Stochastic Differential Equations 265
to F(X) both prelocally in S2, where X is the solution of (*). Therefore, by
Theorem 13, Xn converges to X prelocally in S2 for this subsequence. We
have shown that for the sequence (Xn) there is always a subsequence that
converges prelocally in S2. We conclude by Theorem 12 that Xn converges to
X in ucp. П
The next theorem extends Theorem 9 and the preceding corollary by re-
relaxing the hypotheses on convergence and especially the hypothesis that all
the coefficients be bounded.
Theorem 15. Let Jn, J G D; Zn, Z be semimartingales (Z? = Zo = 0); and
Fn, F be functional Lipschitz with Lipschitz process K, the same for all n.
Let Xn, X be solutions respectively of
[ F"(Xn)s_dZs", (*n)
o
Xt = Jt+ [ F(X)s.dZs. (*)
Jo
Assume that
(i) Jn converges to J in ucp;
(ii) Fn{X) converges to F(X) in ucp, where X is the solution of (*); and
(Hi) Zn converges to Z in the semimartingale topology.
Then Xn converges to X in ucp.
Proof. First we assume that supt Kt{uj) < a < oo. We remove this hypothesis
at the end of the proof. By Theorem 12, it suffices to show that there exists
a subsequence n' such that Xn converges to X prelocally in S2. Then by
Theorem 12 we can assume with loss of generality, by passing to a subsequence
if necessary, that Jn converges to J and Fn(X) converges to F(X) both
prelocally in 52. Moreover by Theorem 14 we can assume without loss, again
by passing to a subsequence if necessary, that Zn converges to Z prelocally
in H2, and that Z G <S( .Jg )¦ Thus all the hypotheses of Theorem 13 are
satisfied except one. We do not assume that the coefficients Fn are bounded.
However by pre-stopping we can assume without loss that |F(X)| is uniformly
bounded by a constant с < oo.
Let us introduce truncation operators Tx defined (for x > 0) by
TX(Y) = min(x,sup{-x,Y)).
Then Tx is functional Lipschitz with Lipschitz constant 1, for each x > 0.
Consider the equations
Ytn =
/ (Ta+c+1Fn)(Yn)s-dZ^.
Jo

266 V Stochastic Differential Equations
Then, by Theorem 13, Yn converges to X prelocally in S2. By passing to yet
another subsequence, if necessary, we may assume that Fn(X) tends to F(X)
and Yn tends to X uniformly on compacts almost surely. Next we define
Sk = inf{t > 0 : 3m > jfc : \Ytm - Xt\ + \Fm(X)t - F(X)t\ > 1}.
The stopping times Sk increase a.s. to oo. By stopping at Sk —, we have for
n> к that (Yn - X)* and {Fn{X) - F{X))* are a.s. bounded by 1. (Note
that stopping at Sk- changes Z to being in S{-^-jw~) instead of S{ j4g-) by
Theorem 5.) Observe that
\Fn{Yn)\ < \Fn(Yn) - Fn{X)\ + \Fn(X) - F(X)\ + \F(X)\
< a{Yn - X)* + (Fn{X) - F{X))* + F(X)*
< a + 1 + c,
whence (Ta+c+1Fn)(Yn) = Fn(Yn). We conclude that, for an arbitrarily
large stopping time R, with Jn and Zn stopped at R, Yn is a solution of
Ytn =
s:
which is equation (*n). By the uniqueness of solutions we deduce Yn = Xn
on [0, R). Since Yn converges to X prelocally in 52, we thus conclude Xn
converges to X prelocally in S2.
It remains only to remove the hypothesis that supt Kt{uS) < о < oo. Since
we are dealing with local convergence, it suffices to consider supg<t Ks < Kt,
for a fixed t. Since Kt < oo a.s., let a > 0 be such that P(Kt < a) > 0, and
define
^m = {w : Kt{uj) < a + m}.
Then Om increase to О a.s. and as in the proof of Theorem 7 we define Pm
by Pm(A) = Р{А\Пт), and define J^ = ^|nm, the trace of Tt on Пт. Then
Pm <c P, so that by Lemma 2 preceding Theorem 25 in Chap. IV, if Zn
converges to Z prelocally in Я2(Р), then Zn converges to Z prelocally in
H_ (Pm) as well. Therefore by the first part of this proof, Xn converges to X
in ucp under Pm, each m > 1.
Choose e > 0 and m so large that -P(O?J < e. Then
lim P((Xn - X)*t >5)< lim Pm((Xn - X)* > S) + Р(Пст)
< lim Pm({Xn - X)\ > 6) + г
and since e > 0 was arbitrary, we conclude that Xn converges to X in ucp on
[0, t]. Finally since t was arbitrary, we conclude Xn converges to X in ucp. ?

4 Stability of Stochastic Differential Equations 267
Another important topic is how to approximate solutions by difference
solutions. Our preceding convergence results yield two consequences (Theo-
(Theorem 16 and its corollary).
The next lemma is a type of Dominated Convergence Theorem for stochas-
stochastic integrals and it is used in the proof of Theorem 16.
Lemma (Dominated Convergence Theorem). Let p,q,r be given such
that - + - = ?, where 1 < r < oo. Let Z be a semimartingale in H4, and let
Hn e§* such^hat \Hn\ < Y € §?, all n > 1. Suppose lim^^ Щ_{ш) = О,
all (t,u>). Then
lim
Proof. Since Z e H_q, there exists a decomposition of Z, Z = N + A, such
/
Jo
/ |e||U <oo.
Jo
Let Cn be the random variable given by
cn = (Г°(явп_)МадвI/2 + Г \н?_\\ма
Jo Jo
The hypothesis that \Hn\ <Y implies
/ \dAa\) a.s.
/ \
Jo
/
o
However
/»OO /»OO
/ \dAa\)hr < \\Y*\\lA\IN,n№ + / |dAe|||L,
Jo Jo
,A) <oo.
Thus Cn is dominated by a random variable in Lr and hence by the Dominated
Convergence Theorem Cn tends to 0 in Lr. D
We let <7ra denote a sequence of random partitions tending to the identity.3
Recall that for a process Y and a random partition а = {0 = Го < Т\ < ¦ ¦ ¦ <
Tfen}, we define
к
Note that HY is adapted, cadlag (i.e, Y € Щ, then (Y°)s>o is left continuous
with right limits (and adapted, of course). It is convenient to have a version
of Ya e D, occasionally, so we define
3 Random partitions tending to the identity are defined in Chap. II, preceding
Theorem 21.

268 V Stochastic Differential Equations
Theorem 16. Let J € 5 , let F be process Lipschitz with Lipschitz process
К < a < oo a.s. and F@) € S2. Let Z be a semimartingale in S( j- ), and
let X(a) be the solution of
rt
Xt = Jt+ F(X°+)°dZs, (*CT)
Jo
for a random partition a. If an is sequence of random partitions tending to
the identity, then X(an) tends to X in S2, where X is the solution of (*) of
Theorem 15.
Proof. For the random partition an (n fixed), define an operator Gn on D by
Gn{H) = F{Ha"+)a"+.
Note that Gn(H) € © for each Я e D and that СП(Я)_ =
Then Gn is functional Lipschitz with constant К and sends S2 into itself, as
the reader can easily verify. Since F@) € S2, so also are Gn@) e S2, n > 1,
and an argument analogous to the proof of Theorem 10 (though a bit simpler)
shows that it suffices to show that /0 Gn(X)s_dZs converges to /0 F(X)s_dZs
in S2, where X is the solution of
Xt = Jt+ I F(X)s-dZs.
I F(X)s-t
Jo
Towards this end, fix (t,oj) with t > 0, and choose e > 0. Then there exists
5 > 0 such that \Хи(ш) - Xt-(w)\ < e for all и € [t - 25, t). If mesh((r) < 6,
then also \X?+(w) - Xu(co)\ < 2e for all и e [t - 5, t). This then implies that
\F(Xa+)(oj) - F(X)(oj)\ < 2ae. Therefore (F{Xa"+) - F(X))°"(uj) tends to
0 as mesh(crn) tends to 0. Since, on @,oo),
lim
mesh(CTn)—
we conclude that
lim
mesh(an)—>0
where convergence is pointwise in (t, u>). Thus
lim G»(X)-=F(X)-.
meshfcr,,)—>0
However we also know that
< F@)' + aX*
which is in S_ and is independent of an. Therefore using the preceding lemma,
the Dominated Convergence Theorem for stochastic integrals, we obtain the
convergence in S2 of /0 Gn{X)s-dZs to /0 F(X)s-dZs, and the proof is com-
complete. D

4 Stability of Stochastic Differential Equations 269
Remark. In Theorem 16 and its corollary which follows, we have assumed
that F is process Lipschitz, and not functional Lipschitz. Indeed, Theorem 16
is not true in general for functional Lipschitz coefficients. Let Jt = l{t>i}>
Zt - t Л 2, and F(Y) = Yil{t>i}. Then X, the solution of (*) of Theorem 15
is given by Xt — (t Л 2)l{t>ij, but if a is any random partition such that
Tfc ф 1 a.s., then (X(a)a+)~t = 0 for t < 1, and therefore F{X(a)a+) = 0,
and X{o~)t = Jt = l{t>i}. (Here X(a) denotes the solution to equation (*a)
of Theorem 16.)
Corollary. Let J e D; F be process Lipschitz; Z be a semimartingale; and
let an be a sequence of random partitions tending to the identity. Then
lim X(an) = X in ucp
n—>oo
where X(an) is the solution of (*a) and X is the solution of (*), as in Theo-
Theorem 16.
Proof. First assume К < a < oo, a.s. Fix t > 0 and e > 0. By Theorem 5 we
can find a stopping time T such that ZT~ € S( L ), and P(T < ?) < e. Thus
without loss of generality we can assume that Z € S( ^ ). By letting 5fc =
inf{? > 0 : | Jt\ > k}, we have that Sk is a stopping time, and lim/^oo Sk = oo
a.s. By now stopping at Sk— we have that J is bounded, hence also in S_2,
and Z e S(tjs-)- ^n analogous argument gives us that F@) can be assumed
bounded (and hence in ^2) as well; hence Z e ^(тАт-). We now can apply
Theorem 16 to obtain the result.
To remove the assumption that К < a < oo a.s., we need only apply
an argument like the one used at the end of the proofs of Theorems 7, 8
and 15. D
Theorem 16 and its corollary give us a way to approximate the solution of
a general stochastic differential equation with finite differences. Indeed, let X
be the solution of
Xt = Jt+ f F(X)s-dZ
J
f F(X)s
Jo
where Z is a semimartingale and F is process Lipschitz. For each random
partition an = {0 = Tff < T™ < ¦ ¦ • < T? }, we see that the random variables
Х(ап)т? verify the relations (writing a for an, X for X(an), Tk for Tg)
Xt0 = Jo,
ХТк+1 = ХТк + JTk+1 - JTk + F(X°+)Tk(ZTk+1 - ZTk).
Then the solution of the finite difference equation above converges to the
solution of (*), under the appropriate hypotheses.
As an example we give a means to approximate the stochastic exponential.

270 V Stochastic Differential Equations
Theorem 17. Let Z be a semimartingale and let X = ?{Z), the stochastic
exponential of Z. That is, X is the solution of
Xs-dZs.
Let an be a sequence of random partitions tending to the identity. Let
Then lim Xn — X in ucp.
n—>oo
Proof. Let Yn be the solution of
f
Jo
equation (*a) of Theorem 16. By the corollary of Theorem 16 we know that
Yn converges to X — ?{Z) in ucp. Thus it suffices to show Yn = Xn.
Let an = {0 = Ton < If < ¦ ¦ • < TfcnJ. On (Tf ,T?+1] we have
Ytn = Y?r
Inducting on i down to 0 we have
for T? < t < T?+v Since ZTi*+i - ZT? = 0 for all j > i when T? <t<T?+1,
we have that Yn = Xn, and the theorem is proved. D
5 Fisk-Stratonovich Integrals and Differential Equations
In this section we extend the notion of the Fisk-Stratonovich integral given
in Chap. II, Sect. 7, and we develop a theory of stochastic differential equa-
equations with Fisk-Stratonovich differentials. We begin with some results on the
quadratic variation of stochastic processes.
Definition. Let H, J be adapted, cadlag processes. The quadratic covari-
covariation process of H, J denoted [H, J] = ([H, J]t)t>o, if it exists, is defined
to be the adapted, cadlag process of finite variation on compacts, such that
for any sequence an of random partitions tending to the identity,

5 Fisk-Stratonovich Integrals and Differential Equations 271
lim Sa (tf,J) = lim tfoJo + Y(HT^ - HT")(JT"^ - JT")
n—*oo n—>oo ^—'
г
= [tf,J]
with convergence in ucp, where on is the sequence 0 = Tff < T™ < • • • < Tg .
A process H in ID is said to have finite quadratic variation if [H, H]t exists
and is finite a.s., each t > 0.
If H, J, and H+J in ID have finite quadratic variation, then the polarization
identity holds:
[tf, J] = ^([tf + J, tf + J] - [tf,tf] - [J, J]).
For X a semimartingale, in Chap. II we defined the quadratic variation of X
using the stochastic integral. However Theorem 22 of Chap. II shows every
semimartingale is of finite quadratic variation and that the two definitions are
consistent.
Notation. For H of finite quadratic variation we let [H, H]c denote the con-
continuous part of the (non-decreasing) paths of [H,H]. Thus,
[tf,tf]t = [tf,tf]tc+ ? A[H,H]S,
0<s<t
where Д[#, H}t = [Я, H]t - [H, H]t-, the jump at t.
The next definition extends the definition of the Fisk-Stratonovich integral
given in Chap. II, Sect. 7.
Definition. Let H ? D, X be a semimartingale, and assume [H, X] exists.
The Fisk-Stratonovich integral of H with respect to X, denoted /0 #s_ о
dXs, is defined to be
f tfs_ о dXs = f Hs-dXs + \ [Я, X\%.
Jo Jo z
To consider properly general Fisk-Stratonovich differential equations, we
need a generalization of Ito's formulas (Theorems 32 and 33 of Chap. II).
Since Ito's formula is proved in detail there, we only sketch the proof of this
generalization.
Theorem 18 (Generalized Ito's Formula). Let X = (X1,.. .,Xn) be an
n-tuple of semimartingales, and let f : WL+ x Q x W1 —> WL be such that
(i) there exists an adapted FV process A and a function g such that
t,u>,x) = / g(s,w,
Jo
f(t,u>,x) = / g(s,oj,x)dAs,
Jo
(s,oj) \—> g(s,cj,x) is an adapted, jointly measurable process for each x,
and Jo supxe^ \g(s,oj,x)\\dAs\ < oo a.s. for compact sets K.

272 V Stochastic Differential Equations
(ii) the function g of (i) is C2 in x uniformly in s on compacts. That is,
sup{\g(s,oj,y) -y2
<rt(W)||x-y||)||x-y||2
a.s., where rt : f!xl| —> R+ is an increasing function with\imuiort(u) =
0 a.s., provided к ranges through a compact set (rt depends on the compact
set chosen).
(in) the partial derivatives fXi, fxiXj, 1 < i, j < n all exist and are continu-
continuous, and moreover
fXt(t,w,x)= / gXi(s,w,x)dAs,
Jo
fXiXj(t,ui,x)= / gXiXj(s,u,x)dAs.
Jo
Then
/t n -t
g(s,ui,Xs)dAs + y2 .
t=i Jo
1 n rt
0<s<t
n
- g(s,u>,Xs)AAs - ^J/Xi(s-,w,Xs_)AXJ}.
Proof. We sketch the proof for n = 1. We have, letting 0 = to < t\ < ¦ ¦ ¦ <
tm — t be a partition of [0,t], and assuming temporarily \X\ < к for all s < t,
к a constant,
f(t,w,Xt)-f(p,w,X0)
m — \ m—1
fe=0 fe=0
m~1 rtk+i m-1
-1 ptk+1 m-1
= 52 9(u,uj,Xtk+1)dAu+J2f(tk,uj,Xtk+1)~f(tk,oj,Xtk) (*)
fe=0 Jtk k=0
Consider first the term Y^k=o /*к+1 Э^т^тXtk+1)dAu. The integrand is not
adapted, however one can interpret this integral as a path-by-path Stieltjes

к к
5 Fisk-Stratonovich Integrals and Differential Equations 273
integral since A is an FV process. Expanding the integrand for fixed (u,oj)
by the Mean Value Theorem yields
g(u,w,Xtk+1) = д(и,ш,Хи) + gx(u,u>,Xu)(Xtk+1 - Xu)
where Xu is in between Xu and Xtk+1. Therefore
g(u,oj,Xtk+1)dAu
/•tk+l ftk + 1
= Y1 I 9(u,oj,Xu)dAu + V / gx(u,u,Xu)(Xtk+1 - Xu)dAu,
к Jtk к Jtk
and since A is of finite variation and X is right continuous, the second sum
tends to zero as the mesh of the partitions tends to zero. Therefore letting тгп
denote a sequence of partitions of [0, t] with limn_,oo meshGrn) = 0,
JLim^ Y^ / 9(u,oj,Xtk+1)dAu= / g(u,u),Xu)dAu(uj) a.s.
Next consider the second term on the right side of (*), namely
m-l
V ^ xf-t. v \ ?/4. v \
/ ^ J{tk,uj,Ji.tk+x) — j[tk,uJ,Ji.tk).
fe=0
Here we proceed analogously to the proof of Theorem 32 of Chap. II. Given
e > 0, t > 0, let A(e, t) be a set of jumps of X that has a.s. a finite number
of times s, and let В = B(e, t) be such that ^g?B(AXsJ < e2, where A U В
exhaust the jumps of X on @,t]. Then
m-l
2_^/(?fc,w,^tk+i) - f(tk,w,xtk)
fe=0
) - f(tk,w,Xtk) + 2_;f(tk,w,Xtk+1) - f(tk,w,Xtk)
k,A k,B
where J2k,A denotes J2tke*n l{An(tk,tk+1]^}- Thus
- J[tk,w,Xtk) = у f(s-,u,Xs) - /(s-,w,As_).
k,A
By Taylor's formula, and letting AkX denote Xtk+1 — Xtk,

274 V Stochastic Differential Equations
, w,Xtk+1)~ f(tk, u>, Xtk) (**)
k,B
к,А
k,B
By Theorems 21 and 30 of Chap. II, the first sums on the right above con-
converge in ucp to /0 fx(s-,u>,Xs-)dXs and \ /0 fxx(s-,u>,Xs^)d[X,X]s, re-
respectively. The third sum converges a.s. to
seA
By condition (ii) on the function g, we have
limsup Yl R(tk,u,Xth,Xtk+1)<rt{w,e+)[X,X]t
n tke-wn,B
where
rt(u),e+) = limsup rt(u>,d).
Sle
Next we let e tend to 0; then rt(cj,e+)\X,X]t tends to 0 a.s., and finally
combining the two series indexed by A we see that
{f(s-,oj,Xs) - f(s-,oj,Xs_) - fx(s-,uj,Xs_)AXs
s?A(e,t)
tends to the series
0<s<t
-\fxx{s-,u,Xs_){AXsJ}
which is easily seen to be an absolutely convergent series (cf., the proof of The-
Theorem 32 of Chap. II). Incorporating - J2o<s<t \fxx{s-,u),Xs-){AXsJ into
\ /o fxx{s-,u,Xs-.)d{X,X]s yields the term \ f* fxx(s-,w,Xa-)d[X,X]ca.
We further note that f(s,u>,Xs) - f(s—,w,Xs) = g(s,u>,Xs)AAs, and
since J3o<s<t5(s'w'^s)Ayls is absolutely convergent, the theorem is proved.
The assumption that \XS\ < к for s < t is removed as it was in Theorem 32
of Chap. II. D

5 Fisk-Stratonovich Integrals and Differential Equations 275
Note that a consequence of Theorem 18 is that for / satisfying the hy-
hypotheses, we have f(t, -,Xt) is a semimartingale when X = (X1,... ,Xn) is
an n-tuple of semimartingales. Also, an important special case is when the
process As = s; in this case the hypotheses partly reduce to assuming / is
absolutely continuous in t.
The next theorem allows an improvement of the Fisk-Stratonovich change
of variables formula given in Chap. II (Theorem 34).
Theorem 19. Let X = (X1,... ,Xd) be a vector of semimartingales and let
f : R+ x п x Rd -> R be such that
(i) there exists an adapted FV process A and a function g such that
f(t,u,x) = / g(s,u>,x)dAs
Jo
where (s,oj) >—> <?(s,w,x) is an adapted and jointly measurable process for
each x.
(ii) for each compact set K, JQ supxeK |#(s,w,x)||cL4s| < oo a.s.
(Hi) fxi exists and is continuous in x and fXi(t,u,x) = JQ gXi(s,co,x)dAs.
Let
Yt = f(t,oj,X\...,Xd).
Then Y is an adapted process of finite quadratic variation, and moreover
г j 0<s<t
Proof. First assume that for fixed (t, w) the function / and all its first partials
are bounded functions of x. Then by optional stopping at times T— we can
assume without loss of generality that / and all its first partials (in x) are
bounded functions. Let fu be a sequence of functions on R+ x п х Rd which are
C2 on M.d such that fk and its first partials converge uniformly on M.d respec-
respectively to / and its corresponding first partials. Moreover we can take all the
fk bounded and Lipschitz continuous with Lipschitz constant c, independent
of к. For simplicity take d = 1. Let an be a sequence of random partitions
tending to the identity. We write, for a process Z,
where
an = {0 = Ton < T\n < • • • < 2?}.
Since fk and their first partials converge to / and its first partials uni-
uniformly, we have fk - fe is Lipschitz with constant eke, which tends to 0 as к, ?
tend to oo. Therefore, letting r denote an for a given n and writing fk(X) for
fk{t,u,Xt), we have

276 V Stochastic Differential Equations
\ST(fk(X)) - ST(fe(X))\
< sT((fk - h){X)) + 2{sT(fk(X))sT((fk - h){X))Y'2
2ceM)ST{X)
by the Lipschitz properties. Since fk{X) and fe(X) are semimartingales, we
know (by Theorem 22 of Chap. II) that San(fk(X)) and San(fi(X)) converge
in ucp respectively to [fk(X),fk(X)} and [fe(X),fe(X)]. By restricting our
attention to an interval (s,t), we then have from (*) that
\{[fk(X)Jk(X)]t - lfk(X),fk(X)]s} - {\fe(X),fe(X)]t - [ft{X)Jt(X)]a}\
2ceke){[X,X}t-[X,X}s}.
Since this is true for all 0 < s < t < oo, we deduce that
f \d[fk(X)Jk(X)}s - d[fe(X),fe(X)]a\ < {e2u + 2ceu)[X,X\t.
Jo
Therefore d[fk(X),fk(X)] is a Cauchy sequence of random measures, con-
converging in total variation norm on @,oo), a.s. In addition, by Ito's formula
(Theorem 18) and Theorem 29 of Chap. II we have that
from Theorem 23 of Chap. II we also know that
Therefore
[fk(X),fk(X)}t= [^±(з-,о;,
°X 0<s<t
and since we have a.s. convergence in total variation norm, we can pass to the
limit to obtain
Vt= [()[];
^ o<s<t
Our identification of the limit removes the dependence on the representation
of Y by /, because
t - Vt\ < \SaM(X))t - SaMk(X))t\ + \Vt - [fk(X),fk(X)]
+ \saMk(X))t-[fk(X),fk(X)]t\,
and by taking к large enough the first two terms on the right side can be taken
small in probability independently of n; one then takes n large enough in the

5 Fisk-Stratonovich Integrals and Differential Equations 277
third term to make it small in probability, and we have limn_,oo Scrn (f(X))t =
Vt, in probability. Therefore Vt = [У,^]< and this completes the proof for /
and its first partials bounded functions of x, and d = 1. The proof for general
d is exactly analogous. For general / satisfying the hypotheses, let gm : Md —>
Md be C°° such that gm(x) — x if |x| < m, and gm has compact support.
Defining Ytm = f(t,uj,gm(Xt)), let T"» = inf{t > 0 : Yt{tj) ф У/7»)-
Then Tm increase to oo a.s. Also, since the quadratic variation is a path
property, [Ym,Ym]t(uj) = [Y,Y]t(w), for all t < Tm(w). Therefore Y is of
finite quadratic variation, and since by the preceding (for d = 1)
= f* ^-(s-,LJ,gm(Xs-))g>m(Xs-)d[X,X}cs+ ? Af(s,w,gm(Xs)J
J° 0<s<t
which, on [0,Tm), is equal to
= f^{s-,u>,Xs-)d[X,X]cs+ ? Af(s,XsJ;
J° 0<s<t
whence the result. D
Note that the subspace of processes Y that have a representation Y =
f(Xl,... ,Xd), where X = (X1,... ,Xd) is a finite-dimensional semimartin-
gale (d is not fixed) and / € C1, is a vector subspace of the space of processes
of finite quadratic variation. As an immediate consequence of Theorem 19 we
have the following extension of Theorem 34 of Chap. II.
Theorem 20. Let X be a semimartingale and let f be C2. Then
f(Xt)-f(X0)= f f'(Xt.)odXa+ J2 {f(Xs)-f(Xs-)-f'(Xs_)AXs}.
J0+ n^.^^
0<s<t
Proof. Note that /' is C1, so that f'(X) is in the domain of definition of the
F-S integral by Theorem 19. Also by definition we have
f f(
/
By Theorem 19 and polarization we know that
[f{X),Xft= f f\
Jo
and thus the result follows by Ito's formula (Theorem 32, Chap. II). D
Theorem 20 has an obvious generalization to the multi-dimensional case.
We omit the proof which is an analogous consequence of Theorem 33 of
Chap. II.

278 V Stochastic Differential Equations
Theorem 21. Let X = (X1,... ,Xn) be an n-tuple of semimartingales, and
let f : Mn —> R have second order continuous partial derivatives. Then /(X)
is a semimartingale and the following formula holds:
/(Xt)
As a corollary of Theorem 21, we have the Stratonovich integration by
parts formula.
Corollary 1 (Stratonovich Integration by Parts Theorem). Let X and
Y be semimartingales. Then
XtYt-X0Y0= f Xs^odYs+ f Ys_odXs+ V AXSAYS.
Jo+ Jo+ o<s<t
Proof. Let f(x, y) = xy and apply Theorem 21. D
Corollary 2. Let X and Y be semimartingales, with at least one of X or Y
continuous. Then
XtYt - X0Y0 = f *5_ odYs + f Ys_odXs.
Jo+ Jo+
Recall that since Xo- = 0 by convention for a cadlag process X, we did
not really need to write Jo, ; the formula also holds for Jo.
For stochastic differential equations with Fisk-Stratonovich differentials we
are limited as to the coefficients we can consider, because the integrands must
be of finite quadratic variation. Nevertheless we can still obtain reasonably
general results. We first describe the coefficients.
Definition. A function / : M+ x Cl x Rd —» M is said to be Fisk-Stratonovich
acceptable if
(i) there exists an adapted FV process A and a function g such that
f(t,w,x) = / g(s,uj,x)dAs
Jo
where (s,w) i—» g(s, lj,x) is an adapted, jointly measurable process for
each x;
(ii) for each compact set K, Jo supxeA: |<?(s,w,x)||cL4.s| < oo a.s.;
(iii) fXi is C1 and
fXl(t,w,x)= / gXi(s,w,x)dAs;
Jo

5 Fisk-Stratonovich Integrals and Differential Equations 279
(iv) for each fixed (t, ш), the functions x i-> f(t, ш, х) and хи (J? •/)(*, u>, x)
are all Lipschitz continuous with Lipschitz constant K(w), К < oo a.s.
A < i < d).A
We will often write "F-S acceptable" in place of "Fisk-Stratonovich ac-
acceptable."
Theorem 22. Given a vector of semimartingales Z = (Z1,... ,Zk), semi-
martingales Jl A < г < d), and F-S acceptable functions /j A < i < d, 1 <
j < k), then the system of equations
x; = Jt + Y] f) (s-, w, xs_) о ^ (*)
has a unique semimartingale solution. Moreover the solution X of (*) is also
the (unique) solution of
T~> Jo
s
E [\-l§;-f™)(s-,u>,Ks-)dlZ\Ziys (**)
j,n=l ro=l ^°
Proof. We note that equation (**) has a unique solution as a trivial conse-
consequence of Theorem 7. Since X is a d-dimensional semimartingale, we know
that fj(s—, ¦, Xs_) is in the domain of definition of the F-S integral by The-
Theorem 19. Further, as a consequence of Theorem 19, we have that5
d /-t
/j(.,u,,X.),Z']tc= E f ^i{s^,
and the equivalence of (*) and (**) follows. Therefore the existence of a unique
semimartingale solution of (**) is equivalent to the existence of a unique
semimartingale solution of (*), and we are done. Note that if Jm is of finite
variation, the terms involving [Jm, Zi]c disappear. ?
4 By (-gjj -f)(t,w,x.) we mean the usual product of functions J^-(t,o;,x) -f(t,w,x.).
5 Since we are taking the continuous parts of the quadratic variations, we need not
write f(s— ,ш,Ха-), etc.

280 V Stochastic Differential Equations
In Chap. II we studied the stochastic exponential of a semimartingale. The
F-S integral allows us to give a version that has a more natural appearance.
Theorem 23. Let Z be a semimartingale, Zo = 0. The unique solution of
the equation
Xt = Xo + / Xs- о dZs
Jo
is given by
Xt = Xoexp{Zt} Ц A + AZs)e~AZ>,
0<s<t
and it is called the Fisk-Stratonovich exponential, Xq?fs{Z).
Proof. By Theorem 22 the equation above is equivalent to
Xt=X0+ f Xs^dZs + \ f Xs_d{Z, Z\%
Jo z Jo
xs_d(zs + hz,z\cs).
Jo z
Therefore
[Z,Zf)
= ?{Z)?{\[Z,Z\C)
where the second equality is by Theorem 38 of Chap. II. Using the formula for
the stochastic exponential established in Theorem 37 of Chap. II, the result
follows. ?
The most interesting case is when the semimartingale is continuous.
Corollary 1. Let Z be a continuous semimartingale, Z$ = 0. Then the unique
solution, ?-p.$(Z), of the exponential equation
Xt = 1 + / X, о dZ,
/
Jo
is given by ?F.s(Z)t = exp{Zt}.
Corollary 2. Let В be a Brownian motion with Bq = 0. Then the unique
solution of
rt
Xs о dД,
/o
is given by Xt = Xo exp{i?t}.
f
Jo

5 Fisk-Stratonovich Integrals and Differential Equations 281
The simplicity gained by using the F-S integral can be surprisingly helpful.
As an example let Z = (Z1,..., Zn) be an n-dimensional continuous semi-
martingale and let x = (x1,..., xn)' be a column vector in Rn. (Here' denotes
transpose.) Let / be the identity n x n matrix and define
(The matrix a(x) represents projection onto the hyperplane normal to x.)
Note that x'a(x) = 0. We want to study the system of F-S differential equa-
equations
Xt = x0 + / a(Xs)odZs, \<i<n. (* * *)
Jo
where X.= (X1,...,Xn)' and Z = (Z\...,Zn)'.
Theorem 24. The solution X of equation (* * *) above exists, is unique, and
it always stays on the sphere of center 0 and radius ||xo||.
Proof. Since a is singular at the origin, we need to modify it slightly. Let g(x)
be a C°° function equal to a(x) outside of a ball centered at the origin, No,
such that x0 ф No- Let Y be the solution of
Yt=x0+ / g(Ys)odZs.
Then Y exists and is unique by Theorem 22. Let T = inf{? > 0 : Yt G No}-
Since Yq = Xo and Y is continuous, P(T > 0) = 1. However for s < T,
g(Ys) = a(Ys). Therefore it suffices to show that the function Ы ||Yt|| is
constant, since ||Yo|| = ||xo||. This would imply that Y always stays on the
sphere of center 0 and radius ||xo||, and hence P(T — oo) = 1 and g(Y) =
a(Y) always. To this end, let /(x) = Х)"=1(ж4J, where x = (ж1,... ,xn)'.
Then it suffices to show that df(Yt) =0,(t> 0). Note that / is C2. We have
that for t < T,
df(Yt) = 2Y'todY = 2Y; • a(Yt) о dZt = 0.
Therefore ||У^|| = ||жо|| for t < T and thus by continuity T = oo a.s. ?
Corollary. Let В = (В1,... ,Bn)' be n-dimensional Brownian motion, let
a(x) — I — f^p, and let X be the solution of
Xt =xo+ / a(Xs)odB3.
Jo
Then X is a Brownian motion on the sphere of radius ||xo||.

282 V Stochastic Differential Equations
Proof. In Theorem 36 of Sect. 6 we show that the solution X is a diffusion. By
Theorem 24 we know that X always stays on the sphere of center 0 and radius
||xo||. It thus remains to show only that X is a rotation invariant diffusion,
since this characterizes Brownian motion on a sphere. Let U be an orthogonal
matrix. Then UB is again a Brownian motion, and thus it suffices to show
d(UX) = a(UX)od(UB).
The above equation shows that UX is statistically the same diffusion as is X,
and hence X is rotation invariant. The coefficient a satisfies
a{Ux) = Ua(x)U',
and therefore
d(UX) = UodX = Ua(X) odB = Ua(X)U'Uо<ffi
= a{UX)od{UB),
and we are done. ?
The F-S integral can also be used to derive explicit formulas for solutions
of stochastic differential equations in terms of solutions of (non-random) or-
ordinary differential equations. As an example, consider the equation
Xt=x0+ [ f(Xs) о dZs + [ g(Xs)ds, (*4)
Jo Jo
where Z is a continuous semimartingale, Zo = 0. (Note that Jo g(Xs) о ds =
Jo g(Xs)ds, so we have not included Ito's circle in this term.) Assume that /
is C2 and that /, g, and //' are all Lipschitz continuous. Let и — u(x, z) be
the unique solution of
du
-K-(x,z) = f(u(x,z))
u(x,0) — x.
Then J^ = f'(u(x,z))%±, and §?(ж,0) = 1, from which we conclude that
д-(ж, z) = exp{ / f'(u(x, v))dv}.
Ox Jo
Let Y = {Yt)t>o be the solution of
ft fZ.
Yt = x0+ exp{- / f(u(Ys, v))dv}g(u(Ys, Zs))ds
Jo Jo
g(u(x Zs))
which we assume exists. For example if ^ц ' — is Lipschitz, this would
suffice.

5 Fisk-Stratonovich Integrals and Differential Equations 283
Theorem 25. With the notation and hypotheses given above, the solution X
of (*4) is given by
Xt=u(Yt,Zt).
Proof. Using the F-S calculus we have
u(Yt,Zt) = u(xo,O) + f ^{Ys,Zs)odYs + f ^-{Ys,Zs)odZs
ft fZ, rt
= xo+ exp{/ f'(u(Ys,v))dv}odYs+ f{u{Ys,Zs))odZs.
Jo Jo Jo
Since
we deduce
dY» rZ
= exp{-/ f'(u(Ys,v))dv}g(u(Ys,Zs)),
as Jo
u(YuZt) = x0+ [ g(u(Ys,Zs))ds+ [ f(u(Ys,Zs)) odZs.
Jo Jo
By the uniqueness of the solution, we conclude that Xt = u(Yt, Zt). D
We consider the special case of a simple Stratonovich equation driven by
a (one dimensional) Brownian motion B. As a corollary of Theorem 25 we
obtain that the simple Stratonovich equation
Xt=x0+ [ f(Xs)odBs
Jo
has a solution Xt — h~l{Bt + /i(Xo)), where
-^r-rds + С
The corresponding Ito stochastic differential equation is
dXt = l-f{Xt)f{Xt)dt + f(Xt)dBt.
By explicitly solving the analogous ordinary differential equation (without
using any probability theory) and composing it, we can obtain examples of
stochastic differential equations with explicit solutions. This can be useful
when testing simulations and numerical solution procedures. We give a few
examples.
Example. The equation
dXt = -\a2Xtdt + aJl -
has solution Xt = sin(aBt + arcsin(X0)).

284 V Stochastic Differential Equations
Example. The following equation has only locally Lipschitz coefficients and
thus can have explosions: for m ф 1,
\
dXt = -a2mX;m~xdt + aXmdBt
has solution Xt = (X^~m - a(m -
Example. Our last example can also have explosions:
dXt = Xt(l + X*)dt + A + X*)dBt
has the solution Xt — tan(? + Bt + arctan(Xo)).
The Fisk-Stratonovich integrals also have an interpretation as limits of
sums, as Theorems 26 through 29 illustrate. These theorems are then useful
in turn for approximating solutions of stochastic differential equations.
Theorem 26. Let H be cadlag, adapted, and let X be a semimartingale.
Assume [H,X] exists. Let an = {0 = To™ < Tf < • ¦ • < T?n} be a sequence
of random partitions tending to the identity. If H and X have no jumps in
common (i.e., Ylo<s<t AHSAXS = 0, all t > 0), then
equals the F-S integral /0 i?5_ о dXs in ucp.
Proof. It follows easily from the definition of [H, X] at the beginning of this
section that A[H,X]t = AHtAXt and lim^oo Е(ят^+1 - HTn){XT?+* -
XT?) = [H,X] — HqXo- Thus if H and X have no jumps in common we
conclude [H,X] = [H,X]C +H0X0. Observing that
the result follows from Theorems 21 and 22 of Chap. II. ?
Corollary 1. If either H or X in Theorem 21 is continuous, then
lim Y\{HTn+HTn ){XT^ -XT?) = [ #5_ odXs,
where Ho- = 0, in ucp.

5 Fisk-Stratonovich Integrals and Differential Equations 285
Corollary 2. Let X and Y be continuous semimartingales, and let an =
{0 = Ttf < T™ < • ¦ • < Tg } be a sequence of random partitions tending to
the identity. Then
,- odX
s,
with convergence in ucp.
Proof. By Theorem 22 of Chap. II, any semimartingale Y has finite quadratic
variation. Thus Corollary 2 is a special case of Theorem 26 (and of Corol-
Corollary 1). ?
Theorem 27. Let H be cadlag, adapted, of finite quadratic variation, and
suppose J2o<s<t |Ajffs| < oo a.s., each t > 0. Let X be a semimartingale,
and let an = ¦pTlo<i<fcn be a sequence of random partitions tending to the
identity. Assume [H, X] exists. Then
f
In
и
lim {T \(HT~ +#t- - T AHS)(XT^ -XT")} = f tfs_ odXs,
г р+1
with convergence in ucp.
Proof. First observe that Ht = Ht — Ylo<s<t AHS defines a continuous process
of finite quadratic variation and that [H, X] = [H, X]c + H0X0. Next we note
that
\{HTr + HTr+i - Y ^ 52
Therefore by Theorems 21 and 22 of Chap. II, or directly from the definition
as in the proof of Theorem 26, we have
= lim > Hxn(X г+1 — X г )+ lim -
n—>oo -^—J г n—>oo Z
i
o+ z
= / Я5_ оdX5 + I[H, X}c = / Я5_ о dXs, where Яо_ = Яо,
Jo * Jo
and the result follows. D

286 V Stochastic Differential Equations
For the general case we have a more complicated result. Let H be a cadlag,
adapted process of finite quadratic variation. For each e > 0 we define
0<s<t
Then JE is also a cadlag, adapted process, and it has paths of finite variation
on compacts.
Theorem 28. Let H be cadlag, adapted, of finite quadratic variation, and let
X be a semimartingale. Assume [H, X] exists. Let an = {TJ1} be a sequence
of random partitions tending to the identity. Let JE be as defined above. Then
Jo
with convergence in ucp.
Proof. Let HE = H — JE. Then the jumps of HE are bounded by s. Moreover,
Therefore the first limit at t > 0 is
f
Hs-dXs+\[H,X]ct+l- f
2 2t? Jo 2o<s<t
and letting e tend to 0 gives the result. That is, since
\AHESAXS\ < [Н*,НЕI/2[Х,Х}\/2 < со,
and since [HE,H% = Eo<s<t(AHsJ, we have lim^o Eo<5<t |АЯ|ДХв| -
0. П
The next result is useful, for example, for approximating integrals of the
form fQ f(Bs) °dBs, where В is a Brownian motion. It can be generalized to
Y a continuous, adapted process of finite quadratic variation, but we do not
do so here.
Theorem 29. Let X be a semimartingale and Y a continuous semimartin-
semimartingale, and let f beC1. Let ц be a probability measure on [0,1] let a = J Xfi(dX),
and let an = {T™}i>o be a sequence of random partitions tending to the iden-
identity. Then

5 Fisk-Stratonovich Integrals and Differential Equations 287
lim V / f(YTr.
« f f'(Y3)d[Y,X}3,
Jo+
0+ J0+
with convergence in ucp. In particular if a = 1/2 then the limit is the F-S
integral /0+ f(Y3) о dX3.
Proof. We begin by observing that
~\,-XT")
Г
jo
V
f
The first sum on the right side of the above equation tends to /0+ f(Ys-)dX3
in ucp. Using the Fundamental Theorem of Calculus, the second sum on the
right above equals
J2 f MdA) / ds\f'{YTr + Xs(YTr+1 -Y^))(YTn -Г
i Jo Jo
which in turn equals
\(YTr+1 - YTn)(XT^ - XT")|,
i
where
Fan(aj) < 8ир{|/'(Утг + Xs(YTr+i - уг„)) - f(YTr)\}.
Since /' and Y are continuous, FOn (w) tends to 0 on compact time intervals.
Also, on [0,t],
-XT")\ < [F,F]t1/2[X,X]t1/2,
and the result follows by Theorem 30 of Chap. II. G
Corollary. Let X be a semimartingale, К be a continuous semimartingale,
and /gC1. Let an = {1Т}о<1<кп be a sequence of random partitions tending
to the identity. Then

288 V Stochastic Differential Equations
with convergence in ucp.
Proof. Let n(d\) = ?{1/2} (dX), point mass at 1/2. Then a = /0 Xfi(dX) = 1/2,
and we need only apply Theorem 29. D
For Brownian motion, the Fisk-Stratonovich integral is sometimes defined
as a limit of the form
l\mTf(BiJ±h±1)(Bti+1 - Bti) = j f(Bs)odBs;
2 Jo
that is, the sampling times are averaged. Such an approximation does not hold
in general even for continuous semimartingales (see Yor [238, page 524]), but
it does hold with a supplementary hypothesis on the quadratic covariation, as
Theorem 30 reveals.
Theorem 30. Let X be a semimartingale and Y be a continuous semimartin-
gale. Let ц be a probability measure on [0,1] and let a = J X/j,(dX). Further
suppose that \X, Y]t = Jo Jsds; that is, the paths of [X, Y] are absolutely con-
continuous. Let an = {i™} be a sequence of non-random partitions tending to the
identity. Let f be C1. Then
/
f(Ys)dXs+a f f'(Ys)d[Y,X}s,
o+ Jo+
with convergence in ucp. In particular if a = 1/2 then the limit is the F-S
integral /0+ f(Ys) о dXs.
Proof. We begin with a real analysis result. We let t± denote i, + A(*i+i - ti),
where the ti are understood to be in an. Suppose a is continuous on [0, t].
Then
> / a(s)ds — > a(ti)X(ti+i — ti) < > /
iJu V i Ju
which tends to 0. Therefore
lim У^ / a(s)ds = X a(s)ds. (*)
»-»Y^ Jo
Moreover since continuous functions are dense in L1([0,t},ds), the limiting
result (*) holds for all a in L1([0,t},ds).

5 Fisk-Stratonovich Integrals and Differential Equations 289
Next suppose Я is a continuous, adapted process, and set
H 'n =
Then taking limits in ucp we have
limУ^Ни{[Х,У]г" - [X,Y}U} = lim
71 n
o
and using integration by parts, this equals
lim{#A'n ¦ (XY) - (HX'nX-) ¦ Y - (Hx'nY) ¦ X}.
71
By Theorem 21 of Chap. II, this equals
However since [X, Y]t = /0 Jsds, by the result (*) we conclude
u{[X,Y]ti - [X,Y]*<} (**)
= A / Hsd[X,Y]s.
Jo
We now turn our attention to the statement of the theorem. Using the
Mean Value Theorem we have
» - X*') (* * *)
U)+Yl f\(d\)f'(Yti)(Yt} -Yti
i Jo
The first sum tends in ucp to / f(Ys)dXs by Theorem 21 of Chap. II. The
second sum on the right side of (* * *) can be written as

290 V Stochastic Differential Equations
The first sum above converges to
/ /x(dA)A / f'(Y3)d[Y,X]s
Jo Jo
by (**)) and the second sum can be written as
ту\.п v"
Л • Л,
where
Then limn KX'n ¦ X converges to 0 locally in ucp by the Dominated Conver-
Convergence Theorem (Theorem 32 of Chap. IV).
Finally consider the third sum on the right side of (***). Let
Fn(oj) = sup sup \f(Yti + a(Ytx - Yu)) - f(Yu)\.
ti€crn s€[0,l]
Then
r-l
-X
= 0
since lim Fn = 0 a.s. and the summations stay bounded in probability. This
completes the proof. Since Y is continuous, [У, X] = [Y, X]c, whence if а = 1/2
we obtain the Fisk-Stratonovich integral. D
Corollary. Let X be a semimartingale, Y a continuous semimartingale, and
/ be C1. Let [X, Y] be absolutely continuous and let on = {?"} be a sequence
of non-random partitions tending to the identity. Then
lim "?f(yh±h±i)(XU+1-XU)= I f(Ys)°dXs
with convergence in ucp.

6 The Markov Nature of Solutions 291
Proof. Let /j.(dX) = ?{1/2}(dX), point mass at 1/2. Then / A/x(rfA) = 1/2, and
apply Theorem 30. ?
Note that if Y is a continuous semimartingale and В is standard Brownian
motion, then [У, В] is absolutely continuous as a consequence of the Kunita-
Watanabe inequality. Therefore, if / is C1 and un are partitions of [0, t], then
lim Y.f(Yu+u+1)(Bti+, -Bti)= f f(Ys)odBs,
with convergence in probability.
6 The Markov Nature of Solutions
One of the original motivations for the development of the stochastic integral
was to study continuous strong Markov processes (that is, diffusions), as
solutions of stochastic differential equations. Let В = {Bt)t>o be a standard
Brownian motion in Mn. K. Ito studied systems of differential equations of the
form
Xt =X0+ / f{s,Xs)dBs+ [ g{s,Xs)ds,
Jo Jo
and under appropriate hypotheses on the coefficients /, g he showed that a
unique continuous solution exists and that it is strong Markov.
Today we have semimartingale differentials, and it is therefore natural to
replace dB and ds with general semimartingales and to study any resulting
Markovian nature of the solution. If we insist that the solution itself be Markov
then the semimartingale differentials should have independent increments (see
Theorem 32); but if we need only to relate the solution to a Markov process,
then more general results are available.
For convenience we recall here the Markov property of a stochastic pro-
process which we have already treated in Chap. I. Assume as given a filtered
probability space (fi,.F, {J~t)t>o,P) satisfying the usual hypotheses.6
Definition. A process Z with values in Md and adapted to F = (.Ft)t>o is a
simple Markov process with respect to F if for each t > 0 the cr-fields J~t
and a{Zu;u > t} are conditionally independent given Zt.
Thus one can think of the Markov property as a weakening of the property
of independent increments. It is easy to see that the simple Markov property
is equivalent to the following. For и > t and for every / bounded, Borel
measurable,
E{f(Zu)\Ft}=E{f(Zu)\a{Zt}}. (*)
One thinks of this as "the best prediction of the future given the past and the
present is the same as the best prediction of the future given the present."
See Chap. I, Sect. 1 for a definition of the "usual hypotheses" (page 3).

292 V Stochastic Differential Equations
Using the equivalent relation (*), one can define a transition function
for a Markov process as follows, for s < t and / bounded, Borel measurable,
let
Note that if/(x) = 1a(x), the indicator function of a set A, then the preceding
equality reduces to
Identifying 1^ with A, we often write Ps<t(Zs,A) on the right side above.
When we speak of a Markov process without specifying the filtration of o-
algebras (^rt)t>o, we mean implicitly that T^ = o{Zs;s < t}, the natural
filtration generated by the process.
It often happens that the transition function satisfies the relationship
Ps,t = Pt-s
for t > s. In this case we say the Markov process is time homogeneous, and
the transition functions are a semigroup of operators, known as the transition
semigroup (Pt)t>o- In the time homogeneous case, the Markov property
becomes
A stronger requirement that is often satisfied is that the Markov property
hold for stopping times.
Definition. A time homogeneous simple Markov process is strong Markov
if for any stopping time T with P(T < oo) = 1, s > 0,
P(ZT+aeA\FT) = Pa(ZT,A)
or equivalently
E{f(ZT+s)\FT} = Ps(ZT,f),
for any bounded, Borel measurable function /.
The fact that we defined the strong Markov property only for time ho-
homogeneous processes is not much of a restriction, since if X is an M.d-valued
simple Markov process, then it is easy to see that the process Zt = (Xt,t) is
an Md+1-valued time homogeneous simple Markov process.
Examples of strong Markov processes (with respect to their natural fil-
trations of cr-algebras) are Brownian motion, the Poisson process, and indeed
any Levy process by Theorem 32 of Chap. I. The results of this section will
give many more examples as the solutions of stochastic differential equations.
Since we have defined strong Markov processes for time homogeneous pro-
processes only, it is convenient to take the coefficients of our equations to be
autonomous. We could let them be non-autonomous, however, and then with
an extra argument we can conclude that if X is the solution then the process
Yt = (Xt, t) is strong Markov.
We recall a definition from Sect. 3 of this chapter.

6 The Markov Nature of Solutions 293
Definition. A function / : M+ x Mn —> M is said to be Lipschitz if there
exists a finite constant к such that
(i) \f(t,x)-f(t,y)\ <k\x-y\,eachteR+, and
(ii) t •—> f(t, x) is right continuous with left limits, each x G M".
/ is said to be autonomous if f(t,x) = f(x).
In order to allow arbitrary initial conditions, we need (in general) a larger
probability space than the one on which Z is defined. We therefore define
ft = Mn x U
V = ey x P
where В denotes the Borel sets of Mn, and ey denotes the Dirac point mass
measure at y. For a point w = (y, w) ? Q, we further define
Xq(u) = y, when w = (y,w). (*)
Finally let J-t = (")«>* 3~и- A random variable Z defined on $7 is considered to
be extended automatically to $7 by the rule Z(w) = Z{uj), when To = {y,w).
We begin with a measurability result which is an easy consequence of
Sect. 3 of Chap. IV.
Theorem 31. Let Z^ be semimartingales A < j> < d), Hx a vector of adapted
processes in D for each x € Шп, and suppose (x,t,u>) н-» Щ{и>) is В® B+ ® J-
measurable.7 Let Fj be functional Lipschitz and for each x G M", Xх is the
unique solution of
\At ) = \Щ ) +
3 = 1
There exists a version of Xх such that (x,t,oj) ь-> Xх(u>) is В ® B+ ® J7
measurable, and for each x, Xх is a cadlag solution of the equation.
Proof. Let X°(x, t,ui) = Hx{u>) and define inductively
Xn+I(x,t,u>y = Hx + J2 I Fi(Xn(x,-,.))s-dZi.
i=i^
The integrands above are in L, hence by Theorem 63 in Chap. IV there exists
measurable, cadlag versions of the stochastic integrals. By Theorem 8 the
processes X" converge ucp to the solution X for each x. Then an application
of Theorem 62 of Chap. IV yields the result. ?
7 В denotes the Borel sets on Rn; B+ the Borel sets on

294 V Stochastic Differential Equations
We state and prove the next theorem for one equation. An analogous result
(with a perfectly analogous proof) holds for finite systems of equations.
Theorem 32. Let Z = (Z1,..., Zd) be a vector of independent Levy pro-
processes, Zo = 0, and let (/j) 1 < j < d, 1 < i < n, be Lipschitz functions. Let
Xq be as in (*) and let X be the solution of
J2 fj(s~,Xs_)dZi
(**)
P
Then X is a Markov process, under each P and X is strong Markov if the
f!j are autonomous.
Proof We treat only the case n = 1. Let T be an F stopping time, T < oo a.s.
Define QT = a{Z3T+u - Z3T; и > 0,1 < j < d}. Then QT is independent of TT
under P , since the ZJ are Levy processes, as a consequence of Theorem 32
of Chap. I. Choose a stopping time T < oo a.s. and let it be fixed. For и > 0
define inductively
Y°(x,T,u)=x,
d j-T+u
x + Y, /
Also, let X(x,T,u) denote the unique solution of
d /-Т+И
^2 /
d /-Т+И
^2 / fj(v~,X(x,T,v-))dZi,
i=i Jt
taking the jointly measurable version (cf., Theorem 31). By Theorem 8 we
know that X(x, T, u) is QT measurable. By approximating the stochastic
integral as a limit of sums, we see by induction that Yn(x,T, u) is QT
measurable as well. Under P we have X(Xo,T,u) = X(x,T,u) a.s., and
Yn(Xo,T,u) = Yn(x,T,u) Px-a.s., also. By uniqueness of solutions and us-
using Theorem 31, for all и > 0 a.s.
X(X0,0,T + u) = X(X(X0,0,T),T,u).
There is no problem with sets of probability zero, due to (for example) the
continuity of the flows. (See Theorem 37.) Writing Ex to denote expectation
on ft with respect to P , and using the independence of Тт and QT (as well
as of J~t and GT), we have for any bounded, Borel function h
Ex{h(X(X0,0, T + u))\TT) = E{h(X(x, 0, T +
= E{h(X(X(x,0,T),T,u))}l&

6 The Markov Nature of Solutions 295
where j(y) = E{h(X(y,T, «)}. The last equality follows from the elementary
fact that E{F(H, -)|W} = /(#), where f(h) = E{F(h, •)}, if-F is independent
of H and H is H measurable. This completes the proof, since the fact that
Ex{h(X(X0,0, T + u))\TT} is a function only of X(x,0, T) implies that
Ex{h(X(X0,0, T + v))\TT} = Ex{h(X(X0,0, T + и))\Х(Х0,0, Т)}. П
It is interesting to note that Theorem 32 remains true with Fisk-Straton-
ovich differentials. To see this we need a preliminary result.
Theorem 33. Let Z = (Zl,...,Zd) be a vector of independent Levy pro-
processes, Zo = 0. Then [Z\Zj]c =Oifi^j, and \Z\Z% = at, where
Proof First assume that the jumps of each Zl are bounded. Then the mo-
moments of Z% of all orders exist (Theorem 34 of Chap. I), and in particular
M\ = Z\ - E{Zi) is an L2 martingale for each i, with E{Z\~) = tE{Z\]. By
independence M%M.i is also a martingale and hence [Мг, ЛР] = 0 by Corol-
Corollary 2 of Theorem 27 of Chap. II. Therefore [Z\Zj]ct = [M\Mj)ct = 0 as
well.
Next consider A\ = [Zl, Zl]t = [M\M%]t. It is an immediate consequence
of approximation by sums (Theorem 22 of Chap. II) that A1 also has inde-
independent increments. Since
and the process J\ = J2o<s<t(^^lJ a^so dearly has independent incre-
increments, we deduce that [Мг,Мг]$ has independent increments as well. There-
Therefore [Мг,Мг]^ — ?{[Ml,M']4c} is a finite variation continuous martingale,
hence constant by Theorem 27 of Chap. II. Since [Ml, Мг]$ = 0, we deduce
that \1\г% = \М\М% = Е{[М\М1)$}. Since \M\M% is also a Levy
process we have E{[M%,Ml]t} = ?.Е{[М*,М*]],} (by the stationarity of the
increments), and also
E{ ? (AMiJ} = E{( ? AMtJ} = tE{ ? (AMIJ}
0<s<t 0<s<t 0<s<1
by the same reasoning. Therefore \Z\Z% = ЬЕ{[г*,г*]$} by subtraction.
If the jumps of Z are not bounded, let J\ = ^20<s<t AZll{\&.zi\>i}- Then
Zl = Zl — J% and Jl are independent Levy processes, and Jl is a quadratic
pure jump semimartingale. It therefore follows that [Z*,ZJ] = [Zl,Z^\ = 0
and that [Z\ Zl]c = [Z\ Z1}0, and the theorem is proved. ?
Theorem 34. Let Z = (Zl,...,Zci) be a vector of independent Levy pro-
processes, Zo = 0, and let (/]) 1 < j < d, 1 < i < n, be non-random F-S accept-
acceptable functions.8 Let X$ be as in Theorem 32 and X be the solution of
8 F-S acceptable functions are defined on page 278.

296 V Stochastic Differential Equations
d
3=1 Jo
Then X is a Markov process and X is strong Markov if the coefficients /j are
autonomous.
Proof. We treat only the case n = 1. By Theorem 33 [Z\ Zj\$ = 0 if i ф j and
\Z%,Z%\'i = a^, ai > 0. Therefore, by Theorem 22 the equation is equivalent
to
J2 f Ы*-,Х-№3. + \ Z> / ф ¦ fj)(s-,
j=1Jo * j=1 Jo ox
X.-)ds,
and since the process Yt = t is a Levy process we need only to invoke Theo-
Theorem 32 to complete the proof. ?
If the differentials are not Levy processes but only strong Markov pro-
processes which are semimartingales, then the solutions of equations such as (**)
in Theorem 32 need not be Markov processes. One does, however, have the
following result.
Theorem 35. Let Z be a strong Markov processes with values in K, Zq = 0,
such that Z is a semimartingale. Let / and g be Lipschitz functions. Let X$
be as in Theorem 32 and X be the solution of
I f(s-,X.-)dZ.+ f g(s,Xs-)ds.
Jo Jo
Then the vector process (X, Z) is Markov under P , each у € Ш, and strong
Markov if f and g are autonomous.
Proof. First recall that X is defined on $7 = К x Q and Z is automatically
extended to f2 as explained at the beginning of this section. The probability
Pv = ey x P is such that PV{Xq = y) = 1, each у е К. As in the proof
of Theorem 32 for a fixed stopping time T (T < oo a.s.) and x e M define
inductively for и > 0
Y°(x,T,u) =x
i-T+u
Yn+\x,T,u)=x+ / f(v-,Yn(x,T,v-))dZv
Jt
/
Jt
T+u
g(v-,Yn(x,T,v-))dv,
IT
and let X(x, T, u) denote the unique solution of
rT+u rT+u
X(x,T,u)=x+ f(v-,X(x,T,v-))dZv+ g{v-,X{x,T,v-))dv.
Jt Jt

6 The Markov Nature of Solutions 297
Next define QT = a{Zx+u — %т\ и > 0} for the same stopping time T. As in
the proof of Theorem 32 we see that Yn(x,T,u) is QT measurable for each
n > 1 and that X(x, T, u) is QT measurable as well. Next let h be Borel
measurable and bounded, and let G ? QT and bounded.
Observe that
E{h(XT)G\TT} = h{XT)E{G\TT}
= h(XT)E{G\ZT}
where j : R2 —> R is Borel. Since XT+U — X(XT,T,u) by the uniqueness
of the solution, the theorem now follows from the Monotone Class Theorem
(Theorem 8 of Chap. I). D
Theorem 35 can also be shown to be true with Stratonovich differentials,
but the proof is more complicated since the quadratic variation process is an
additive functional of Z, rather than a deterministic process (as is the case
when Z is a Levy process). For this type of result we refer the reader to
Cinlar-Jacod-Protter-Sharpe [34].
Each of Theorems 32, 34 and 35 can be interpreted with X having an
arbitrary initial distribution /i. Indeed for /j, a probability measure on (R, B),
define P" on fi by Р"(Л) = JRPx(A)[x(dx), for Л € В <g> Т. Then the conclu-
conclusions of the three theorems are trivially still valid for PM, and the distribution
of Xo is /i.
Traditionally the most important Markovian solutions of stochastic dif-
differential equations are diffusions. Suppose as given a space (fi, F, (Jrt)t>o, P)
satisfying the usual hypotheses.
Definition. An adapted process X with values in R™ is a diffusion9 if it has
continuous sample paths and if it satisfies the strong Markov property.
A restatement of Theorems 32 and 34 yields the following.
Theorem 36. Let В = (B1,...,Bd) be a standard Brownian motion on
M.d, Bo = 0, and let (/j) 1 < j < d, 1 < i < n, gl be autonomous Lipschitz
functions. Let Xq be as in Theorem 32 and let X be the solution of
*t = Xo + E / fAXs)dBi + f 9i{Xs)ds. (* * *)
J
E / fAXs)dBi + f
j=1jo Jo
ThenX is a diffusion. If the coefficients (fj)i<j<d, 1 < i < n, are non-random
F-S acceptable functions and if Y is the solution of
Ytl = Yj + Y, f f){Ys) о dBi + f gi{Ys)ds,
~[ Jo Jo
The definition of a diffusion is not yet standardized. We give a general definition.

298 V Stochastic Differential Equations
then Y is a diffusion.
In equation (***) of Theorem 36 the coefficients /j are called the diffusion
coefficients and the coefficients g% are called the drift coefficients.
Note that a diffusion need not be semimartingale, even though of course
the solutions of equations such as (* * *) are semimartingales. Indeed any
deterministic continuous function with paths of unbounded variation is a dif-
diffusion which is not a semimartingale. Another interesting example is provided
by Tanaka's formula. The process
rt
•t\= sign(B s)dBs + L°t
Jo
1Д
is a diffusion and it is a semimartingale, where В is standard Brownian motion
on R. Since the paths of L° are singular with respect to Lebesgue measure, and
since a semimartingale decomposition is unique for continuous processes (the
corollary of Theorem 31 of Chap. Ill), we see that \Bt\ cannot be represented
as a solution of (* * *). Another example is that of I-B^1/3 which is a time
homogeneous diffusion but not a semimartingale (Theorem 71 of Chap. IV).
An intuitive notion of a diffusion is to imagine a pollen grain floating down-
downstream in a river. The grain is subject to two forces: the current of the river
(drift), and the aggregate bombardment of the grain by the surrounding wa-
water molecules (diffusion). The coefficient f(t,x) then represents the sensitivity
of the particle at time t and place x to the diffusion forces. For example, if
part of the river water is warmer at certain times and places (due to sunlight
or industrial effluents, for example), then / might be larger. Analogously д
would be larger when the river was flowing faster due to a steeper incline.
We give three simple examples of diffusions.
Example. The stochastic exponential exp{Bt - \t} is a diffusion where
f(t,x) = x and g(t,x) = 0.
Example. Consider the simple system
Vt = V0+ f adBs + f aVsds,
Jo Jo
Xt = X0+ [ Vsds.
Jo
The process X can be used as a model of Brownian motion alternative to
Einstein's. It is called the Ornstein-Uhlenbeck Brownian motion, or simply
the Ornstein-Uhlenbeck process. Note that here the process X has paths
of finite variation and hence the process (Vt)t>o is a true velocity process for
X. Using integration by parts we can verify that
Vt = eat(V0
/¦
Jo

6 The Markov Nature of Solutions 299
is an explicit solution for V. Indeed
ft rt
e~atVt = V0+ Vs(-ae-as)ds + e~QWs
Jo Jo
f-t Pt rt
= Vo — a / V3e~a3ds + / e~asaVsds + / <
Jo Jo Jo
= Vo+
f e-a
Jo
and we are done. Since Vj) and the Brownian motion В are independent (by
construction), we see that when Vo has a Gaussian distribution then V is a
Gaussian process. If a is negative and we take a1 = —l/Ba), then V is а
stationary Gaussian process.
Example. Consider next the equation
Xt = / -ds + BU @<t< 1)
for S a standard Brownian motion.
For each to,O < to < 1, there is a solution which is a diffusion on [0, fo]. By
the uniqueness of solutions if t\ < t0 then the solution for [0, to] agrees with
the solution for [0, t\] on the interval [0, t\]. Thus we have a solution on [0,1).
If we can show that limt_>i Xt = 0, then the solution extends by continuity to
[0,1] and we will have constructed a diffusion X on [0,1] with Xo = X\ = 0,
known as the Brownian bridge.10
An application of integration by parts shows that the solution of the Brow-
Brownian bridge equation is given by
-^—dBs @<*<l).
Write Xt = f(t)Mt, where f(t) = A-t) and Mt = /„(l-s)^. To see that
limt_>i Xt = 0, we study Mt, making the change of variables t — u(l + u).
Then 0 < t < 1 corresponds to 0 < и < оо, and define
лг„ = м_^_, ди = t^l-, o<«<oo.
N is clearly a continuous Qu martingale, and moreover [N, N]u = u; hence N
is a standard Brownian motion by Levy's Theorem (Theorem 39 of Chap. II).
It is then easy to show, as a consequence of the Strong Law of Large Numbers,
that11
Nt
lim — =0 a.s.
t-»oo t
10 The Brownian bridge is also known as tied down Brownian motion, and alterna-
alternatively as pinned Brownian motion.
11 See, for example, Breiman [23, page 265].

300 V Stochastic Differential Equations
Let g(t,u>) = Nt(u>)/t so that Iimt_o5(*~1) w) = 0 a.s. Thus, limt_»o tNi/t = 0
a.s. We then have, replacing t with A — i)
t—> 1
_t) = lim(l - t)Ml/{2_t) = 0 a.s.,
and therefore limu_1(l - ы)М„ = 0 a.s., and hence limt_>i Xt = 0 a.s.
We now know that X is a continuous diffusion on [0,1], and that Xq =
Xi = 0 a.s. Also X is clearly a semimartingale on [0,1), but it is not obvious
that X is a semimartingale on [0,1]. One needs to show that the integral
Jo ?^ds < °° a-s- To see this calculate E{X?}, 0<t<l,
E{X2t) = f(tJE{M?} = f(tJE{[M,M]t}
= (l-t)
-t
By the Cauchy-Schwarz inequality
E{\Xt\}<E{X?}l/2 =
Therefore
Pl \Xs\^_ Г'ЕЦХ
ч^-г-г-
ds
s
whence Jo -j^ds < oo a.s. Therefore the solution X is a semimartingale on
[0,1].
Finally we remark that one can construct a similar Brownian bridge from
any value a to any value b, on an interval of arbitrary finite length т. Using
the interval [0, r] one has the equation
Bt, 0<t<r,
-dBs, 0<t<r,
with
solution
xt =
xt
L
= a +
= b.
Г Ь- Xg
Jo т — s

7 Plows of SDE: Continuity and Differentiability 301
7 Flows of Stochastic Differential Equations: Continuity
and Differentiability
Consider a stochastic differential equation of the form
F(X)s_dZs.
f
Jo
Obviously there is a dependence on the initial condition, and we can write the
solution in the form X(t, w, x), or Xх (u>). The study of the flow of a stochastic
differential equation is the study of the functions ф : x —» X(t,w,x) which can
be considered as mapping R™ —> R™ for (t,u>) fixed, or as mapping R" -» P",
where Vn denotes the space of cadlag functions from R+ to M™, equipped with
the topology of uniform convergence on compacts. It is important to distin-
distinguish between Vn and Dn. The former is a function space, and it is associated
in the literature with weak convergence results (see, e.g., Billingsley [17], or
Ethier-Kurtz [71], or Kurtz-Protter [136] for recent results in a semimartin-
gale context); the latter is the space of stochastic processes with cadlag paths,
and which are adapted to the underlying filtration.
Note that we have already encountered flows in Sect. 6 (cf., Theorem 31),
where we proved measurability of the solution with respect to a parameter
(which can be taken to be, of course, the initial condition). We will be inter-
interested in several properties of the flows: continuity, differentiability, injectivity,
and when the flows are diffeomorphisms of Mn.
We begin with continuity. We consider a general system of equations of
the form
XI = Щ + f F{Xx)s_dZs
Jo
(*)
where Xf and Hf are column vectors in R™, Z is a column vector of m
semimartingales with Zo = 0, and F is an n x m matrix with elements (F^).
For x fixed, for each у we have that Xt — Xf —Xf is a solution of the equation
Xt = Щ - Щ + f F(X)s-dZs, (**)
Jo
where ~F(Y) = F{XX + Y)- F(XX).
Theorem 37. Let Hx be processes in D™, and let x ь-> Hx : R™ -> Dn be
prelocally Lipschitz continuous in S_p, some p > n. Let F be an n x m matrix
of functional Lipschitz operators (F^), l<«<n, 1 < a < m.12 Then there
exists a function X(t,w,x) on R+ x fi x R™ such that
(i) for each x the process Хх(ш) = X(t, to, x) is a solution of (*), and
(ii) for almost all u>, the flow x к-> X(-,u>,x) from R" into T>n is continuous
in the topology of uniform convergence on compacts.
See page 250 for the definition of functional Lipschitz.

302 V Stochastic Differential Equations
Proof. We recall the method of proof used to show the existence and unique-
uniqueness of a solution (Theorem 7). By stopping at a fixed time to, we can
assume the Lipschitz process is just a random variable К which is finite
a.s. Then by conditioning13 we can assume without loss of generality that
this Lipschitz constant is non-random, and we call it с < оо. By replac-
replacing Щ with X? + f* F@)s^dZs, and then by replacing F with G given by
G(Y)t = F(Y)t — F@)t, we can further assume without loss of generality that
F@) = 0. Then for /3 = Cp(c), by Theorem 5 we can find an arbitrarily large
stopping time T such that ZT~ e <S(/3), and Hx is Lipschitz continuous in
S_p on [0,T). Then by Lemma 2 (preceding Theorem 7) we have that for the
solution X of (**)
\\XT~\\gP < Cp(c,Z) ||(Я* - Н»)т-\\&, for any p > 2,
and some (finite) constant Cp(c, Z). Choose p > n, and we have
E{sup Щ - ХУ\*} < Cp(c, Z)K\\x - y||p, (* * *)
s<T
due to the Lipschitz hypothesis on x *-> Hx. By Kolmogorov's Lemma (The-
(Theorem 72 of Chap. IV) we have the result on R™ x [0, T). However since T was
arbitrarily large, the result holds as well on R™ x Q x R+. ?
For the remainder of this section we will consider less general equations.
Indeed, the following will be our basic hypotheses, which we will supplement
as needed.
Hypothesis (Hi). Za are given semimartingales with Zft = 0, 1 < a < m.
Hypothesis (H2). fla : R" -> R are given functions, 1 <i <n, l<a<m,
and f(x) denotes the n x m matrix (/„(x)).
We will study the system of equations
Jo
which we also write
ft
n
Xt = x+ f(Xs_)dZs
Jo
where it is understood that Xt and x are column vectors in R™, /(Xs_) is an
n x m matrix, and Z is a column vector of m semimartingales.
To study the differentiability of the flow we will need a more general theo-
theorem on existence and uniqueness of solutions. Indeed, we wish to replace our
customary (global) Lipschitz condition with a local Lipschitz condition.
See the proofs of Theorems 7, 8, or 15 for this argument.

7 Flows of SDE: Continuity and Differentiability 303
Definition. A function / : R™ —» R is said to be locally Lipschitz if there
exists an increasing sequence of open sets Ak such that (Jfc Ak = R™ and / is
Lipschitz with a constant K^ on each Ak.
For example, if / has continuous first partial derivatives, then it is locally
Lipschitz, while if its continuous first partials are bounded, then it is Lipschitz.
If /, g, are both Lipschitz, then their product fg is locally Lipschitz. These
coefficients arise naturally in the study of Fisk-Stratonovich equations (see
Sect. 5).
Theorem 38. Let Z be as in (HI) and let the functions (/?) in (H2) be
locally Lipschitz. Then there exists a function ((x, u>) : R™ x fi —> [0, oo] such
that for each x ((x, •) is a stopping time, and there exists a unique solution of
;-)dZs (®®)
up to C(ar, •) with limsupj^^^.) ]|Xt|| = oo a.s. on {? < oo}. Moreover
x >—> ?(ar, ui) is lower semi-continuous, strictly positive, and the flow of X
is continuous on [0,?(z, •)).
Remark. Before proving the theorem we comment that for each x fixed
the stopping time T(u>) = (,(x,u>) is called an explosion time. Thus Theo-
Theorem 38 assures the existence and uniqueness of a solution up to an explosion
time; and at that time, the solution does indeed explode in the sense that
limsup^y ||Xt|| = -boo on {T < oo}. Note however that if the coefficients
(/?) in (H2) are (globally) Lipschitz, then a.s. ( = oo for all x (Theorem 7).
Proof. Let Ai be open sets increasing to R™ such that there exist (he), a
sequence of C°° functions with compact support mapping Rn to [0,1] such
that hi = 1 on A%. For each I we let Xt(t, u>, x) denote the solution (continuous
in x) of the equation
Xt =
/ ge(X..)dZa
Jo
where the matrix ge(x) is defined by h^(x)f(x).
Define stopping times, for x fixed,
Se(w,x) - inf{t > 0 : Xt(t,w,x) ? Ae or Xe(t-,w,x) <? Ae}.
By Theorem 37 the flow is uniformly continuous on compact sets, hence the
functions Si are lower semi-continuous. Then on [0,?^) we have that JQ =
Xe+i by the uniqueness of the solutions, since they both satisfy equation
). We wish to show that the relation
Хе(-,ш,х) = Xt+i(;<jj,x) on[0,Se(u),x)) (L)
holds for all ieE" simultaneously. Choose an x and let

304 V Stochastic Differential Equations
Ax = {w: Хе(-,ш,х) = Xe+i(-,uj,x) on [0,S*(w,x))}.
Then Р(ЛХ) = 1. We set A = UzeQ" ^ where Q™ denotes the rationale in
Rn. Then P(A) = 1 as well, and without loss of generality we take A — fi.
Next for arbitrary у e R", let yn -» y, with у„ е Q". Then
Se(w,y) < liminf Se(uj,yn),
n—>oo
and therefore the relation (L) holds for у as well, by the previously established
continuity.
Observe that Sf(uj,x) < Sg+i(u>,x), and let C(x,w) — supf Sg(u>,x),
for each x. Then ? is lower semi-continuous because the Sp are, and it is
strictly positive. Further, we have shown that there exists a unique func-
function X(-,u,x) on [0, ?(x,w)) which is a solution of (<E><?>), and it is equal
to X^(-,w,x) on [0,Si(uj,x)), each ? > 1. Indeed, on [0,Se(uJ,x)) we have
X(-,w,x) = Xt(-,w,x), and since X?(-,w,x) e Л^, we have ht(Xe(-,w,x)) = 1,
wherefore X% is a solution of (<8><g>) on [0,5^).
By our construction we have that X(Se(u>, x), u>, x) belongs to A\. Indeed,
letting Se(w, x) = s, then X(s—,w, x) = Xt(s—,w, x) = и ? Ag, for some value
u. Since и e Ag we have he (u) = 1, and thus X and JQ have the same jump at
s and we can conclude X(s,u>,x) = Xe(s,w,x). But Xt(s,uj,x) or Xt(s—,uj,x)
must be in A\ by right continuity and the definition of Sg; therefore X(s, w, x)
or X(s-,w,x) is in Л^. This shows that limsupt^ \\Xt\\ = ooon {С < оо}. П
It is tempting to conclude that if there are no explosions at a finite time for
each initial condition x a.s. (null set depending on x), then also C(#r) = oo
a.s. (with the same null set for all initial values x). Unfortunately this is not
true as the next example shows.
Example. Let В be a complex Brownian motion. That is, let B1 and B2
be two independent Brownian motions on R and let Bt = ВI + iB%, where
i2 — — 1. Consider the stochastic differential equation
Jo
Z2.dB,
where the initial value z is complex. This equation has a closed form solution
given by
Indeed, if we set f(x) = z(l + zx) and Zt = f(Bt), then f(Bt) = -Z?,
and since Bt = B\ + iB%, we have by Ito's formula
Zt = z- f Z2sdBs + \ f f"{Bs){d[B\B%+i2d\B2,B2}s);
Jo l Jo
since d[B1,B1]s = d[B2, B2]s = ds, we see that Zt = z - /„* Z2dBs.

7 Flows of SDE: Continuity and Differentiability 305
For z fixed we know that P{3t : Bt — —1/z) = 0, since Brownian motion
in R2 a.s. does not hit a specified point. Therefore Z does not have explosions
in a finite time for each fixed initial condition Zo = z. On the other hand
for any ujq and to fixed we have BtQ(w0) = zq for some zq G C, and thus for
the initial value z — —1/zo we have an explosion at the chosen finite time tg.
Thus each trajectory has initial conditions such that it will explode at a finite
time, and P{u> : 3 z with C(u>, z) < oo} = 1.
We next turn our attention to the differentiability of the flows. To this end
we consider the system of n + n2 equations (assuming that the coefficients (/?)
are at least C1)
"' ft
a=l J°
m n rt
e*=l j = l'
A < i < n) where D denotes an n x n matrix-valued process and 8\. = 1 if
i = к and 0 otherwise (Kronecker's delta). A convenient convention, sometimes
called the Einstein convention, is to leave the summations implicit. Thus the
system of equations (D) can be alternatively written as
I ra{
We will use the Einstein convention when there is no ambiguity. Note that in
equations (D) if X is already known, then the second system is linear in D.
Also note that the coefficients for the system (D) are not globally Lipschitz,
but if the first partials of the (/?) are locally Lipschitz, then so also are the
coefficients of (D).
Theorem 39. Let Z be as in (HI) and let the functions (/?) in (H2) have
locally Lipschitz first partial derivatives. Then for almost all u> there exists a
function X(t,u>,x) which is continuously differentiable in the open set {x :
?(ar, u>) > t}, where ? is the explosion time (cf., Theorem 38). If (/?) are
globally Lipschitz then C, = oo. Let Dk(t,u),x) = -^-X{t,u),x). Then for each
x the process (X(¦, to, x), D(-, u>, x)) is identically cadlag, and it is the solution
of equations (D) on [0, ?(z, •)).
Proof. We will give the proof in several steps. In Step 1 we will reduce the
problem to one where the coefficients are globally Lipschitz. We then resolve
the first system (for X) of (D), and in Step 2 we will show that, given X, there
exists a "nice" solution D of the second system of equations, which depends

306 V Stochastic Differential Equations
continuously on x. In Step 3 we will show that D\ is the partial derivative in
Xk of Xх in the distributional sense.14 Then since it is continuous (in x), we
can conclude that it is the true partial derivative.
Step 1. Choose a constant N. Then the open set {x : ((x,lj) > N} is a
countable union of closed balls. Therefore it suffices to show that if В is one
of these balls, then on the set Г = {u> : Vx ? B, ?(ar, u>) > N} the function
x t—* X(t,u>,x) is continuously differentiable on B. However by Theorem 38
we know that for each w e Г, the image of X as x runs through В is compact
in Vn with 0 <t < N, hence it is contained in a ball of radius R in R™, for R
sufficiently large.
We fix the radius R and we denote by К the ball of radius R of R™ centered
at 0. Let
A = {w : for x e В and 0 < t < N, X(t, ш, x) ? K}.
We then condition on A. That is, we replace P by Рл, where Рл(А) =
Р(А\Л) = Рр?д^ ¦ Then Рл <С Р, so Z is still a semimartingale with re-
respect to Рл (Theorem 2 of Chap. II). This allows us to make, without loss of
generality, the following simplifying assumption: if x ? В then ?(z, w) > N,
and X(t,w,x) eK,0<t<N.
Next let h : M™ —> R be C°° with compact support and such that h(x) = 1
if x e К and replace / with fh. Let Z be implicitly stopped at the (constant
stopping) time N. (That is, ZN replaces Z.) With these assumptions and
letting Рл replace P, we can therefore assume—without loss of generality—
that the coefficients in the first equation in (D) are globally Lipschitz and
bounded.
Step 2. In this step we assume that the simplifying assumptions of Step 1
hold. We may also assume by Theorem 5 that Z ? <S(/3) fc>r a /3 which will be
specified later. If we were to proceed to calculate formally the derivative with
respect to Xk of X1, we would get
dX\ _ ^ д
dxk
Therefore our best candidate for the partial derivative with respect to Xfc is
the solution of the system
m rt n
*4)
? / E ^
a=iJo 3=1 axi
14 These derivatives are also known as derivatives in the generalized function sense.

7 Flows of SDE: Continuity and Differentiability 307
and let D be the matrix (Dlk). Naturally Xs = X(s,ui,x), and we can make
explicit this dependence on x by rewriting the above equation as
Ukt -
where the summations over a and j are implicit (Einstein convention). We
now show that Dk = (D\,..., D%) is continuous in x. Fix x,y ? Rn and let
Vs(u>) =Dk(s,w,x) -Dk(s,Lj,y). Then
Jo oxj
where У/» = ЕГ=1/0* l^(x«-)dZ"- Note that ЬУ SteP 1 we know that
|^-(X^_) is bounded; therefore since Za e <S(/3), the Yj are in 5(c/3) for a
constant c. If C is small enough, by Lemma 2 (preceding Theorem 7) we have
that for each p > 2 there exists a constant CP(Z) such that
\\g,<Cp{Z)\\H\\g,.
However we can also estimate ||ЯЦ^р. If we let
then Я4г = ЕГ=1 /о JasdZf' an<^ therefore by Emery's inequality (Theorem 3)
we have that
which in turn implies
% < Cp(Z)\\J\\gr.
We turn our attention to estimating || J||sp. Consider first the terms
дР дР
dx~{ s~> ~ dx

308 V Stochastic Differential Equations
By the simplifying assumptions of Step 1, the functions -^ are Lipschitz in
K, and Xх takes its values in K. Therefore Theorem 37 applies, and as we
saw in its proof (inequality (* * *)) we have that
<K\\Xx-Xy\\s^<K(p,Z)\\x-y\\.
Next consider the terms DJkx. We have seen that these terms are solutions of
the system of equations
f
Jo
and therefore they can be written as solutions of the exponential system
with Yg = /„* %&(XZ_)dZZ. As before, by Lemma 2,
\\Dix\\g,P<C2p{Z).
Recalling the definition of J and using the Cauchy-Schwarz inequality gives
<C2p(Z)\\x-yl
which in turn combined with previous estimates yields
||V||r <C(p,Z)\\x-y\\.
Since V was defined to be Vs(ui) = Dk(s,ui,x) — Dk(s,u>,y), we have shown
that (with p > n)
E \sup \\Dk(s,u>,x) - Dk(s,uj,y)\\A < C(p,Z)P\\x - y\\P,
{s<N J
and therefore by Kolmogorov's Lemma (Theorem 72 of Chap. IV) we have
the continuity in x of Dk(t,ui,x).
Step 3. In this step we first show that Dk(t,u>,x), the solution of equations
(*4) (and hence also the solution of the n2 equations of the second line of (D))
is the partial derivative of X in the variable xk in the sense of distributions
(i.e., generalized functions). Since in Step 2 we established the continuity of
.Dfc in x, we can conclude that Dk is the true partial derivative.

7 Flows of SDE: Continuity and Differentiability 309
Let us first note that with the continuity established in Step 2, by increas-
increasing the compact ball K, we can assume further that Dk(s,ui,x) € К also, for
s < N and all x e B.
We now make use of Cauchy's method of approximating solutions of dif-
differential equations, established for stochastic differential equations in The-
Theorem 16 and its corollary. Note that by our simplifying assumptions, Y =
(X,D) takes its values in a compact set, and therefore the coefficients are
(globally) Lipschitz. The process Y is the solution of a stochastic differential
equation, which we write in vector and matrix form as
Yt=V
Г' f(Y.-)dZ..
Jo
Let ar be a sequence of partitions tending to the identity, and with the no-
notation of Theorem 16 let Y(a) = (X(a),D(a)) denote the solution of the
equation of the form
f f{Y°+)°dZs.
f
Jo
For each (ay) the equations (D) become difference equations, and thus trivially
dX(ar)
dxk
= Dk{ar).
The proof of the theorem will be finished if we can find a subsequence rq
such that limg_>ooX(crr.?) = X and lim^oo Dk(<?rq) = Dk, in the sense of
distributions, considered as functions of x.
We now enlarge our space Q, exactly as in Sect. 6 (immediately preceding
Theorem 31):
U = R2n
x п
u>t
where B2n denotes the Borel sets of R2n. Let A be normalized Lebesgue mea-
measure of K. Finally define
P = A x P.
We can as in the proof of Step 2 assume that Z e S@) for /? small enough
and then,
lim (X(<rr), D(ar) = (X, D) in 52,
J >OO —
by Theorem 16. Therefore there exists a subsequence rq such that
oo
M = J2^P\\(X(rq),D(rq)) - (X,D)\\ e Ll(dP).
9 = 1

310 V Stochastic Differential Equations
The function M = M(w,x) is in L1(X x P), and therefore for P-almost all ui
the function x t—> M(ui,x) € L1(dX). For u> not in the exceptional set, and t
fixed it follows that
limo(X(rq),D(rq)) = (X,D)
A a.e. Further, it is bounded by the function M(w, ¦) + \\(X(t,w, -),D(t,u>, -))||
which is integrable by hypothesis. This gives convergence in the distributional
sense, and the proof is complete. ?
We state the following corollary to Theorem 39 as a theorem.
Theorem 40. Let Z be as in (HI) and let the functions (/?) in (H2) have
locally Lipschitz derivatives up to order N, for some N, 0 < N < oo. Then
there exists a solution X(t,w,x) to
l = Xi + J2 ra
which is N times continuously differentiable in the open set {x : <^(x, w) > t},
where ? is the explosion time of the solution. If the coefficients (/?) are globally
Lipschitz, then С = oo.
Proof If N = 0, then Theorem 40 is exactly Theorem 38. If N = 1, then
Theorem 40 is Theorem 39. If N > 1, then the coefficients of equations (?>)
have locally Lipschitz derivatives of order N — 1 at least. Induction yields
(X,D)eCN-\ whence X e CN. О
Note that the coefficients (fla) in Theorem 40 are locally Lipschitz of order
N if, for example, they have N + 1 continuous partial derivatives; that is, if
/*, G CW+1(R"), for each г and a, then (/?) are locally Lipschitz of order N.
8 Flows as Diffeomorphisms: The Continuous Case
In this section we will study a system of differential equations of the form
X\ = Xi + V f Fi{X)s-dZf, 1 < i < n,
(*)
where the semimartingales Za are assumed to have continuous paths with
Zo = 0. The continuity assumption leads to pleasing results. In Sect. 10
we consider the general case where the semimartingale differentials can have
jumps. The flow of an equation such as (*) is considered to be an K"-valued
function ip : Rn —» R™ given by ip(x) = X(t,w,x), for each (t,w). We first
consider the possible injectivity of ip, of which there are two forms.

8 Flows as Diffeomorphisms: The Continuous Case 311
Definition. The flow ip of equation (*) is said to be weakly injective if for
each fixed x, у е R™, x ф у,
P{w :3t: X(t,LJ,x) = X(t,w,y)} = 0.
Definition. The flow ip of equation (*) is said to be strongly injective
(or, simply, injective) if for almost all ui the function ц> : x —> X(t,ui,x) is
injective for all t.
For convenience we recall here a definition from Sect. 3 of this chapter.
Definition. An operator F from Ю™ into D is said to be process Lipschitz
if for any X, Y € Ю™ the following two conditions are satisfied.
(i) For any stopping time T, XT" = YT" implies F(X)T~ = F(Y)T~.
(ii) There exists an adapted process К е L such that
Actually, process Lipschitz is only slightly more general than random Lip-
Lipschitz. The norm symbols in the above definition denote Euclidean norm, and
not sup norm. Note that if F is process Lipschitz then F is also functional
Lipschitz and all the theorems we have proven for functional Lipschitz coeffi-
coefficients hold as well for process Lipschitz coefficients. If / is a function which
is Lipschitz (as defined at the beginning of Sect. 3) then / induces a process
Lipschitz operator. Finally, observe that by Theorem 37 we know that the
flow of equation (*) is continuous from R™ into R™ or from R" into T>n a.s.,
where T>n has the topology of uniform convergence on compacts.
Theorem 41. Let Za be continuous semimariingales, 1 < a < m, H a
vector of adapted cadlag processes, and F an nxm matrix of process Lipschitz
operators. Then the flow of the solution of
is weakly injective.15
Proof. Let x, у G R™, x ф у. Let Xх, Xy denote the solutions of the above
equation with initial conditions x, у respectively. We let и = x — у and U =
Xх - XV. We must show P{w : 3t : Ut(u>) = 0} = 0. Set V = F(XX)- -
F(Xy)_. Then V G L and \V\ < K\U\. Further, the processes U and V are
related by
Ut = u+ [ VsdZs.
Let T = inf{t > 0 : Ut = 0}; the aim is to show P(T = oo) = 1. Since U is
continuous the stopping time T is the limit of a sequence of increasing stopping
We are using the Einstein convention on sums.

312 V Stochastic Differential Equations
times Sk strictly less than T. Therefore the process 1[о,т) = lim^oo l[o,sfc] is
predictable.
We use Ito's formula (Theorem 32 of Chap. II) on [О, Г) for the function
f(x) =log||a;||. Note that
df 1 d\\x\\ 1 xt xt
г)т llrll г)т llrll llrll llrll2'
ихг \\x\\ ихг уху \\x\\ \\x\\
d2f 1 Ixj d\\x\\ _ 1 2x±
32f 2xiXj
\\x\\A '
Therefore on [0,T),
+ 5 E/ йр*-, </•!. - E/o
Since <Я7* = ^a V^dZ", the foregoing equals
log Ht^H-tog ||«
* 1
-E
i,j,a,0
All the integrands on the right side are predictable and since ||V|| <
they are moreover bounded by К and K2 in absolute value. However on
{T < 00} the left side of the equation, log ||J7t|| - log ||и||, tends to -00 as t
increases to T; the right side is a well-defined non-exploding semimartingale
on all of [0,00). Therefore P(T < 00) = 0, and the proof is complete. D
In the study of strong injectivity the stochastic exponential of a semi-
semimartingale (introduced in Theorem 37 of Chap. II) plays an important role.
Recall that if Z is a continuous semimartingale, then XqS(Z) denotes the
(unique) solution of the equation
Xt = Xq + / X.dZs,
/
Jo
and ?(Z)t = exp{Zt - \[Z,Z\t}. In particular, P(\nis<tS{Z)s > 0) = 1.
Theorem 42. For iel", let Hx be in Dfc such that they are locally bounded
uniformly in x. Assume further that there exists a sequence of stopping times
i increasing to oo a.s. such that \\(H^_ — й"^)т'||^'- < K\\x — y\\, each

8 Flows as Diffeomorphisms: The Continuous Case 313
(- > 1> for a constant К and for some r > n. Let Z = (Z1,... ,Zk) be к
semimartingales. Then the functions
ft
т t—> / TJxrl7
JO
x ь-» [Hx ¦ Z, Hx ¦ Z]t
have versions which are continuous as functions from R" into T>, with T>
having the topology of uniform convergence on compacts.
Proof. By Theorem 5 there exists an arbitrarily large stopping time T such
that ZT~ G H°°. Thus without loss of generality we can assume that Z e H°°,
and that Hx is bounded by some constant K, uniformly in x. Further we
assume \\H1 - Hl\\sr < K\\x - y\\. Then
up || /* Hx_dZs - f H»_dZ,\\r} < CE{snp \\HX_ - Щ_\\г}\\г\\гн~
t Jo Jo t =
where we have used Emery's inequality (Theorem 3). The result for /Hf_dZs
now follows from Kolmogorov's Lemma (Theorem 72 of Chap. IV). For the
second result we have
|| [Hx_ -Z,HX-Z\- [Hi -Z,Hy_- Z] ||r
)+Ht(H:i
<2к\\г\\%г\\нх-ну\\§:,
and the result follows. D
Theorem 43. Let F be a matrix of process Lipschitz operators and Xх the
solution of (*) with initial condition x, for continuous semimartingales Za,
1 < a < m. Fix x, у ? R". For r ? R there exist for every x, у G R™ with
x ф у (uniformly) locally bounded predictable processes Ha(x,y), Ja'P(x,y),
which depend on r, such that
where
Ar
(x,
V)t =
ft
Jo
ft
Jo

314 V Stochastic Differential Equations
Proof. Fix x, у e Rn and let U = Xх - X*, V = F{XX)- - F(X»)_. Ito's
formula applies since U is never zero by weak injectivity (Theorem 36). Using
the Einstein convention,
r4UlUi +5)\\Us\\T-2}d[U\U%.
Let (•, •) denote Euclidean inner product on R™. It suffices to take
H?(x,y)=r\\Us\\-2(Us,Vsa),
(where Va is the a-th column of V); and
One checks that these choices work by observing that dU\ —
Finally the above allows us to conclude that
Л
Jo
and the result follows. ?
Before giving a key corollary to Theorem 43, we need a lemma. Let H00
be the space of continuous semimartingales X with Xo = 0 such that X has
a (unique) decomposition
X = N + A
where N is a continuous local martingale, A is a continuous process of finite
variation, Nq = Aq = 0, and such that [NjN]^ and Jo°° \dAs\ are in L°°.
Further, let us define
coo
\dAs\\\L°°-
\\Х\\ЙОО = \\[N,N}U? +
Jo
Lemma. For every p,a < oo, there exists a constant C(p,a) < oo such that
if \\Х\\Й„ < a, then \\S(X)\\iP < C(p,a).
Proof. Let X = N + A be the (unique) decomposition of X. Then
\\PSP = E{supexp{p{Xt - ]:[X,X]t)}}
- t 2
< E{epX'} (recall that X* = sup \Xt\)
t
< E{exp{pN* + pa}}
= epaE{epN'},

8 Flows as Diffeomorphisms: The Continuous Case 315
since \At\ < a, a.s. We therefore need to prove only an exponential maximal
inequality for continuous martingales. By Theorem 42 of Chap. II, since N
is a continuous martingale, it is a time change of a Brownian motion. That
is, Nt = B[jv,jv]t) where В is a Brownian motion defined with a different
filtration. Therefore since [N, N}oo < a2, we have
and hence E{epN'} < E{exp{pB*2}}. Using the reflection principle (Theo-
(Theorem 33 of Chap. I) we have
2 2
E{exp{pB*a2}} < 2E{exp{pBa2}} = ^|
D
Note that in the course of the proof of the lemma we obtained C{p, a) =
2
Corollary. Let — oo < r < oo and p < oo, and let (Лг(х, y)t)t>o be as given
in Theorem 43. Then ?(Ar(x, y)) is locally in ^p, uniformly in x, y.
Proof. We need to show that there exists a sequence of stopping times Тц
increasing to oo a.s., and constants C( < oo, such that ||5(Лг(а;, y))Tt \\s_p < Ci
for all x, у е К™, x фу. ~
By stopping, we may assume that Za and {Za,Z@] are in H_ and that
\Ha\ and \Ja^\ < b for all (x,y), 1 < a, C < m. Therefore \\Лг(х^у)\\йоо < С
— OO
for a constant C, since if X € Я and if К is bounded, predictable, then
К ¦ X e Я°° and \\K ¦ XWjja* < \\К\\з°°\\Х\\йоо, as can be proved exactly
analogously to Emery's inequalities (Theorem 3). The result now follows by
the preceding lemma. ?
Comment. A similar result in the right continuous case is proved by a dif-
different method in the proof of Theorem 62 in Sect. 10.
Theorem 44. Let Za be continuous semimartingales, 1 < a < m, and F an
n x m matrix of process Lipschitz operators. Then the flow of the solution of
Xt=x+ f F(X)s-dZs
Jo
is strongly injective on K™.
Proof. It suffices to show that for any compact set Ccl", for each N, there
exists an event of probability zero outside of which for every x,y € С with
Ы\\Х(8,и,х)-Х(8,и,у)\\>0.
s<N

316 V Stochastic Differential Equations
Let x, y, s have rational coordinates. By Theorem 43 a.s.
The left side of the equation is continuous (Theorem 37). As for the right side,
?(Лг(х, у)) will be continuous if we can show that the processes H"(x, y) and
Jf >/3(ж, у), given in Theorem 43, verify the hypotheses of Theorem 42. To
this end, let В be any relatively compact subset of R™ x R™ \ {(x,x)} (e.g.,
В = Bxx B2 where Bi,B2 are open balls in K" with disjoint closures). Then
|jar — y\\r is bounded on В for any real number r. Without loss we take r = 1
here. Let U(x,y) = Xх -X», V(x,y) = F(X*)_ -F(X»)_, and let Va(x,y)
be the a-th column of V. Then for (x,y) and (x',yr) in В we have
Ha(x,y)-Ha(x',y') (**)
= (\\U(x,y)\\-2 - \\U(x',y')\\-2)(U(x,y),Va(x,y))
+ \\U(x', y')\\-2(U(x, y) - U(x', y'), Va(x, y))
+ \\U(x',y')\\-2(U(x',y>),Va(x,y) - Va(x',y')).
The first term on the right side of (**) above is dominated in absolute value
by
\\\U(x,y)\\ - \\U(x',y')\\\(\\U(x,y)\\ + \\U(x',y')\\\) уШуа(х v)n
<K\\U(x,y)-U(x',y')\\(\\U(x,y)\\ + \\U(x',y')\\)\\U(x',y')\\-2,
where we are assuming (by stopping), that F has a Lipschitz constant K.
Since U(x, y) — U(x', y') = U(x, x') — U(y, y'), the above is less than
K{\\U{x,x')\\+\\U{y,y')\\){\\U{x,y)\\
= K{\\x - x'\\S{Ax{x,x')) + \\y - j
{\\x - у\\?(Лг(х,у)) + \\x' - у'ЩЛ^х^у'Шх' - у'\\-2?{Л_2{х',у')
<К1\\(х,у)-(х',у')\\(?(Л1(х,х'))+?(Л1(у,у')))(?(М^У))
By the lemma following Theorem 43, and by Holder's and Minkowski's
inequalities we may, for any p < oo, find stopping times Ti increasing to oo a.s.
such that the last term above is dominated in 5P norm by K( || (x, y) — (x', y') \\
for a constant Ki corresponding to Ti. We get analogous estimates for the
second and third terms on the right side of (**) by similar (indeed, slightly
simpler) arguments. Therefore Ha satisfies the hypotheses of Theorem 42, for
(x,y) € B. The same is true for Ja^, and therefore Theorem 42 shows that
Ar and [yl^ylr] are continuous in (x,y) on B. (Actually we are using a local
version of Theorem 42 with (x, y) ? В С R2n instead of all of R2n; this is not
a problem since Theorem 42 extends to the case x ? W open in R™, because

8 Flows as Diffeomorphisms: The Continuous Case 317
Kolmogorov's Lemma does—recall that continuity is a local property.) Finally
since Ar and [Лг, Лг] are continuous in (x, y) G В we deduce that ?(Лг(х, y))
is continuous in {(x, y) G M2n : x ф у}.
We have shown that both sides of
\\X(s,w,x)-X(s,w,y)\\r = \\x-y\\r?(Ar(x,y))s
can be taken jointly continuous. Therefore except for a set of probability zero
the equality holds for all (x, y, s)et"xI"xR|. The result follows because
?(Ar(x, y))t is defined for all t finite and it is never zero. D
Theorem 45. Let Za be continuous semimartingales, 1 < a < m, and let F
be an n x m matrix of process Lipschitz operators. Let X be the solution of
(*). Then for each N < oo and almost all ш
lim inf \\X(s,u),x)\\ = oo.
¦ Ill ^ »r II \ ' ' /II
||x||-»oo s<N
Proof By Theorem 43 the equality
is valid for all r G R.
For x ф 0 let Yx = \\XX - Х°\\~1. (Note that Yx is well-defined by
Theorem 41.) Then
(x,0)) (***)
\yx -Yy\ < ц^0101
Define Y°° = 0. The mapping x i-> x||x||~2 inspires a distance (I on Rn \ {0}
by d(x,y) = |ff|j||||. Indeed,
У ll2 (\\x-y\\'2
11 № h\r VN
By Holder's inequality we have that
\\?(Л1(х,у))?(Л.1(х,0)?(Л-1(у,0))\\ш-
and therefore by the corollary to Theorem 43 we can find a sequence of stop-
stopping times (Te)e>i increasing to oo a.s. such that there exist constants Ce with
(using (* * *))
т
Next set

318 V Stochastic Differential Equations
0 < ||x|| < oo,
||x|| = 0.
Then ||(УХ - Yy)Te\\rsr < Crt\\x - y\\r on Rn, and by Kolmogorov's Lemma
(Theorem 72 of Chap. IV), there exists a jointly continuous version of
(t,x) i-> Ytx, on Rn. Therefore Нтцхц_>0Угх exists and equals 0. Since
(У1) = ||ХЖ"ЖИ - Х°\\, we have the result. ?
Theorem 46. Let Za be continuous semimartingales, 1 < a < m, and F be
an n x m matrix of process Lipschitz operators. Let X be the solution of (*).
Let ip : Rn —> Rn be the flow ip(x) = X(t,w,x). Then for almost allw one has
that for all t the function <p is surjective and moreover it is a homeomorphism
from Rn to Rn.
Proof. As noted preceding Theorem 41, the flow ip is continuous from Rn to
Vn, topologized by uniform convergence on compacts; hence for a.a. lj it is
continuous from Rn to Rn for all t.
The flow tp is injective a.s. for all t by Theorem 44.
Next observe that the image of Rn under ip, denoted y(Rn), is closed.
Indeed, let y>(Rn) denote its closure and let у € y(Rn). Let (xk) denote a
sequence such that lim^o, ^(ij) = y. By Theorem 45, limsup^^^ ||х*,|| <
oo, and hence the sequence (xk) has a limit point x e Rn. Continuity implies
ip(x) = y, and we conclude that ip(Rn) = y(Rn); that is, ip(Rn) is closed.
Then, as we have seen, the set {xk} is bounded. If Xk does not converge to
x, there must exist a limit point z ф x. But then ip(z) = у = <р(х), and this
violates the injectivity, already established. Therefore ip~ * is continuous.
Since ip is a homeomorphism from Rn to </?(Rn), the subspace fp(M.n) of
Rn is homeomorphic to a manifold of dimension n in Rn; therefore by the
theorem of the invariance of the domain (see, e.g., Greenberg [84, page 82]),
the space y(Rn) is open in Rn. But </?(Rn) is also closed and non-empty. There
is only one such set in Rn that is open and closed and non-empty and it is
the entire space Rn. We conclude that y>(Rn) = Rn. ?
Comment. The proof of Theorem 46 can be simplified as follows: extend ip
to the Alexandrov compactification R" = Rn U {oo} of Rn to Tp by
I oo, x = oo.
Then Ip is continuous on R" by Theorem 45, and obviously it is still injective.
Since R™ is compact, Tp is a homeomorphism of R" onto 7p(R%). However R"
is topologically the sphere Sn, and thus it is not homeomorphic to any proper
subset (this is a consequence of the Jordan-Brouwer Separation Theorem (e.g.,
Greenberg [84, page 79]). Hence ^(R?) = R™.

8 Flows as Diffeomorphisms: The Continuous Case 319
We next turn our attention to determining when the flow is a diffeomor-
phism of Rn. Recall that a diffeomorphism of Rn is a bijection (one to one
and onto) which is C°° and which has an inverse that is also C°°. Clearly the
hypotheses on the coefficients need to be the intersection of those of Sect. 7
and process Lipschitz.
First we introduce a useful concept, that of right stochastic exponentials,
which arises naturally in this context. For given n, let Z be an n x n matrix
of given semimartingales. If X is a solution of
X* = 1+1 Xs-dZs,
f Xs-c
Jo
where X is an n x n matrix of semimartingales and / is the identity matrix,
then X = ?{Z), the (matrix-valued) exponential of Z. Since the space of n x n
matrices is not commutative, it is also possible to consider right stochastic
integrals, denoted
(Z :H)t= f (dZs)Hs,
Jo
where Z is an n x n matrix of semimartingales and H is an n x n matrix of
(integrable) predictable processes. If ' denotes matrix transpose, then
(Z : Я) = (H' ¦ Z'Y,
and therefore right stochastic integrals can be defined in terms of stochastic
integrals. Elementary results concerning right stochastic integrals are collected
in the next theorem. Note that / Y-dZ and [Y, Z] denote nxn matrix-valued
processes here.
Theorem 47. Let Y, Z be given nxn matrices of semimartingales, H an
nxn matrix of locally bounded predictable processes. Then,
(i) YtZt - Y0Z0 = /„* Ys_dZs + J*(dYs)Zs_ + [У, Z]t;
(ii) [H-Y,Z] =H-[Y,Z); and
(Hi) [Y,Z:H} = [Y,Z]:H.
Moreover ifF is an nxn matrix of functional Lipschitz operators, then there
exists a unique nxn matrix of 3-valued processes which is the solution of
/'
./о
(dZs)F(X)s_.
Proof The first three identities are easily proved by calculating the entries
of the matrices and using the results of Chap. II. Similarly the existence and
uniqueness result for the stochastic integral equation is a simple consequence
of Theorem 7. ?
Theorem 47 allows the definition of the right stochastic exponential.

320 V Stochastic Differential Equations
Definition. The right stochastic exponential of ал п х п matrix of semi-
martingales Z, denoted ?H(Z), is the (unique) matrix-valued solution of the
equation
Xt = I+ f (dZe)Xe..
Jo
We illustrate the relation between left and right stochastic exponentials
in the continuous case. The general case is considered in Sect. 10 (see Theo-
Theorem 63). Note that ?R(Z) = ?{Z')'.
Theorem 48. Let Z be an n x n matrix of continuous semimartingales with
Zo = 0. Then ?{Z) and SR{-Z + [Z, Z]) are inverses; that is, S(Z)?R(-Z +
[Z,Z]) = I.
Proof. Let U = ?{Z) and V = ?R(-Z + [Z,Z]). Since U0VQ = I, it suffices
to show that d(UtVt) = 0, all t > 0. Note that
dV = (-dZ + d[Z,Z])V,
(dU)V = (UdZ)V, and
d[U,V] = Ud[Z,V] = -Ud[Z,Z}V.
Using Theorem 47 and the preceding,
d(UV) = UdV + (dU)V + d{U, V]
= U(-dZ + d[Z, Z])V + UdZV - Ud[Z, Z)V
= o,
and we are done. D
The next theorem is a special case of Theorem 40 (of Sect. 7), but we state
it here as a separate theorem for ease of reference.
Theorem 49. Let (Z1,..., Zm) be continuous semimartingales with Z^ = 0,
1 < i < m, and let (fa), 1 < i < n, 1 < a < m, be functions mapping Шп to
R, with locally Lipschitz partial derivatives up to order N, 1 < N < oo, and
bounded first derivatives. Then there exists a solution X(t,u),x) to
J0
fia{Xe)dZf,
such that its flow <p : x —> X(x, t,uj) is N times continuously differentiable on
Шп. Moreover the first partial derivatives satisfy the linear equation
a=lj-1
where S-j, is Kronecker's delta.

9 General Stochastic Exponentials and Linear Equations 321
Observe that since the first partial derivatives are bounded, the coefficients
are globally Lipschitz and it is not necessary to introduce an explosion time.
Also, the value N = oo is included in the statement. The explicit equation for
the partial derivatives comes from Theorem 39.
Let D denote the n x n matrix-valued process
Dt = (Dit)i<i<n,i<fc<n- (®)
The process D is the right stochastic exponential ?R(Y), where Y is defined
by
Combining Theorems 48 and 49 and the above observation we have the im-
important following result.
Theorem 50. With the hypotheses and notation of Theorem 49, the matrix
Dt is non-singular for allt> 0 and ieR", a.s.
Theorem 51. Let (Z1,..., Zm) be continuous semimartingales and let (fa),
1 < i < n, 1 < a < m, be functions mapping Шп to Ш, with partial derivatives
of all orders, and bounded first partials. Then the flow of the solution of
a=l
fa(Xs)dZ», 1 < * < n,
is a diffeomorphism from Шп to Rn.
Proof. Let (p denote the flow of X. Since (/„)i<i<n,i<a<m have bounded first
partials, they are globally Lipschitz, and hence there are no finite explosions.
Moreover since they are C°°, the flow is C°° on Rn by Theorem 49. The
coefficients (fa) are trivially process Lipschitz, hence by Theorem 46 the flow
ip is a homeomorphism; in particular it is a bijection of Rn. Finally, the matrix
Dt (defined in (®) preceding Theorem 50) is non-singular by Theorem 50, thus
ip is C°° by the Inverse Function Theorem. Since <p~l is also C°°, we conclude
ip is a diffeomorphism of Rn. ?
9 General Stochastic Exponentials and Linear Equations
Let 2bea given continuous semimartingale with Zq = 0 and let ?{Z)t denote
the unique solution of the stochastic exponential equation
Xt = 1 + / XsdZs. (*)
Jo
Then Xt = ?{Z)t = exp{Zt - \[Z,Z]t} (cf., Theorem 37 of Chap. II). It is
of course unusual to have a closed form solution of a stochastic differential

322 V Stochastic Differential Equations
equation, and it is therefore especially nice to be able to give an explicit
solution of the stochastic exponential equation when it also has an exogenous
driving term. That is, we want to consider equations of the form
=Ht+ f
Jo
Xt=Ht+ f Xs_dZs, (**)
Jo
where H G D (cadlag and adapted), and Z is a continuous semimartingale. A
unique solution of (**) exists by Theorem 7. It is written ?jj{Z).
Theorem 52. Let H be a semimartingale and let Z be a continuous semi-
semimartingale with Zo = 0. Then the solution ?h(Z) of equation (**) is given
by
eH(Z)t = ?(Z)t{H0 + ( ?(г);Ч(н3 - [н, z\s)}.
Proof. We use the method of "variation of constants." Assume the solution is
of the form Xt = CtUt, where Ut = ?(Z)t, the normal stochastic exponential.
The process С is cadlag while U is continuous. Using integration by parts,
dXt = Ct-dUt + UtdCt + d[C, U]t
= Ct-UtdZt + UtdCt + Utd[C, Z]t
If X is the solution of (**), then equating the above with (**) yields
dHt + Xt.dZt = Xt_dZt + Utd{Ct + [C, Z]t)
or
dHt = Utd{Ct + [C,Z]t}.
Since U is an exponential it is never zero and 1/U is locally bounded. Therefore
Calculating the quadratic covariation of each side with Z and noting that
[[C, Z],Z]= 0, we conclude
Therefore equation (* * *) becomes
dH = dC+
and Ct = Jl U~ld{H8 - [Я, Z)s). Recall that Ut = ?{Z)t and Xt = CtUt, and
the theorem is proved. ?

9 General Stochastic Exponentials and Linear Equations 323
Since ?(Z)^1 — l/?(Z)t appears in the formula for ?jj{Z), it is worthwhile
to note that (for Z a continuous semimartingale)
/ 1 \=dZ-d[Z,Z]
\?{Z)) ?{Z)
and also
A more complicated formula for ?h{Z) exists when Z is not continuous
(see Yoeurp-Yor [236]). The next theorem generalizes Theorem 52 to the case
where H is not necessarily a semimartingale.
Theorem 53. Let H be cadlag, adapted (i.e., H G Ш), and let Z be a con-
continuous semimartingale with Zq = 0. Let Xt — ?jj(Z)t be the solution of
,t
Xt = Hf + I Xs-dZs.
Jo
Then Xt = ?H(Z)t is given by
Xt=Ht+ ?(Z)t [ ?{Z)-\Hs_dZs - Hs_d[Z, Z]s).
Jo
Proof. Let Yt = Xt - Ht. Then Y satisfies .
[
= f Hs_dZs + [ Ys^
Jo Jo
Yt=
= Kt [
Jo
where К is the semimartingale Я_ • Z. By Theorem 52,
Yt = ?(Z)t{K0 + f ?{г)-Ч{Ка - [K,Z].)}
Jo+
and since KQ = 0 and [K, Z)t = f* H..d[Z, Z]s,
Yt = ?(Z)t f ?{Z)-\Hs_dZs - Hs^d[Z,Z]s),
Jo
from which the result follows. ?
Theorem 54 uses the formula of Theorem 52 to give a pretty result on the
comparison of solutions of stochastic differential equations.

324 V Stochastic Differential Equations
Lemma. Suppose that F is functional Lipschitz such that if Xt{oS) = 0, then
F(X)t-(w) > 0 for continuous processes X. Let С be a continuous increasing
process and let X be the solution of
Xt = x0 + j F(X)s_dCs,
Jo+
with xQ > 0. Then P{31 > 0 : Xt < 0} = 0.
Proof. Let Г = inf {t > 0 : Xt = 0}. Since XQ > 0 and X is continuous, Xs > 0
for all s < T on {T < oo}. The hypotheses then imply that F(X)T- > 0 on
{T < сю}, which is a contradiction. ?
Comment. In the previous lemma if one allows xq = 0, then it is necessary
to add the hypothesis that С be strictly increasing at 0. One then obtains the
same conclusion.
Theorem 54 (Comparison Theorem). Let (Za)i<a<:m be continuous
semimartingales with Zft — 0, and let Fa be process Lipschitz. Let A be a
continuous, adapted process with increasing paths, strictly increasing att = O.
Let G and H be process Lipschitz functional^ such that G(X)t_ > H{X)t_
for any continuous semimartingale X. Finally, let X and Y be the unique
solutions of
Xt = x0 + f G(X)s_dAs + f F(X)s_dZs,
Jo+ Jo
H(Y)s_dAs+ f F(Y)s_dZs
o+ Jo
where x0 > y0 and F and Z are written in vector notation. Then P{4t > 0 :
Xt < Yt} = 0.
Proof. Let
ft
Nt
= j {F(X)S- - F(Y)S^}(XS - Ys)-4{x^Yi]dZs, and
Jo
Ct=xo-yo+ [ {G(X)S_ - H(Y)s_}dAs.
Jo+
Then Ut = Ct + /0* Us-dNs, and by Theorem 52
Ut = ?(N)t{(x0 -Ы+/
Next set

9 General Stochastic Exponentials and Linear Equations 325
vt =
and define the operator К on continuous processes W by
K(W) = G(W?(N) + Y)- H(Y).
Note that since G and H are process Lipschitz, if Wt = 0 then G(W?(N) +
Y)t- = G(Y)t... Therefore К has the property that Wt(w) = 0 implies that
K(W)t- > 0. Note further that K(V) = G(U + Y)~ H(Y) = G(X) - H(Y).
Next observe that
= x0 - yo + / ?{NO1dCs
Jo+
?(N);1{G(X)s--H(Y)s.}dAs
= xo-yo+ f K(V)s-?(N):xdAs
Jo+
K(V)s-dDs,
о
where Dt = J0,?(N)~ldAs. Then D is a continuous, adapted, increasing
process which is strictly increasing at zero. Since V satisfies an equation of
the type given in the lemma, we conclude that a.s. Vt > 0 for all t. Since
?{N)^1 is strictly positive (and finite) for all t > 0, we conclude Ut > 0 for
all t > 0, hence Xt > Yt for all t > 0. ?
Comment. If Xq > г/о (i-e-> ^o = 2/0 is n°t allowed), then the hypothesis that
A is strictly increasing at 0 can be dropped.
The theory of flows can be used to generalize the formula of Theorem 52.
In particular, the homeomorphism property is used to prove Theorem 55.
Consider the system of linear equations given by
Xt=Ht+yl A{_Xs_dZi (*4)
where H = (Нг), 1 < i < n is a vector of n semimartingales, X takes values
in Rn, and Л-3 is an n x n matrix of adapted, cadlag processes. The processes
Z-', 1 < j <m, are given, continuous semimartingales which are zero at zero.
Define the operators Fj on Dn by
F(X)t = A\Xt
where A> is the nxn matrix specified above. The operators Fj are essentially
process Lipschitz. (The Lipschitz processes can be taken to be \\АЦ\ which

326 V Stochastic Differential Equations
is cadlag, not caglad, but this is unimportant since one takes F(X)t- in the
equation.)
Before examining equation (*4), consider the simpler system A < i,k <
n),
where
xi-
0, гфк.
Letting I denote the nxn identity matrix and writing the preceding in matrix
notation yields
/ M-Us-dZi, (*5)
E /
j=i Jo
where U takes its values in the space of n x n matrices of adapted processes
inD.
Theorem 55. Let A^, 1 < j < m, be n x n matrices of cadlag, adapted
processes, and let U be the solution of (*5). Let Xf be Hie solution of (*4)
where Ht = x, x еШп. Then X* = Utx and for almost all w, for all t and x
the matrix Ut(cj) is invertible.
Proof. Note that Ut is an n x n matrix for each (t,uj) and x G Rn, so that
UfX is in Rn. If X* = Utx, then since the coefficients are process Lipschitz we
can apply Theorem 46 (which says that the flow is a homeomorphism of Шп)
to obtain the invertibility of Ut(co).
Note that U is also a right stochastic exponential. Indeed, U = ?R(V),
where Vj = Jo YlT=i Ai-dZJs7 and therefore the invertibility also follows from
Theorem 48.
Thus we need to show only that Xf =¦ Utx. Since Utx solves (*4) with
Ht = x, we have Utx = Xf a.s. for each x. Note that a.s. the function
x >—> U(uj)x is continuous from Rn into the subspace of T>n consisting of
continuous functions; in particular (t,x) i—* Ut(cj)x is continuous. Also as
shown in the proof of Theorem 46, (x,t) i—> Xf is continuous in x and right
continuous in t. Since Utx = Xf a.s. for each fixed x and t, the continuity
permits the removal of the dependence of the exceptional set on x and t. ?
Let U^1 denote the nxn matrix-valued process with continuous trajec-
trajectories a.s. denned by (U~l)t(co) =
Recall equation (*4)
m „t
3=1 Jo
И)

9 General Stochastic Exponentials and Linear Equations 327
where H is a column vector of n semimartingales and Zq = 0. Let [H, Z3}
denote the column vector of n components, the ith one of which is [Hl, Z3].
Theorem 56. Let И be a column vector of n semimartingales, Z] A < j <
m) be continuous semimartingales with Zq = 0, and let A], 1 < j < m be
nxn matrices of processes in ID. Let U be the solution of equation (*5). Then
the solution XH of (*A) is given by
XtH = UtH0 + Ut / U;\dHs - J2 A{-d[H, Z%)
Jo+ i=1
Proof. Write XH as the matrix product UY. Recall that U~l exists by Theo-
Theorem 48, hence Y = U~1XH is a semimartingale, that we need to find explicitly.
Using matrix notation throughout, we have
d(UY) = dH + J2 AiX
Integration by parts yields (recall that U is continuous)
m
(dU)Y- + U(dY) + d[U, Y] = dH + ^ Aj_U-Y-dZj,
by replacing X with UY on the right side above. However U satisfies (*5) and
therefore
m
{dU)Y_ = J2A-u-Y-dZJi
3 = 1
and combining this with the preceding gives
U(dY)+d[U,Y] =dH,
or equivalently
dY = U-1dH-U-1d[U,Y]. (*6)
Taking the quadratic covariation of the preceding equation with Z, we have
since \U~1d\U,Y},Z3] = 0, 1 < j < m. However since U satisfies (*5),
d[U,Y\ -
3=1 3=1
3 = 1

328 V Stochastic Differential Equations
since U equals ?/_. Substitute the above expression for d[U,Y] into (*6) and
we obtain
m
dY = U~\dH -Y,A-d[H,Zj}),
and since XH = UY, the theorem is proved. D
10 Flows as Diffeomorphisms: The General Case
In this section we study the same equations as in Sect. 8, namely
X\ = Xi + JT f Fi(X).-dZf, 1 < i < n, (*)
except that the semimartingales {Za)\<a<m are no longer assumed to he con-
continuous. For simplicity we still assume that Zo = 0. In the general case it is
not always true that the flows of solutions are diffeomorphisms of Rn, as the
following example shows.
Example. Consider the exponential equation in R,
f*
Xt — x + / Xs-dZs.
Jo
Let Z be a semimartingale, Zq = 0, such that Z has a jump of size —1 at a
stopping time Г, Т > 0 a.s. Then all trajectories, starting at any initial value
x, become zero at T and stay there after T, as is trivially seen by the closed
form of the solution with initial condition x:
Xt=x exp{Zt--[Z,Z]ct} TT (l + AZs)e-AZ\
2 A A
0<s<t
Therefore, injectivity of the flow fails, and the flow cannot be a diffeomorphism
We examine both the injectivity of the flow and when it is a diffeomorphism
of Rn. Recall the hypotheses of Sect. 7, to which we add one, denoted (H3).
Hypothesis (HI). Za are given semimartingales with Zff = 0, 1 < a < m.
Hypothesis (H2). /? : Rn —> R are given functions, I <i < n, 1 < a < m,
and f(x) denotes the n x m matrix (/„(a:)).
The system of equations
* =xi + E f ш
(*)

10 Flows as Diffeomorphisms: The General Case 329
may also be written
[
Jo
[ f(Xs-)dZs
Jo
where Xt and x are column vectors in Rn, /(Xs_) is an n x m matrix, and Z
is a column vector of m semimartingales.
Hypothesis (H3). / is C°° and has bounded derivatives of all orders.
Note that by Theorem 40 of Sect. 7, under (H3) the flow is C°°. The key
to studying the injectivity (and diffeomorphism properties) is an analysis of
the jumps of the semimartingale driving terms.
Choose an e > 0, the actual size of which is yet to be determined. For
(Za)i<a<m we can find stopping times 0 = To < T\ < T2 < ¦ ¦ ¦ tending a.s.
to oo such that
have an H°° norm less than e (cf., Theorem 5). Note that by Theorem 1,
[Za'j, Z'^'ijoo < e as well, hence the jumps of each Za>i are smaller than e.
Therefore all of the "large" jumps of Za^ occur only at the times (Tj), j > 1.
Let X3t (x) denote the solution of (*) driven by the semimartingales Za>i.
Outside of the interval (Tj-i,Tj) the solution is
Next define the linkage operators
using vector and matrix notation. We have the following obvious result.
Theorem 57. The solution X of (*) is equal to, for Tn <t < Tn+1,
Xt(x)=X?+\xn+),
where
xo+ = x
xi- =Xfx_(a:), x1+ = H1(x1-)
X2+ = H2(X2~)
xn- = X?n_(z(n_i)+), xn+ = Hn{xn-).
Theorem 58. The flow ip : x —> Xt(x,uf) of the solution X of (*) is a
diffeomorphism if the collections of functions
x i—у X3t (x,lj) x i—> H3 (x, w)
are diffeomorphisms.

330 V Stochastic Differential Equations
Proof. By Theorem 57, the solution (*) can be constructed by composition of
functions of the types given in the theorem. Since the composition of diffeo-
morphisms is a diffeomorphism, the theorem is proved. D
We begin by studying the functions x —> X3T.{x,w) and x —> X™+1(x,lj).
Note that by our construction and choice of the times Tj, we need only to
consider the case where Z = Z? has a norm in H°° smaller than e.
The following classical result, due to Hadamard, underlies our analysis.
Theorem 59 (Hadamard's Theorem). Let g : Rn -> Rn be C°°. Suppose
(i) Нт^ц^оо ||5(a;)|| = oo, and
(ii) the Jacobian matrix g'(x) is an isomorphism o/Rn for all x.
Then g is a diffeomorphism o/Rn.
Proof. By the Inverse Function Theorem the function g is a local diffeomor-
diffeomorphism, and hence it suffices to show it is a bijection of Rn.
To show that g is onto (i.e., a surjection), first note that <?(Rn) is open and
non-empty. It thus suffices to show that <?(Rn) is a closed subset of Rn, since
Rn itself is the only nonempty subset of Rn that is open and closed. Let (?j)j>i
be a sequence of points in Rn such that lirn^oo g(xi) = у € Rn. We will show
that у e <7(Rn). Let Xi — tiVi, where U > 0 and ||vj|| = 1. By choosing a
subsequence if necessary we may assume that Vi converges to и € Sn, the
unit sphere, as i tends to oo. Next observe that the sequence (ij)«>i must be
bounded by condition (i) in the theorem: for if not, then tt = \\xi\\ tends to oo
along a subsequence and then ||<7(?ifc)|| tends to oo by (i), which contradicts
that limj^oo g(xi) = y. Since (?»)»>! is bounded we may assume lim^oo tt =
to E Rn again by taking a subsequence if necessary. Then limj—oo xt = tov,
and by the continuity of g we have у — lim^oo g(Xi) = g(tov).
To show g is injective (i.e., one-to-one), we first note that g is a local
homeomorphism, and moreover g is finite-to-one. Indeed, if there exists an
infinite sequence (a;n)n>i such that g(xn) = yo, all n, for some t/o, then by
condition (i) the sequence must be bounded in norm and therefore have a
cluster point. By taking a subsequence if necessary we can assume that xn
tends to x (the cluster point), where g(xn) = yo, all n. By the continuity of g
we have g{x) = г/о as well. This then violates the condition that g is a local
homeomorphism, and we conclude that g is finite-to-one.
Since g is a finite-to-one surjective homeomorphism, it is a covering map.16
However since Rn is simply connected the only covering space of Rn is Rn (the
fundamental group of Rn is trivial). Therefore the fibers g~1(x) for x € Rn
each consist of one point, and g is injective. D
The next step is to show that the functions x н-> Xj,.(x,ui) and x i—>
X™+1(x,u>) of Theorem 58 satisfy the two conditions of Theorem 59 and are
16 For the algebraic topology used here, the reader can consult, for example,
Munkries [183, Chapter 8].

10 Flows as Diffeomorphisms: The General Case 331
thus diffeomorphisms. This is done in Theorems 62 and 64. First we give a
result on weak injectivity which is closely related to Theorem 41.
Theorem 60. Let Za be semimartingales, 1 < a < m with Zft — 0, and let F
be an n x m matrix of process Lipschitz operators with non-random Lipschitz
constant K. Let Нг e D, 1 < i < n (cadlag, adapted). //?™=1 ||-?а||/г» < е,
for e > 0 sufficiently small, then the flow of the solution of
Xt=x+Ht+ f
Jo
is weakly injective.
17
Proof. Let x, у & M.n, and let Xх, Xy denote the solutions of the above equa-
equation with initial conditions x, y, respectively. Let и = x - y, U = Xх - Xy,
and V = F(XX)_ - F{Xy)-. Then V e L and |V| < K\U-\. Also,
Ut = u+ f VsdZs.
Jo
Therefore AUS = J2a V?&Zf and moreover (using the Einstein convention
to leave the summations implicit)
<\\\Us-\\
if e is small enough. Consequently \\US\\ > |||C/S_||. Define T = infjt > 0 :
Ut- = 0}. Then C/t_ ^Oon [0,T) and the above implies Ut ф 0 on [0,T) as
well. Using Ito's formula for f(x) = log \\x\\, as in the proof of Theorem 41 we
have
log ||?/t||-log|H (**)
yi,ayi,f3
л jji yi,a i ft yi,ayi,f3
~ / IIГ7 II2 s 9 / ИГ/ Ц2 dlZ '
Jo \\Us-\\ ? Jo \\Us-\\
ft jji jji y
-Jo \\u.-
ft jji jji yi,ayj,P
0<s<t "Us
For s fixed, let
17 We are using vector and matrix notation, and the Einstein convention on sums.
The Einstein convention is used throughout this section.

332 V Stochastic Differential Equations
Js = {log\\U.\\ -log||C/s_|| - -p=
\\us-\\
so that the last sum on the right side of equation (**) can be written
Eo<s<t Js- ВУ Taylor's Theorem
d2f
2 —J—(x)\
OXOX
where f(x) = log||a;||, and Is denotes the segment with extremities Us and
?/,_. Since
1
and since ||AC/S|| < ^||C/S_||, we deduce
which in turn implies
\Js\<C\\Us-f\\AZs
Js" \\ut.p-
Therefore
0<s<t 0<s<t а
on [О, Г). Returning to (**), as t increases to T, the left side tends to — oo on
{T < oo} and the right side remains finite. Therefore P(T < oo) = 0, and U
and U- never vanish, which proves the theorem. D
Theorem 61. Let (^a)i<a<m be semirnartingales, Zff = 0, F an n x m
matrix of process Lipschitz operators with a non-random Lipschitz constant,
and Hl G D, 1 < г < п. /f E™=1 Ц^Цд00 < e for e >0 sufficiently small,
then for r ? Ш there exist uniformly locally bounded predictable processes
Ha(x,y) and Ka<P(x,y), which depend onr, such that
where Xх is the solution of
Xt = x + Ht+ f F(X)s_dZs.
Jo
The semimartingale AT is given by

10 Flows as Diffeomorphisms: The General Case 333
Ar(x,y)t = f H?(x,y)dZ? + f
Jo Jo
where Jt = Ylo<s<t ^s' aru^ w^ere ^s is an adapted process such that \AS\ <
Cr(AZ«J.
Proof. Fix x, у e Rn, x ф y, and let U = Xх - Xy, V = F{XX)- - F(Xy)-.
By Theorem 60 U is never zero. As in the proof of Theorem 43, by Ito's
formula,
\\Uy\\r = \\x - y\\r + f r\\Us-\\r
Jo
+ \ [\{(г-2)\\и^\
& Jo
0<s<t
Let Ls denote the summands of the last sum on the right side of the above
equation. If g(x) = \\x\\r, then
\LS\<C\\AUS\\
where Is is the segment with boundary Us and C/s_. However,
да л ¦ n
II = r\(r — 2)l|a;||'— xx¦ + 5*||o;llr~ I
which is less than Cr||a;||r~2. However as in the proof of Theorem 60, we
have ||Af/s|| < |||?78_||, which implies (for all r positive or negative) that
||я||г < Сг||^8-1Г for all x between Us- and Us. Hence (the constant С
changes)
<C\\Us4r\\AZs\\2.
Therefore, let As = ||C/s_||~rI/s, and we have
\A.\< E
0<s<t 0<s<t
an absolutely convergent series with a bound independent of (x,y). To com-
complete the proof it suffices to take
and

334 V Stochastic Differential Equations
as in the proof of Theorem 43. Note that l{c/s_^o} is indistinguishable from
the zero process by weak injectivity (Theorem 60). These choices for Ha and
Ka'$ are easily seen to work by observing that Щ = Z)™=i /o K*'Q?^?> ancl
the preceding allows us to conclude that
\\Ut\\r = \\x-y\\r+ f \\Us_\\4Ar(x,y)s,
Jo
and the result follows. D
Theorem 62. Let (Za)i<a<m be semimartingales, Z§ = 0, F an n x m
matrix of process Lipschitz operators with a non-random Lipschitz constant,
and W e D, 1 < i < n. Let X = X(t,w, x) be the solution of
[
Jo
F(X)s_dZs. (*)
If 51Г=1 Il-Z^lltf00 < ? for ? > 0 sufficiently small, then for each N < oo and
almost all ш
lim inf ||X(s,o;,a;)|| = oo.
||ж||—юо s<N
Proof. The proof is essentially the same as that of Theorem 45, so we only
sketch it. For x ф 0 let Yx = \\XX - X0^1, which is well-defined by weak
injectivity (Theorem 60). Then
\yx\ = WxW
\YX-Yy\ < ||ХХ
by Theorem 61, where Лг(х, у) is as defined in Theorem 61. Set Y°° = 0.
Since H^H/foe < e each Za has jumps bounded by e, and the process Jt
defined in Theorem 61 also has jumps bounded by Cre2. Therefore we can
stop the processes Лг(х,у) at an appropriately chosen sequence of stopping
times (Tg)t>\ increasing to oo a.s. such that each Лг(х, у) & S(e) for a given e,
and for each ?, uniformly in (x, y). However if Z is a semimartingale in S(e),
then since ?(Ar(x,y)) satisfies the equation Ut = 1 + /0 Us^dAr(x,y)s, by
Lemma 2 of Sect. 3 of this chapter we have
\\S(Ar(x,y))\\sp <C(p,z)<<x,

10 Flows as Diffeomorphisms: The General Case 335
where C(p, z) is a constant depending on p and z = ||Лг(а;, у)\\н°° < ке: the
bound for ?, provided of course that e is sufficiently small. We conclude that
for these Tg there exist constants Ce such that
wherep > n, and where d is the distance on Rn\{0} given by d(x, y) = K,.^),,
Set
0 < ||ж|| < oo,
Y°° = 0,
Then ||(Yx - Yv)T'\\%, < C*\\x - y\\P on Rn, and by Kolmogorov's Lemma
(Theorem 72 of Chap. IV) we conclude that Итцх||_>о Yx exists and it is zero.
Since (У*) = ||XxHx|1 - X°\\, the result follows. D
If (p is the flow of the solution of (*), Theorem 62 shows that
lira \\(p(x)\\ = +oo,
\\x\\-* oo
and the first condition in Hadamard's Theorem (Theorem 59) is satisfied.
Theorem 63 allows us to determine when the second condition in Hadamard's
Theorem is also satisfied (see Theorem 64), but it has an independent interest.
First, however, some preliminaries are needed.
For given n, let Z be an n x n matrix of given semimartingales. Recall that
X = ?{Z) denotes the (matrix-valued) exponential of Z, and that ?R(Z) de-
denotes the (matrix-valued) right stochastic exponential of Z, which was defined
in Sect. 8, following Theorem 47.
Recall that in Theorem 48 we showed that if Z is an n x n matrix of
continuous semimartingales with Zq = 0, then ?{Z)?R{—Z + [Z, Z]) = /, or
equivalently ?{-Z + [Z, Z])?R{Z) — I. The general case is more delicate.
Theorem 63. Let Z be an n x n matrix of semimartingales with Zq = 0.
Suppose that Wt = -Zt + [Z,Z]ct + E0<s<t(/ + Д^ГЧД^J is a well-
defined semimartingale. Then
?(W)t?R(Z)t = I
for all t > 0.
Proof Let U = ?(W), V = ?R(Z), and Jt = Eo<s<t(/ +
Then,
dU = U-(-dZ + d[Z, Z)c + dJ),
dV = (dZ)V-,
and therefore

336 V Stochastic Differential Equations
= U-(-dZ + d[Z, Z]c
d[U,V] = U-d[W,
= -U-d[Z, Z]V- + U-d[J, Z]V-,
since d[Z, [Z, Z]c] = 0. By Theorem 47
d(UV) = U_dV + (dU)V- + d[U, V];
using the above calculations several terms cancel, yielding
d(UV) = U_d[Z,Z}cV_ + U-(dJ)V. - U-d[Z,Z]V- + U.d[J,Z]V-.
Since [Z, Z]t = [Z, Z\4 + E0<s<t(A^J, and
0<s<t 0<s<t
the preceding becomes
d(UV) - -U_
Since
(AZSJ = ((AZSJ + (AZSK)(I + AZs)-\
the above equation implies d(UV)t = 0, all t > 0. However UoVo = I, and
therefore UtVt = I, all t > 0. П
Corollary. Let Z be a square matrix of semimartingales. If ||Z||#oo < e for
г > 0 sufficiently small, then ?R(Z)t is invertible for all t > 0.
Proof. If || Z\\h°° < ? then the jumps of Z are bounded by e > 0 and therefore
the process W of Theorem 63 is a well-defined semimartingale. D
Theorem 64. Let (^a)i<a<m be semimartingales with Zo = 0, and let f be
a matrix of coefficients satisfying Hypotheses (H2) and (H3). Let X be the
unique solution of
Xt = x+ f f(X.-)dZe. (*)
Jo
The Jacobian matrix Dlk{t,u),x) = -^-Х1^,и),х) is invertible for each t > 0
provided \\Z\\h°° < ? for sufficiently small e > 0.

10 Flows as Diffeomorphisms: The General Case 337
Proof. By Theorem 39 (in Sect. 7) the Jacobian matrix D satisfies the right
stochastic exponential equation
and the matrix semimartingale differential -g^(Xs-)dZ? satisfies the hy-
hypotheses of the corollary of Theorem 63, whence the result. ?
Before stating the principal result of this section, we need to define two
subsets of Km; recall that under Hypotheses (HI), (H2), and (H3), that Z =
{Za)i<a<m is a given m-tuple of semimartingales and that f(x) = {fa(x)) is
an n x m matrix of C°° functions. Let
P = {z e Km : H(x) = x + f(x)z is a diffeomorphism of K™}
X ={z?Rm : H{x) = x + f(x)z is injective in K™}.
Clearly PCI.
Theorem 65. Let Z and f be as given in Hypotheses (HI), (H2), and (H3),
and let X be the solution of
Xt=x+ f f(X,-)dZa. (*)
Jo
The flow of X is a.s. a diffeomorphism of K™ (resp. trajectories of X from
different initial points a.s. never meet) for all t if and only if all the jumps of
Z belong to P (resp. all the jumps of Z belong to X).
Proof. Recall the processes Х^,.(х,ш) and X™+ (x,u>) defined in Theorem 57,
and the linkage operator W (x) = x + f(x)AZx:i, defined immediately preced-
preceding Theorem 57. By hypothesis the linkage operators H^(x) are clearly diffeo-
diffeomorphisms of K™ (resp. injective), and by Theorems 62 and 64, Hadamard's
conditions are satisfied (Theorem 59), and therefore the functions x н->
Xj,.(x,(jj) and x н-> X™+1(x,w) are diffeomorphisms of K™ if e > 0 is taken
small enough in the definition of the stopping times (Tj)j>i, which it is always
possible to do. Therefore by Theorem 58 the flow ip : x —> Xt(x,w) is a.s. a
diffeomorphism of K™ for each t > 0.
The necessity is perhaps the more surprising part of the theorem. First
observe that by Hadamard's Theorem (Theorem 59) the set P contains a
neighborhood of the origin. Indeed, if z is small enough and x is large enough
then ||/(x)z|| < ||ж||/2 since / is Lipschitz, which implies that ||Д"(х)|| > ||х||/2
and thus condition (i) of Hadamard's Theorem is satisfied. On the other hand
H'(x) — I + f'(x)Z is invertible for all x for ]|z|| small enough because f'(x)
is bounded; therefore condition (ii) of Hadamard's Theorem is satisfied. Since
f(x) is C°° (Hypothesis (H3)), we conclude that P contains a neighborhood
of the origin.

338 V Stochastic Differential Equations
To prove necessity, set
A = {u : 3 5 > 0 with AZs{u) G Vе},
Г2 = {uj : 3 5 > 0 with AZs{uj) e Iе}.
Suppose -P(A) > 0. Since V contains a neighborhood of the origin, there
exists an e > 0 such that all the jumps of Z less than e are in V. We also take
? so small that all the functions x н-> X^ (x) are diffeomorphisms as soon as
the linkage operators Hk are, all к < i.
Since the jumps of Z smaller than e are in V, the jumps of Z that are in
Vе must take place at the times Tj. Let
A> = {u : AZTi e P, all i < j, and AZTj e X>c}.
Since -P(-Ti) > 0, there must exist a j such that P(Aj) > 0. Then for w G Aj,
x t-> X7v_(a:,a;) is a diffeomorphism, but H^(x,uj) is not a diffeomorphism.
Let u)q G Л^ and io = Tj(u)o). Then а; н-> Xt0(x,u)o) is not a diffeomorphism,
and therefore
P{ui : 3 t such that Z —> Xt(x,w) is not a diffeomorphism} > 0,
and we are done. The proof of the necessity of the jumps belonging to I to
have injectivity is analogous. D
Corollary. Let Z and / be as given in Hypotheses (HI), (H2), and (H3), and
let X be the solution of
Xt = x+ f{Xs_)dZs.
Jo
Then different trajectories of X can meet only at the jumps of Z.
Proof. We saw in the proof of Theorem 65 that two trajectories can intersect
only at the times Tj that slice the semimartingales Za into pieces of H°°
norm less than e. If the Za do not jump at Tj0 for some jo, however, and
paths of X intersect there, one need only slightly alter the construction of Tj0
(cf., the proof of Theorem 5, where the times Tj were constructed), so that
Tj0 is not included in another sequence that ?-slices {Za)i<a<m, to achieve
a contradiction. (Note that if, however, (Za)i<a<m has a large jump at Tj0,
then it cannot be altered.) D
11 Eclectic Useful Results on Stochastic Differential
Equations
We begin this collection of mostly technical results concerning stochastic dif-
differential equations with some useful moment estimates. And we begin the
moment estimates with preliminary estimates for stochastic integrals. The
first result is trivial, and the second is almost as simple.

11 Eclectic Useful Results on Stochastic Differential Equations 339
Lemma. For any predictable (matrix-valued) process H and for any p > 1,
and for 0 < t < 1, we have
-E{sup
s<t
f Hsds } < f E{\Hs\p}ds.
Jo Jo
Proof. Since 0 < t < 1, we have that ds on [0,1] is a probability measure,
and since f(x) = xp is convex for p > 1, we can use Jensen's inequality. The
result then follows from Fubini's Theorem. ?
Lemma. For any predictable (matrix-valued) process H and multidimen-
multidimensional Brownian motion B, and for any p > 2, and for 0 < t < 1, there exists
a constant cp depending only on p such that
up / HsdBs }<cp f E{\Hs\p}ds.
<t [Jo Jo
E{sup / HsdBs
s<t \Jo
Proof. The proof for the case p = 2 is simply Doob's inequality; we give it for
the one dimensional case.
?{sup( ГHsdBsJ} < 4E{\H-B,H-B]t} - 4E{ f H2sds} = 4 / E{H2s}ds,
s<t Jo Jo Jo
where the last equality is by Fubini's Theorem. For the case p > 2, we use
Burkholder's inequality (see Theorem 48 of Chap. IV,
?{sup
s<t
f HsdBs } < cpE{{H -B,H- Bft'2} <cp f E{\Hs\p}ds
Jo Jo
where the last inequality follows from Jensen's inequality (recall that p > 2
so that f(x) = xpl2 is a convex function, and since 0 < t < 1 we have used
that ds on [0,1] is a probability measure) and Fubini's Theorem. ?
Theorem 66. Let Z be a d-dimensional Levy process with E{\\ Zt \\lp} < со
forO < t < 1, H a predictable (matrix-valued) process and p > 2. Then then-
exists a finite constant Kv such that for 0 < t < 1 we have
If HsdZs }<KP f E{\Hs\p}ds.
\Jo Jo
E{sup
s<t \JO
Proof We give the proof for the one dimensional case.18 Since the Levy pro-
process Z can be decomposed Zt — bt + aBt + Mt, where M is a purely dis-
discontinuous martingale (that is, M is orthogonal in the L2 sense to the stable
subspace generated by continuous L2 martingales), it is enough to prove the
inequality separately for Zt = bt, and Zt = crBt, and Zt = Mt. The preceding
two lemmas prove the first two cases, so we give the proof of only the third.
18 An alternative (and simpler) proof using random measures can be found in [105].

340 V Stochastic Differential Equations
In the computations below, the constants Cp and the functions Kp(-) vary
from line to line. Choose the rational number к such that 2k < p < 2k+l,
Applying the Burkholder-Gundy inequalities19 for p > 2 we have
E{
f
Jo
HsdMs
} < (CP)pE{
Set
/ \Hs\2d[M,M]t
Jo
= / \x\2vM{dx) < со.
Since [M, M] is also a Levy process, we have that [M, M]t — olm^ is also a
martingale. Therefore the inequality above becomes
E{
f
Jo
} <CPE{
\Hs\2d([M,M]s-aMs)
p/2
f\Hs\4s
Jo
p/2
}¦
We apply a Burkholder-Gundy inequality again to the first term on the right
side above to obtain
E{
H8dMs
¦K,
,(«)[/
Hs\4s}.
We continue recursively to get
E{\ HsdMs
s<t
)E{ f
\Hs\4s}.
Next we use the fact that, for any sequence a such that ||a||;« is finite,
\\a\\p < ЦоЦ/, for 1 < q < 2. As 1 < p2~k < 2 we get
s<t
s<t
s<t
whence
19 The Davis inequality is for the (important) case p = 1 only.

11 Eclectic Useful Results on Stochastic Differential Equations 341
s<t
s<t
Note that J2s<t \AMS\P is an increasing, adapted, cadlag process, and its
(non-random) compensator is At = t j \x\pi/M(dx), which is finite by hypoth-
hypothesis. Since \H\P is a predictable process,
Jo
r\p - As
r<s
is a martingale with zero expectation. Therefore we have
E{\ J* HsdMs\p} <{CP J \x\puM{dx) + Kp(t) f^(J i
¦E{f\Hs\pds}
Jo
It remains to show that, for any 1 < г < к,
(J \xf uM{dx)f2~l < (J \x\2yM{dx)Y'2 + J \x\piyM(dx).
Let Am := / \x\2VM{dx), so that
liM{dx) :- -—\x\2vM{dx)
лм
is a probability measure. Denote 2* by q. One has to show
*Mq(J N9~ Wc*r))p/e < \%2 + XmJ \x\p-2m(dx).
If
the preceding inequality is obvious. On the other hand, if
then it is sufficient to prove that
X^9~\J\x\9-2fiM{dx)r/9 <J\x\p-2m(dx).
But the bound on Am and Jensen's inequality give the result.
?

342 V Stochastic Differential Equations
Theorem 66 can be used to prove many different results concerning solu-
solutions of stochastic differential equations driven by Levy processes. We give two
examples. (These types of results are true more extensively (for example in n
dimensions) with similar proofs; see for example [105].) Our first example is a
moment estimate. As a convenience, we continue to work on the time interval
[0,1].
Theorem 67. Let a be continuously differentiable with a bounded derivative,
and let Z be a Levy process with Levy measure v such that J,x,>1 \x\pis(dx) <
oo, which is equivalent to E{\Zt\p} < oo for p > 2 and for all t, 0 < t < 1.
Let Xх denote the unique solution of the stochastic differential equation
dXt = a{Xt_)dZu Xo = x. (*)
Then we have
E{suP\Xx\p}<K(l + \x\p)
s
where К is of course a positive constant.
Before proving this theorem we need to recall Gronwall's inequality.20
Theorem 68 (Gronwall's Inequality). Let a be a function from K+ to
itself, and suppose
a(s) < c + к a(r)dr < oo
Jo
forO < 5 < t. Thena(t) < cekt. Moreover if с = 0 thena vanishes identically.
The proof is simple and usually taught in elementary differential equations
courses.
Proof of Theorem 67. This is a consequence of Theorem 66 and Gronwall's
inequality. ?
The next corollary follows easily from Theorem 67.
Theorem 69. Let a and Z be as in Theorem, 67. Let g be continuous and for
q > 0 define
|| g \\q= mi{a > 0 : \g(x)\ < a(l + \x\*)}.
Forq = 0 we let \\ g ||o be the L°° norm. If Pt denotes the transition semigroup
of the Markov process Xх of equation (*), then we have for q > 0
|| Ptg \\q< К || g ||g .
20 Extensions of this classic version of Gronwall's inequality axe in the exercises for
this chapter. See also [182].

11 Eclectic Useful Results on Stochastic Differential Equations 343
We now turn to our second example, which consists of moment estimates
on the flows of the solution Xх of equation (*). We first establish some nota-
notation. For a Levy process Z with Levy measure v and with finite p-th moment
for p > 2, we decompose it as Zt = bt + cBt + Mt where В is a standard Brow-
nian motion and M is a martingale independent from В and 1? orthogonal
to all continuous martingales. We then define
which is finite since by assumption i?{|Zt|p} < oo and p > 2. Here we consider
the multidimensional case: both Z and the solution Xх of equation (*) are
assumed to be d-dimensional, where d > I.21 We let X* ' represent a A;-th
derivative of the flow of Xх.
Theorem 70. For к > 1 assume that a is differentiable к times and that
all partial derivatives of a of order greater than or equal to one are bounded.
Assume also that Z has moments of order kp with p > 2. With the nota-
notation defined right before this theorem, we have that there exists a constant
K(k,p,a,T]p) such that for 0 < s <1,
W\P} < K.
Proof. We recall from Theorem 39 that Xх together with XX)^ constitute
the unique solution of the system of equations
as long as к > 2, and with a slightly simpler formulation if At = 1. (Note
that this is the same equation (D) as on page 305.) We can write the second
equation above in a slightly more abstract way
f
i=2Jo
if к > 2; and if к = 1 the equation is the simpler
21 One could also allow the dimensions of Z and Xх to be different, but we do not
address that case here.

344 V Stochastic Differential Equations
f
Jo
We next observe that the components of Fk^x^,... ,x^k г+1)) are sums of
terms of the form
6*5
(**)
, and an "empty" product
k-i+1 ocj
3 = 1 r=l
and where x^'1 is the Z-th component of a;
equals 1.
We thus want now to prove that
E{sup
for all i — 2,..., N and for some constant К = К(к,р, а, щр) (in the remain-
remainder of the proof К = К(к,р,сг,г)кр) varies from line to line). And of course,
it is enough to prove that if G is any monomial as in (**), then
We will use Theorem 66 and the fact that W is bounded. Note that Theo-
Theorem 66 implies that
s<t
[SHsdZs\P'}<K(p,r,p) f E{\Hs\P'}ds
Jo JO
for 2 < p' < p, where the constant K{p,r)p) depends only on p, r)p, and the
dimensions of H and Z. For this we see that the left side of the previous
equation is smaller than
k+l-i
k+l-i
К Е{
by Holder's inequality, since ^ jatj = k. The recurrence assumption yields
that each expectation above is smaller than some constant K(p, k, a, T]kp), and
we obtain the required inequality. D
We now turn to the positivity of solutions of stochastic differential equa-
equations. This issue arises often in applications (for example in mathematical
finance) when one wants to model a dynamic phenomenon where for reasons
relating to the application, the solution should always remain positive. We
begin with a very simple result, already in widespread use.

11 Eclectic Useful Results on Stochastic Differential Equations 345
Theorem 71. Let Z be a continuous semimartingale, a a continuous func-
function, and suppose there exists a (path-by-path) unique, non-exploding solution
to the equation
Xt=X0+ I a(Xs)XsdZ
f e{Xs)Xs
Jo
with Xo > 0 almost surely. Let T = ini{t > 0 : Xt = 0}. Then P(T < oo) = 0.
In words, if the coefficient of the equation is of the form xa(x) with cr(x)
continuous on [0, oo), then the solution X stays strictly positive in finite time
if it begins with a strictly positive initial condition. We also remark that in
the applied literature the equation is often written in the form
Note that if a(x) is any function, one can then write a(x) = x(p*p-), but
we then need the new function ^^p- to be continuous, well-defined for all
nonnegative x, and we also need for a unique solution to exist that has no
explosions. This of course amounts to a restriction on the function a. Since
xa(x) being Lipschitz continuous is a sufficient condition to have a unique
and non-exploding solution, we can require in turn for a to be both bounded
and Lipschitz, which of course implies that xcr(x) is itself Lipschitz.
Proof of Theorem 71. Define Tn = mi{t > 0 : Xt - \jn or Xt = Xo V n},
and note that P(Tn > 0) = 1 because P(Xq > 0) = 1 and Z is continuous.
Using Ito's formula up to time Tn we have
ln(\XTn\) = ln(X0)+ a(Xs)dZs-- a(XsJd[Z,Z}s,
Jo z Jo
and since a is assumed continuous, it is bounded on the compact set [1/n, n],
say by a constant с Since the stopping times Tn | T we see that on the event
{T < oo} the left side of the above equation tends to oo while the right side
remains finite, a contradiction. Therefore P(T < oo) = 0. ?
Theorem 71 has an analogue when Z is no longer assumed to be continuous.
Theorem 72. Let Z be a semimartingale and let a be a bounded, continuous
function such that there exists a unique non-exploding solution of
Xt = Xq +
Jo
with Xq > 0 almost surely. If in addition we have \AZS\ < ,Ут,е almost surely
for all s > 0 for some г with 0 < г < 1, then the solution X stays positive
a.s. for all t > 0.

346 V Stochastic Differential Equations
Proof. || a 11oo refers to the L°° norm of a. The proof is similar to the proof
of Theorem 71. Using Ito's formula gives
- \ f
* Jo
s<t
and let us consider the last term, the summation. We have
|Xs|-ln|Xs_| -a(Xs_)AZs}
s<t
= -У
where |i?s| < 1 by the Mean Value Theorem, applied и>-Ъу-ш, which implies
the above is
s<t
which is a convergent series for each finite time t. The only ways X can be
zero or negative is to cross 0 while continuous, which cannot happen by an
argument analogous to the proof of Theorem 71, or to jump to 0 or across into
negative values. Assume X has not yet reached (—oo, 0] at time 5—. Then we
have
\AXS\ < Xs.\a(Xs_)AZs\ < Xa.(l - e) < Xs_,
which implies that X cannot cross into (—oo,0] by jumping. Hence we have
that X must be positive on [0, oo). ?
We now turn to something completely different. In Sect. 6, Theorem 32,
we showed that an equation of the form
d Л
with Z a vector of independent Levy processes, had a (unique) solution which
was strong Markov. We now show what amounts to a converse, which is per-
perhaps surprising. That is, if one wants a solution of a stochastic differential
equation to be time homogeneous Markov, one essentially must have that the
driving semimartingale is a Levy process!

Bibliographic Notes 347
Theorem 73. Let (fi, T, F, P) be a filtered probability space with Z a cadlag
semimartingale defined on the space. Let f be a Borel function which is never
0 and is such that for every x el the equation
dXt = f{Xt-)dZt, Xo = x
has unique (strong) solution, which we denote Xх. If each of the processes Xх
is a time homogeneous Markov process with the same transition semigroup,
then Z is a Levy process.
For the proof of this theorem, we assume the reader is familiar with the
abstract theory of Markov processes, as can be found (for example) in [215].
Proof. Let fi' be the function space of cadlag functions defined on M+ with
Xt(w) = w(t), the projection operator. Let ?' be the canonical filtration and
let F't) be the shift operators. If P'x denotes the law of Xх, then we have that
(Sl',F',6't,X',P?) is a Markov process in the Dynkin (or Blumenthal-Getoor)
sense. Since / is never zero, we can write
f
Jo
f
Jo
We can therefore define on $1', relative to each law Px, the stochastic integral
z't= f f(X's_)-4x's,
Jo
and we have that the law of Z' under Px is the law of the process Z — Zo.
However Z' is also an additive functional, and hence the Markov property
implies
Z[)\T'} = E'x{g(Z's) о o't\Ft}
where we have also used that the law of Z' under Px is the law of the process
Z — Zo. This in turn implies that Z't+S — Z't is P'x independent of T't, and
hence also independent from Z'r for all r <t, and it also implies that the law
of Z'tJrS — Z't is the same as the law of Zs — Zq. Using the identity of the laws of
Z' under Px with that of the process Z — Zq once again, the result follows. ?
Bibliographic Notes
The extension of the i?p norm from martingales to semimartingales was im-
implicit in Protter [197] and first formally proposed by Emery [64]. A compre-
comprehensive account of this important norm for semimartingales can be found in

348 V Stochastic Differential Equations
Dellacherie-Meyer [46]. Emery's inequalities (Theorem 3) were first established
in Emery [64], and later extended by Meyer [174].
Existence and uniqueness of solutions of stochastic differential equations
driven by general semimartingales was first established by Doleans-Dade [51]
and Protter [198], building on the result for continuous semimartingales in
Protter [197]. Before this Kazamaki [123] published a preliminary result, and
of course the literature on stochastic differential equations driven by Brownian
motion and Lebesgue measure, as well as Poisson processes, was extensive. See,
for example, the book of Gihman-Skorohod [81] in this regard. These results
were improved and simplified by Doleans-Dade-Meyer [54] and Emery [65];
our approach is inspired by Emery [65]. Metiver-Pellaumail [161] have an
alternative approach. See also Metivier [158]. Other treatments can be found
in Doleans-Dade [52] and Jacod [103]. The approach of L. Schwartz [214] is
nicely presented in [44].
The stability theory is due to Protter [199], Emery [65], and also to
Metivier-Pellaumail [162]. The semimartingale topology is due to Emery [66]
and Metivier-Pellaumail [162], while a pedagogic treatment is in Dellacherie-
Meyer [46].
The generalization of Fisk-Stratonovich integrals to semimartingales is
due to Meyer [171]. The treatment here of Fisk-Stratonovich differential
equations is new. The idea of quadratic variation is due to Wiener [230].
Theorem 18, which is a random Ito's formula, appears in this form for
the first time. It has an antecedent in Doss-Lenglart [58], and for a very
general version (containing some quite interesting consequences), see Sznit-
man [222]. Theorem 19 generalizes a result of Meyer [171], and Theorem 22
extends a result of Doss-Lenglart [58]. Theorem 24 and its corollary is from
Ito [99]. Theorem 25 is inspired by the work of Doss [57] (see also Ikeda-
Watanabe [92] and Sussman [221]). The treatment of approximations of the
Fisk-Stratonovich integrals was inspired by Yor [238]. For an interesting ap-
application see Rootzen [211]. For more examples of solutions of stochastic dif-
differential equations with formulas, see [128].
The results of Sect. 6 are taken from Protter [196] and Cinlar-Jacod-
Protter-Sharpe [34]. A comprehensive pedagogic treatment when the Markov
solutions are diffusions can be found in either Stroock-Varadhan [220] or
Rogers-Williams [209, 210].
Work on flows of stochastic differential equations goes back to 1961 and the
work of Blagovescenskii-Freidlin [18] who considered the Brownian case. For
recent work on flows of stochastic differential equations, see Kunita [131, 133],
Ikeda-Watanabe [92] and the references therein. There are also flows results for
the Brownian case in Gihman-Skorohod [81], but they are L2 rather than al-
almost sure results. Much of our treatment is inspired by the work of Meyer [177]
and that of Uppman [224, 225] for the continuous case, however results are
taken from other articles as well. For example, the example following The-
Theorem 38 is due to Leandre [140], while the proof of Theorem 41, the non-
confluence of solutions in the continuous case, is due to Emery [68]; an alter-

Exercises for Chapter V 349
native proof is in Uppman [225]. For the general (right continuous) case, we
follow the work of Leandre [141]. A similar result was obtained by Fujiwara-
Kunita [78].
Our presentation on moment estimates follows [205] and [105], although it
dates back to [16]. A perhaps simpler treatment using random measures can
be found in [105]. See also [106]. The results on when solutions of stochastic
differential equations are always positive is well known, at least in the Brown-
ian case, but a reference is not easily found in the literature. The general case
may be new. Theorem 73 is from [108].
Exercises for Chapter V
Exercise 1. Let X be a solution of the stochastic differential equation
dXt = X?dBt + Xfdt
with initial condition Xo ф 0. Show that X is defined on [0, T), where T =
inf{t > 0 : Bt = 1/-Xo}. Moreover show that X is the unique solution on
[0,T). Finally show that X explodes at T.
Exercise 2. Let cr(x) = \f\x~\- Note that a is Lipschitz continuous everywhere
except at x = 0. Let X be a solution of the equation
dXt = a(Xt)dBt + ^dt
with initial condition Xo. Find an explicit solution of this equation and de-
determine where it is defined.
Exercise 3. Let X be the unique solution of the equation
dXt = a(Xt)dBt + b(Xt)dt
with Xo given, and a and b both Lipschitz continuous. Let the generator of
the (time homogeneous) Markov process X be given by
Af(x)= lim */(*)-/(*>
J v ; t-»o,t>o t
where Ptf(x) equals the transition function22 Ps(Xo,f) when Xo = x almost
surely. A is called the infinitesimal generator of X. Use Ito's formula to
show that if / has two bounded continuous derivatives (written / e C2), then
the limit exists and A is denned. In this case / is said to be in the domain of
the generator A.
This notation is defined in Sect. 5 of Chap. I on page 35.

350 Exercises for Chapter V
Exercise 4. (Continuation of Exercise 3.) With the notation and terminology
of Exercise 3, show that the infinitesimal generator of X is a linear partial
differential operator for functions / G C\ given by
Exercise 5. Let / 6 C|, let a and b be Lipschitz continuous, let X be the
unique solution of the stochastic integral equation
Xt=X0+ f a(Xs)dBs + f b(Xs)ds, (*)
Jo Jo
and let A denote its infinitesimal generator. If Xo = x, where x ? M, prove
Dynkin's expectation formula for Markov processes in this case:
E{f(Xt) - /(Xo) - f Af(Xs)ds} = 0. (**)
Jo
Exercise 6. As a continuation of Exercise 5, show that
(Ptf)(y) ~ f(y) - [\PsAf)(y)ds = 0
Jo
for у e M, and that limsj0 PsAf = Af for (for example) all / in C°° with
compact support.
Exercise 7. As in Exercise 5, extend Dynkin's expectation formula by show-
showing that it still holds if one replaces in equation (**) the integration upper
limit t with a bounded stopping time T, and use this to verify Dynkin's for-
formula in this case. That is, show that if / e C\ and a and b are both Lipschitz,
and X is a solution of equation (*) of Exercise 5, then
fT
f(Xr) — f(Xo) — / Af(Xs)ds is a local martingale.
Jo
Exercise 8. Let Zbea Levy process with Levy measure v and let X satisfy
the equation
Xt=x+ [ f(Xe.)dZe
Jo
where / is Lipschitz continuous. The process X is taken to be d-dimensional,
and Z is n-dimensional, so / takes its values in Rd x Mn. We use (column)
vector and matrix notation. The initial value is some given x ? M.d. Let Px
denote the corresponding law of X starting at x. Show that if g € C°° with
compact support on Rd, then

Exercises for Chapter V 351
Ag(x) =\ ? (^
J
u{dy) {g(x + f(x)y) - g(x) - Vg(x)f(x)),
where Vy is a row vector and b is the drift coefficient of Z, * denotes transpose,
and A is of course the infinitesimal generator of X.
Exercise 9. In the setting of Exercise 8 show that
Co) - / Ag(X,
Jo
g{Xt) - g(X0) — I Ag(Xs)ds is a local martingale
Jo
for each law Px. Show further that it is a local martingale for initial laws
of the form P^, where P^ is given by: for Л с П measurable, Р^(Л) =
JRd Px(A)^(dx) for any probability law /i on Md.
Exercise 10. Let Z be an arbitrary semimartingale, and let To = 0 and
(Tn)n>i denote the increasing sequence of stopping times of the jump times of
Z when the corresponding jump size is larger than or equal to 1 in magnitude.
Define en = sign(AZ<rn) with во = 1- For a semimartingale Y define Un(Y)t
to be the solution of the exponential equation
for Tn < t < Tn+i. Show that the (unique) solution of the equation
*\Xt.\dZ.
о
is given by Xt = ?„>о X?l[TntTn+l)(t), where X? = XTnUn(enZ)t, for Tn <
t < Tn+i.
Exercise 11. Show that if M is an L2 martingale with AMS > — 1, and
(M, M)oo is bounded, then ?(MHO > 0 a.s. and it is square integrable.
(See [147].)
Exercise 12. Let M be a martingale which is of integrable variation. Suppose
also that the compensator of the process X^s<t I^-Wsl is bounded. Show that
the local martingale ?{M) is also of integrable variation.
Exercise 13. Let Z be a semimartingale with decomposition Z = L + V
where ? is a local martingale with ALS ф -1 for all finite s, and V has paths
of finite variation on compacts. Let
Show that ?{Z) = ?{L)?{U). (See [157].)

352 Exercises for Chapter V
* Exercise 14 (Memin's criterion for exponential martingales). Let
M be a local martingale, let Jt = Y2a<t Д-^Л{|ДМа|>А}! an<l let ^t — Jt~ Jt
and Nt — Mt - Lt. (Both L and N are of course also local martingales.) Let
At = [M,M]c+
0<s<t 0<s<t
and assume that the compensator of A is bounded. Show that this implies
that ?{M) is a uniformly integrable martingale. {Note: This is related to
Kazamaki's and Novikov's criteria, but without the assumption of path con-
continuity. Also see [156].)
Exercise 15 (Gronwall's inequality). . Let Cj be a cadlag increasing pro-
process and suppose 0 < At < а + /0 As~dCs for t > 0. Show that At < aeCi,
each t > 0. (Note: The classical Gronwall inequality is usually stated with
Ct =t. Also see [85].)
The next five exercises outline the Metivier-Pellaumail method for showing
the existence and uniqueness of solutions of stochastic differential equations.
* Exercise 16. Let M be an L2 martingale. Show that for any predictable
stopping time T, E{(E{AMT|J"T-}J} < ?{(M,M)T_}. Show also that for
any totally inaccessible stopping time T, we have Е{(Е{^Мт\Тт-}J} <
E{(M,M)t}- Conclude that for an arbitrary stopping time T we have
Е{{Е{&Мт\?т-}?} < Е{{М,М)т-}.
Exercise 17 (Metivier-Pellaumail inequality). Let M be an L2 mar-
martingale and let M*_ = sups<t \MS\. Show that for any stopping time T we
have
(Hint: Use Exercise 16).
Exercise 18. Let M be a locally square integrable local martingale. Show
that for any stopping time T we have
E{sup(Ms - M0J} < 4?'{(M,M)T_ + [M, M]T_}.
s<T
Exercise 19 (extended Gronwall's inequality). Let A and С be two
increasing processes with A adapted and E{AOO} < oo, and С such that
0 < Coo < к everywhere for some constant k. Suppose that for every stopping
time T and for some positive constant a we have
T-
Then ElAoo} < aek. (Hint: Use Lebesgue's change of time lemma, together
with Gronwall's inequality of Exercise 15. Also see [110, page 576].)

Exercises for Chapter V 353
Exercise 20. Use Exercises 16 through 19 as needed to show, via a Picard
iteration method, that a unique solution exists to the stochastic differential
equation
dXt = f(X,-)dZa
where / is Lipschitz continuous and Z is an arbitrary semimartingale.
Exercise 21. Show that there are an infinite number of solutions (on the
same probability space) to the equation
= 3 / xhs + 3 / Xi dBs, Xo = 0,
Jo Jo
where Б is a standard Brownian motion. (See [121, page 293].)
*Exercise 22. Let a satisfy a weakening of the Lipschitz condition of the
form \o{y) — &(x)\ < k(\x — y\) for all i, j ? I, with J^ ,K$du = oo and к
increasing with к@) = 0. (Such a condition is a variant of a standard condition
for uniqueness of solutions in the theory of ordinary differential equations; see
for example [35, page 60]. Let b be Lipschitz continuous. Show that there
exists a unique solution of the equation
dXt = b(Xt)dt + a{Xt)dBt
where В is standard Brownian motion. (See [232].)
Exercise 23. Let a and b be Lipschitz continuous and let X be the unique
solution of equation (*) of Exercise 5. Show that 2xb(x) + cr(xJ < с + k\x\2,
where с and k are positive constants. Use Ito's formula on X2 to show the
following moment estimate: E{X2} < (E{X^} + ct)ekt. (See [135].)
*Exercise 24 (Euler method of approximation). Let a and b be con-
continuous and such that a unique path-by-path solution X of equation (*) of
Exercise 5 exists. Let (?fc)fc>i be i.i.d. with mean zero and variance r2. Let
Xo be independent of (?&)*:>i an<i ^ Xk be defined inductively by
Ь(Хк)
n
Define Xn(t) = X[nt] where [nt] denotes the integer part of nt. Let Bn(t) =
7Й ЕЙ &> Vn(t) = If1, and finally observe that
Xn(t)=X{0)+ f a(Xn(s-))dBn(s) + f b{Xn{s-))dVn(s).
Jo Jo
By Donsker's Theorem we know that (Bn,Vn) converges weakly (in distri-
distribution) to (tB, V), where В is a standard Brownian motion and V(t) = t.
Show that Bn is a martingale, and that Xn converges weakly to X. (See [136]
and [137].)

354 Exercises for Chapter V
*Exercise 25. Let Zbea Levy process which is a square integrable martin-
martingale. Let / be continuously differentiable with a bounded derivative and let
Xх be the unique solution of
Xx = x + f f{Xx_)dZs.
Jo
/о
Show that Xх is also a square integrable martingale.
*Exercise 26. In the framework of Exercise 25, let X'tx be the solution of
ft
Y x _ I , / fi/ ух \ у x jy
Jo
Show that X x is also a square integrable martingale.
**Exercise 27. In the framework of Exercises 25 and 26, for each measurable
function g with at most linear growth, set
Ptg(x) = E{g(Xx)} and Qt9(x) = E{g(Xf)X'tx}.
Assume that / is infinitely differentiable, and that g is twice differentiable,
bounded and both of its first two derivatives are bounded. Show that the
function (t,x) i-> Ptg(x) is twice differentiable in x and once differentiable in
t, that all the partial derivatives are continuous in (t,x), and further that
д " ' ~ Qtg'{x).

VI
Expansion of Filtrations
1 Introduction
By an expansion of the filtration, we mean that we enlarge the filtration
(•^t)t>o to get another filtration (Ht)t>o such that the new filtration satisfies
the usual hypotheses and Tt С Ht, each t > 0. There are three questions we
wish to address: A) when does a specific, given semimartingale remain a semi-
martingale in the enlarged filtration; B) when do all semimartingales remain
semimartingales in the enlarged filtration; C) what is a new decomposition of
the semimartingale for the new filtration.
The subject of the expansion of filtrations began with a seminal paper
of K. Ito in 1976 (published in [100] in 1978), when he showed that if В
is a standard Brownian motion, then one can expand the natural filtration
(Jrt)t>o of В by adding the ст-algebra generated by the random variable B\
to all Tt of the filtration, including of course To- He showed that В remains
a semimartingale for the expanded filtration, he calculated its decomposition
explicitly, and he showed that one has the intuitive formula
f HsdBs = f BxHsdBa
Jo Jo
where the integral on the left is computed with the original filtration, and
the integral on the right is computed using the expanded filtration. Obviously
such a result is of interest only for 0 < t < 1. We will establish this formula
more generally for Levy processes in Sect. 2.
The second advance for the theory of the expansion of filtrations was the
1978 paper of M. Barlow [7] where he considered the problem that if L is a
positive random variable, and one expands the filtration in a minimal way to
make L a stopping time, what conditions ensure that semimartingales remain
semimartingales for the expanded filtration? This type of question is called
progressive expansion and it is the topic of Sect. 3.

356 VI Expansion of Filtrations
2 Initial Expansions
Throughout this section we assume given an underlying filtered probability
space (ft,T, (Tt)t><a-,P) which satisfies the usual hypotheses. As in previous
chapters, for convenience we denote the filtration [Tt)t><a by the symbol F.
The most elementary result on the expansion of filtrations is due to Jacod
and was established in Chap. II (Theorem 5). We recall it here.
Theorem 1 (Jacod's Countable Expansion). Let A be a collection of
events in T such that if Aa, Ар е A then Aa П Ар = ф, а ф C. Let 7it be the
filtration generated by Tt and A. Then every ((^)г>о,Р) semimartingale is
an ((Ht)t>o,P) semimartingale also.
We also record a trivial observation as a second elementary theorem.
Theorem 2. Let X be a semimartingale with decomposition X = M + A
and let Q be a a-algebra independent of the local martingale term M. Let Ш
denote the filtration obtained by expanding F with the one a-algebra Q (that
is, Ht = JiV Q, each t > 0 and Ht = Ht+). Then X is апШ semimartingale
with the same decomposition.
Proof. Since the local martingale M remains a local martingale under H, the
theorem follows. D
We now turn to Levy processes and an extension of Ito's first theorem. Let
Z be a given Levy process on our underlying space, and define H = (Ht)t>o
to be the smallest filtration satisfying the usual hypotheses, such that Z\ is
Ho measurable and Tt С Ht for all t > 0.
Theorem 3 (Ito's Theorem for Levy Processes). The Levy process Z
is an Ш semimartingale. If moreover E{\Zt\} < oo, all t > 0, then the process
f
t/\\ 17 17
CIS
1 -s
is an Ш martingale on [0, oo).
Proof. We begin by assuming E{Z?} < oo, each t > 0. Without loss of gener-
generality we can further assume E{Zt} = 0. Since Z has independent increments,
we know Z is an F martingale. Let 0<s<t<lbe rationale with s = j/n
and t = k/n. We set
Then Zx - Zs = YSZj Yi and Zt-Za = Ylilj Yi- The random variables
are i.i.d. and integrable. Therefore

2 Initial Expansions 357
fc-l n-l , . n-1
E{Zt — ZS\Z\ — Zs] —
The independence of the increments of Z yields E{Zt — ZS\HS} — E{Zt —
ZS\ZX - Zs}; therefore E{Zt - ZS\HS) = j5f (^i - Zs) for all rationals, 0 <
s < t < 1. Since Z is an F martingale, the random variables (Zt)o<t<i are
uniformly integrable, and since the paths of Z are right continuous, we deduce
E{Zt - Zs\ns} = jE^{Zi - Za) for all reals, 0 < s < t < 1. By Fubini's
Theorem for conditional expectations the above gives
E{Mt - MS\HS} = E{Zt - ZS\HS} - I --^E{Zl - Zu\Hs}du
Ju
= 0.
There is a potential problem at t = 1 because of the possibility of an explo-
explosion. Indeed this is typical of initial enlargements. However if we can show
E{f0 \Z^ ds} < oo this will suffice to rule out explosions. By the sta-
tionarity and independence of the increments of Z we have E{\Z\ — Zs\} <
E{(Zi — ZsJ}% < a(l — s)i for some constant a and for all s, 0 < s < 1.
Therefore E{f^ |z^lfa|rfs} < af* ::^^ds < oo. Note that if t > 1 then
Tt — Wt, and it follows that M is a martingale. Since Zt = Mt + JQ '~^? ds
we have that Z is a semimartingale.
Next suppose only that -E{|Zt|} < oo, t > 0 instead of Z being in L2 as
we assumed earlier. We define
jl ^~* Л 7 1 A T^ \~* Л 7 1
0<s<t 0<s<t
(Since Z has cadlag paths a.s. for each u> there are at most a finite number of
jumps bigger than a fixed size on each compact time set; hence each Jl is finite
a.s.) By the results of Chap. I on Levy processes we have that Yt = Zt — J^ + J2
is a Levy process with bounded jumps, and hence it is square integrable.
Additionally, the processes Y, J1, and J2 are jointly independent.
Combining these facts with the preceding proof yields
Nt=Yt-
— s
is an H martingale, where H is the smallest right continuous filtration ob-
obtained by expanding F with (Yx,Jl,J2). Note that H С Н'. The same argu-
argument, plus the observation that since J* does not jump at time t = 1 and is

358 VI Expansion of Filtrations
therefore constant in a (random) neighborhood of 1, which in turn yields that
Jo ' \Za ds < oo a.s., shows that (Ji~~f0 \ZSS ds)t>o is a local martingale for
f I 117*7*1
H . Moreover it is a martingale for H as soon as E{f0 \ZS ds} < oo. But
the function t i—> E{ Jt*} = atf for all i by the stationarity of the increments.
Hence
/•1 I Ji Ji /¦! ji ji f
E{ \J^Lds} = m ±_Lids}l = l
Jo i ~~s Jo l ~ s Jo
1 1 - s J
11 _ „
is
ds
о 1~s
— Bj <C 00.
Since У, J1, and J2 are all independent, we conclude that M is an H' martin-
martingale. Since M is adapted to H, by Strieker's Theorem it is also an H martingale,
and thus Z is an H semimartingale.
Finally we drop all integrability assumptions on Z. We let
Jt = У~] ^^s^-{\az3\>i} and also Xt = Zt — j\.
0<s<t
Then X is also a Levy process, and since X has bounded jumps it is in Lp
for all p > 1, and in particular ?{X2} < oo, each t > 0. Let Ш(Х\) denote
F expanded by the adjunction of the random variable Xx. Let К = H(Xi) V
M(Jl), the filtration generated by H(Xi) and H(J11). Then H(Zi) с К. But
X is a semimartingale on Ш(Х\), and since J1 is independent of X we have
by Theorem 2 that X is а (К, Р) semimartingale. Therefore by Strieker's
Theorem, X is an (H, P) semimartingale; and since J\ is a finite variation
process adapted to H we have that Z is an (H, P) semimartingale as well. ?
The most important example of the above theorem is that of Brownian
motion, which was Ito's original formula. In this case let H = H(J3i), and we
have as a special case the H decomposition of Brownian motion:
d / d f ^1 — Bs f Bi — Bs f Bi —
Bt = (Bt- / — ds) + / — ds = Pt+ —
Jo 1-s Jo 1 - s Jo 1 -
B B f B — Bs
s
Note that the martingale /3 in the decomposition is continuous and has
[j3, /3]t — t, which by Levy's theorem gives that /3 is also a Brownian mo-
motion. A simple calculation gives Ito's original formula, for a process H which
is F predictable:
( [ B\~Bs
HsdBs = Si / HsdPs+Вг [ Н3Щ—
Jo Jo L ~~
= / B1Hsdp3+ f ВгНэЦ^
Jo Jo l ~
= [ BxHadBa
Jo
^ds
s

2 Initial Expansions 359
where since the random variable B\ is Ho measurable, it can be moved inside
the stochastic integral. We can extend this theorem with a simple iteration;
we omit the fairly obvious proof.
Corollary. Let Z be a given Levy process with respect to a filtration F, and
let 0 = to < t\ < ¦ ¦ ¦ < tn < oo. Let H denote the smallest filtration satis-
satisfying the usual hypotheses containing F and such that the random variables
Ztl,...,Ztn are all Ho measurable. Then Z is an H semimartingale. If we
have a countable sequence 0 = to < t\ < ¦ • ¦ < tn < ¦ ¦ ¦, we let r = supn tn,
with H the corresponding filtration. Then Z is an H semimartingale on [0, r).
We next give a general criterion (Theorem 5) to have a local martingale
remain a semimartingale in an expanded filtration. (Note that a finite varia-
variation process automatically remains one in the larger filtration, so the whole
issue is what happens to the local martingales.) We then combine this theo-
theorem with a lemma due to Jeulin to show how one can expand the Brownian
filtration. Before we begin let us recall that a process X is locally integrable
if there exist a sequence of stopping times (Tn)n>i increasing to сю a.s. such
that Е{\Хтп1{тп>0}\} < °° f°r eacn n. Of course, if Xq = 0 this reduces to
the condition -Е{|-Хт„|} < сю for each n.
Theorem 4. Let M be an? local martingale and suppose M is a semimartin-
semimartingale in an expanded filtration Ш. Then M is a special semimartingale in H.
Proof. First recall that any local martingale is a special semimartingale. In
particular the process Mt* = sups<t \MS\ is locally integrable (see Theorem 33
of Chap. Ill), and this of course remains locally integrable in the expanded
filtration H, since stopping times remain stopping times in an expanded fil-
filtration. Since M is an H semimartingale by hypothesis, it is special because
Mt* is locally integrable (see Theorem 34 of Chap. III). ?
Theorem 5. Let M be an? local martingale, and let H be predictable such
that f0 Hgd[M,M]s is locally integrable. Suppose Ш is an expansion of? such
that M is an Ш semimartingale. Then M is a special semimartingale in Ш
and let M = N + A denote its canonical decomposition. The stochastic inte-
integral process (Jo HsdMs)t>o is an Ш semimartingale if and only if the process
(f0 HsdAs)t>o exists as a path-by-path Lebesgue-Stieltjes integral a.s.
Proof. First assume that -E{/0°° H%d[M, M]s} < сю, which implies that Я • M
(where H ¦ M denotes the stochastic integral process (Jo HsdMs)t>o) is a
square integrable martingale, and hence by Theorem 4 it is a special semi-
semimartingale in the H filtration. Let M = N + A be the canonical H de-
decomposition of M, and it follows that H ¦ M = H • N + H ¦ A is the
canonical H decomposition of H ¦ M. By the lemma following Theorem 23
of Chap. IV we have E{f™ Hfd[N,N]s} < E{f™ H*d[M,M]s} < сю and
E{$™H2sd[A,A]s} < ?{/0°°H2sd[M,M\s) < сю. This allows us to conclude
that H is (W2,M) integrable, calculated in the H filtration.

360 VI Expansion of Filtrations
To remove the assumption E{f?° H%d[M, M]s} < oo, we only need to
recall that H ¦ M is assumed to be locally square integrable, and thus take M
stopped at a stopping time Tn that makes H ¦ M square integrable, and we
are reduced to the previous case. ?
Theorem 6 (Jeulin's Lemma). Let R be a positive, measurable stochas-
stochastic process. Suppose for almost all s, Rs is independent of Ts; and the law
(or distribution) of Rs is fj, which is independent of s, with //({0}) = 0 and
Jo°° xfi(dx) < oo. If a is a positive predictable process with fQ asds < oo a.s.
for each t, then the two sets below are equal almost surely:
POO POO
{ / Rsasds < oo} = { / asds < сю} a.s.
Jo Jo
Proof. We first show that {/0°° Rsasds < сю} С {/0°° asds < сю} a.s. Let A
be an event with P(A) > 0, and let J = 1a, Jt = E{J\!Ft}, the cadlag version
of the martingale. Let j = inft Jt. Then j > 0 on {J = 1}. We have a.s.
POO
E{lARt\Tt}= E{lAl{Rt>u}\Tt}du. (*)
Jo
Consider next
E{lAl{Rt>u}\Ft} = E{(lA - l{Rt<u))+\Ft) > (E{(lA - 1{Дг<и})|^})+
by Jensen's inequality, and this is equal to (E{lA\!Ft} - м@> и])+> where fj, is
the law of R4. Continuing with (*) we have
> Г(Е{1А\Ъ} - M@, u])+du = Ф{Е{1А\Ъ}) =
Jo
where Ф(х) = J^°(x — (j,@, u])+du. Note that Ф is increasing and continuous
on [0,1], and Ф > 0 on @,1] because //({0}) = 0.
Choose An = {/0°° Rsasds < n} for the event A in the foregoing. Then
POO /»OO
00 > nP(An) > E{lAn / Rsasds) = E{ / lAnRsasds}
Jo Jo
= E{ E{lAnRs\Fs}asds}>E{ 0(Je)aads},
Jo Jo
which implies that /0°° Ф(Js)asds < 00 a.s. But
POO /»0O
/ Ф(Js)asds > Ф{э) I asds,
Jo Jo
POO
OO >
and therefore Jo°° asds < 00 a.s. on {J = 1}. That is, Jo°° asds < 00 a.s. on
An. Letting n tend to 00 through the integers gives {/0°° Rsasds < 00} С

2 Initial Expansions 361
{/o°° asds < 00} a.s. We next show the inverse inclusion. Let Tn = inf{? > 0 :
/„ asds > n}. Then Tn is a stopping time, and n > E{f0 " asds}. Moreover
E{ f
Jo
Rsasds} = L
Jo
= E{ f " E{E{Rs\fs}\FsATJasds} = E{ f " E{Rs}asds}
Jo Jo
= aE{ / asds} <
Jo
an < сю
where a is the expectation of Rs, which is finite and constant by hypothesis.
Therefore Jo " Rsasds < сю a.s. Let ш be in {Jo asds < сю}. Then there exists
an n (depending on us) such that Tn(u>) = сю. Therefore we have the inclusion
{/0°° Rsasds < сю} Э {/0°° asds < сю} a.s. and the proof is complete. ?
We are now ready to study the Brownian case. The next theorem gives
the main result.
Theorem 7. Let M be a local martingale defined on the standard space of
canonical Brownian motion. Let H be the minimal expanded filtration con-
containing B\ and satisfying the usual hypotheses. Then M is апШ semimartin-
gale if and only if the integral Jo ,*_s\d\M,B]s\ < сю a.s. In this case
^t — /0 B\Zfs d[M,B]s is an Ш local martingale.
Proof. By the Martingale Representation Theorem we have that every F lo-
local martingale M has a representation Mt = Mo + /0 HsdBs, where H is
predictable and f0 H^ds < сю a.s., each t > 0. By Theorem 5 we know
that M is an H semimartingale if and only if J* \HS\ \Zf ds is finite
a.s., 0 < t < 1. We take as = J^L, and Rs = l{s<1} l^~_^"l. Then
/o \Ha\^B\Zf'^ds = /0 asRsds, which is finite only if /0 asds < сю a.s. by
Jeulin's Lemma. Thus it is finite only if JQ -L=d=ds < сю a.s. But
,J3]S= /" -jl==d[\H\-B,B
Jo
and this completes the proof. ?
As an example, let 1/2 < a < 1, and define

362 VI Expansion of Filtrations
Then H is trivially predictable and also /0 H^ds < oo. However Jo Hs i—rfs
is divergent. Therefore M — H ¦ В is an F local martingale which is not an H
semimartingale, by Theorem 7, where of course Ш = M(Bi). Thus we conclude
that not all F local martingales (and hence a fortiori not all semimartingales)
remain semimartingales in the H filtration.
We now turn to a general criterion that allows the expansion of filtration
such that all semimartingales remain semimartingales in the expanded filtra-
filtration. It is due to Jacod, and it is Theorem 10. The idea is surprisingly simple:
recall that for a cadlag adapted process X to be a semimartingale, if Hn is a
sequence of simple predictable processes tending uniformly in (t, us) to zero,
then we must have also that the stochastic integrals Hn ¦ X tend to zero in
probability. If we expand the filtration by adding a a-algebra generated by
a random variable L to the F filtration at time 0 (that is, cr{L} is added to
fo), then we obtain more simple predictable processes, and it is harder for
X to stay a semimartingale. We will find a simple condition on the random
variable L which ensures that this condition is not violated. This approach is
inherently simpler than trying to show there is a new decomposition in the
expanded filtration.
We assume that L is an (E, ?)-valued random variable, where E is a stan-
standard Borel space1 and ? are its Borel sets, and we let M(L) denote the smallest
filtration satisfying the usual hypotheses and containing both L and the orig-
original filtration F. When there is no possibility of confusion, we will write H
in place of H(L). Note that if Y e Н$ = Tt V a{L}, then Y can be written
Y(lo) = G(lo,L(lo)), where (ui,x) >—> G(co,x) is an Tt®? measurable function.
We next recall two standard theorems from elementary probability theory.
Theorem 8. Let Xn be a sequence of real-valued random variables. Then Xn
converges to 0 in probability if and only if limn^oo E{mm(l, |Xn|)} = 0.
A proof of Theorem 8 can be found in textbooks on probability (see for
example [109]).
We write 1Л \Xn\ for min(l, \Xn\). Also, given a random variable L, we let
Qt(u>,dx) denote the regular conditional distribution of L with respect
to Tt, each t > 0. That is, for any Л € ? fixed, Qt(-,A) is a version of
E{l{L€A}\^t}, and for any fixed w, Qt(u>, dx) is a probability on ?. A second
standard elementary result is the following.
Theorem 9. Let L be a random variable with values in a standard Borel
space. Then there exists a regular conditional distribution Qt(co,dx) which is
a version of
For a proof of Theorem 9 the reader can see, for example, Breiman [23,
page 79].
1 (E, ?¦) is a standard Borel space if there is a set Г € Б, where Б are the Borel
subsets of R, and an injective mapping ф : E —> Г such that ф is ? measurable and
ф~г is Б measurable. Note that (Rn, Bn) are standard Borel spaces, 1 < n < oo.

2 Initial Expansions 363
Theorem 10 (Jacod's Criterion). Let L be a random variable with values
in a standard Borel space (E,?), and let Qt(co,dx) denote the regular condi-
conditional distribution of L given Tt, each t > 0. Suppose that for each t there
exists a positive a-finite measure r\t on (E,?) such that Qt(u>,dx) <gc rjt(dx)
a.s. Then every F semimartingale X is also an M(L) semimartingale.
Proof. Without loss of generality we assume Qt(u>,dx) <?i rjt{dx) surely. Then
by Doob's Theorem on the disintegration of measures there exists an ? ® Ft
measurable function qt(x,co) such that Qt(co,dx) = qt(x,co)r]t(dx). Moreover
since E{JEQt(;dx)} = E{E{l{Y€E}\Ft}} = P(Y € E) = 1, we have
E{ f Qt(;dx)} = E{[ qt(x,w)Tit(dx)} = [ E{qt{x,u)}rit(dx) = 1.
Je Je Je
Hence for almost all x (under r]t{dx)), we have qt{x, •) € Lx{dP).
Let X be an F semimartingale, and suppose that X is not an H(L) semi-
semimartingale. Then there must exist а и > 0 and an e > 0, and a sequence Hn
of simple predictable processes for the H filtration, tending uniformly to 0 but
such that infn E{1 A \Hn ¦ X\} > s. Let us suppose that tn < u, and
n-l
i=0
with Jf € Tu Vct{L}. Hence Jf has the form д^{ш,Ь{ш)), where (u>,x) t->
gi(w,x) is Tti ®? measurable. Since Hn is tending uniformly to 0, we can
take without loss \Hn\ < 1/n, and thus we can also assume that |<^| < 1/n.
We write
n-l
i=0
and therefore (х,ш) ^ Щ'х(ш) and (х,ш) ^ (Яп-Х • X)t(w) are each ? <g> Tu
measurable, 0 < t < u. Moreover one has clearly Hn-X = Hn<L-X. Combining
the preceding, we have
A \Hn -XU\} = E{[(IA \Hn'x ¦ XU\)QU(-,dx)}
Je
A \Hn'x ¦ Xu\)qu(-, dx)Vu(dx)}
= f E{{1 A \Hn<* ¦ Xu\)qu(-,
Je
where we have used Fubini's Theorem to obtain the last equality. However the
function hn(x) = E{{1 A \Hn'x ¦ Xu\)qu(-,x)} < E{qu(-,x)} € Ll{d-qu), and
since hn is non-negative, we have

364 VI Expansion of Filtrations
lim
im E{\ A \Hn ¦ Xu\) = lim / E{{1 A \Hn ¦ Xu\)qu(; xft^dx)
—изо n—изо Jv?
= / lim E{{1 A\Hn ¦ X^q^-
JEn->oo
by Lebesgue's Dominated Convergence Theorem. However qu{-,x) € Ll(dP)
for a.a. x (under (dr]u)), and if we define dR = cqu(-, x)dP to be another proba-
probability, then convergence in P-probability implies convergence in Д-probability,
since R<.P. Therefore lim^^ ER{A A \Hn>x ¦ Xu\)} = 0 as well, which im-
implies
0 = lim -ER{A A \Hn'x ¦ Xu\)}
п—юо с
= EP{(lA\Hn>x-Xu\)qu(;x)}
for a.a. x (under (dr]u)) such that qu(-,%) & Ll{dP). Therefore the limit of
the integrand in (*) is zero for a.a. x (under (dj]u)), and we conclude
lim ?{AЛ|Яп-Хи|)}=0,
n—»oo
which is a contradiction. Hence X must be a semimartingale for the filtration
H°, where H° = Tt V a{L}. Let X — M + A be a decomposition of X under
H°. Since H° need not be right continuous, the local martingale M need
not be right continuous. However if we define Mt = Mt if t is rational; and
liniux^ueQ Mu if t is not rational; then Mt is a right continuous martingale
for the filtration H where Tit — f]u>t ^u- Letting At = Xt —Mt, we have that
Xt = Mt+At is an H decomposition of X, and thus X is an H semimartingale.
?
A simple but useful refinement of Jacod's Theorem is the following where
we are able to replace the family of measures щ be a single measure 77.
Theorem 11. Let L be a random variable with values in a standard Borel
space (E,?), and let Qt(u>,dx) denote the regular conditional distribution of L
given Tt, each t > 0. Then there exists for each t a positive a-finite measure
rjt on (E, 8) such that Qt(u),dx) <C T]t(dx) a.s. if and only if there exists one
positive a-finite measure rj(dx) such that Qt(co,dx) <gc rj(dx) for all со, each
t > 0. In this case r\ can be taken to be the distribution of L.
Proof. It suffices to show that the existence of r\t for each t > 0 implies
the existence of 77 with the right properties; we will show that the dis-
distribution measure of L is such an 77. As in the proof of Theorem 10 let
(x,lo) н-> qt(x,u>) be ?¦ ® Tt measurable such that Qt(i0,dx) = qt(x,io)r]t(dx).
Let at(x) = E{qt(x,u>)}, and define
I 0, otherwise.

2 Initial Expansions 365
Note that at{x) = 0 implies qt(x, •) = 0 a.s. Hence, qt(x,ui) = rt(x,co)at(x)
a.s.; whence rt(:r,u>)at(:r)?7t(d:r) is also a version of Qt(u>, dx).
Let r/ be the law of L. Then for every positive ? measurable function g we
have
Jg{x)r,{dx) = E{g(L)} = E{Jg(x)Qt(-,dx)}
g{x)qt{x,-)r)t{dx)}
i
= f g(x)E{qt(x,-)}r]t(dx)
Je
= / g(x)at(x)r}t(dx)
Je
from which we conclude that at(x)r]t(dx) = r](dx). Hence, Qt(co,dx) =
rt(u>,x)r](dx), and the theorem is proved. D
We are now able to re-prove some of the previous theorems, which can be
seen as corollaries of Theorem 11.
Corollary 1 (Independence). Let L be independent of the filtration F.
Then every F semimartingale is also an M(L) semimartingale.
Proof. Since L is independent of !Ft, E{g(L)\Jrt\ = E{g(L)} for any bounded,
Borel function g. Therefore
E{g(L)\Ft) = [ Qt(u,,dx)g(x) = [ f](dx)g(x) = E{g(L)},
Je Je
from which we deduce Qt(w, dx) = r]{dx), and in particular Qt(w, dx) < rj(dx)
a.s., and the result follows from Theorem 11. ?
Corollary 2 (Countably-valued random variables). Let L be a random
variable taking on only a countable number of values. Then every F semi-
semimartingale is also an M(L) semimartingale.
Proof. Let L take on the values ai, a%, аз,.... The distribution of L is given
by r](dx) = YliLi P{L — ai)?ai(dx), where eai(dx) denotes the point mass at
O!j. With the notation of Theorem 11, we have that the regular conditional
distribution of L given Tt, denoted Qt(w,dx), has density with respect to r\
given by
The result now follows from Theorem 11. D

366 VI Expansion of Filtrations
Corollary 3 (Jacod's Countable Expansion). Let Л = (A1;A2,...) be
a sequence of events such that А* П A,: = 0, i ф j, all in T, and such that
USi Ai = fl. Let H be the filtration generated by F and Л, and satisfying
the usual hypotheses. Then every F semimartingale is an H semimartingale.
Proof. Define L = Y^li 2~*1л<- Then И = M{L) and we need only to apply
the preceding corollary. ?
Next we consider several examples.
Example (Ito's example). We first consider the original example of Ito,
where in the standard Brownian case we expand the natural filtration F with
a{Bi}. We let H denote M(Bi). We have
= E{g{Bl -Bt + Bt)\Tt) = Jg(x
where r]t(dx) is the law of B\ - Bt and where we have used that Bi — Bt is in-
independent of Tt- Note that T]t(dx) is a Gaussian distribution with mean 0 and
variance A — t) and thus has a density with respect to Lebesgue measure. Since
Lebesgue measure is translation invariant this implies that Qt(u>,dx) <§C dx
a.s., each t < 1. However at time 1 we have E{g(B\) \!F\} = g{B\), which yields
Qi(u>, dx) — e{Bi(ui)}{dx), which is a.s. singular with respect to Lebesgue mea-
measure. We conclude that any F semimartingale is also an H(J3i) semimartingale,
for 0 < t < 1, but not necessarily including 1. This agrees with Theorem 7
which implies that there exist local martingales in F which are not semimartin-
galesinll(Si).
Our next example shows how Jacod's criterion can be used to show a
somewhat general, yet specific result on the expansion of filtrations.
Example (Gaussian expansions). Let F again be the standard Brownian
filtration satisfying the usual hypotheses, with В a standard Brownian motion.
Let V = Jo°° g(s)dBs, where Jo°° g(sJds < сю, д a deterministic function. Let
a = inf {t > 0 : Jt°° g(sJds = 0}. If h is bounded Borel, then as in the previous
example
E{h(V)\Ft} = E{h( f g(s)dBs + f g(s)dBs)\Tt}
Jo Jt
= j h(f g{s)dBs + x)r]t{dx),
where rjt is the law of the Gaussian random variable /t°° g(s)dBs. If a = сю,
then r]t is non-degenerate for each t, and r\t of course has a density with
respect to Lebesgue measure. Since Lebesgue measure is translation invariant,
we conclude that the regular conditional distribution of Qt(u, dx) of V given
Tt also has a density, because

2 Initial Expansions 367
Qt(w,h) = E{h{V)\Tt} = Jh(J g(s)dBs+x)r,t(dx).
Hence by Theorem 10 we conclude that every F semimartingale is an M(V)
semimartingale.
Example (expansion via the end of a stochastic differential equa-
equation). Let В be a standard Brownian motion and let X be the unique solution
of the stochastic differential equation
pt pt
Xt=X0+ / cr{Xs)dBs + / b{Xs)ds
Jo Jo
where a and b are Lipschitz. In addition, assume a and b are chosen so that
for h Borel and bounded,
E{h{Xx)\Tt} = Jh(x)w(l-t,Xt,x)dx
where тгA — t,u,x) is a deterministic function.2 Thus Qt(u>,dx) — тгA —
t,Xt(oj),x)dx, and Qt(ui,dx) is a.s. absolutely continuous with respect to
Lebesgue measure if t < 1. Hence if we expand the Brownian filtration F
by initially adding X\, we have by Theorem 10 that every F semimartingale
is an H(.Xi) semimartingale, for 0 < t < 1.
The mirror of initial expansions is that of filtration shrinkage. This has
not been studied to any serious extent. We include one result (Theorem 12
below), which can be thought of as a strengthening of Strieker's Theorem,
from Chap. II. Recall that if X is a semimartingale for a filtration H, then it
is also a semimartingale for any subfiltration G, provided X is adapted to G,
by Strieker's Theorem. But what if a subfiltration F is so small that X is not
adapted to it? This is the problem we address. We will deal with the optional
projection Z of X onto F.
Definition. Let H = (Ht)t>o be a bounded measurable process. It can be
shown that there exists a unique optional process °H, also bounded, such that
for any stopping time T one has
E{HT1{T<OO}} = E{°HT1{T<OO}}.
The process °H is called the optional projection of H.
We remark that °Ht — Ht a.s. for each fixed time t, but the null set
depends on t. Therefore were we simply to write as a process (E{Ht\J-t})t>o-,
instead of °H, it would not be uniquely determined almost surely. This is why
we use the optional projection. The uniqueness follows from Meyer's section
theorems, which are not treated in this book, so we ask the reader to accept
it on faith.
2 Sufficient conditions are known for this to be true. These conditions involve Xo
having a nice density, and requirements on the differentiability of the coefficients.
See, for example, [190] and [193].

368 VI Expansion of Filtrations
Definition. Let H = (#t)t>0 be a bounded measurable process. It can be
shown that there exists a predictable process PH, also bounded, such that for
any predictable stopping time T one has
Е{Нт1{т<оо}} = Е{рНт1{т«х>}}-
The process PH is called the predictable projection of H.
It follows that for the optional projection, for each stopping time T we
have
°#T = E{HT\FT} a.s. on {T < 00}
whereas for the predictable projection we have that
PHT = E{HT\rT-} a.s. on {T < 00}
for any predictable stopping time T. For fixed times t > 0 we have then
of course °Ht = E{Ht\Tt) a.s., and one may wonder why we don't simply
use the "process" {E{Ht\Ft})t>o instead of the more complicated object °H.
The reason is that this is defined only almost surely for each t, and we have
an uncountable number of null sets; therefore it does not uniquely define a
process. (Related to this observation, note in contrast that in some cases
(E{Ht\Ft})t>0 might exist even when °H does not). We begin our treatment
with two simple lemmas.
Lemma. Let F С G, and let X be a G martingale but not necessarily adapted
to F. Let Z denote the optional projection of X onto F. Then Z is an F
martingale.
Proof. Let s < t. Then
Zs = E{XS\TS} = E{E{Xt\gs}\Ts} = E{E{Xt\gt}\Ts} = E{Zt\Ts}
and the result follows. D
The Azema martingale, a projection of Brownian motion onto a subfil-
tration to which it is not adapted, is an example of the above lemma. The
projection of an increasing process, however, is a submartingale but not an
increasing process.
Lemma. Let fcG, and let X be a G supermartingale but not necessarily
adapted to F. Let Z denote the optional projection of X onto F. Then Z is
an F supermartingale.
Proof. Let s < t. Then
Zs = E{XS\TS} > E{E{Xt\gs}\Ts} - E{Xt\Ts} = E{E{Xt\Ft}\Fs)
= E{Zt\Ts),
and the result follows. ?

3 Progressive Expansions 369
The limitation of the two preceding lemmas is the need to require integra-
bility of the random variables Xt for each t > 0. We can weaken this condition
by a localization procedure.
Definition. We say that a G semimartingale X starting at 0 is an F spe-
special, G semimartingale if there is a sequence (Tn)n>i of F stopping times
increasing a.s. to сю, and such that the stopped processes XT™ can be written
in the form XT" = Mn + An where Mn is a G martingale with Mg = 0
and where An has integrable variation over each [0, t], each t > 0, and with
Ao=0.
Theorem 12 (Filtration Shrinkage). Let G be a given filtration and let F
be a subfiltration of Q. Let X be an? special, G semimartingale. Then the F
optional projection of X, called Z, exists, and it is a special semimartingale
for the F filtration.
Proof. Without loss of generality we can assume Tn < n for each n > 1. We
set To = 0 and let Xn = XT" -XT~-\ and TV" = MT--MT—1 with iV° = 0.
For each n there are two increasing processes Cn and Dn, each starting at 0,
with Xn — Nn + Cn — Dn, and moreover we can choose this decomposition
such that the following holds:
?{sup \Nt\} < oo,
t
and where t < Tn_i implies Ct™ = D™ = JVf = 0, and t > Tn implies
ct ~ стп = Dt ~ DTn = Nt - NTn = °- The integrability condition implies
that the F optional projections of Cn, Dn, and Nn all exist and have cadlag
versions. By the previous two lemmas the optional projection of iVn is an F
martingale, and those of Cn and Dn are F submartingales. Therefore letting
°Xn, °Nn, °Cn, and °Dn denote the respective F optional projections of Xn,
Nn, Cn, and Dn, we have that °Xn =° Nn+°Cn-°Dn exists and is a special
F semimartingale.
Since Tn-i and Tn are F stopping times, we have that also the F optional
projections °iVn, °Cn, and °Dn and hence °Xn are all null over the stochastic
interval [0,Tn_i] and constant over (Tn,oo). Then J2n>i °^" is a c&d\hg
version of °X = Z and thus Z is a special F semimartingale. D
3 Progressive Expansions
We consider the case where we add a random variable gradually to a filtration
in order to create a minimal expanded filtration allowing it to be a stopping
time. Note that if the initial filtration F = {Tt)t>o is given by Tt = {0,^}
for all t, then G = (Q)t>o given by Qt = a{L Л s;s < i] is the smallest
expansion of F making L a stopping time. Note that Qt = cr{L Л t} as well.
Let L be a strictly positive random variable. Let Л = {(t,w) : t < L(oj)}.

370 VI Expansion of Filtrations
Then L = sup{t : (t,w) s Л}. In this sense every positive random variable
is the end of a random set. Instead however let us begin with a random set
Л с М+ x Q and define L to be the end of the set A. That is,
L(<jj) — sup{i : (t,ui) e Л}
where we use the (unusual) convention that sup@) = 0-, where {0—} is an
extra isolated point added to the non-negative reals [0, oo] and which can be
thought of as 0— < 0. We also define J- = -^o- The purpose of {0-} is to
distinguish between the events {w : Л(и>) = 0} and {w : Л(и>) — {0}}, each of
which could potentially be added to the expanded filtration.
The smallest filtration expanding F and making the random variable L a
stopping time is G° defined by Q® = Tt V a{L Л t}; but G° is not necessarily
right continuous. Thus the smallest expanded filtration making L a stopping
time and satisfying the usual hypotheses is G given by Qt = f)u>tGu- Nev-
Nevertheless, it turns out that the mathematics is more elegant if we consider
expansions slightly more rich than the minimal ones. In order to distinguish
progressive expansions from initial ones, we will change the notation for the
expanded filtrations. Beginning with our usual filtered probability space sat-
satisfying the usual hypotheses (tt, T, F, P)m where of course F denotes the fil-
filtration (^")t>0) and a random variable L which is the end of a random set, we
define the expanded filtration to be FL and is given by
Г e FtL <=> {Г e T and 3 Ft e Tt : Г П {L > t} = Гг П {L > t}}.
This filtration is easily seen to satisfy the usual hypotheses, and also it makes
L into a stopping time. Thus GcF1. There are two useful key properties the
filtration FL enjoys.
Lemma. If Я is a predictable process for FL then there exists a process
J which is predictable for F such that H — J on [0,L]. Moreover, if T is
any stopping time for FL then there exists an F stopping time 5 such that
S AL = T AL a.s.
Proof. Let Г be an event in T^. Then events of the form (t, oo) x Г form a
generating set for V(?L), the predictable sets for FL. Let Hs — 1{D>Оо)хг}(^)
and then take J to be Js = l(t,oo)xrt(s)- The first result follows by an appli-
application of the Monotone Class Theorem. For the stopping time T, note that it
suffices to take H = l[o,T], and let J be the F predictable process guaranteed
by the first half of this lemma, and take 5 = inf {t: Jt = 0}. ?
We next define a measure ць on [0, сю] х п by
HL(J) = E{JLl{L>0_}}
for any positive, measurable process J. For such a measure ць there exists an
increasing process 1{*>l} which is null at 0— but which can jump at both 0

3 Progressive Expansions 371
and +00. We will denote AL = (A^L>o, the (predictable) compensator
of l{t>L} for the filtration F. Therefore if J is an F predictable bounded
process we have
E{JLl{L>0_}} = E{ [ JsdALs).
J[0,oo]
We now define what will prove to be a process fundamental to our analy-
analysis. The process Z defined below was first used in this type of analysis by
J. Azema [3]. Recall that if H is a (bounded, or integrable) ?L process, then
its optional projection °H onto the filtration F exists. We define
Note that l{L>t} 1S decreasing, hence by the lemma preceding Theorem 12
we have that Z is an F supermartingale. We next prove a needed technical
result.
Theorem 13. The set {t : 0 < t < 00, Zt- = 0} is contained in the set (L, 00]
and is negligible for the measure dAL.
Proof. Let T(w) = inf{i > 0 : Zt{w) = 0, or Zt-(w) = 0 for t > 0}. Then
it is a classic result for supermartingales that for almost all u> the function
t 1—» Zt(oj) is null on \T(oj), 00]. (This result is often referred to as "a non-
negative supermartingale sticks at zero.") Thus we can write {Z = 0} as the
stochastic interval [T, 00], and on [0, T) we have Z > 0, Z~ > 0. We have
E{A^A^} = P(T <L)= E{ZTl{T<oo}} = 0, hence dAL is carried by [0,T].
Note that since dl{L>ty is carried by the graph of L, written [L], we have
L <T, and hence we have Z > 0, Z- > 0 on [0, L). Next observe that the set
{Z- = 0} is predictable, hence 0 = E{l{Zt_=o}^l{L>t}} = E{l{Zt=OydAf}
and hence {Z- = 0} is negligible for dAL. Note that this further implies that
P{ZL- > 0) = 1, and again that {Z- = 0} С {L, 00]. ?
We can now give a description of martingales for the filtration FL, as
long as we restrict our attention to processes stopped at the time L. What
happens after L is more delicate. For an integrable process J we let p J denote
its predictable projection.
Theorem 14. Let Y be a random variable with E{\Y\} < 00. A right con-
continuous version of the martingale Yt = E{Y\J-^} is given by the formula
At
Moreover the left continuous version Y- is given by
1

372 VI Expansion of Filtrations
Proof. Let OL denote the optional cr-algebra on Ш+ x П, corresponding to
the filtration FL. On [0,L), OL coincides with the trace of О on [0,1,). (By
О we mean of course the optional cr-algebra on R+ x Q corresponding to the
underlying nitration F.) Moreover on [L,oo), OL coincides with the trace of
the cr-algebra B(M+)®^roo on [L, oo). The analogous description of VL holds,
with [0, L) replaced by @, L], and with [L, oo) replaced with (L, oo). It is then
simple to check that the formulas give the bona fide conditional expectations,
and also the right continuity is easily checked on [0, L) and [L, oo) separately.
The second statement follows since Z- > 0 on @, L] by Theorem 13. D
We now make a simplifying assumption for the rest of this paragraph. This
assumption is often satisfied in the cases of interesting examples, and it allows
us to avoid having to introduce the dual optional projection of the measure
Simplifying assumption to hold for the rest of this paragraph. We
assume L avoids all F stopping times. That is, P(L = T) = 0 for all F
stopping times T.
Definition. The martingale ML given by MtL = A\ + Zt is called the fun-
fundamental L martingale.
Note that it is trivial to check that ML is in fact a martingale, since AL
is the compensator of 1 — Z. Note also that M^ = A^, since Z^ = 0. Last,
note that it is easy to check that ML is a square integrable martingale.
Theorem 15. Let X be a square integrable martingale for the F filtration.
Then (XtAi)t>o is a semimartingale for the filtration ?L. Moreover X4Al —
Jo Y^d{X,ML)s is a martingale in the ?L filtration.
Proof. Let С be the (non-adapted) increasing process Ct = 1{j>l}- Since С
has only one jump at time L we have E{Xl} = E{f?° XsdCs}. Since X is a
martingale it jumps only at stopping times, hence it does not jump at L, and
using that AL is predictable and hence natural we get
/¦OO /-OO
E{XL} = E{ Xs-dCs} = E{ Xs-dALs} = ^
Jo Jo
M^} - E{[X, MLU} = E{(X,
Suppose that Я is a predictable process for FL, and J is a predictable process
for F which vanishes on {Z_ = 0} and is such that J = H on @,L]. We
are assured such a process J exists by the lemma preceding Theorem 13.
Suppose first that H has the simple form H = /ilDiOO) for bounded h ? J-^.
If j is an Tt random variable equal to h on {t < L}, then we can take
J = jl(t,oo) and we obtain H ¦ Xoo = h(Xb — Xt)l{t<Ly. In this way we can
define stochastic integrals for non-adapted simple processes. We have then
E{(H ¦ X)oo} = E{(J ¦ X)l}, and using our previous calculation, since J ¦ X
is another square integrable martingale, we get

3 Progressive Expansions 373
/>OO
E{(H ¦ X)oo} = E{(J- X, ML)OO} = E{ / Jsd(X, ML)S}.
Jo
Since (X, ML) is F predictable, we can replace H by g°'L] because it has
the same predictable projection on the support of d(X, ML). This yields
/>oo pL ту
E{ HsdXs} = E{ ^d(X,ML)s}. (**)
J0 Jo ^s-
Last if we take the bounded ?L predictable process H to be a stochastic
interval [0,T Л L], where T is an FL stopping time, we obtain E{Xtal -
/0TAL ^zd(X,ML)s} = 0, which implies by Theorem 21 of Chap. I that
Xt/\l - /о Л Y^d{X,ML)s is a martingale. П
We do not need the assumption that X is a square integrable martingale,
which we made for convenience. In fact the conclusion of the theorem holds
even if X is only assumed to be a local martingale. We get our main result as
a corollary to Theorem 15.
Corollary. Let X be a semimartingale for the F filtration. Then (XtAi,)t>0
is a semimartingale for the filtration ?L.
Proof. If X is a semimartingale then it has a decomposition X = M + D.
The local martingale term M can be decomposed into X = V + N, where
V and N are both local martingales, but V has paths of bounded variation
on compact time sets, and N has bounded jumps. (This is the Fundamental
Theorem of Local Martingales, Theorem 25 of Chap. III.) Clearly V and D
remain finite variation processes in the expanded filtration ?L, and since M
has bounded jumps it is locally bounded, hence locally square integrable, and
since every F stopping time remains a stopping time for the FL filtration, the
corollary follows from Theorem 15. ?
We need to add a restriction on the random variable L in order to study
the evolution of semimartingales in the expanded filtration after the time L.
Definition. A random variable L is called honest if for every t < oo there
exists an Tt measurable random variable Lt such that L = Lt on {L < t}.
Note that in particular if L is honest then it is J-qq measurable. Also, any
stopping time is honest, since then we can take L = L Л t which is of course
Tt measurable by the stopping time property.
Example. Let X be a bounded cadlag adapted processes, and let Xt+* =
sups<tX+ and X+* = supsX+. Then L = inf{s : X+* = X+*}, is honest
because on the set {L < t) one has L — inf{s : Xs = X^*}, which is Tt
measurable.
Theorem 16. L is an honest time if and only if there exists an optional set
Л С [0, oo] x п such that L(uj) — sup{i < oo : (t,w) G A}.

374 VI Expansion of Filtrations
This is often described verbally by saying "L is honest if it is the end of
an optional set."
Proof. The end of an optional set is always an honest random variable. Indeed,
on {L < t}, the random variable L coincides with the end of the set Лп ([О, t] x
fi), which is Tt measurable.
For the converse we suppose L is honest. Let (Lt)t>o be an F adapted
process such that L = Lt on {L < t}. Since we can replace Lt with Lt At
we can assume without loss of generality that Lt < t. There is also no loss of
generality to assume that Lt is increasing with t j since we can further replace
Lt with sups<4Z,s. Last, it is also no loss, now that it is increasing, to take
it right continuous. We thus have that the process (Lt)t>o is optional for the
filtration F. Last, L is now the end of the optional set {(t,u>) : Lt(w) = t}. D
When L is honest we can give a simple and elegant description of the
filtration TL.
Theorem 17. Let L be an honest time. Define
gt = {r.r= (An{L >t})U(Bn{L <t}) for some A,B eft}
Then G = (Gt)t>o constitutes a filtration satisfying the usual hypotheses.
Moreover L is a G stopping time. A process U is predictable for G if and
only if it has a representation of the form
where H and К are F predictable processes.
Proof. Let s<t and take Я e Qs, of the form (An{L> s}) U (В П {L < s})
with А, В G Ts. We will show that H G Gt, which shows that the collection G
is filtering to the right.3 Since L is an honest time, there must exist D e Tt
such that {L < s} = D П {L < t}. Therefore
H П {L < t} = [(А П Dc) U(Bfl D)} n{L< t},
with [(А П Dc) U (J5 П D)\ G Tt. The fact that each Qt is a cr-algebra, and also
that <G is right continuous, we leave to the reader. Note that {L < t} e Gt
which implies that L is a G stopping time, as we observed at the start of the
proof of Theorem 16. For the last part of the theorem, let U = (Ut)t>o be a
cadlag process adapted to the <G filtration. Let Ht and Kt be Tt measurable
random variables such that
Ut = #t on {L > t} and Ut = Kt on {L < t}.
Next set Ht = lim infs/t Hs;Kt = liminfsy4 Js for all t > 0. Suppose that s
is a rational number. Take t = 0 and we have the equality
That is, if s < t, then Qs С Gt-

3 Progressive Expansions 375
U = Н1[0,ц + Kl{LiOo].
One can extend this result to predictable processes by the Monotone Class
Theorem. D
Theorem 18. Let X be a square integrable martingale for the F filtration.
Then X is a semimartingale for the G filtration if L is an honest time. More-
Moreover X has a G decomposition
etl\L i
JO %s-
+{ / —d(X, ML)S - l{t>L} / —d(X, ML)S}.
Jo "s- Jl l ~ ^s-
Before beginning the proof of the theorem, we establish a lemma we will
need in the proof. It is a small extension of the local behavior of the stochastic
integral established in Chap. IV.
Lemma (Local behavior of the stochastic integral at random times).
Let X and Y be two semimartingales and let H G L(X) and J G L(Y). Let
U and V be two positive random variables with U < V. (U and V are not
assumed to be stopping times.) Define
A = {w : Ht(w) = JtH and Xt(u) - Xu{oj) = Yt(w) - Yu(w),
for all te [U(w),V(w))}.
Let Wt = H ¦ Xt and Zt = J ¦ Yt. Then a.s. on Л, Wt{u)) - Wu{uj) = Zt(w) -
Zv{uj) for all t G [U(oj),V(oj)).
Proof. We know by the Corollary to Theorem 26 of Chap. IV that the con-
conclusion of the lemma is true when U and V are stopping times. Let (u, v) be
rationale with 0 < и < v, and let
Auv - {u> : U(uj) < и < v < V(ui)} П A.
Then also Wt - Wu — Zt — Zu, all t G [u, v), a.s. on Auv, since u, v are fixed
times and hence a fortiori stopping times. Next let
A = iu> : P| {U(w) <u<v< V(u>), and Wt(u>) - Wu{w) = Zt{w) - Zu{w)
for all t e [u,v)} >.
The intersection in the definition of A is countable, so null sets do not accu-
accumulate. Finally let и decrease to U{uj) and then v increase to V(oj), which
gives the result. G

376 VI Expansion of Filtrations
Proof of Theorem 18. We first observe that without loss of generality we can
assume Xo = 0. Let H be a bounded F predictable process. We define stochas-
stochastic integrals at the random time L by
/ H3dX3 = (# • X)L, / Hs
Jo Jl
where H¦ X = (H • X4L>o is the F stochastic integral process. That is, we use
the usual definition of the stochastic integral, sampling it at the random time
L. When H is a simple predictable process, these are reasonable definitions.
Moreover we know from the lemma that if H and J are both bounded F pre-
predictable processes and if also H — J on (L, oo], then JL HsdXs — J^° JsdXs
a.s., so the definition is well-defined.
Since X is an F martingale in L2 with Xo = 0, we have E{fo°° HsdXs} = 0.
Applying the equalities (*) on page 372 to H ¦ X we have
E{ HsdXs} = E{ Hsd(X,ML)s} = E{ -^-d{X,ML)s} (f)
JO Jo Jo ^s-
where the second equality uses (**) on page 373. Recalling E{ JQ°° HsdXs} = 0
and combining this with the above gives (where we have changed the name
of the process H to К for clarity slightly later in the proof)
/>OO />OO
E{ KsdXs} = -E{ Ksd(X,ML)s}
Jl Jo
We next replace К with К\^г<ц to obtain
/>OO />OO
E{ KsdXs} = -E{ Ksl{Zs_<1}d(X,ML)s}
Jl Jo
К
because the predictable projection of j^|—1(l,oo)(s) is l{zs_<i}- Finally we
combine equalities (f) and (|), and define U = Hl^0 ц + Kl^oo] to get
E{ / HsdXs + / KsdXs}
Jo Jl
= E{ Usd{ d(X,ML)s-l{t>L} / z —d(X,ML)s}
Jo Jo As- JL L ~ ^s-
Up to this point we have not used the hypothesis that L is an honest time.
Now we do. Let us take U to be a simple G predictable process, and using
Theorem 17 we have

4 Time Reversal 377
where H and К are F predictable processes (not necessarily simple pre-
predictable, however). If we further take U bounded by 1 and take the supre-
mum over all such simple predictable processes of the elementary G stochas-
stochastic integrals /0 UsdXs, we get that the variation of X, Var(X) as defined in
Chap. Ill, is less than or equal the expected total variation of
=d{ d(X,ML)s + l{t>L} -
JO ^s- JL l
p / 1
dat=d{ d(X,ML)s + l{t>L} - —d(X,ML)s).
Thus the process XisaG quasimartingale, and hence also a semimartingale.
We also note that by the preceding we have that Е{Хт — ат} — 0 for all G
stopping times T, and thus it is a martingale for the G filtration. D
The next theorem is really a corollary of Theorem 18.
Theorem 19. Let X be a semimartingale for the F filtration. Then X is a
semimartingale for the G filtration if L is an honest time.
Proof. Let X = M + A be a decomposition of X in the F filtration, where M is
a local martingale and Л is a finite variation process. Since A remains a finite
variation process in the larger G filtration, we need only concern ourselves
with M. By the Fundamental Theorem of Local Martingales (Theorem 25
on page 125), we know that we can write M = N + V where N is a local
martingale with bounded jumps, and V is a martingale with paths of finite
variation on compacts. Again, V remains a semimartingale in the G filtration
since it is of finite variation, and further N can be locally stopped with F
stopping times Tn tending to oo a.s. to be a square integrable martingale for
each Tn. Then by Theorem 18 each XTn is also a G semimartingale. Since X
is locally a G semimartingale, it is an actual G semimartingale. D
4 Time Reversal
In this section we apply the results on initial expansions of filtration to some
elementary issues regrading time reversal of semimartingales, stochastic inte-
integrals, and stochastic differential equations.
Definition. Let Y = (Yt)o<t<i be a cadlag process. The time reversal of
Y on [0,1] is defined to be
Y =
Note that time as a superscript denotes reversal. Let F denote the forward
filtration, and correspondingly let F = (Jrt)o<t<i denote a backward filtration.
As an example, if В is a standard Brownian motion, J-^ = cr{Bs; s < t}, and

378 VI Expansion of Filtrations
Ft = rf V Af, where N are the P-null sets of T\, while F* = 1% VЫ is the
analogous backward filtration for B. Our first two theorems are obvious, and
we omit the proofs.
Theorem 20. Let В be a standard Brownian motion on [0,1]. Then the time
reversal В of В is also a Brownian motion with respect to its natural filtration.
Theorem 21. Let Z be a Levy process on [0,1]. Then Z is also a Levy process,
with the same law as —Z, with respect to its natural filtration.
In what follows let us assume that we are given a forward filtration F =
№)o<t<i and a backward filtration G = (?7*)o<t<i- We further assume both
filtrations satisfy the usual hypotheses.
Definition. A cadlag process Y is called an (F, G) reversible semimartin-
gale if Y is an F semimartingale on [0,1] and Y is a <G semimartingale on
[0,1).
Note the small lack of symmetry: for the time reversed process we do not
include the final time 1. This is due to the occurrence of singularities at the
terminal time 1, and including it as a requirement would exclude interesting
examples.
Let rt = (to, ...,tk) denote a partition of [0, t] with t0 = 0, tk = t, 0 < t <
1, and let
STt(H,Y) = H0Y0 + Y,(Hti+1 - Hti)(Yu+1 - Yti).
Definition. Let H, Y be two cadlag processes. The quadratic covariation
of H and Y, denoted [H, Y], is denned to be
when this limit exists in probability as mesh(rt) —> 0, and is such that [H, Y]
is cadlag, adapted, and of finite variation a.s.
Theorem 22. Let Y be an (F,<G) reversible semimartingale, and let H be
a cadlag process such that Ht ? Tt, and Ht ? Ql~l,Q < t < 1. Suppose the
quadratic variation [H, Y] exists. Then the two processes [H,Y] and Xt =
/0* Hs-dYs are (F, <&) reversible semimartingales. Moreover
~ r*
Jo
Proof. Since Ht e Я1~г,0 < t < 1, we have that #i_s e c^-d-a) - g^ an(j
thus it is adapted and left continuous, so the stochastic integral JQ Hi_sdYs
is well-defined. We choose and fix t, 0 < t < 1. Let r be a partition of
[1 — t, 1], r = (si,...,sn)) chosen such that ДЯ8( = 0 and AYSi = 0 a.s.,

4 Time Reversal 379
i = 2,..., n — 1. Note that one can always choose the partition points {s^} in
this manner because
{
Jo
= 0,
since for each u>, s н-» Ys(u>) has only countably many jumps, and hence
/o ^-{\AYs\>o}ds = 0 a.s. But then JQ Р(]ДУ8| > 0)ds = 0, which in turn
implies that Р(|ДУ8| > 0) = 0 for almost all s in [0,1]. We now define three
new processes:
n-2
n-2
Let тп be a sequence of partitions of [0,1] with limn^oo mesh(rn) = 0. Then
lim CT" =
n—>oo
Since Y is (F, G) reversible by hypothesis, we know that CT is <& adapted.
Hence [H, У] is G adapted and moreover since it has paths of finite variation
by hypothesis, it is a semimartingale.
Since H is cadlag we can approximate the stochastic integral with partial
sums, and thus
lim ATn =
n—>oo
Since У8,+1_ - У8,_ = -(У1- - У1-^), we have
ft
lim BTn = I .
n—^oo I n
J U
Combining ATn, 5Tti , and CTn, we get

380 VI Expansion of Filtrations
n-2
* + Х>в1+1(У,1+1-У,4)-
г=1
n-1
n-2
г=1
Since we chose our partitions rn with the property that AYSi — 0 a.s. for each
partition point Sj, the above simplifies to
t + ДЯ1(У,„_1 - У1_),
which tends to 0 since si decreases tol — t, and sn_i increases to 1. Therefore
0 = lim A7"" + lim B7"" + lim C
n—»oo n—>oo n—»oo
7"" = -X* + / H^s
Уд
This establishes the desired formula, and since we have already seen that
[H, Y]t is a semimartingale, we have that X is also a reversible semimartingale
as a consequence of the formula. D
We remark that in the case where У is a G semimartingale on [0,1] (and
not only on [0,1)), the same proof shows that X is a G semimartingale on
[0,1].
Example (reversibility for Levy process integrals). Let Z be a Levy
process on [0,1], and let H = (iit)o<t<i be the filtration generated by Z and
expanded by the addition of Z1 = Zo - Z\-. Then Z is an H semimartingale
on [0,1] by Theorem 21. Let Ht = f(Zt) with / such that the quadratic
covariation [H,Z] exists. Clearly Ht = f(Zt) G Tt- Since
W1-* D ct{ZA_u)_ - Z{1_v)_; 0 < u,v < 1} V a{Zx},
we have that Zt = Zi_A_t) € W1"*, since Zt = Zt_ a.s. Therefore Я4 G W1"*.
Using Theorem 22 we conclude that Xt = J* f(Zs_)dZs is an (F, H) reversible
semimartingale.
Example (reversibility for Brownian integrals). Let В be a standard
Brownian motion on [0,1]. Let Щ denote Brownian local time at the level
x and let ц be a signed measure on R. Let F be the minimal nitration of В
satisfying the usual hypotheses, and let (HI*)o<t<i be the minimal filtration
generated by B, expanded by adding B1 = —Bi, and satisfying the usual
hypotheses. Finally let / be a cadlag function on R of finite variation on
compacts with primitive F (that is, F+(x) = f(x)), and define

4 Time Reversal 381
Ut= f f(Bs)dBs + f L^(d
Jo Jr
We will show U is an (F, Й) reversible semimartingale.
The hypotheses on / imply that its primitive F is the difference of two
convex functions, and therefore Mt — F(Bt) - F(B0) is an F semimartingale.
Moreover M* — F(B\-t) —F(B{), and since we already know that Bi-t is an
H semimartingale by Theorem 3, by the convexity of F we conclude that M is
also an H semimartingale. Therefore /0 f(Bs)dBs is an (F, H) reversible semi-
semimartingale as soon as | JR Lfrj(da) is one, where tj is the signed measure 'sec-
'second derivative' of F. Finally, what we want to show is that At = \ /R L^fj,(da)
is an (F,H) reversible semimartingale, for any signed measure /x. This will
imply that U is an (F, Й) reversible semimartingale.
Since A is continuous with paths of finite variation on [0,1], all we really
need to show is that A* e Ti*, each t. But Л* = Ai-t - A\ = /R(?i_t -
L^)fj,(da), and
- L\ = lim(-i / l[e>e+e)(B,)de)
1 f1
= lim(— / l[a_Bl)a_Bl+?)(Bs - Bjds)
l[a-Bl,a-B1+e)(Bu)du)
[
where Af is the local time at level x of the standard Brownian motion B.
Therefore Л* = - jR(A^~Bl)fj,(da). Since A% is H adapted, and since B\ e
7i°, we conclude A is H adapted, and therefore U is an (F, Й) reversible
semimartingale.
We next consider the time reversal of stochastic differential equations,
which is perhaps more interesting than the previous two examples. Let В be
a standard Brownian motion and let X be the unique solution of the stochastic
differential equation
i-t i-t
Xt = X0+ / <i(Xs)dBs + / b(Xs)ds
Jo Jo
where a and b are Lipschitz, and moreover, a and b are chosen so that for h
Borel and bounded,
J
= / /i(z)ttA ~t,Xt,x)dx
where тгA — t,u, x) is a deterministic function. As in the example expansion
via the end of a stochastic differential equation treated earlier on page 367,

382 VI Expansion of Filtrations
we know (as a consequence of Theorem 10) that if F is the natural, completed
Brownian nitration, we can expand F with Xi to get H : Tit = f]u>t -^и V
a{Xi}, and then all F semimartingales remain H semimartingales on [0,1).
We fix (t,w) and define ф : R -> R by ф(х) = X(t,w,x) where Xt =
X(t,u),x) is the unique solution of
Xt = x-
for iel. Recall from Chap. V that ф is called the flow of the solution of
the stochastic differential equation. Again we have seen in Chap. V that
under our hypotheses on a and b the flow ф is injective. For 0 < s < t < 1
define the function ф$^ to be the flow of the equation
X.,t = x
f a{Xs<u)dBu + f b(XStU)du.
J s Js
It now follows from the uniqueness of solutions that Xt = фа,г(Ха), and
in particular X\ = фг,г(Х^, and therefore ф^\(Х\) = Xt, where ф^\ is of
course the inverse function of фа$. Since the solution Xa>t of equation (*) is
Q — cr{Bv — Bu; s < u,v < t} measurable, we have ф±г\ G ^rl~*, where
P = a{Bs;0 < s < t}V Я where Я are the null sets of T. Let H* =
C\u>t-^U V a{Xi\ and we have В is an И = Gi*)o<t<i semimartingale, and
that Xt = ф^\(Х{) is ~Н1~г measurable. Therefore Xt G Tt and also at the
same time Xt G Til~i. Finally note that since X is a semimartingale and a
is C1, the quadratic covariation [a(X),B] exists and is of finite variation, and
we are in a position to apply Theorem 22.
Theorem 23. Let В be a standard Brownian motion and let X be the unique
solution of the stochastic differential equation
Xt = Xo + [ <j{Xs)dBs + f b(Xs)ds
Jo Jo
for 0 < t < 1, where a and b are Lipschitz, and moreover, a and b are chosen
so that for h Borel and bounded,
= fh(xOT(l-t,Xt,x)dx
where тгA — t,u,x) is a deterministic function. Let H be given by Tit —
[\u>tTu Va{X{\. Then В is an (F,H) reversible semimartingale, and Xf =
Xi-t satisfies the following backward stochastic differential equation
Yt = Zi+ f a{Ys)dBs+ f a'(Ys)a{Ys)ds+ f b{Ys)ds,
Jo Jo Jo
for 0 < t < 1. In particular, X is an (F, Й) reversible semimartingale.

Bibliographic Notes 383
Proof. We first note that В is an (F, Й) reversible semimartingale as we saw
in the example on page 367. We have that [<r(X),B] = fQ cr'(Xs)cr(Xs)ds,
which is clearly of finite variation, since a' is continuous, and thus a'(X) has
paths bounded by a random constant on [0,1]. In the discussion preceding this
theorem we further established that cr(Xt) G 7Y1-t, and of course cr(Xt) ? Tf
Therefore by Theorem 22 we have
b(Xs)ds.
* = Xi-t -Xi= f a(Xi-a)dB
J
i-t
Observe that [a{X),B\t = J1_ta'(Xs)a(Xs)ds. Use the change of variable
и = 1 — s in the preceding integral and also in the term /lt b(Xs)ds to get
f
Jo
b(Xi-a)ds,
Jo
and the proof is complete. D
Bibliographic Notes
The theory of expansion of filtrations began with the work of K. Ito [100] for
initial expansions, and with the works of M. Barlow [7] and M. Yor [239] for
progressive expansions. Our treatment has benefited from some private notes
P. A. Meyer shared with the author, as well as the pedagogic treatment found
in [44]. A comprehensive treatment, including most of the early important
results, is in the book of Th. Jeulin [114]. Excellent later summaries of main
results, including many not covered here, can be found in the lecture notes
volume edited by Th. Jeulin and M. Yor [117] and also in the little book by
M. Yor [247].
Theorem 1 is due to J. Jacod, but we first learned of it through a paper of
P. A. Meyer [175], while Theorem 3 (Ito's Theorem for Levy processes) was
established in [107], with the help of T. Kurtz. Theorems 6 and 7 are of course
due to Th. Jeulin (see page 44 of [114]); see also M. Yor [245]. Theorem 10,
Jacod's criterion, and Theorem 11 are taken from [104]; Jacod's criterion is
the best general result on initial expansions of filtrations. The small result on
filtration shrinkage is new and is due to J. Jacod and P. Protter.
Two seminal papers on progressive expansions after M. Barlow's original
paper are those of Th. Jeulin and M. Yor [115] and [116]. The idea of using a
slightly larger filtration than the minimal one when one progressively expands,
dates to M. Yor [240], and the idea to use the process Zt —° l{L>t} originates
with J. Azema [3]. The lemma on local behavior of stochastic integrals at
random times is due to C. Dellacherie and P. A. Meyer [46], and the idea of
the proof of Theorem 18 is ascribed to C. Dellacherie in [44]. The results on
time reversal are inspired by the work of J. Jacod and P. Protter [107] and

384 Exercises for Chapter VI
also E. Pardoux [190]. Related results can be found in J. Picard [193]. More
recently time reversal has been used to extend Ito's formula for Brownian
motion to functions which are more general than just being convex, although
we did not present these results in this book. The interested reader can consult
for example H. Follmer, P. Protter and A.N. Shiryaev [77] and also H. Follmer
and P. Protter [76] for the multidimensional case. The case for diffusions is
treated by X. Bardina and M. Jolis [6].
Exercises for Chapter VI
Exercise 1. Let (?l,F,W,B,P) be a standard Brownian motion. Expand F
by the initial addition of a{B\}. Let M be the local martingale in the formula
Bt=B0+Mt + "\ "sds.
Jo
Bj-B,
1-s
Show that the Brownian motion M = (Mf)o<t<i ls independent of cr{Bi}.
Exercise 2. Show that the processes Jl denned in the proof of Ito's Theorem
for Levy processes (Theorem 3) are compound Poisson processes. Let N%
denote the Poisson process comprised of the arrival times of Jl, and let Gl
be the natural completed filtration of Nl. Further, show the following three
formulae (where we suppress the i superscripts) hold.
(a) {}^
(b) E{Jt\Jl,g1}=tJl.
(c) ??{Ji|J1}=tJi.
Exercise 3. In Exercise 1 assume Bo = x and condition on the event {Bi —
y}. Show that В is then a Brownian bridge beginning at x and ending
at y. (See Exercise 9 below for more results concerning filtration expansions
and Brownian bridges. Brownian bridges are also discussed in Exercise 24 of
Chap. II, and on page 299.)
Exercise 4. Let (ft, J7, F, Z, P) be a standard Poisson process. Expand F by
the initial addition on a{Z\\. Let M be the local martingale in the formula
= Mt+
i;
Show that M = (MtH<t<i is a time changed compensated Poisson process.
Exercise 5. Let В be standard Brownian motion. Show there exists an T\
measurable random variable L such that if G is the filtration F expanded
initially with L, then Qt — T\ for 0 < t < 1. Deduce that В is not a semi-
martingale for the expanded filtration G. Show further that L can be taken
to be of the form L = jQ f(s)dBs where / is non-random.

Exercises for Chapter VI 385
Exercises 6 through 9 are linked, with the climax being Exercise 9.
Exercise 6. Let В denote standard Brownian motion on its canonical path
space on continuous functions with domain R+ and range R, with Bt(u>) =
u)(t). F is the usual minimal filtration completed (and thus rendered right
continuous a fortiori). Define У/7 = P{Ba ? и\Тг}. Show that
where
.-{u-xf
= g(u,t,Bt) and g(u,t,x) =
for t < a. Show also that Y^ = l^B^eU} f°r t > a.
Exercise 7. (Continuation of Exercise 6.) Show that Уои_ = О except on the
null set {Ba = u} and infer that (У4")о<«о is a positive martingale which is
not uniformly integrable on [0, a].
Exercise 8. (Continuation of Exercises 6 and 7.) Show that
where
дд и-х -(и-a;J
-^-(u,s,x) = . = exp{~ f-j
dxy ' ^2n(a -1) X 2(a-t) S
for t < a.
Exercise 9. (Continuation of Exercises 6, 7, and 8.) Show that one can define
a probability Pu on C([0, a],M) such that Pu(Ba = u) = 1 with Pu absolutely
continuous with respect to P on Tt for t < a, and singular with respect to
P on Ta. (Hint: Sketch of procedure. Let Па = C([0, a),R), be the space of
continuous functions mapping [0, a] into R, and define M" = Ytu/Yt° on Tt for
(of course) t < a. Let Pu denote the (unique) probability on па = C([0, a), R)
given on Tt by dPu = M^dP, and confirm that
is a Brownian motion under Pu. Next show that
AZ=[-Ld[B,M»]. verifies M? ЩЩ "'*«
Finally show that

386 Exercises for Chapter VI
Bt - —Bo --u = (a-t) Г —d&
a a 'Jo a-s
under Pu, and thus (a -1) JQ (a - s)~ldf3s, which is defined only on [0, a), has
the same distribution under Pu as the Brownian bridge, whence
/"* 1
lim(a-t) / dCs =0
t^a Jo a- S
a.s., and deduce the desired result.)
Exercise 10 (expansion by a natural exponential random variable).
Let F be a filtration satisfying the usual hypotheses, and let Г be a totally
inaccessible stopping time, with P{T < oo) = 1. Let A = (^4t)t>o be the
compensator of 1{*>т} and let M be the martingale Mt = l{t>x} — A*. Note
that At = Aqo- Let G be the filtration obtained by initial expansion of F
with <r{Aoo}- For a bounded, non-random, Borel measurable function / let
М{ = /0* f(As)dMs which becomes in G,
{ =
ft
)l{t>T} - /
Jo
f
M{ = f(AT)l{t>T} - / f(As)dAs =
Jo
where F(t) = Jo f(s)ds, since A is continuous. Show that E{f(At)} =
E{F(At)}, and since / is arbitrary deduce that the distribution of the random
variable At is exponential of parameter 1.
Exercise 11. Let F, Г, A, and G be as in Exercise 10. Show that if X is an
F martingale with Хт- = Хт a.s., then X is also a G martingale.
Exercise 12. Show that every stopping time for a filtration satisfying the
usual hypotheses is the end of a predictable set, and is thus an honest time.
Exercise 13. Let В = (Bt)t>o be a standard three dimensional Brownian
motion. Let L = sup{t : \\Bt\\ < 1}, the last exit time from the unit ball.
Show that L is an honest time.
Exercise 14. Let ML be the fundamental L martingale of a progressive ex-
expansion of the underlying filtration F using the non-negative random variable
L.
(a) Show that for any F square integrable martingale X we have EXl =
EX^M^.
(b) Show that ML is the only F^ square integrable martingale with this
property for all F square integrable martingales.
Exercise 15. Give an example of a filtration F and a non-negative random
variable L and an F semimartingale X such that the post L process Yt =
Xt — XtAi = XtvL — Xt is not a semimartingale. [Hint: Use the standard
Brownian nitrations and let

Exercises for Chapter VI 387
/ f(s)dBs = \ I f(s)dBs}
o z Jo
where / is the non-random function of Exercise 5.)
Exercise 16. Let (п,Т,?,Р) be a filtered probability space satisfying the
usual hypotheses, let L be a non-negative random variable, let Goo — Foo V
cr{L}, and let G be given by
Qt = {Ae дж\ 3At eTt,An{t<L} = Atn{t< L}}.
Let X be a F semimartingale. Give a simple proof (due to M. Yor [239])
that Xtl{t<L,} (and hence X^l) 1S a semimartingale. (Hint: Show that it
is enough to consider X a uniformly integrable martingale, hence enough
to consider X a quasimartingale, and finally enough to consider X a positive
supermartingale. Let Zt = °1{l>*}, the F optional projection of 1{l>*}> which
is a supermartingale. Recall from Theorem 13 that P(Zl- > 0) = 1. Show
that
vz Xt
xt =
is a G positive supermartingale. Next show that XZ is an F special semi-
semimartingale, and let XZ — M + A be its canonical decomposition. Observe
that
Xtl{t<L} = —^—l{t<L} = ^ l{t<L) = Щ +At,
and argue that Mz is a G semimartingale as before, and that Az is the
product of the two G semimartingales A and (Zt)~1l[t<Ly, and hence it is
also a G semimartingale.)
Exercise 17. Let Z be a Levy process on [0, l] and let
Xt =
I a(Xs-)dZs + f b[Xs-)ds.
Jo Jo
Assume that a and b are smooth, and also assume (which is not always true!)
that the flows of X are injective. Find a stochastic differential equation solved
by Xi-t.
Exercise 18. Let (?l,Jr, F, B,P) be a standard Brownian space with В, а
standard Brownian motion. Let Z denote the random zero set of B, which is
a closed, perfect, nowhere dense set. Let {(Ln, Rn)}n>i denote the random
intervals contiguous to Z. Without loss of generality assume that the graphs
of Ln, denoted [Ln], are disjoint random sets. P. A. Meyer's conjecture was
that one could expand F to a filtration G in such a way that В would still
be a semimartingale and each Ln would be a G stopping time. Show this
is false (argument due to M. Barlow [8]) by taking Yt = \Bt\ and showing
(computed in the G filtration) that /0 l{Ys>0}dYs = Yt, which implies that
/0 l{ys=o}dYs — 0, which is a contradiction, since as a consequence of Tanaka's
formula /0 l^Ya=o}dYs — L°, the Brownian local time at 0.

References
1. H. Ahn and P. Protter. A remark on stochastic integration. In Seminaire
de Probabilites XXVIII, volume 1583 of Lecture Notes in Mathematics, pages
312-315. Springer-Verlag, Berlin, 1994.
2. L. Arnold. Stochastic Differential Equations: Theory and Applications. Wiley,
New York, 1973.
3. J. Azema. Quelques applications de la theorie generate des processus. Invent.
Math., 18:293-336, 1972.
4. J. Azema. Sur les fermes aleatoires. In Seminaire de Probabilites XIX, volume
1123 of Lecture Notes in Mathematics, pages 397-495. Springer-Verlag, Berlin,
1985.
5. J. Azema and M. Yor. Etude d'une martingale remarquable. In Seminaire
de Probabilites XXIII, volume 1372 of Lecture Notes in Mathematics, pages
88-130. Springer-Verlag, Berlin, 1989.
6. X. Bardina and M. Jolis. An extension of Ito's formula for elliptic diffusion
processes. Stochastic Processes and their Appi, 69:83-109, 1997.
7. M. T. Barlow. Study of a filtration expanded to include an honest time. Z.
Wahrscheinlichkeitstheorie verw. Gebiete, 44:307-323, 1978.
8. M. T. Barlow. On the left end points of Brownian excursions. In Seminaire
de Probabilites XIII, volume 721 of Lecture Notes in Mathematics, page 646.
Springer-Verlag, Berlin, 1979.
9. M. T. Barlow, S. D. Jacka, and M. Yor. Inequalities for a pair of processes
stopped at a random time. Proc. London Math. Soc., 52:142-172, 1986.
10. R. F. Bass. Probabilistic Techniques in Analysis. Springer-Verlag, New York,
1995.
11. R. F. Bass. The Doob-Meyer decomposition revisited. Canad. Math. Bull,
39:138-150, 1996.
12. J. Bertoin. Levy Processes. Cambridge University Press, Cambridge, 1996.
13. K. Bichteler. Stochastic integrators. Bull. Amer. Math. Soc, 1:761-765, 1979.
14. K. Bichteler. Stochastic integration and Lp-theory of semimartingales. Ann.
Probab., 9:49-89, 1981.
15. K. Bichteler. Stochastic Integration With Jumps. Cambridge University Press,
Cambridge, 2002.
16. K. Bichteler, J.-B. Gravereaux, and J. Jacod. Malliavin Calculus for Processes
With Jumps. Gordon and Breach, 1987.

390 References
17. P. Billingsley. Convergence of Probability Measures. Wiley, New York, 1968.
18. Yu. N. Blagovescenskii and M. I. Preidlin. Certain properties of diffusion pro-
processes depending on a parameter. Sov. Math., 2:633-636, 1961. Originally
published in Russian in Dokl. Akad. Nauk SSSR, 138:508-511, 1961.
19. R. M. Blumenthal and R. K. Getoor. Markov Processes and Potential Theory.
Academic Press, New York, 1968.
20. T. Bojdecki. Teoria General de Procesos e Integracion Estocastica. Universidad
Nacional Autonoma de Mexico, 1988. Segundo edicion.
21. N. Bouleau. Formules de changement de variables. Ann. Inst. H. Poincare
Probab. Statist., 20:133-145, 1984.
22. N. Bouleau and M. Yor. Sur la variation quadratique des temps locaux de
certaines semimartingales. C. R. Acad. Sci. Paris Sir. I Math., 292:491-494,
1981.
23. L. Breiman. Probability. SIAM, Philadelphia, 1992.
24. P. Bremaud. Point Processes and Queues: Martingale Dynamics. Springer-
Verlag, New York, 1981.
25. J. L. Bretagnolle. Processus a accroissements independents. In Ecole d'Ete de
Probabilites: Processus Stochastiques., volume 307 of Lecture Notes in Mathe-
Mathematics, pages 1-26. Springer-Verlag, Berlin, 1973.
26. B. Bru and M. Yor. La vie de W. Doeblin et le Pli cachete 11 668. La Lettre
de L'Academie des sciences, 2:16-17, 2001.
27. B. Bru and M. Yor. Comments on the life and mathematical legacy of Wolfgang
Doeblin. Finance Stoch., 6:3-47, 2002.
28. Mireille Chaleyat-Maurel and T. Jeulin. Caracterisation d'une classe de semi-
semimartingales. In Grossissements de filtrations: exemples et applications, volume
1118 of Lecture Notes in Mathematics, pages 59-109. Springer-Verlag, Berlin,
1979.
29. C. S. Chou. Caracterisation d'une classe de semimartingales. In Seminaire de
Probabilites XIII, volume 721 of Lecture Notes in Mathematics, pages 250-252.
Springer-Verlag, Berlin, 1979.
30. C. S. Chou and P. A. Meyer. Sur la representation des martingales comme
integrales stochastiques dans les processus ponctuels. In Seminaire de Prob-
Probabilites IX, volume 465 of Lecture Notes in Mathematics, pages 226-236.
Springer-Verlag, Berlin, 1975.
31. C. S. Chou, P. A. Meyer, and C. Strieker. Sur les integrales stochastiques de
processus previsibles non bornes. In Seminaire de Probabilites XIV, volume
784 of Lecture Notes in Mathematics, pages 128-139. Springer-Verlag, Berlin,
1980.
32. K. L. Chung and R. J. Williams. Introduction to Stochastic Integration.
Birkhauser, Boston, second edition, 1990.
33. E. (Jinlar. Introduction to Stochastic Processes. Prentice Hall, Englewood
Cliffs, 1975.
34. E. Qinlar, J. Jacod, P. Protter, and M. Sharpe. Semimartingales and Markov
processes. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 54:161-220, 1980.
35. E. A. Coddington and N. Levinson. Theory of Ordinary Differential Equations.
McGraw-Hill, New York, 1955.
36. P. Courrege. Integrate stochastique par rapport a une martingale de carre
integrable. In Seminaire de Thiorie du Potentiel dirige par M. Brelot, G. Cho-
quet et J. Deny, 7e annee A962/1963), Paris, 1963. Faculte des Sciences de
Paris.

References 391
37. К. Е. Dambis. On the decomposition of continuous submartingales. Theory
Probab. AppL, 10:401-410, 1965.
38. F. Delbaen and W. Schachermayer. The fundamental theorem of asset pricing
for unbounded stochastic processes. Math. Ann., 312:215-250, 1998.
39. F. Delbaen and M. Yor. Passport options. Preprint, ETH, 2001.
40. С Dellacherie. Un exemple de la theorie generate des processus. In Seminaire
de Probabilites IV, volume 124 of Lecture Notes in Mathematics, pages 60-70.
Springer-Verlag, Berlin, 1970.
41. С Dellacherie. Capacites et processus stochastiques. Springer-Verlag, Berlin,
1972.
42. C. Dellacherie. Un survol de la theorie de 1'integrale stochastique. Stochastic
Process. AppL, 10:115-144, 1980.
43. С Dellacherie. Mesurabilite des debuts et theoreme de section. In Seminaire de
Probabilites XV, volume 850 of Lecture Notes in Mathematics, pages 351-360.
Springer-Verlag, Berlin, 1981.
44. C. Dellacherie, B. Maisonneuve, and P. A. Meyer. Probabilites et Potentiel.
Chapitres XVII a XXIV; Processus de Markov (fin); Complements de Calcul
Stochastique. Hermann, Paris, 1992.
45. C. Dellacherie and P. A. Meyer. Probabilities and Potential, volume 29 of
North-Holland Mathematics Studies. North-Holland, Amsterdam, 1978.
46. C. Dellacherie and P. A. Meyer. Probabilities and Potential B, volume 72 of
North-Holland Mathematics Studies. North-Holland, Amsterdam, 1982.
47. С Dellacherie, P. A. Meyer, and M. Yor. Sur certaines proprietes des espaces de
Banach H1 et BMO. In Seminaire de Probabilites XII, volume 649 of Lecture
Notes in Mathematics, pages 98-113. Springer-Verlag, Berlin, 1978.
48. C. Doleans-Dade. Integrates stochastiques dependant d'un parametre. Publ.
Inst. Stat. Univ. Paris, 16:23-34, 1967.
49. С Doleans-Dade. Quelques applications de la formule de changement de vari-
variables pour les semimartingales. Z. Wahrscheinlichkeitstheorie verw. Gebiete,
16:181-194, 1970.
50. С Doleans-Dade. Une martingale uniformement integrable mais non locale-
ment de carre integrable. In Seminaire de Probabilites V, volume 191 of Lecture
Notes in Mathematics, pages 138-140. Springer-Verlag, Berlin, 1971.
51. С Doleans-Dade. On the existence and unicity of solutions of stochastic dif-
differential equations. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 36:93-101,
1976.
52. C. Doleans-Dade. Stochastic processes and stochastic differential equations. In
Stochastic Differential Equations: I ciclo 1978, Palazzone della Scuola Normale
Superiore, Cortona (Centre Internazionale Matematico Estivo), pages 5-75.
Liguori Editore, Napoli, 1981.
53. С Doleans-Dade and P. A. Meyer. Integrates stochastiques par rapport aux
martingales locales. In Seminaire de Probabilites IV, volume 124 of Lecture
Notes in Mathematics, pages 77-107. Springer-Verlag, 1970.
54. С Doleans-Dade and P. A. Meyer. Equations differentielles stochastiques. In
Seminaire de Probabilites XI, volume 581 of Lecture Notes in Mathematics,
pages 376-382. Springer-Verlag, Berlin, 1977.
55. J. L. Doob. Stochastic Processes. Wiley, 1953.
56. J. L. Doob. Classical Potential Theory and Its Probabilistic Counterpart.
Springer-Verlag, Berlin, 1984.

392 References
57. H. Doss. Liens entre equations differentielles stochastiques et ordinaires. Ann.
Inst. H. Poincare Probab. Statist, 13:99-125, 1977.
58. H. Doss and E. Lenglart. Sur l'existence, l'unicite et le comportement asymp-
totique des solutions d'equations differentielles stochastiques. Ann. Inst. H.
Poincare Probab. Statist, 14:189-214, 1978.
59. L. Dubins and G. Schwarz. On continuous martingales. Proc. Natl. Acad. Sci.
USA, 53:913-916, 1965.
60. D. Duffie and С Huang. Multiperiod security markets with differential infor-
information. J. Math. Econom., 15:283-303, 1986.
61. D. A. Edwards. A note on stochastic integrators. Math. Proc. Cambridge
Philos. Soc, 107:395-400, 1990.
62. N. Eisenbaum. Integration with respect to local time. Potential Anal., 13:303-
328, 2000.
63. R. J. ElUott. Stochastic Calculus and Applications. Springer-Verlag, Berlin,
1982.
64. M. Emery. Stabilite des solutions des equations differentielles stochastiques;
applications aux integrales multiplicatives stochastiques. Z. Wahrscheinlich-
keitstheorie verw. Gebiete, 41:241-262, 1978.
65. M. Emery. Equations differentielles stochastiques lipschitziennes: etude de la
stabilite. In Seminaire de Probabilites XIII, volume 721 of Lecture Notes in
Mathematics, pages 281-293. Springer-Verlag, Berlin, 1979.
66. M. Emery. Une topologie sur l'espace des semimartingales. In Seminaire de
Probabilites XIII, volume 721 of Lecture Notes in Mathematics, pages 260-280.
Springer-Verlag, Berlin, 1979.
67. M. Emery. Compensation de processus a variation finie non localement
integrables. In Seminaire de Probabilites XIV, volume 784 of Lecture Notes
in Mathematics, pages 152-160. Springer-Verlag, Berlin, 1980.
68. M. Emery. Non confluence des solutions d'une equation stochastique lips-
chitzienne. In Seminaire de Probabilites XV, volume 850 of Lecture Notes in
Mathematics, pages 587-589. Springer-Verlag, Berlin, 1981.
69. M. Emery. On the Azema martingales. In Seminaire de Probabilites XXIII,
volume 1372 of Lecture Notes in Mathematics, pages 66-87. Springer-Verlag,
Berlin, 1989.
70. M. Emery and M. Yor, editors. Seminaire de Probabilites 1967-1980: A Se-
Selection in Martingale Theory, volume 1771 of Lecture Notes in Mathematics.
Springer-Verlag, Berlin, 2002.
71. S. Ethier and T. G. Kurtz. Markov Processes: Characterization and Conver-
Convergence. Wiley, New York, 1986.
72. W. Feller. An Introduction to Probability Theory and Its Applications. Volume
II. Wiley, New York, second edition, 1971.
73. D. L. Fisk. Quasimartingales. Trans. Amer. Math. Soc, 120:369-389, 1965.
74. T. R. Fleming and D. P. Harrington. Counting Processes and Survival Analysis.
John Wiley, New York, 1991.
75. H. Follmer. Calcul d'lto sans probabilites. In Seminaire de Probabilites XV,
volume 850 of Lecture Notes in Mathematics, pages 144-150. Springer-Verlag,
Berlin, 1981.
76. H. Follmer and P. Protter. An extension of Ito's formula in n dimensions.
Probability Theory and Related Fields, 116:1-20, 2000.
77. H. Follmer, P. Protter, and A. N. Shiryaev. Quadratic variation and an exten-
extension of Ito's formula. Bernoulli, 1:149-169, 1995.

References 393
78. Т. Pujiwara and H. Kunita. Stochastic differential equations of jump type and
Levy processes in diffeomorphisms group. J. Math. Kyoto Univ., 25:71-106,
1985.
79. R. K. Getoor and M. Sharpe. Conformed martingales. Invent. Math., 16:271-
308, 1972.
80. R. Ghomrasni and G. Peskir. Local time-space calculus and extensions of Ito's
formula. Preprint, Aarhus, 2002.
81. 1.1. Gihman and A. V. Skorokhod. Stochastic Differential Equations. Springer-
Verlag, Berlin, 1972.
82. R. D. Gill and S. Johansen. A survey of product-integration with a view
towards applications in survival analysis. Ann. Statist., 18:1501-1555, 1990.
83. I. V. Girsanov. On transforming a certain class of stochastic processes by
absolutely continuous substitutions of measures. Theory Probab. Appi, 5:285-
301, 1960.
84. M. Greenberg. Lectures on Algebraic Topology. Benjamin, New York, 1967.
85. Т. Н. Gronwall. Note on the derivatives with respect to a parameter of the
solutions of a system of differential eqautions. Ann. of Math. (&), 20:292-296,
1919.
86. A. Gut. An introduction to the theory of asymptotic martingales. In Amarts
and Set Function Processes, volume 1042 of Lecture Notes in Mathematics,
pages 1-49. Springer-Verlag, Berlin, 1983.
87. S. W. He, J. G. Wang, and J. A. Yan. Semimartingale Theory and Stochastic
Calculus. CRC Press, Boca Raton, 1992.
88. T. Hida. Brownian Motion. Springer-Verlag, Berlin, 1980.
89. G. A. Hunt. Markoff processes and potentials I. Illinois J. Math., 1:44-93,
1957.
90. G. A. Hunt. Markoff processes and potentials II. Illinois J. Math., 1:316-369,
1957.
91. G. A. Hunt. Markoff processes and potentials III. Illinois J. Math., 2:151-213,
1958.
92. N. Ikeda and S. Watanabe. Stochastic Differential Equations and Diffusion
Processes. North-Holland, Amsterdam, 1981.
93. K. ltd. Stochastic integral. Proc. Imp. Acad. Tokyo, 20:519-524, 1944.
94. K. ltd. On stochastic integral equations. Proc. Japan Acad., 22:32-35, 1946.
95. K. Ito. Stochastic differential equations in a differentiable manifold. Nagoya
Math. J., 1:35-47, 1950.
96. K. Ito. Multiple Wiener integral. J. Math. Soc. Japan, 3:157-169, 1951.
97. K. Ito. On a formula concerning stochastic differentials. Nagoya Math. J.,
3:55-65, 1951.
98. K. Ito. On Stochastic Differential Equations. Memoirs of the American Math-
Mathematical Society, 4, 1951.
99. K. Ito. Stochastic differentials. Appl. Math. Optim., 1:374-381, 1974.
100. K. Ito. Extension of stochastic integrals. In Proceedings of International Sym-
Symposium on Stochastic Differential Equations, pages 95-109, New York, 1978.
Wiley.
101. K. Ito and M. Nisio. On stationary solutions of a stochastic differential equa-
equation. J. Math. Kyoto Univ., 4:1-75, 1964.
102. K. Ito and S. Watanabe. Transformation of Markov processes by multiplicative
functionals. Ann. Inst. Fourier, 15:15-30, 1965.

394 References
103. J. Jacod. Calcul Stochastique et Problemes de Martingales, volume 714 of
Lecture Notes in Mathematics. Springer-Verlag, Berlin, 1979.
104. J. Jacod. Grossissement initial, hypothese (H1), et theoreme de Girsanov. In
Grossissements de filtrations: exemples et applications, volume 1118 of Lecture
Notes in Mathematics, pages 15-35. Springer-Verlag, Berlin, 1985.
105. J. Jacod, T. G. Kurtz, S. Meleard, and P. Protter. The approximate Euler
method for Levy driven stochastic differential equations. Preprint, 2003.
106. J. Jacod, S. Meleard, and P. Protter. Explicit form and robustness of martin-
martingale representations. Ann. Probab., 28:1747-1780, 2000.
107. J. Jacod and P. Protter. Time reversal on Levy processes. Ann. Probab.,
16:620-641, 1988.
108. J. Jacod and P. Protter. Une remarque sur les equations differentielles stochas-
tiques a solutions Markoviennes. In Seminaire de Probabilites XXV, volume
1485 of Lecture Notes in Mathematics, pages 138-139. Springer-Verlag, Berlin,
1991.
109. J. Jacod and P. Protter. Probability Essentials. Springer-Verlag, Heidelberg,
second edition, 2003.
110. J. Jacod and A. N. Shiryaev. Limit Theorems for Stochastic Processes.
Springer-Verlag, New York, second edition, 2003.
111. J. Jacod and M. Yor. Etude des solutions extremales et representation integrale
des solutions pour certains problemes de martingales. Z. Wahrscheinlichkeits-
theorie verw. Gebiete, 38:83-125, 1977.
112. Y. Le Jan. Temps d'arret stricts et martingales de sauts. Z. Wahrscheinlich-
keitstheorie verw. Gebiete, 44:213-225, 1978.
113. S. Janson and M. J. Wichura. Invariance principles for stochastic area and
related stochastic integrals. Stochastic Process. Appl., 16:71-84, 1983.
114. T. Jeulin. Semi-Martingales et Grossissement d'une Filtration, volume 833 of
Lecture Notes in Mathematics. Springer-Verlag, Berlin, 1980.
115. T. Jeulin and M. Yor. Grossissement d'une filtration et semi-martingales:
formules explicites. In Seminaire de Probabilites XII, volume 649 of Lecture
Notes in Mathematics, pages 78-97. Springer-Verlag, Berlin, 1978.
116. T. Jeulin and M. Yor. Inegalite de hardy, semimartingales, et faux amis. In
Seminaire de Probabilites XIII, volume 721 of Lecture Notes in Mathematics,
pages 332-359. Springer-Verlag, Berlin, 1979.
117. T. Jeulin and M. Yor. Recapitulatif. In Grossissements de filtrations: exemples
et applications, volume 1118 of Lecture Notes in Mathematics, pages 305-315.
Springer-Verlag, Berlin, 1985.
118. G. Johnson and L. L. Helms. Class D supermartingales. Bull. Amer. Math.
Soc, 69:59-62, 1963.
119. T. Kailath, A. Segall, and M. Zakai. Pubini-type theorems for stochastic inte-
integrals. Sankhya Ser. A, 40:138-143, 1978.
120. G. Kallianpur and C. Striebel. Stochastic differential equations occuring in
the estimation of continuous parameter stochastic processes. Theory Probab.
Appl, 14:567-594, 1969.
121. I. Karatzas and S. Shreve. Brownian Motion and Stochastic Calculus. Springer-
Verlag, New York, second edition, 1991.
122. A. F. Karr. Point Processes and Their Statistical Inference. Marcel Dekker,
New York, second edition, 1991.

References 395
123. N. Kazamaki. Note on a stochastic integral equation. In Seminaire de
Probabilites VI, volume 258 of Lecture Notes in Mathematics, pages 105-108.
Springer-Verlag, Berlin, 1972.
124. N. Kazamaki. On a problem of Girsanov. Tohoku Math. J. B), 29:597-600,
1977.
125. N. Kazamaki. Continuous Exponential Martingales and BMO, volume 1579 of
Lecture Notes in Mathematics. Springer-Verlag, Berlin, 1994.
126. J. F. C. Kingman. An intrinsic description of local time. J. London Math.
Soc, 6:725-731, 1973.
127. J. F. C. Kingman and S. J. Taylor. Introduction to Measure and Probability.
Cambridge University Press, Cambridge, 1966.
128. P. E. Kloeden and E. Platen. Numerical Solution of Stochastic Differential
Equations. Springer-Verlag, Berlin, 1992.
129. F. B. Knight. Calculating the compensator: method and example. In Seminar
on Stochastic Processes, 1990, pages 241-252, Boston, 1991. Birkhauser.
130. P. E. Корр. Martingales and Stochastic Integrals. Cambridge University Press,
Cambridge, 1984.
131. H. Kunita. On the decomposition of solutions of stochastic differential equa-
equations. In Stochastic Integrals, volume 851 of Lecture Notes in Mathematics,
pages 213-255. Springer-Verlag, Berlin, 1981.
132. H. Kunita. Stochastic differential equations and stochastic flow of diffeomor-
phisms. In Ecole d'Ete de Probabilites de Saint-Flour ХП-1982, volume 1097
of Lecture Notes in Mathematics, pages 143-303. Springer-Verlag, Berlin, 1984.
133. H. Kunita. Lectures on stochastic flows and applications, volume 78 of Tata
Institute of Fundamental Research Lectures on Mathematics and Physics.
Springer-Verlag, Berlin, 1986.
134. H. Kunita and S. Watanabe. On square integrable martingales. Nagoya Math.
J., 30:209-245, 1967.
135. T. G. Kurtz. Lectures on Stochastic Analysis. Preprint, 2001.
136. T. G. Kurtz and P. Protter. Weak limit theorems for stochastic integrals and
stochastic differential equations. Ann. Probab., 19:1035-1070, 1991.
137. T. G. Kurtz and P. Protter. Wong-Zakai corrections, random evolutions, and
simulation schemes for SDEs. In Stochastic Analysis: Liber Amicorum for
Moshe Zakai, pages 331-346. Academic Press, 1991.
138. A. U. Kussmaul. Stochastic Integration and Generalized Martingales. Pitman,
London, 1977.
139. G. Last and A. Brandt. Marked Point Processes on the Real Line. Springer-
Verlag, New York, 1995.
140. R. Leandre. Un exemple en theorie des flots stochastiques. In Seminaire de
Probabilites XVII, volume 986 of Lecture Notes in Mathematics, pages 158-161.
Springer-Verlag, Berlin, 1983.
141. R. Leandre. Flot d'une equation differentielle stochastique avec semimartin-
gale directrice discontinue. In Seminaire de Probabilites XIX, volume 1123 of
Lecture Notes in Mathematics, pages 271-274. Springer-Verlag, Berlin, 1985.
142. E. Lenglart. Transformation des martingales locales par changement absol-
ument continu de probabilites. Z. Wahrscheinlichkeitstheorie verw. Gebiete,
39:65-70, 1977.
143. E. Lenglart. Une caracterisation des processus previsibles. In Seminaire de
Probabilites XI, volume 581 of Lecture Notes in Mathematics, pages 415-417.
Springer-Verlag, Berlin, 1977.

396 References
144. E. Lenglart. Appendice a l'expose precedent: inegalites de semimartingales. In
Seminaire de Probabilites XIV, volume 784 of Lecture Notes in Mathematics,
pages 49-52. Springer-Verlag, Berlin, 1980.
145. E. Lenglart. Semi-martingales et integrates stochastiques en temps continu.
Rev. CETHEDEC, 75:91-160, 1983.
146. E. Lenglart, D. Lepingle, and M. Prattle. Presentation unifiee de certaines
inegalites de la theorie des martingales. In Seminaire de Probabilites XIV,
volume 784 of Lecture Notes in Mathematics, pages 26-48. Springer-Verlag,
Berlin, 1980.
147. D. Lepingle and J. Memin. Sur l'integrabilite uniforme des martingales expo-
nentielles. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 42:175-203, 1978.
148. G. Letta. Martingales et integration stochastique. Scuola Normale Superiore,
Pisa, 1984.
149. P. Levy. Processus stochastiques et mouvement Brownien. Gauthier-Villars,
Paris, 1948.
150. P. Levy. Wiener's random function, and other Laplacian random functions. In
Proceedings of the Second Berkeley Symposium on Mathematical Statistics and
Probability, pages 171-187, Berkeley, 1951. University of California Press.
151. B. Maisonneuve. Une mise au point sur les martingales continues definies sur
un intervalle stochastique. In Seminaire de Probabilites XI, volume 581 of
Lecture Notes in Mathematics, pages 435-445. Springer-Verlag, Berlin, 1977.
152. G. Maruyama. On the transition probability functions of the Markov process.
Nat. Sci. Rep. Ochanomizu Univ., 5:10-20, 1954.
153. H. P. McKean. Stochastic Integrals. Academic Press, New York, 1969.
154. E. J. McShane. Stochastic Calculus and Stochastic Models. Academic Press,
New York, 1974.
155. E. J. McShane. Stochastic differential equations. J. Multivariate Anal., 5:121-
177, 1975.
156. J. Memin. Decompositions multiplicatives de semimartingales exponentielles
et applications. In Seminaire de Probabilites XII, volume 649 of Lecture Notes
in Mathematics, pages 35-46. Springer-Verlag, Berlin, 1978.
157. J. Memin and A. N. Shiryaev. Un critere previsible pour l'uniforme integrabilite
des semimartingales exponentielles. In Seminaire de Probabilites XIII, volume
721 of Lecture Notes in Mathematics, pages 147-161. Springer-Verlag, Berlin,
1979.
158. M. Metivier. Semimartingales: A Course on Stochastic Processes. Walter de
Gruyter, Berlin and New York, 1982.
159. M. Metivier and J. Pellaumail. On Doleans-Follmer's measure for quasimartin-
gales. Illinois J. Math., 19:491-504, 1975.
160. M. Metivier and J. Pellaumail. Mesures stochastiques a valeurs dans les espaces
Lo. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 40:101-114, 1977.
161. M. Metivier and J. Pellaumail. On a stopped Doob's inequality and general
stochastic equations. Ann. Probab., 8:96-114, 1980.
162. M. Metivier and J. Pellaumail. Stochastic Integration. Academic Press, New
York, 1980.
163. P. A. Meyer. A decomposition theorem for supermartingales. Illinois J. Math.,
6:193-205, 1962.
164. P. A. Meyer. Decomposition of supermartingales: the uniqueness theorem.
Illinois J. Math., 7:1-17, 1963.

References 397
165. P. A. Meyer. Probability and Potentials. Blaisdell, Waltham, 1966.
166. P. A. Meyer. Integrates stochastiques I. In Seminaire de Probabilites I, vol-
volume 39 of Lecture Notes in Mathematics, pages 72-94. Springer-Verlag, Berlin,
1967.
167. P. A. Meyer. Integrates stochastiques II. In Seminaire de Probabilites I, vol-
volume 39 of Lecture Notes in Mathematics, pages 95-117. Springer-Verlag, Berlin,
1967.
168. P. A. Meyer. Integrates stochastiques III. In Seminaire de Probabilites I,
volume 39 of Lecture Notes in Mathematics, pages 118-141. Springer-Verlag,
Berlin, 1967.
169. P. A. Meyer. Integrates stochastiques IV. In Seminaire de Probabilites I,
volume 39 of Lecture Notes in Mathematics, pages 142-162. Springer-Verlag,
Berlin, 1967.
170. P. A. Meyer. Le dual de "Я1" est "BMO" (cas continu). In Seminaire de
Probabuites VII, volume 321 of Lecture Notes in Mathematics, pages 136-145.
Springer-Verlag, Berlin, 1973.
171. P. A. Meyer. Un cours sur les integrates stochastiques. In Seminaire de Proba-
Probabilites X, volume 511 of Lecture Notes in Mathematics, pages 246-400. Springer-
Verlag, Berlin, 1976.
172. P. A. Meyer. Le theoreme fondamental sur les martingales locales. In Seminaire
de Probabilites XI, volume 581 of Lecture Notes in Mathematics, pages 463-464.
Springer-Verlag, Berlin, 1977.
173. P. A. Meyer. Sur un theoreme de C. Strieker. In Seminaire de Probabilites XI,
volume 581 of Lecture Notes in Mathematics, pages 482-489. Springer-Verlag,
Berlin, 1977.
174. P. A. Meyer. Inegalites de normes pour les integrates stochastiques. In
Seminaire de Probabilites XII, volume 649 of Lecture Notes in Mathematics,
pages 757-762. Springer-Verlag, Berlin, 1978.
175. P. A. Meyer. Sur un theoreme de J. Jacod. In Seminaire de Probabilites XII,
volume 649 of Lecture Notes in Mathematics, pages 57-60. Springer-Verlag,
Berlin, 1978.
176. P. A. Meyer. Caracterisation des semimartingales, d'apres Dellacherie. In
Seminaire de Probabilites XIII, volume 721 of Lecture Notes in Mathematics,
pages 620-623. Springer-Verlag, Berlin, 1979.
177. P. A. Meyer. Flot d'une equation differentielle stochastique. In Seminaire de
Probabilites XV, volume 850 of Lecture Notes in Mathematics, pages 103-117.
Springer-Verlag, Berlin, 1981.
178. P. A. Meyer. Geometrie differentielle stochastique. In Colloque en I'honneur
de Laurent Schwartz (Ecole Polytechnique, 30 mai-3 juin 1983). Vol. 1, volume
131 of Asterisque, pages 107-114. Societe Mathematique de Prance, Paris, 1985.
179. P. A. Meyer. Construction de solutions d'equation de structure. In Seminaire
de Probabilites XXIII, volume 1372 of Lecture Notes in Mathematics, pages
142-145. Springer-Verlag, Berlin, 1989.
180. T. Mikosch. Elementary Stochastic Calculus with Finance in View. World
Scientific, Singapore, 1998.
181. P. W. Millar. Stochastic integrals and processes with stationary independent
increments. In Proceedings of the Sixth Berkeley Symposium on Mathematical
Statistics and Probability. Vol. Ill: Probability theory, pages 307-331, Berkeley,
1972. University of California Press.

398 References
182. D. S. Mitrinovic, J. E. Pecaric, and A. M. Fink. Inequalities Involving Functions
and Their Integrals and Derivatives. Kluwer, Dordrecht, 1991.
183. J. R. Munkries. Topology: A First Course. Prentice-Hall, Englewood Cliffs,
1975.
184. A. A. Novikov. On an identity for stochastic integrals. Theory Probab. Appl,
17:717-720, 1972.
185. C. O'Cinneide and P. Protter. An elementary approach to naturality, pre-
predictability, and the fundamental theorem of local martingales. Stoch. Models,
17:449-458, 2001.
186. B. 0ksendal. Stochastic Differential Equations: An Introduction with Applica-
Applications. Springer-Verlag, New York, sixth edition, 2003.
187. S. Orey. .F-processes. In Proceedings of the Fifth Berkeley Symposium on
Mathematical Statistics and Probability. Vol. II: Contributions to probability
theory. Part 1, pages 301-313, Berkeley, 1965. University of California Press.
188. Y. Ouknine. Generalisation d'un lemme de S. Nakao et applications. Stochas-
tics, 23:149-157, 1988.
189. Y. Ouknine. Temps local du produit et du sup de deux semimartingales. In
Seminaire de Probabilites XXIV, volume 1426 of Lecture Notes in Mathematics,
pages 477-479. Springer-Verlag, Berlin, 1990.
190. E. Pardoux. Grossissement d'une filtration et retournement du temps d'une
diffusion. In Seminaire de Probabilites XX, volume 1204 of Lecture Notes in
Mathematics, pages 48-55. Springer-Verlag, Berlin, 1986.
191. E. Pardoux. Book Review of Stochastic Integration and Differential Equations,
first edition, by P. Protter. Stoch. Stoch. Rep., 42:237-240, 1993.
192. J. Pellaumail. Sur I'integrale stochastique et la decomposition de Doob-Meyer,
volume 9 of Asterisque. Societe Mathematique de Prance, Paris, 1973.
193. J. Picaxd. Une classe de processus stable par retournement du temps. In
Seminaire de Probabilites XX, volume 1204 of Lecture Notes in Mathematics,
pages 56-67. Springer-Verlag, Berlin, 1986.
194. Z. R. Pop-Stojanovic. On McShane's belated stochastic integral. SIAM J.
Appl. Math., 22:87-92, 1972.
195. M. H. Protter and C. B. Morrey. A First Course in Real Analysis. Springer-
Verlag, Berlin, 1977.
196. P. Protter. Markov solutions of stochastic differential equations. Z. Wahr-
scheinlichkeitstheorie verw. Gebiete, 41:39-58, 1977.
197. P. Protter. On the existence, uniqueness, convergence, and explosions of so-
solutions of systems of stochastic integral equations. Ann. Probab., 5:243-261,
1977.
198. P. Protter. Right-continuous solutions of systems of stochastic integral equa-
equations. J. Multivariate Anal, 7:204-214, 1977.
199. P. Protter. Hp stability of solutions of stochastic differential equations. Z.
Wahrscheinlichkeitstheorie verw. Gebiete, 44:337-352, 1978.
200. P. Protter. A comparison of stochastic integrals. Ann. Probab., 7:176-189,
1979.
201. P. Protter. Semimartingales and stochastic differential equations. Technical
Report 85-25, Purdue University Department of Statistics, 1985.
202. P. Protter. Stochastic integration without tears. Stochastics, 16:295-325, 1986.
203. P. Protter. A partial introduction to financial asset pricing theory. Stochastic
Process. Appl., 91:169-203, 2001.

References 399
204. P. Protter and M. Sharpe. Martingales with given absolute value. Ann. Probab.,
7:1056-1058, 1979.
205. P. Protter and D. Talay. The Euler scheme for Levy driven stochastic differ-
differential equations. Ann. Probab., 25:393-423, 1997.
206. К. М. Rao. On decomposition theorems of Meyer. Math. Scand., 24:66-78,
1969.
207. К. М. Rao. Quasimartingales. Math. Scand., 24:79-92, 1969.
208. D. Revuz and M. Yor. Continuous Martingales and Brownian Motion.
Springer-Verlag, New York, third edition, 1999.
209. L. C. G. Rogers and D. Williams. Diffusions, Markov Processes, and Martin-
Martingales. Volume 1: Foundations. Cambridge University Press, Cambridge, second
edition, 2000.
210. L. C. G. Rogers and D. Williams. Diffusions, Markov Processes, and Martin-
Martingales. Volume 2: ltd Calculus. Cambridge University Press, Cambridge, second
edition, 2000.
211. H. Rootzen. Limit distributions for the error in approximations of stochastic
integrals. Ann. Probab., 8:241-251, 1980.
212. G. Samorodnitsky and M. S. Taqqu. Stable Non-Gaussian Random Processes:
Stochastic Models with Infinite Variance. Chapman and Hall, New York, 1994.
213. L. Schwartz. Semimartingales and their stochastic calculus on manifolds.
Presses de l'Universite de Montreal, Montreal, 1984.
214. L. Schwartz. Convergence de la serie de picard pour les eds. In Seminaire
de Probabilites XXIII, volume 1372 of Lecture Notes in Mathematics, pages
343-354. Springer-Verlag, Berlin, 1989.
215. M. Sharpe. General Theory of Markov Processes. Academic Press, New York,
1988.
216. J. M. Steele. Stochastic Calculus and Financial Applications. Springer-Verlag,
New York, 2001.
217. R. L. Stratonovich. A new representation for stochastic integrals. SIAM J.
Control, 4:362-371, 1966.
218. C. Strieker. Quasimartingales, martingales locales, semimartingales, et filtra-
tions naturelles. Z. Wahrscheinlichkeitstheorie verm. Gebiete, 3:55-64, 1977.
219. C. Strieker and M. Yor. Calcul stochastique dependant d'un parametre. Z.
Wahrscheinlichkeitstheorie verw. Gebiete, 45:109-134, 1978.
220. D. W. Stroock and S. R. S. Varadhan. Multidimensional Diffusion Processes.
Springer-Verlag, Berlin, 1979.
221. H. Sussman. On the gap between deterministic and stochastic ordinary differ-
differential equations. Ann. Probab., 6:19-41, 1978.
222. A. S. Sznitman. Martingales dependant d'un parametre: une formule d'lto. Z.
Wahrscheinlichkeitstheorie verw. Gebiete, 60:41-70, 1982.
223. F. Treves. Topological Vector Spaces, Distributions and Kernels. Academic
Press, New York, 1967.
224. A. Uppman. Deux applications des semi-martingales exponentielles aux
equations differentielles stochastiques. С R. Acad. Sci. Paris Sir. A-B,
290:A661-A664, 1980.
225. A. Uppman. Sur le fiot d'une equation differentielle stochastique. In Seminaire
de Probabilites XVI, volume 920 of Lecture Notes in Mathematics, pages 268-
284. Springer-Verlag, Berlin, 1982.
226. H. von Weizsacker and G. Winkler. Stochastic Integrals. Vieweg, Braunschweig,
1990.

400 References
227. J. Walsh. A diffusion with a discontinuous local time. In Temps locaux: Exposes
du seminaire J. Azema, M. Yor A976-1977), volume 52-53 of Aster is que, pages
37-45. Societe Mathematique de France, Paris, 1978.
228. N. Wiener. Differential space. J. Math. Phys., 2:131-174, 1923.
229. N. Wiener. The Dirichlet problem. J. Math. Phys., 3:127-146, 1924.
230. N. Wiener. The quadratic variation of a function and its Fourier coefficients.
J. Math. Phys., 3:72-94, 1924.
231. E. Wong. Stochastic Processes in Information and Dynamical Systems. Mc-
Graw Hill, New York, 1971.
232. T. Yamada. Sur une construction des solutions d'equations differentielles
stochastiques dans le cas non-lipschitzien. In Seminaire de Probabilites XII,
volume 649 of Lecture Notes in Mathematics, pages 114-131. Springer-Verlag,
Berlin, 1978.
233. J. A. Yan. Caracterisation d'une classe d'ensembles convexes de L1 ou H1. In
Seminaire de Probabilites XIV, volume 784 of Lecture Notes in Mathematics,
pages 220-222. Springer-Verlag, Berlin, 1980.
234. Ch. Yoeurp. Decomposition des martingales locales et formules exponentielles.
In Seminaire de Probabilites X, volume 511 of Lecture Notes in Mathematics,
pages 432-480. Springer-Verlag, Berlin, 1976.
235. Ch. Yoeurp. Solution explicite de l'equation Zt = l+/0* \Zs-\dXs. In Seminaire
de Probabilites XIII, volume 721 of Lecture Notes in Mathematics, pages 614-
619. Springer-Verlag, Berlin, 1979.
236. Ch. Yoeurp and M. Yor. Espace orthogonal a une semimartingale: applications.
Unpublished, 1977.
237. M. Yor. Sur les integrates stochastiques optionnelles et une suite remarquable
de formules exponentielles. In Seminaire de Probabilites X, volume 511 of
Lecture Notes in Mathematics, pages 481-500. Springer-Verlag, Berlin, 1976.
238. M. Yor. Sur quelques approximations d'integrales stochastiques. In Seminaire
de Probabilites XI, volume 581 oi Lecture Notes in Mathematics, pages 518-528.
Springer-Verlag, Berlin, 1977.
239. M. Yor. Grossissement d'une filtration et semi-martingales: theoremes gener-
aux. In Seminaire de Probabilites XII, volume 649 of Lecture Notes in Mathe-
Mathematics, pages 61-69. Springer-Verlag, Berlin, 1978.
240. M. Yor. Grossissement d'une filtration et semi-martingales: theoremes gener-
aux. In Seminaire de Probabilites XII, volume 649 of Lecture Notes in Mathe-
Mathematics, pages 61-69. Springer-Verlag, Berlin, 1978.
241. M. Yor. Inegalites entre processus minces et applications. C. R. Acad. Sci.
Paris Sir. A-B, 286:A799-A801, 1978.
242. M. Yor. Rappels et preliminaires generaux. Asterisque, 52-53:17-22, 1978.
243. M. Yor. Sur la continuite des temps locaux associes a certaines semi-
martingales. In Temps Locaux, volume 52 and 53 of Asterisque, pages 23-36.
Societe Mathematique de France, Paris, 1978.
244. M. Yor. Un exemple de processus qui n'est pas une semi-martingale. In Temps
Locaux, volume 52 and 53 of Asterisque, pages 219-222. Societe Mathematique
de France, Paris, 1978.
245. M. Yor. Application d'un lemme de Jeulin au grossissement de la filtration
brownienne. In Seminaire de Probabilites XIV, volume 784 of Lecture Notes in
Mathematics, pages 189-199. Springer-Verlag, Berlin, 1980.

References 401
246. M. Yor. Remarques sur une formule de Paul Levy. In Seminaire de Probabilites
XIV, volume 784 of Lecture Notes in Mathematics, pages 343-346. Springer-
Verlag, Berlin, 1980.
247. M. Yor. Some Aspects of Brownian Motion. Part II: Some Recent Martingale
Problems. Birkhauser, Berlin, Germany, 1997.
248. W. Zheng. Une remarque sur une meme integrale stochastique calculee dans
deux nitrations. In Seminaire de Probabilites XVIII, volume 1059 of Lecture
Notes in Mathematics, pages 172-178. Springer-Verlag, Berlin, 1984.

Symbol Index
AL 371 Vn 220,301
|| Л ||p 245 dx(H,J) 155
J™\dAs\ 101,245 ?(X) 85
AX,AX 180 SR{Z) 320
Л 154 SH{Z) 322
Л 118 ?f-s(?) 280
\A\, (\A\t)t>o 40,101 (|? •/)(*, w,x) 279
|Л|оо 101 №L>о 355
Е{\А\х}<оо 111 F-Л 40
?{|Л|т„}<оо 111 F 3,356
bL 56,155 FL 370
ЪО 102 1РМ 36
hV 102,154 TT 5
BMO 195 -7? 16
A 252 Tt- 105
D 56,244 ^ 293,309
Dn 250,301 F 377
Ducp 57 F-S acceptable 279

404 Symbol Index
F-S integral 82 1(M) 181,185
FV 39,101 Jx 58
9t 228 jp(N,A) 245
(H2,X)integrable 163 L(X) 163
HA 40 La{X)t 212,230
HX 58,60,156 Ц 212
HQX 60,165,170 L 56,101,102,153,155,244
HG-X 61 L(G) 61
He-X 175 UcP 57
№У] 378 Lo 52M6
ЖР 244 (M,N) 178
/Zf.dX, 58 Мь 372
J0°°H3dX3 58,60 мт- 172
J*Hs°dXs 82,271 M2 178,186
j*HsdXs 60,156 Л^2(Л) 182
/О'+Я3ЙХ3 60,78 \\М\\Вмо 195
H(L) 362 iV 154
H 357 С Ю2
H2 154,244,193 OL 372
W2(P) 169 -P" 36
W 193 V 102,154,206
H°° 314 P 293,309
(HI) 302,328 <2~P 131
(H2) 302,328 Qt{u,dx) 362
(H3) 329 Q 4,214
/x 52,144 Rn 19

Symbol Index 405
M+ 19 \\X\\sp 244
S 51 \\X\\H2 244
S(H) 53 (X,X) 70,122
Su 52,56 (Xе, Xе) 70
Sucp 57 {X- < a} 214
S(a) 248 °Xn 369
5(Л) 179 xVy 211
Tx 265 xAy 211
Гл 104 ж+ 211
[T] 104 ж" 211
иер 57 У 64,267
Var(X) 116 Ya+ 267
Varr(X) 116 Y- 64,159
Х(ст„) 269 У' 377
X* 11,244 [Z,Z] 84
Х+* 373 Zta 221
Х+* 373 (Z : Я)* 319
Xе 70,221 J*(dZs)Hs 319
Хт 10,178 П 293,309
Хт" 54,172 a-slice 248
Хт" 369 4 305
Xf 293,301 ф 301,382
[Х,Х] 66 <р 311
[Х,Х]С 70,70,216,271 тгп 17
[Х,У] 66 <t{L\L°°) 150
[Xе, Xе] 70 an 64
AXt 25,60,158 С,{х,ш) 303
\\X\\hp 245 < 131

Subject Index
cc-sliceable 248
BMO see bounded mean oscillation
F optional projection 369
F special, G semimartingale 369
F-S see Fisk-Stratonovich
FV see finite variation process
U2 norm 154
W norm 193
HF
(pre)locally in 247
converge (pre)locally in 260
norm 245
gp
(pre) locally in 247
converge (pre) locally in 260
norm 244
ucp see uniform convergence on
compacts in probability
absolutely continuous 131
Absolutely Continuous Compensators
Theorem 192
absolutely continuous space 191
accessible stopping time 103
adapted process
decomposable 55
definition of 3
space L of with caglad paths 155
with bounded caglad paths 56
with cadlag paths 56
with caglad paths 56
angle bracket 123
announcing sequence for stopping time
103
approximation of the compensator by
Laplacians 150
arcsine law 228
associated jump process 27
asymptotic martingale (AMART) 218
autonomous 250,293
Azema's martingale 204
and last exit from zero 228
definition of 228
local time of 230
Backwards Convergence Theorem 9
Banach-Steinhaus Theorem 43
Barlow's Theorem 240
Bichteler-Dellacherie Theorem 144
Bouleau-Yor formula 226, 227
bounded functional Lipschitz 257
bounded jumps 25
bounded mean oscillation
BMO norm 195
BMO space of martingales 195
Duality Theorem 199
bracket process 66
Brownian bridge 97, 299,384
Brownian motion
absolute value of 217
arcsine law for last exit from zero
228
Azema's martingale 228
Brownian bridge see Brownian
bridge
completed natural filtration right
continuous 19

408 Subject Index
conditional quadratic variation of
123
continuous modification 17
continuous paths 221
definition of 16
Fisk-Stratonovich exponential of
280
Fisk-Stratonovich integral for 288
geometric 86
Girsanov-Meyer Theorem and 141
Ito integral for 78
Levy's characterization of 88
Levy's Theorem (martingale
characterization) 86,87
last exit from zero 228
last exit time 46
local time of 217, 225
martingale characterization (Levy's
Theorem) 86,87
martingale representation for 187
martingales as time change of 88,
187
maximum process 23
on the sphere 281
Ornstein-Uhlenbeck 298
quadratic variation of 17, 67, 123
reflection principle 23, 228
reversibility for integrals 380
semimartingale 55
standard 17
starting at ж 17
stochastic area formula 89
stochastic exponential for see
geometric Brownian motion
stochastic integral exists 173
strong Markov property for 23
Tanaka's formula 217
tied down or pinned see Brownian
bridge
time change of 88
unbounded variation 19
white noise 141, 243
Burkholder's inequality 222
Burkholder-Davis-Gundy inequalities
193
cadlag 4
caglad 4
canonical decomposition
129,154
censored data 121
centering function 32
change of time 88,190
exercises in Chap. II 98
Lebesgue's formula 190
change of variables see Ito's formula,
see Meyer-Ito formula
Change of Variables Theorem
for continuous FV processes 41
for right continuous FV processes
78
classical semimartingale 102,127,144
Class D 106
closed martingale 8
Comparison Theorem 324
compensated Poisson process 31,42,
65
compensator 118
Absolutely Continuous Compensators
Theorem 192
Knight's compensator calculation
method 151
compensator of l{t>z,} for F 371
compound Poisson process 33
conditional quadratic variation 70,
122
Brownian motion 123
polarization identity 123
continuous local martingale part of
semimartingale 70,221
continuous martingale part 191
converge (pre) locally in H_p, S_p 260
convex functions
and Meyer-Ito formula 214
of semimartingale 210
countably-valued random variables
corollary 365
counting process 12
explosion time 13
without explosions 13
crude hazard rate 122
decomposable adapted process 55,101
diffeomorphism of Rn 319
diffusion 243,291,297
examples 298
rotation invariant 282
diffusion coefficient 298
discrete Laplacian approximations 150

Subject Index 409
Doleans-Dade exponential see
stochastic exponential
Dominated Convergence Theorem for
stochastic integrals
in Яг 267
in ucp 174
Doob class see Class D
Doob decomposition 105
Doob's maximal quadratic inequality
11
Doob's Optional Sampling Theorem 9
Doob-Meyer Decomposition Theorem
case without Class D 115
general case 111
totally inaccessible jumps 106
drift coefficient 143, 298
removal of 137
Duality Theorem 199
Dynkin's expectation formula 350
Dynkin's formula 56,350
Einstein convention 305
Emery's example of stochastic integral
behaving badly 176
Emery's inequality 246
Emery's structure equation see
structure equation
Emery-Perkins Theorem 240
enveloping sequence for stopping time
103
equivalent probability measures 131
Euler method of approximation 353
evanescent 158
example
Emery's example of stochastic
integral behaving badly 176
expansion via end of SDE 367
Gaussian expansions 366
hazard rates and censored data 121
Ito's example 366
reversibility for Brownian integrals
380
reversibility for Levy process integrals
380
example of local martingale that is not
martingale 37,74
example of process that is not
semimartingale 217
example of stochastic integral that is
not local martingale 176
examples of diffusions 298
Existence of Solutions of Structure
Equation Theorem 201
expanded filtration 370
expansion by a natural exponential r.v.
386
expansion via end of SDE 367
explosion time 13, 254,303
extended Gronwall's inequality 352
extremal point 183
Fefferman's inequality 195
strengthened 195
Feller process 35
filtration
countably-valued random variables
corollary 365
definition of 3
expansion by a natural exponential
r.v. 386
filtration shrinkage 367
Filtration Shrinkage Theorem 369
Gaussian expansions 366
independence corollary 365
Ito's example 366
natural 16
progressive expansion 355, 369
quasi left continuous 148,189
right continuous 3
filtration shrinkage 367
Filtration Shrinkage Theorem 369
finite quadratic variation 271
finite variation process
definition of 39,101
integrable variation 111
Fisk-Stratonovich
acceptable 278
approximation as limit of sums 284
integral 82,271
integral for Brownian motion 288
Integration by Parts Theorem 278
Ito's circle 82
Ito's formula 277
stochastic exponential 280
flow
of SDE 301
of solution of SDE 382

410 Subject Index
strongly injective 311
weakly injective 311
Fubini's Theorem for stochastic
integrals 207,208
function space 301
functional Lipschitz 250
fundamental L martingale 372
fundamental sequence 37
Fundamental Theorem of Local
Martingales 125
Gamma process 33
Gaussian expansions 366
generalized Ito's formula 271
generator of stable subspace 179
geometric Brownian motion 86
Girsanov-Meyer Theorem 132
predictable version 133
graph of stopping time 104
Gronwall's inequality 342, 352
extended 352
Hadamard's Theorem 330
Hahn-Banach Theorem 199
hazard rates 121
hitting time 4
Holder continuous 99
honest random variable 373
Hunt process 36
Hypothesis A 221
increasing process 39
independence corollary 365
independent increments
of Levy process 20
of Poisson process 13
index of stable law 34
indistinguishable 3,60
infinitesimal generator 349
injective flow see strongly injective
flow
integrable variation process 111
Integration by Parts Theorem
for Fisk-Stratonovich integrals 278
for semimartingales 68, 83
intrinsic Levy process 20
Ito integral see stochastic integral
Ito's circle 82
Ito's formula
for an n-tuple of semimartingales 81
for complex semimartingales 83, 84
for continuous semimartingales 81
for Fisk-Stratonovich integrals 277
for semimartingales 78
generalized 271
Ito-Meyer formula see Meyer-Ito
formula
Ito's example 366
Ito's Theorem for Levy processes 356
Jacod's Countable Expansion 366
Jacod's Countable Expansion Theorem
53, 356
Jacod's criterion 363
Jacod-Yor Theorem on martingale
representation 199
Jensen's Inequality 11
Jeulin's Lemma 360
Kazamaki's criterion 139
Knight's compensator calculation
method 151
Kolmogorov's continuity criterion 220
Kolmogorov's Lemma 218
Kronecker's delta 305
Kunita-Watanabe inequality 69
Levy Decomposition Theorem 31
Levy measure 26
Levy process
associated jump process 27
bounded jumps 25
cadlag version 25
centering function 32
definition of 20
has finite moments of all orders 25
intrinsic 20
is a semimartingale 55
Ito's Theorem for 356
Levy measure 26
reflection principle 49
reversibility for integrals 380
strong Markov property for 23
symmetric 49
Levy's arcsine law 228
Levy's stochastic area formula 89
Levy's stochastic area process 89
Levy's Theorem characterizing
Brownian motion 86,87

Subject Index 411
Levy-Khintchine formula 31
last exit from zero 228
Le Jan's Theorem 124
Lebesgue's change of time formula 190
Lenglart's Inclusion Theorem 177
Lenglart-Girsanov Theorem 135
linkage operators 329
Lipschitz
bounded functional 257
definition of 250, 293
functional 250
locally 251,254,303
process 250,311
random 250
local behavior of stochastic integral
62,165,170
local behavior of the stochastic integral
at random times 375
local convergence in H_p, S_p 260
local martingale
BMO 195
cadlag martingale is 37
compensator of 118
condition to be a martingale 38, 73
continuous part of semimartingale
221
decomposable 126
definition of 37
example that is not a martingale
37,74
fundamental sequence for 37
Fundamental Theorem 125
intervals of constancy 71, 75
not locally square integrable 126
not preserved under shrinkage of
filtration 126
pre-stopped 172
preserved by stochastic integration
see stochastic integral, preserves
reducing stopping time for 37
time change of Brownian motion 88
local property 38,162
local time
and Bouleau-Yor formula 227
and change of variables formula
see Meyer-Ito formula, see
Meyer-Tanaka formula
and delta functions 217
continuous in t 213
definition of L$ 212
discontinuous, example of 225
occupation time density 216, 225
of Azema's martingale 230
of Brownian motion 217
of semimartingale 212
regularity in space 224
support of 214
locally bounded process 164
locally in Kp,gp 247
locally integrable process 359
locally integrable variation process
111
locally Lipschitz 251, 254,303
locally special semimartingale 149
Memin's criterion for exponential
martingales 352
Metivier-Pellaumail inequality 352
Metivier-Pellaumail method 352
Markov process
definition of 34
diffusion 243, 291, 297
Dynkin's expectation formula 350
Dynkin's formula 56
equivalence of definitions 34
infinitesimal generator 349
simple 291
strong 292
time homogeneous 35, 292
transition function 35, 292
transition semigroup 292
martingale
L2 projection 9
W norm 193
BMO 195
asymptotic (AMART) 218
Azema's 204,228
Backwards Convergence Theorem 9
Burkholder's inequality 222
cadlag modification 8
closed 8
continuous martingale part 191
definition of 7
Doob's inequalities 11
fundamental L martingale 372
in L2 178
Jacod-Yor Theorem 199
local 37

412 Subject Index
Martingale Convergence Theorem 8
martingale representation 203
maximal quadratic inequality 11
measures M2(A) 182
not locally square integrable 126
Optional Sampling Theorem 9
orthogonal 179
predictable representation property
182
purely discontinuous part 191
quasimartingale 116
sigma martingale 233
space M2 of 178
square integrable 73, 74
strongly orthogonal 179
submartingale 7
supermartingale 7
uniformly integrable 8
with exactly one jump 124
Martingale Convergence Theorem 8
martingale representation 203
mathematical finance theory 137
maximal quadratic inequality 11
Meyer's Theorem 104
Meyer-Ito formula 214
Meyer-Tanaka formula 216
modification 3
Monotone Class Theorem 7
monotone vector space 7
closed under uniform convergence 7
multiplicative collection of functions 7
natural filtration 16
natural process 111
net hazard rate 122
Novikov's criterion 140
oblique bracket 123
occupation time density 216, 225
optional <7-algebra 102,372
optional projection 367, 369
Optional Sampling Theorem 9
Ornstein-Uhlenbeck process 298
orthogonal martingales 179
Ouknine's formula 240
partition 116
path space 142
path-by-path continuous part
70
Perkins-Emery Theorem 240
Picard iteration 255
pinned Brownian motion 299
Pitman's Theorem 240
Poisson process
arrival rate 15
compensated 31,42, 65,118
compound 33
definition of 13
independent increments 13
intensity 15
stationary increments 13
polarization identity 66,123, 271
potential 105,150
pre-stopped local martingale 172
predictable cr-algebra 102, 154
predictable compensator of l{t>z,} for F
371
predictable process
predictable <7-algebra 102
simple 51
predictable projection 368
predictable representation property
182
predictable stopping time 103
predictably measurable process 102,
105
prelocal convergence in i?p, S_p 260
prelocally in i?p, §? 247
preserved by stochastic integration see
stochastic integral, preserves
process Lipschitz 250,311
process stopped at T 10, 37
progressive expansion 355, 369
projections
optional projection 367
predictable projection 368
property holds locally 38,162, 247
property holds prelocally 162, 247
purely discontinuous part of martingale
191
quadratic covariation 66, 378
polarization identity 66,271
quadratic covariation process 270
quadratic pure jump 71
quadratic variation
conditional 122
continuous part of 70, 271

Subject Index 413
covariation 66
finite 271
of a semimartingale 66
polarization identity 66, 271
quasi left continuous filtration 148,
189
quasimartingale 116
random Lipschitz 250
random partition
definition of 64
tending to the identity 64
Rao's Theorem 118
Ray-Knight Theorem 225
reduced by stopping time 37
reflection principle
for Brownian motion 23, 228
for Levy processes 49
for stochastic area process 92
regular conditional distribution 362
regular supermartingale 150
representation property 182
reversibility for Brownian integrals
380
reversibility for Levy process integrals
380
reversible semimartingale 378
Riemann-Stieltjes integral 41
right continuous filtration 3
right stochastic exponential 319, 320
right stochastic integral 319
rotation invariant diffusion 282
sample paths of stochastic process 4
scaling property 238
semimartingale
(F, G) reversible 378
(H2,X) integrable 163
H2 norm 154
absolute value of see Meyer-Tanaka
formula
bracket process 66
canonical decomposition 129
classical 102,127,144
continuous local martingale part 70,
221
convex function of 210
definition of 52
equivalent norm 244
example of process that is not 217
Integration by Parts Theorem 68,
83
local martingale as 127
local time of 212
locally special 149
preserved by stochastic integration
see stochastic integral, preserves
quadratic pure jump 71
quadratic variation of 66
quasimartingale as 127
space H2 of 154
special 129,154
stochastic exponential of 85
submartingale as 127
supermartingale as 127
topology 264
total 52
semimartingale topology 264
separable measurable space 189
sharp bracket 123
sigma martingale 233
preserved by stochastic integration
see stochastic integral, preserves
sign function 212
signal detection 140
simple Markov process 291
simple predictable process 51
Skorohod topology 220
Skorohod's Lemma 239
smallest filtration making r.v. a
stopping time 119,370
special semimartingale 129,154
canonical decomposition 129
square integrable martingale 178
preserved by stochastic integration
see stochastic integral, preserves
stable law 34
index 34
stable process
defintion of 33
scaling properties 34
symmetric 34
stable subspace
generated 179
of M2 178
stable under stopping 178
standard Borel space 362
stationary Gaussian process 299

414 Subject Index
stationary increments
of Levy process 20
of Poisson process 13
statistical communication theory 140
Stieltjes integral see Riemann-Stieltjes
integral
stochastic area formula 89
stochastic area process
definition of 89
density 91
Levy's formula 89
properties 92
reflection principle 92
stochastic differential equation 255,
321
flow of 301
flow of solution 382
weak solution 201, 240
weak uniqueness 201
stochastic exponential
appproximation of 269
as a diffusion 298
definition of 85
Fisk-Stratonovich 280
for Brownian motion see geometric
Brownian motion
for semimartingale 85
invertibility of 335
right 319,320
uniqueness of 255
stochastic integral
X integrable, L(X) 163
Associativity Theorem 62, 159, 165
behavior of jumps 60,158,165
does not preserve FV processes in
general 241
Dominated Convergence Theorem in
Ш 267
Dominated Convergence Theorem in
ucp VTA
example that is not local martingale
176
for L 59
for bV 156
for S 58
for V 163
for Brownian motion 78
Fubini's Theorem 207, 208
local behavior at random times 375
local behavior of 62,165,170
preserves
FV processes, integrands in L 63
continuous local martingales 173
local martingales 128,171,174
locally square integrable local
martingales 63, 171
semimartingales 63
sigma martingales 234
square integrable martingales 159,
171
right 319
said to exist 163
stochastic process
adapted 3
decomposable 55,101
definition of 3
finite variation 39,101
increasing 39
indistinguishable 3,60
locally bounded 164
locally integrable 359
modification of 3
natural 111
potential 105
sample paths of 4
stopped 10,37
strong Markov 36
stopped process 10, 37
stopping time
cr-algebra 5,105
accessible 103
announcing sequence 103
change of time 98
definition of 3
enveloping sequence 103
explosion time 13, 254, 303
fundamental sequence of 37
graph 104
hitting time 4
hitting time of Borel set is 5
predictable 103
reducing 37
totally inaccessible 103
Stratonovich integral see Fisk-
Stratonovich, integral
Stratonovich Integration by Parts
Theorem 278
strengthened Fefferman inequality 195

Subject Index 415
Strieker's Theorem 53
strong Markov process 36, 292
strong Markov property 36, 292
for Brownian motion 23
for Levy process 23
strongly injective flow 311
strongly orthogonal martingales 179
structure equation 200, 238
Existence of Solutions Theorem 201
submartingale 7
supermartingale 7
regular 150
symmetric Levy process 49
symmetric stable process 34
Tanaka's formula 217
tied down Brownian motion 299
time change 190
time change of Brownian motion 88
time homogeneous Markov process
292
time reversal 377
total semimartingale 52
total variation process 40
totally inaccessible stopping time 103
transition function for Markov process
292
transition semigroup 292
truncation operators 265
uniform convergence on compacts in
probability 57
uniformly integrable martingale 8
usual hypotheses 3, 291
variation 116
variation along т
116
weak solution of SDE 201, 240
weak uniqueness of SDE 201
weakly injective flow 311
white noise 140, 243
Wiener measure 141
Wiener process 137,140

Applications of Mathematics (continued from page и>
51 Asmussen, Applied Probability and Queues Bnd ed. 2003)
52 Robert, Stochastic Networks and Queues B003)
53 Glasserman, Monte Carlo Methods in Financial Engineering B003)