336 Notation
Notation Description
F A probability distribution representation
F
E
Empirical probability distribution representation
F
N
Normal probability distribution representation,
parameterized by mean and variance
F
C,m
m
-categorical probability distribution representation
F
Q,m
m-quantile probability distribution representation
Π
F
A projection onto the probability distribution repre-
sentation F
bzc, dze
Floor and ceiling operations, mapping
z ∈R
to the
nearest integer that is less or equal (floor), or greater
or equal (ceiling), than z
d
A probability metric, typically used for the purposes
of contraction analysis
L
τ
(θ)
Quantile regression loss function for target thresh-
old τ ∈(0, 1) and location estimate θ ∈R
{u}
An indicator function that takes the value 1 when
u
is true and 0 otherwise; also {·}
J(π) Objective function for a control problem
G
Greedy policy operator, produces a policy that is
greedy with respect to a given action-value function
T
G
, T
G
The Bellman and distributional Bellman optimality
operators derived from greedy selection rule G
J
ρ
(π)
A risk-sensitive control objective function, with risk
measure ρ
ψ A statistical functional or sketch
ξ
π
Steady-state distribution under policy π
φ(x)
State representation for state
x
, a mapping
φ
:
X→
R
n
Π
φ,ξ
Projection onto the linear subspace generated by
φ
with state weighting ξ
M (R)
Space of signed probability measures over the reals
ξ,2
Weighted Cramér distance over return-distribution
functions, with state weighting given by ξ
Π
φ,ξ,
2
Projection onto the linear subspace generated by
φ
,
minimizing the
ξ,2
distance
L Loss function
H
κ
The Huber loss with threshold κ ≥0
Draft version.