A (potentially) history dependent scheduler.
Equations
- One or more equations did not get rendered due to their size.
Instances For
class
MDP.Scheduler.Markovian
{State : Type u_1}
{Act : Type u_2}
{M : MDP State Act}
(๐ฎ : ๐[M])
:
- intro : ๐ฎ.IsMarkovian
Instances
A Markovian
(historyless) scheduler.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Instances For
Equations
theorem
MDP.MScheduler.toScheduler_injective
{State : Type u_1}
{Act : Type u_2}
{M : MDP State Act}
:
noncomputable def
MDP.Scheduler.specialize
{State : Type u_1}
{Act : Type u_2}
{M : MDP State Act}
[DecidableEq State]
(๐ฎ : ๐[M])
(s : State)
(s' : โ(M.succs_univ s))
:
Specialize a scheduler such that all scheduled paths are considered with a given state as the immediately predecessor.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
@[simp]
theorem
MDP.Scheduler.specialize_apply
{State : Type u_1}
{Act : Type u_2}
{M : MDP State Act}
[DecidableEq State]
{๐ฎ : ๐[M]}
{s : State}
{s' : โ(M.succs_univ s)}
{ฯ : M.Path}
:
(๐ฎ.specialize s s') ฯ = if h : ฯ[0] = โs' then ๐ฎ (ฯ.prepend โจs, โฏโฉ) else M.default_act ฯ.last
@[simp]
theorem
MDP.Scheduler.specialize_default_on
{State : Type u_1}
{Act : Type u_2}
{M : MDP State Act}
[DecidableEq State]
{๐ฎ : ๐[M]}
{s : State}
{ฯ : M.Path}
{s' : โ(M.succs_univ s)}
(h : ยฌฯ[0] = โs')
:
theorem
MDP.MScheduler.toScheduler_specialize
{State : Type u_1}
{Act : Type u_2}
{M : MDP State Act}
[DecidableEq State]
{s : State}
{s' : โ(M.succs_univ s)}
(โ : ๐[M])
: