AICurious Logo

What is: Dutch Eligibility Trace?

Year2000
Data SourceCC BY-SA - https://paperswithcode.com

A Dutch Eligibility Trace is a type of eligibility trace where the trace increments grow less quickly than the accumulative eligibility trace (helping avoid large variance updates). For the memory vector e_tRb0\textbf{e}\_{t} \in \mathbb{R}^{b} \geq \textbf{0}:

e_0=0\mathbf{e\_{0}} = \textbf{0}

e_t=γλe_t1+(1αγλe_t1Tϕ_t)ϕ_t\textbf{e}\_{t} = \gamma\lambda\textbf{e}\_{t-1} + \left(1-\alpha\gamma\lambda\textbf{e}\_{t-1}^{T}\phi\_{t}\right)\phi\_{t}