The double-ended queue
Familiarize yourself the with Double-Ended Queue data structure which you will use as replay buffer.
The Double Ended Queue, or deque
is a limited capacity queue which 'forgets' its oldest items as new items are added over time. This provides a well suited abstraction to support the replay buffer.
Cet exercice fait partie du cours
Deep Reinforcement Learning in Python
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
from collections import deque
buffer = deque(range(10))
print('Buffer initialized as:', buffer)
# Append 10 to the right of the buffer
buffer.____
print('Buffer after appending:', buffer)