Fig. 2
From: Evaluation of linear relaxations in Ad Network optimization for online marketing

Perception and action cycle — The agent observes the state, applies an action, receives a reward, and observes the new state
From: Evaluation of linear relaxations in Ad Network optimization for online marketing
Perception and action cycle — The agent observes the state, applies an action, receives a reward, and observes the new state