Fig. 2From: Evaluation of linear relaxations in Ad Network optimization for online marketingPerception and action cycle — The agent observes the state, applies an action, receives a reward, and observes the new stateBack to article page