TUMER, KAGAN; AGOGINO, ADRIAN - In: Advances in Complex Systems (ACS) 12 (2009) 04, pp. 475-492
local rewards for a class of problems where the mapping from the agent actions to system reward functions can be decomposed … performance of the entire system. In this paper, we show how this problem can be solved in general for a large class of reward … functions whose analytical form may be unknown (hence "black box" reward). This method combines the salient features of global …