If you don't want to explicitly create a reward function, what options do I have?

Emma_Smith · November 20, 2024, 10:55am

If you don't want to explicitly create a reward function, what options do I have? (that are compatible with your courses in Deep RL*)

Akshay_Choudhary · November 21, 2024, 4:23pm

Hi Emma,

You can explore methods like Imitation Learning, where agents mimic certain sources by learning from historical data, as demonstrated in the Imitative Market Maker (IMM) framework. Here is a research paper for your reference. Another method is Inverse Reinforcement Learning (IRL), which infers the underlying reward function by observing expert behavior, allowing agents to replicate successful strategies. You can go through this paper to get more insights for the same.

Hope this helps!