Pessimistic Q-learning model
Frameworks: Reinforcement Learning
Disciplines:
Clinical Psychology
Programming language: Python
The model is an adaptation of a standard Q-learning where the assumption that agents will always make the reward-maximizing action is replaced by a weighting scheme that the agent might also make the reward-minimizing decision. The pessimistic Q-learning model is used to model characteristics of anxious behavior.