In chapter 1, I study the experimentation dynamics of a decision maker (DM) in a two-armed bandit setup ([5]), where the agent holds ambiguous beliefs regarding the distribution of the return process of one arm and is certain about the other one. The DM entertains Multiplier preferences à la [27], thus I frame the decision making environment as a twoplayer differential game against nature in continuous time. I characterize the DM’s value function and her optimal experimentation strategy that turns out to follow a cut-off rule with respect to her belief process. The belief threshold for exploring the ambiguous arm is found in closed form and is shown to be increasing with respect to the ambiguity aversion index. I then study the effect of pr...