THESIS
2023
1 online resource (xvii, 94 pages) : color illustrations
Abstract
In this thesis, we present two applications of reinforcement learning models in
the financial domain. The first study focuses on the dynamic hedging of financial
derivatives, where a reinforcement learning algorithm is designed to minimize the
variance of the delta hedging process. In contrast to previous research in this
area, we apply uncertainty estimation technology to measure the uncertainty of
the agent’s decision, which can further reduce unnecessary wear and tear in the
hedging process and control model overconfidence that may lead to significant
losses. Numerical experiments show the superiority of our strategy in Monte
Carlo simulations and SP 500 option data. The second study is about optimal
order execution, where a large order is split into several small orders to maximize...[
Read more ]
In this thesis, we present two applications of reinforcement learning models in
the financial domain. The first study focuses on the dynamic hedging of financial
derivatives, where a reinforcement learning algorithm is designed to minimize the
variance of the delta hedging process. In contrast to previous research in this
area, we apply uncertainty estimation technology to measure the uncertainty of
the agent’s decision, which can further reduce unnecessary wear and tear in the
hedging process and control model overconfidence that may lead to significant
losses. Numerical experiments show the superiority of our strategy in Monte
Carlo simulations and SP 500 option data. The second study is about optimal
order execution, where a large order is split into several small orders to maximize
the implementation shortfall. Based on the diversity of cryptocurrency
exchanges, we attempt to extract cross-exchange signals by aligning data from
multiple exchanges for the first time. Unlike most previous studies that focused
on using single-exchange information, we discuss the impact of cross-exchange
signals on the agent’s decision-making in the optimal execution problem. Experimental
results show that cross-exchange signals can provide additional information
for the optimal execution of cryptocurrency to facilitate the optimal
execution process.
Keywords: optimal dynamic hedging, uncertainty estimation, optimal execution,
cross-exchange, reinforcement learning
Post a Comment