Simulation-based setup to make your own data

Assignment Help Python Programming

Reference no: EM132408683

MULTIAGENT REINFORCEMENT LEARNING LEAGUE OF OPTIONS TRADING MODELS

-MAKE SURE THE CODE IS WELL COMMENTED AND ADD A CONCLUSION WITH YOUR RESULTS.

-> Start with 2 agents: one option seller and one option buyer. And, then add more agents, that is then make it multi-agent.

-> Please use a simulation-based setup to make your own data, i.e. Monte Carlo Simulation. Stick to just Black-Scholes for the entire assignment. Hence, the underlying stock follows a GBM (Gradient Boosting Machines). This way, you know the theoretical value of the option and the hedging strategy in a frictionless world. However, remember that price impact on the stock price will be ad-hoc in this setup.

-> Defining the right incentives (utility functions) for your agents will be key. The seller makes money by selling and hedging the option. The buyer MUST have some external willingness to buy an option, up to a certain price.

-> The seller must probably have some risk-aversion, otherwise you may end up with an agent that do not hedge effectively as he focuses only on the "average" P&L which does not take account large losses.

-> Check deep hedging JP Morgan slides/paper. There, it is crucial that the agent focuses on the \alpha-CVaR. The approach is policy-based hence different form Q- learning.

-> Start introducing frictions like transaction costs only when your "simplest" setup start giving reasonable results.

So, BRIEFLY,
Make data -> do Q-learning on it -> get results -> do Fitted Q-iteration ->get results -> policy based approach -> get results -> make the cumulative reward plots -> CLEARLY show all results

1) Q-learning
2) Fitted Q-Iteration
3) Policy based/JP Morgan..deep hedging
4) Cumulative Rewards Plot or any other better way(s) to compare their efficiencies

Attachment:- MULTIAGENT REINFORCEMENT LEARNING LEAGUE.rar

Reference no: EM132408683

Questions Cloud

Practice meet the conditions needed for price discrimination : Does this practice meet the conditions needed for price discrimination? Why or why not? Please address all conditions one by one.

Committing large investments to a single project : Committing large investments to a single project is always risky, and it becomes even riskier when a competitor is set to do the same thing

Source contributes to the solution of the global societal : How the source contributes to the solution of the global societal issue on international drug trafficking

One-child policy and societal preference for male children : Referring to the combination of the one-child policy and societal preference for male children, explain why children available for adoption from China

Simulation-based setup to make your own data : MULTIAGENT REINFORCEMENT LEARNING LEAGUE OF OPTIONS TRADING MODELS - use a simulation-based setup to make your own data

Prepare a report and litrature review on blue ocean strategy : Conduct a literature review of the relevant academic Strategy literature. You will analyze select academic articles from the literature.

PROD 1024 Advanced Principles in Lean Manufacturing : PROD 1024 Advanced Principles in Lean Manufacturing Assignment Help and Solution, University of Greenwich - Assessment Writing Service and Produce door-to-door

Control the behavior of monopoly pricing : Also, why does the government also protect some monopolies under paten Laws? Give examples on both sides.

Arguments between the president of the us and china : Because of international arguments between the President of the US and China (don't worry, it would never happen), China decides to increase the tariffs

User Account

All Pages