ebook img

Quantitative Trading: Algorithms, Analytics, Data, Models, Optimization PDF

363 Pages·2016·11.78 MB·English
by  Xin Guo
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Quantitative Trading: Algorithms, Analytics, Data, Models, Optimization

QUANTITATIVE TRADING Algorithms, Analytics, Data, Models, Optimization Contents Preface List of Figures List of Tables 1 Introduction 1.1 Evolution of trading infrastructure 1.2 Quantitative strategies and time-scales 1.3 Statistical arbitrage and debates about EMH 1.4 Quantitative funds, mutual funds, hedge funds 1.5 Data, analytics, models, optimization, algorithms 1.6 Interdisciplinary nature of the subject and how the book can be used 1.7 Supplements and problems 2 Statistical Models and Methods for Quantitative Trading 2.1 Stylized facts on stock price data 2.1.1 Time series of low-frequency returns 2.1.2 Discrete price changes in high-frequency data 2.2 Brownian motion models for speculative prices 2.3 MPT as a “walking shoe” down Wall Street 2.4 Statistical underpinnings of MPT 2.4.1 Multifactor pricing models 2.4.2 Bayes, shrinkage, and Black-Litterman estimators 2.4.3 Bootstrapping and the resampled frontier 2.5 A new approach incorporating parameter uncertainty 2.5.1 Solution of the optimization problem 2.5.2 Computation of the optimal weight vector 2.5.3 Bootstrap estimate of performance and NPEB 2.6 From random walks to martingales that match stylized facts 2.6.1 From Gaussian to Paretian random walks 2.6.2 Random walks with optional sampling times 2.6.3 From random walks to ARIMA, GARCH 2.7 Neo-MPT involving martingale regression models 2.7.1 Incorporating time series effects in NPEB 2.7.2 Optimizing information ratios along efficient frontier 2.7.3 An empirical study of neo-MPT 2.8 Statistical arbitrage and strategies beyond EMH 2.8.1 Technical rules and the statistical background 2.8.2 Time series, momentum, and pairs trading strategies 2.8.3 Contrarian strategies, behavioral finance, and investors’ cognitive biases 2.8.4 From value investing to global macro strategies 2.8.5 In-sample and out-of-sample evaluation 2.9 Supplements and problems 3 Active Portfolio Management and Investment Strategies 3.1 Active alpha and beta in portfolio management 3.1.1 Sources of alpha 3.1.2 Exotic beta beyond active alpha 3.1.3 A new approach to active portfolio optimization 3.2 Transaction costs, and long-short constraints 3.2.1 Cost of transactions and its components 3.2.2 Long-short and other portfolio constraints 3.3 Multiperiod portfolio management 3.3.1 The Samuelson-Merton theory 3.3.2 Incorporating transaction costs into Merton’s problem 3.3.3 Multiperiod capital growth and volatility pumping 3.3.4 Multiperiod mean-variance portfolio rebalancing 3.3.5 Dynamic mean-variance portfolio optimization 3.3.6 Dynamic portfolio selection 3.4 Supplementary notes and comments 3.5 Exercises 4 Econometrics of Transactions in Electronic Platforms 4.1 Transactions and transactions data 4.2 Models for high-frequency data 4.2.1 Roll’s model of bid-ask bounce 4.2.2 Market microstructure model with additive noise 4.3 Estimation of integrated variance of X t 4.3.1 Sparse sampling methods 4.3.2 Averaging method over subsamples 4.3.3 Method of two time-scales 4.3.4 Method of kernel smoothing: Realized kernels 4.3.5 Method of pre-averaging 4.3.6 From MLE of volatility parameter to QMLE of [X] T 4.4 Estimation of covariation of multiple assets 4.4.1 Asynchronicity and the Epps effect 4.4.2 Synchronization procedures 4.4.3 QMLE for covariance and correlation estimation 4.4.4 Multivariate realized kernels and two-scale estimators 4.5 Fourier methods 4.5.1 Fourier estimator of [X] and spot volatility T 4.5.2 Statistical properties of Fourier estimators 4.5.3 Fourier estimators of spot co-volatilities 4.6 Other econometric models involving TAQ 4.6.1 ACD models of inter-transaction durations 4.6.2 Self-exciting point process models 4.6.3 Decomposition of D and generalized linear models i 4.6.4 McCulloch and Tsay’s decomposition 4.6.5 Joint modeling of point process and its marks 4.6.6 Realized GARCH and other predictive models 4.6.7 Jumps in efficient price process and power variation 4.7 Supplementary notes and comments 4.8 Exercises 5 Limit Order Book: Data Analytics and Dynamic Models 5.1 From market data to limit order book (LOB) 5.2 Stylized facts of LOB data 5.2.1 Book price adjustment 5.2.2 Volume imbalance and other indicators 5.3 Fitting a multivariate point process to LOB data 5.3.1 Marketable orders as a multivariate point process 5.3.2 Empirical illustration 5.4 LOB data analytics via machine learning 5.5 Queueing models of LOB dynamics 5.5.1 Diffusion limits of the level-1 reduced-form model 5.5.2 Fluid limit of order positions 5.5.3 LOB-based queue-reactive model 5.6 Supplements and problems 6 Optimal Execution and Placement 6.1 Optimal execution with a single asset 6.1.1 Dynamic programming solution of problem (6.2) 6.1.2 Continuous-time models and calculus of variations 6.1.3 Myth: Optimality of deterministic strategies 6.2 Multiplicative price impact model 6.2.1 The model and stochastic control problem 6.2.2 HJB equation for the finite-horizon case 6.2.3 Infinite-horizon case T = ∞ 6.2.4 Price manipulation and transient price impact 6.3 Optimal execution using the LOB shape 6.3.1 Cost minimization 6.3.2 Optimal strategy for Model 1 6.3.3 Optimal strategy for Model 2 6.3.4 Closed-form solution for block-shaped LOBs 6.4 Optimal execution for portfolios 6.5 Optimal placement 6.5.1 Markov random walk model with mean reversion 6.5.2 Continuous-time Markov chain model 6.6 Supplements and problems 7 Market Making and Smart Order Routing 7.1 Ho and Stoll’s model and the Avellanedo-Stoikov policy 7.2 Solution to the HJB equation and subsequent extensions 7.3 Impulse control involving limit and market orders 7.3.1 Impulse control for the market maker 7.3.2 Control formulation 7.4 Smart order routing and dark pools 7.5 Optimal order splitting among exchanges in SOR 7.5.1 The cost function and optimization problem 7.5.2 Optimal order placement across K exchanges 7.5.3 A stochastic approximation method 7.6 Censored exploration-exploitation for dark pools 7.6.1 The SOR problem and a greedy algorithm 7.6.2 Modified Kaplan-Meier estimate ˄T i 7.6.3 Exploration, exploitation, and optimal allocation 7.7 Stochastic Lagrangian optimization in dark pools 7.7.1 Lagrangian approach via stochastic approximation 7.7.2 Convergence of Lagrangian recursion to optimizer 7.8 Supplementary notes and comments 7.9 Exercises 8 Informatics, Regulation and Risk Management 8.1 Some quantitative strategies 8.2 Exchange infrastructure 8.2.1 Order gateway 8.2.2 Matching engine 8.2.3 Market data dissemination 8.2.4 Order fee structure 8.2.5 Colocation service 8.2.6 Clearing and settlement 8.3 Strategy informatics and infrastructure 8.3.1 Market data handling 8.3.2 Alpha engine 8.3.3 Order management 8.3.4 Order type and order qualifier 8.4 Exchange rules and regulations 8.4.1 SIP and Reg NMS 8.4.2 Regulation SHO 8.4.3 Other exchange-specific rules 8.4.4 Circuit breaker 8.4.5 Market manipulation 8.5 Risk management 8.5.1 Operational risk 8.5.2 Strategy risk 8.6 Supplementary notes and comments 8.7 Exercises A Martingale Theory A.1 Discrete-time martingales A.2 Continuous-time martingales B Markov Chain and Related Topics B.1 Generator Q of CTMC B.2 Potential theory for Markov chains B.3 Markov decision theory C Doubly Stochastic Self-Exciting Point Processes C.1 Martingale theory and compensators of multivariate counting processes C.2 Doubly stochastic point process models C.3 Likelihood inference in point process models C.4 Simulation of doubly stochastic SEPP D Weak Convergence and Limit Theorems D.1 Donsker’s theorem and its extensions D.2 Queuing system and limit theorems Bibliography Index Preface After the tumultuous period marked by the 2007-2008 Financial Crisis and the Great Recession of 2009, the financial industry has entered a new era. Quantitative strategies, together with statistical models and methods, knowledge representation and data analytics, and algorithms and informatics for their development and implementation, are of increasing importance in this new era. The onset of this era is marked by two “revolutions” that have transformed modern life and business. One is technological, dubbed “the FinTech revolution” for financial services by the May 9, 2015, issue of The Economist which says: “In the years since the crash of 2007-08, policymakers have concentrated on making finance safer.... Away from the regulator spotlight, another revolution is under way.. .. From payments to wealth management, from peerto-peer lending to crowdfunding, a new generation of startups is taking aim at the heart of the industry – and a pot of revenues that Goldman Sachs estimates is worth $4.7 trillion. Like other disrupters from Silicon Valley, fintech firms are growing fast.” The other is called “big data revolution”. In August 2014, the UN Secretary General commissioned an Independent Advisory Group to make recommendations on “bringing about a data revolution” in sustainable development. The October 2012 issue of Harvard Business Review features an article on “Big Data: The Management Revolution”. On August 20, 2015, the Premier of the People’s Republic of China asked different government departments to share their data and implement a big data action plan. Soon afterward, on September 5, 2015, the country’s State Council issued an action plan to develop and promote big data applications in economic planning, finance, homeland security, transportation, agriculture, environment, and health care. To respond to the opportunities and challenges of this new era and the big data and FinTech revolutions that have fascinated their students, the two academics (Guo and Lai) on the author team, who happen to be teaching students in the greater Silicon Valley, developed and taught new courses in the Financial Engineering/Mathematics Curriculum at Berkeley and Stanford in the past three years and exchanged their course material. They also invited practitioners from industry, in particular the other two co-authors (Shek and Wong), to give guest lectures and seminars for these courses. This informal collaboration quickly blossomed into an intense concerted effort to write up the material into the present book that can be used not only to teach these courses more effectively but also to give short courses and training programs elsewhere, as we have done at Shanghai Advanced Institute of Finance, Fudan University Tsinghua University, Chinese University of Hong Kong, Hong Kong University of Science & Technology, National University of Singapore, National Taiwan University, and Seoul National University. A prerequisite or co- requisite of these courses is a course at the level of STATS 240 (Statistical Methods in Finance) at Stanford, which covers the first six chapters of Lai and Xing (2008). We will therefore make ample references to the relevant sections of these six chapters, summarizing their main results without repeating the details. The website for this book can be found at http://lait.web.stanford.edu/quantstratbook/. The datasets for the exercises and examples can be downloaded from the website. We want to highlight in the book an interdisciplinary approach to the development and implementation of algorithmic trading and quantitative strategies. The interdisciplinary approach, which involves computer science and engineering, finance and economics, mathematics and statistics, law and regulation, is reflected not only in the research activities of the recently established Financial and Risk Modeling Institute (FARM) at Stanford, but also in the course offerings of Berkeley’s Financial Engineering and Stanford’s Financial Mathematics that has currently been transformed to the broader Mathematical and Computational Finance program to reflect the greater emphasis on data science, statistical modeling, advanced programming and high performance computing. Besides the interdisciplinary approach, another distinctive feature of the book is the effort to bridge the gap between academic research/education and the financial industry, which is also one of the missions of FARM. Different parts of the book can be used in short thematic courses for practitioners, which are currently being developed at FARM. Acknowledgments We want to express our gratitude to Cindy Kirby for her excellent editing and timely help in preparing the final manuscript. The first two authors thank their current and former Ph.D. students: Joon Seok Lee and Renyuan Xu at Berkeley, and Pengfei Gao, Yuming Kuang, Ka Wai Tsang, Milan Shen, Nan Bai, Vibhav Bukkapatanam, Abhay Subramanian, Zhen Wei, Zehao Chen, Viktor Spivakovsky, and Tiong-Wee Lim at Stanford for their research and teaching assistance, as well as students of IEOR 222 from 2011 to 2016 and IEOR 230X in Spring 2015 at UC Berkeley and Keith Sollers from UC Davis. They also acknowledge grant support by the National Science Foundation, under DMS 1008795 at Berkeley and DMS 1407828 at Stanford, for research projects related to the book. In addition, the first author would like to thank her collaborators Adrien de Larrard, Isaac Mao, Zhao Ruan and Lingjiong Zhu in research on algorithmic trading, funding support from the endowment of the Coleman Fung Chair Professorship, and the NASDAQ OMX education group for generous data and financial support. She also wants to thank her colleague Prof. Terry Hendershott who co-taught with her a high-frequency finance course at the Haas Business School. The last author wants to thank Prof. Myron Scholes for his valuable help and advice and Ted Givens for the excellent book cover design, while the second author wants to thank his colleague Prof. Joseph Grundfest of Stanford Law School for insightful discussions on regulatory issues in high-frequency trading. Department of Industrial Engineering and Operations Research, University of California at Berkeley Xin Guo Department of Statistics, Stanford University Tze Leung Lai Tower Research Capital, LLC Howard Shek Samuel Po-Shing 5Lattice Securities Limited Wong List of Figures 1.1 Hand signals for trading in an open outcry system. (Used with permission of CME.) 1.2 Stock ticker manufactured by Western Union Telegraph Company in the 1870s and now an exhibit at the Computer History Museum in Mountain View, California. Originally, only transacted prices and abbreviated stock symbols were printed on the ticker tape; after the 1930s, traded volume was also printed. (Photo credit: Wikimedia Commons/Don DeBold.) 2.1 Time series plot of the tick-by-tick transaction prices P of Stock Code 388 on Oct 10, t 2014. 2.2 Distribution of tick-by-tick transaction prices (top panel) and of price differences in the morning session (bottom panel) 2.3 ACF of transaction price differences of 388 in the morning session of Oct 10, 2014. Dashed lines represent rejection boundaries of 5%-level tests of zero ACF at the indicated lag. 2.4 Time series of P (top panel) and ∆ (bottom panel) for Pfizer closing prices from January t t 1, 2005, to December 31, 2014 2.5 Normal QQ-plots of (top panel) and of (bottom panel) with and defined in (2.18). 2.6 QQ-plots of ∆ of normal model (top panel) and symmetric stable distribution (bottom t panel). 4.1 Log-likelihood function of a simulated data set of size n = 1000 from model (4.21), which is equivalent to (4.69), in the top panel, and using the MA(1) parameterization (4.70) in the bottom panel. 5.1 Snapshots showing the evolution of a ten-level deep limit order book just before a trade has taken place (gray lines) and just after (black lines) for British Petroleum PLC (BP). Dotted lines are for the best bid and ask prices. Solid line is the average or mid price. Bars are scaled by maximum queue size across the whole book and represented in two color tones of gray to help identify changes in the order book just before and after a trade has taken place. 5.2 Probability of order completion within 5 seconds from submission for BP on June 25, 2010. Squares are relative frequencies based on empirical data and the solid curve is based on fitting a power-law function suggested by Bouchaud et al. (2002) 5.3 Time series for the difference in probability weighted cumulative volume for BP on June 25, 2010. 5.4 Conditional intensity of bid and ask side market orders following an order submitted on the bid side of the market, estimated with bin size ranging from 30 to 500 milliseconds, using

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.