Predicting Stock Liquidity by Using Ensemble Data Mining Methods (앙상블 데이터마이닝 기법을 이용한 주식유동성 예측성과에 관한 실증연구)

배은찬; Kun-Chang Lee (이건창)

Predicting Stock Liquidity by Using Ensemble Data Mining Methods

Journal of The Korea Society of Computer and Information
Abbr : JKSCI
2016, 21(6), pp.9~19
Publisher : The Korean Society Of Computer And Information
Research Area : Engineering > Computer Science

배은찬 ¹, Kun-Chang Lee ¹

¹성균관대학교

Accredited

ABSTRACT

In finance literature, stock liquidity showing how stocks can be cashed out in the market has received rich attentions from both academicians and practitioners. The reasons are plenty. First, it is known that stock liquidity affects significantly asset pricing. Second, macroeconomic announcements influence liquidity in the stock market. Therefore, stock liquidity itself affects investors' decision and managers' decision as well. Though there exist a great deal of literature about stock liquidity in finance literature, it is quite clear that there are no studies attempting to investigate the stock liquidity issue as one of decision making problems. In finance literature, most of stock liquidity studies had dealt with limited views such as how much it influences stock price, which variables are associated with describing the stock liquidity significantly, etc. However, this paper posits that stock liquidity issue may become a serious decision-making problem, and then be handled by using data mining techniques to estimate its future extent with statistical validity. In this sense, we collected financial data set from a number of manufacturing companies listed in KRX (Korea Exchange) during the period of 2010 to 2013. The reason why we selected dataset from 2010 was to avoid the after-shocks of financial crisis that occurred in 2008. We used Fn-GuidPro system to gather total 5,700 financial data set. Stock liquidity measure was computed by the procedures proposed by Amihud (2002) which is known to show best metrics for showing relationship with daily return. We applied five data mining techniques (or classifiers) such as Bayesian network, support vector machine (SVM), decision tree, neural network, and ensemble method. Bayesian networks include GBN (General Bayesian Network), NBN (Naive BN), TAN (Tree Augmented NBN). Decision tree uses CART and C4.5. Regression result was used as a benchmarking performance. Ensemble method uses two typesintegration of two classifiers, and three classifiers. Ensemble method is based on voting for the sake of integrating classifiers. Among the single classifiers, CART showed best performance with 48.2%, compared with 37.18% by regression. Among the ensemble methods, the result from integrating TAN, CART, and SVM was best with 49.25%. Through the additional analysis in individual industries, those relatively stabilized industries like electronic appliances, wholesale & retailing, woods, leather-bags-shoes showed better performance over 50%.

KEYWORDS

Stock liquidity, Data-mining, Ensemble methods, decision making ∙

Citation status

* References for papers published after 2024 are currently being built.

[journal] 이형철 / 2014 / The Relation between Asset Liquidity and Stock Liquidity / 대한경영학회지 / 대한경영학회 27(10) : 1691~1710

[report] S. W. Hwang / 2015 / Outlook for Korea`s stock and bond markets / Korea Capital Market Institute

[journal] K. Mazouz / 2014 / Index revisions, systematic liquidity risk and the cost of equity capital / Journal of International Financial Markets, Institutions and Money 33 : 283~298

[journal] M. L. Lipson / 2009 / Liquidity and capital structure / Journal of Financial Markets 12(4) : 611~644

[journal] 고혁진 / 2009 / The Empirical Analysis on the Relation between Volatility of Liquidity and Return / 대한경영학회지 / 대한경영학회 22(5) : 2873~2893

[journal] Y. Amihud / 1986 / Liquidity and stock returns / Financial Analysts Journal 42(3) : 43~48

[journal] A. S. Turnbull / 2010 / In search of liquidity : The block broker's choice of where to trade cross-listed stocks / Journal of Economics and Business 62(1) : 20~34

[journal] L. Kryzanowski / 2009 / Liquidity minimization and cross-listing choice : Evidence based on Canadian shares cross-listed on U. S. venues / Journal of International Financial Markets, Institutions and Money 19(3) : 550~564

[journal] R. Gopalan / 2012 / Asset liquidity and stock liquidity / Journal of Financial and Quantitative Analysis 47(2) : 333~364

[journal] 조경식 / 2013 / A Study on the Effects of Block Ownership on Trading Activity and Market Liquidity in Korean Stock Market / 대한경영학회지 / 대한경영학회 26(1) : 131~148

[book] J. Pearl / 1988 / Probabilistic Reasoning in Intelligent Systems:Networks of Plausible Inference / Morgan Kaufmann

[journal] B. Yet / 2013 / Decision support system for Warfarin therapy management using Bayesian networks / Decision Support Systems 55(2) : 488~498

[journal] Y. Zuo / 2012 / Stock price forecast using Bayesian network / Expert Systems with Applications 39(8) : 6729~6737

[journal] F. Zheng / 2012 / Subsumption resolution : an efficient and effective technique for semi-naive Bayesian learning / Machine Learning 87(1) : 93~125

[journal] G. I. Webb / 2012 / Learning by extrapolation from marginal to full-multivariate probability distributions : decreasingly naive Bayesian classification / Machine Learning 86(2) : 233~272

[journal] B. Park / 2015 / Using machine learning algorithms for housing price prediction : The case of Fairfax County, Virginia housing data / Expert Systems with Applications 42(6) : 2928~2934

[journal] L. Bouchaala / 2010 / Improving algorithms for structure learning in Bayesian Networks using a new implicit score / Expert System Application 37(7) : 5470~5475

[journal] R. O. Duda / 2007 / Pattern classification / Journal of Classification 24(2) : 305~307

[book] J. Quinlan / 1993 / C4.5: Programs for Machine Learning / Morgan Kaufman

[journal] S. Lee / 2013 / Using data envelopment analysis and decision trees for efficiency analysis and recommendation of B2C controls / Decision Support Systems 49(4) : 486~497

[journal] L. Rutkowski / 2014 / The CART decision tree for mining data streams / Information Sciences 266(10) : 1~15

[confproc] Y. Lin / 2013 / An SVM-based Approach for Stock Market Trend Prediction / Proceedings of International Joint Conference on Neural Networks : 1~7

[journal] J. A. Suykens / 1999 / Least squares support vector machine classifiers / Neural processing letters 9(3) : 293~300

[journal] L. Zhou / 2010 / Least squares support vector machines ensemble models for credit scoring / Expert Systems with Applications 37(1) : 127~133

[book] M. T. Hagan / 1996 / Neural network design / Pws Pub

[journal] H. C. W. Lau / 2013 / A demand forecast model using a combination of surrogate data analysis and optimal neural network approach / Decision Support Systems 54(3) : 1404~1416

[journal] P. Hájek / 2011 / Municipal credit rating modelling by neural networks / Decision Support Systems 51(1) : 108~118

[journal] T. G. Dietterich / 2002 / Ensemble learning / The handbook of brain theory and neural networks 2 : 110~125

[journal] 이건창 / 2007 / A Study on the Classification Properties of Firms to beSubject to Accounting Disclosure Reviews andInvestigations: Comparison of Bayesian Network, C5.0, andEnsemble Prediction Methods / 경영학연구 / 한국경영학회 36(3) : 705~737

[journal] L. I. Kuncheva / 2010 / Classifier ensembles for fMRI data analysis : an experiment / Magnetic Resonance Imaging 28(4) : 583~593

[journal] E. Fersini / 2014 / Sentiment analysis : Bayesian Ensemble Learning / Decision Support Systems 68 : 26~38

[journal] J. K. Bae / 2010 / An integrated approach to predict corporate bankruptcy with voting algorithms and neural networks / Korean Business Review 3(2) : 79~101

[journal] 양철원 / 2012 / Comparisons of Liquidity Measures in the Korean Stock Market / 재무연구 / 한국재무학회 25(1) : 37~88

[journal] P. M. Dechow / 1995 / Detecting earnings management / the Accounting Review 70(2) : 193~225

[book] J. Han / 2012 / Data mining. concepts and techniques / Morgan Kaufmann

[journal] 조규수 / 2013 / Influence of Overseas Construction Business on Construction Companies’ Financial Stability / 한국건설관리학회 논문집 / 한국건설관리학회 14(1) : 43~51

[journal] 김갑종 / 2008 / A Study on the Characteristics of Asymmetric Volatility by Industry in Korean Stock Market / 대한경영학회지 / 대한경영학회 21(6) : 2947~2964

This paper was written with support from the National Research Foundation of Korea.

KJCKorea
Journal Central

Journal of The Korea Society of Computer and Information 2024 KCI Impact Factor : 0.81