Does machine learning prediction dampen the information asymmetry for non-local investors?

    Jinwoo Jung Affiliation
    ; Jihwan Kim   Affiliation
    ; Changha Jin   Affiliation


In this study, we examine the prediction accuracy of machine learning methods to estimate commercial real estate transaction prices. Using machine learning methods, including Random Forest (RF), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and Deep Neural Networks (DNN), we estimate the commercial real estate transaction price by comparing relative prediction accuracy. Data consist of 19,640 transaction-based office properties provided by Costar corresponding to the 2004–2017 period for 10 major U.S. CMSA (Consolidated Metropolitan Statistical Area). We conduct each machine learning method and compare the performance to identify a critical determinant model for each office market. Furthermore, we depict a partial dependence plot (PD) to verify the impact of research variables on predicted commercial office property value. In general, we expect that results from machine learning will provide a set of critical determinants to commercial office price with more predictive power overcoming the limitation of the traditional valuation model. The result for 10 CMSA will provide critical implications for the out-of-state investors to understand regional commercial real estate market.

Keyword : machine learning, office price, commercial real estate, prediction accuracy, information asymmetry, non-local investors

How to Cite
Jung, J., Kim, J., & Jin, C. (2022). Does machine learning prediction dampen the information asymmetry for non-local investors?. International Journal of Strategic Property Management, 26(5), 345–361.
Published in Issue
Nov 14, 2022
Abstract Views
PDF Downloads
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.


Bishop, C. M. (2006). Information science and statistics. In Pattern recognition and machine learning. Springer.

Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140.

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.

Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (2017). Classification and regression trees. Routledge.

Čeh, M., Kilibarda, M., Lisec, A., & Bajat, B. (2018). Estimating the performance of random forest versus multiple regression for predicting prices of the apartments. ISPRS International Journal of Geo-Information, 7(5), 168.

Colwell, P., Munneke, H., & Trefzger, J. (1998) Chicago’s office market: price indices, location and time. Real Estate Economics, 26(1), 83–106.

Conway, J. J. E. (2018). Artificial intelligence and machine learning: current applications in real estate.

Cortes, C., & Vapnik, V. (1995). Support vector networks. Machine Learning, 20(3), 273–297.

Cowden, C., Fabozzi, F. J., & Nazemi, A. (2019). Default prediction of commercial real estate properties using machine learning techniques. The Journal of Portfolio Management, 45(7), 55–67.

Egan, M. (2019, February 17). How elite investors use artificial intelligence and machine learning to gain an edge. CNN Business.

Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of Statistics, 1189–1232.

Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4), 367–378.

Gallimore, P., & Wolverton, M. (2000). The objective in valuation: a study of the influence of client feedback. Journal of Property Research, 17(1), 47–57.

Geltner, D., MacGregor, B. D., & Schwann, G. M. (2003). Appraisal smoothing and price discovery in real estate markets. Urban Studies, 40(5–6), 1047–1064.

Gupta, R., Marfatia, H. A., Pierdzioch, C., & Salisu, A. A. (2022). Machine learning predictions of housing market synchronization across US states: the role of uncertainty. The Journal of Real Estate Finance and Economics, 64, 523–545.

Han, L., & Hong, S. H. (2016). Understanding in‐house transactions in the real estate brokerage industry. The RAND Journal of Economics, 47(4), 1057–1086.

Hansz, J. A., & Diaz III, J. (2001). Valuation bias in commercial appraisal: a transaction price feedback experiment. Real Estate Economics, 29(4), 553–565.

Hill, R. J. (2013). Hedonic price indexes for residential housing: a survey, evaluation and taxonomy. Journal of Economic Surveys, 27(5), 879–914.

Ho, W., Tang, B., & Wong, S. W. (2021). Predicting property prices with machine learning algorithms. Journal of Property Research, 38(1), 48–70.

Hochreiter, S. (1998). The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(02), 107–116.

International Association of Assessing Officers. (2013). Standard on mass appraisal of real property.

Izmailov, R., Vapnik, V., & Vashist, A. (2013). Multi-dimensional splines with infinite number of knots as SVM kernels. In The 2013 International Joint Conference on Neural Networks (IJCNN) (pp. 1–7). IEEE.

Kandlbinder, K. (2018). The role of information in real estate markets [Doctoral dissertation, Universität Regensburg].

Kok, N., Koponen, E. L., & Martínez-Barbosa, C. A. (2017). Big data in real estate? From manual appraisal to automated valuation. The Journal of Portfolio Management, 43(6), 202–211.

Kotu, V., & Deshpande, B. (2014). Predictive analytics and data mining: concepts and practice with rapidminer. Morgan Kaufmann.

Lam, K. C., Yu, C. Y., & Lam, C. K. (2009). Support vector machine and entropy-based decision support system for property valuation. Journal of Property Research, 26(3), 213–233.

Ling, D. C., Naranjo, A., & Scheick, B. (2018). Geographic portfolio allocations, property selection and performance attribution in public and private real estate markets. Real Estate Economics, 46(2), 404–448.

Liu, Y., Gallimore, P., & Wiley, J. A. (2015). Non-local office investors: anchored by their markets and impaired by their distance. The Journal of Real Estate Finance and Economics, 50(1), 129–149.

McMillen, D. P., & Redfearn, C. L. (2010). Estimation and hypothesis testing for nonparametric hedonic house price functions. Journal of Regional Science, 50(3), 712–733.

Meese, R. A., & Wallace, N. E. (1991). Nonparametric estimation of dynamic hedonic price models and the construction of residential housing price indices. Real Estate Economics, 19(3), 308–332.

Moghaddam, D. D., Rahmati, O., Panahi, M., Tiefenbacher, J., Darabi, H., Haghizadeh, A., Haghighi, A. T., Nalivang, O. A., & Bui, D. T. (2020). The effect of sample size on different machine learning models for groundwater potential mapping in mountain bedrock aquifers. Catena, 187, 104421.

Mullainathan, S., & Spiess, J. (2017). Machine learning: an applied econometric approach. Journal of Economic Perspectives, 31(2), 87–106.

Nghiep, N., & Al, C. (2001). Predicting housing value: a comparison of multiple regression analysis and artificial neural networks. Journal of Real Estate Research, 22(3), 313–336.

Parmeter, C. F., Henderson, D. J., & Kumbhakar, S. C. (2007). Nonparametric estimation of a hedonic price function. Journal of Applied Econometrics, 22(3), 695–699.

Pérez-Rave, J. I., Correa-Morales, J. C., & González-Echavarría, F. (2019). A machine learning approach to big data regression analysis of real estate prices for inferential and predictive purposes. Journal of Property Research, 36(1), 59–96.

Peterson, S., & Flanagan, A. (2009). Neural network hedonic pricing models in mass real estate appraisal. Journal of Real Estate Research, 31(2), 147–164.

Probst, P., Wright, M. N., & Boulesteix, A.-L. (2019). Hyperparameters and tuning strategies for random forest. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3), e1301.

Rafiei, M. H., & Adeli, H. (2016). A novel machine learning model for estimation of sale prices of real estate units. Journal of Construction Engineering and Management, 142(2), 04015066.

Rice, L., Wong, E., & Kolter, Z. (2020). Overfitting in adversarially robust deep learning. In International Conference on Machine Learning (pp. 8093–8104). PMLR.

Schapire, R. E. (1990). The strength of weak learnability. Machine Learning, 5(2), 197–227.

Schapire, R., & Freund, Y. (1995). A decision-theoretic generalization of on-line learning and an application to boosting. In Second European Conference on Computational Learning Theory (pp. 23–37). Springer.

Shen, L., & Ross, S. (2021). Information value of property description: a machine learning approach. Journal of Urban Economics, 121, 103299.

Shi, L., & Tapia, C. (2016). The disciplining effect of concern for referrals: evidence from real estate agents. Real Estate Economics, 44(2), 411–461.

Simlai, P. E. (2021). Predicting owner-occupied housing values using machine learning: an empirical investigation of California census tracts data. Journal of Property Research, 38(4), 305–336.

Sun, X., Ren, X., Ma, S., & Wang, H. (2017). meProp: sparsified back propagation for accelerated deep learning with reduced overfitting. In International Conference on Machine Learning (pp. 3299–3308). PMLR.

Telgarsky, M. (2013). Margins, shrinkage, and boosting. In International Conference on Machine Learning (pp. 307–315). PMLR.

Turnbull, G. K., & Sirmans, C. F. (1993). Information, search, and house prices. Regional Science and Urban Economics, 23(4), 545–557.

Vabalas, A., Gowen, E., Poliakoff, E., & Casson, A. J. (2019). Machine learning algorithm validation with a limited sample size. PloS ONE, 14(11), e0224365.

Vapnik, V. (2013). The nature of statistical learning theory. Springer Science & Business Media.

Wong, S. K., Yiu, C. Y., & Chau, K. W. (2012). Liquidity and information asymmetry in the real estate market. The Journal of Real Estate Finance and Economics, 45(1), 49–62.

Xu, H., & Gade, A. (2017). Smart real estate assessments using structured deep neural networks. In 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (pp. 1–7). IEEE.

Yao, Y., Zhang, J., Qian, C., Wang, Y., Ren, S., Yuan, Z., & Guan, Q. (2021). Delineating urban job-housing patterns at a parcel scale with street view imagery. International Journal of Geographical Information Science, 35(10), 1927–1950.

Yilmazer, S., & Kocaman, S. (2020). A mass appraisal assessment study using machine learning based on multiple regression and random forest. Land Use Policy, 99, 104889.

You, Q., Pang, R., Cao, L., & Luo, J. (2017). Image-based appraisal of real estate properties. IEEE Transactions on Multimedia, 19(12), 2751–2759.

Yu, L., Jiao, C., Xin, H., Wang, Y., & Wang, K. (2018). Prediction on housing price based on deep learning. International Journal of Computer and Information Engineering, 12(2), 90–99.

Zhou, X., Gibler, K., & Zahirovic-Herbert, V. (2015). Asymmetric buyer information influence on price in a homogeneous housing market. Urban Studies, 52(5), 891–905.