File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Book Chapter: Predicting Housing Prices in Hong Kong Based on AI Interpreted Sentiment in Social Media, Health and Sustainability Factors: A White-box AutoML Research Agenda

TitlePredicting Housing Prices in Hong Kong Based on AI Interpreted Sentiment in Social Media, Health and Sustainability Factors: A White-box AutoML Research Agenda
Authors
KeywordsAutomated valuation
AutoML
Health
Housing price
Prediction
Sentiment
Sustainability
Issue Date15-May-2022
PublisherSpringer Nature
AbstractIn recent years, automation via deep learning/machine learning has flourished. As such, automated valuation will likely become a major task in valuation over the coming years. While most studies in industry/academia utilisemultivariate regression for housing price predictions, we aim to introduce explainable AutoML models (H2O AutoML and Google Cloud Platform (GCP AutoML Tables) for housing price predictions, which will be the first of its kind. The objectives are as three folds: 1. predict housing prices based on the following factors: (i) sentiment, as reflected in (ia) animal spirits (market sentiment) interpreted and constructed using social listening tools with artificial intelligence, and (ib) a sentiment index constructed based on Google searches; (ii) health-related factors, including COVID-19 and radio base stations; (iii) sustainability factors, i.e., the urban heat island effect and polychlorinated dibenzo-p-dioxins (PCDD); (iv) the number of years left on a land lease; and (v) the impact of precarious social events on housing prices. 2. Study the impact of these factors and compare their relative importance on housing price predictions. 3. Predict housing prices via (i) H2O AutoML and (ii) GCP AutoML Tables and compare their results. The first stage includes a systematic literature review and content analysis of big data housing price research for the past ten years in Science Direct, Emerald, Sage and Taylor, and Francis, to uncover the current state-of-the-art research in housing prices and revise/update the variables to be included in our prediction model. Fiftyfive highest housing price transaction records in Hong Kong will be used for the housing price predictions. Housing estate name, price, date of transaction, gross floor area, building completion date, and floor level will be downloaded from the Economic Property Research Center (EPRC).We will then reformat the data so that it can be fitted to AutoML. Second, we will obtain the sentiment of 55 housing estates and the “Hong Kong’s housing market”, as reflected in social media, by using keywords to search on a Natural Language Processing platform. Third, PCDD will be obtained from the Environmental Protection Department, while radio base station locations will be obtained from the Office of the Communications Authority. The number of years left on the housing estates’ land lease will be obtained from the Land Registry Department, with dummy variables being assigned in the prediction models to represent the precarious protest period and the number of COVID-19 cases obtained from Statista to study their impact on housing prices.Moreover, the remote sensing data of housing estates obtained from Landsat 8 (an American Earth observation satellite) can be downloaded through the United States Geological Survey (USGS). The land surface temperature, computed by ENVI 5.3 software, will be compared with nearby areas of vegetation to compute the extra heat caused by the “urban heat island effect”. Big data collection will allow us to utilise H2O AutoML and GCP Cloud AutoML for housing price predictions.We will apply H2O AutoML to select the best machine learning algorithms. Unlike traditional machine learning algorithms, AutoML minimises parameter adjustments and automatically tunes all parameters according to embedded algorithms. The results will then be compared with the results from GCP Cloud AutoML. We will also select the model with the highest efficacy, which will reveal the relative importance of factors and show the coefficients of the factors on housing price predictions.
Persistent Identifierhttp://hdl.handle.net/10722/338376
ISBN

 

DC FieldValueLanguage
dc.contributor.authorLi, RYM-
dc.contributor.authorChau, KW-
dc.date.accessioned2024-03-11T10:28:23Z-
dc.date.available2024-03-11T10:28:23Z-
dc.date.issued2022-05-15-
dc.identifier.isbn9789811907364-
dc.identifier.urihttp://hdl.handle.net/10722/338376-
dc.description.abstractIn recent years, automation via deep learning/machine learning has flourished. As such, automated valuation will likely become a major task in valuation over the coming years. While most studies in industry/academia utilisemultivariate regression for housing price predictions, we aim to introduce explainable AutoML models (H2O AutoML and Google Cloud Platform (GCP AutoML Tables) for housing price predictions, which will be the first of its kind. The objectives are as three folds: 1. predict housing prices based on the following factors: (i) sentiment, as reflected in (ia) animal spirits (market sentiment) interpreted and constructed using social listening tools with artificial intelligence, and (ib) a sentiment index constructed based on Google searches; (ii) health-related factors, including COVID-19 and radio base stations; (iii) sustainability factors, i.e., the urban heat island effect and polychlorinated dibenzo-p-dioxins (PCDD); (iv) the number of years left on a land lease; and (v) the impact of precarious social events on housing prices. 2. Study the impact of these factors and compare their relative importance on housing price predictions. 3. Predict housing prices via (i) H2O AutoML and (ii) GCP AutoML Tables and compare their results. The first stage includes a systematic literature review and content analysis of big data housing price research for the past ten years in Science Direct, Emerald, Sage and Taylor, and Francis, to uncover the current state-of-the-art research in housing prices and revise/update the variables to be included in our prediction model. Fiftyfive highest housing price transaction records in Hong Kong will be used for the housing price predictions. Housing estate name, price, date of transaction, gross floor area, building completion date, and floor level will be downloaded from the Economic Property Research Center (EPRC).We will then reformat the data so that it can be fitted to AutoML. Second, we will obtain the sentiment of 55 housing estates and the “Hong Kong’s housing market”, as reflected in social media, by using keywords to search on a Natural Language Processing platform. Third, PCDD will be obtained from the Environmental Protection Department, while radio base station locations will be obtained from the Office of the Communications Authority. The number of years left on the housing estates’ land lease will be obtained from the Land Registry Department, with dummy variables being assigned in the prediction models to represent the precarious protest period and the number of COVID-19 cases obtained from Statista to study their impact on housing prices.Moreover, the remote sensing data of housing estates obtained from Landsat 8 (an American Earth observation satellite) can be downloaded through the United States Geological Survey (USGS). The land surface temperature, computed by ENVI 5.3 software, will be compared with nearby areas of vegetation to compute the extra heat caused by the “urban heat island effect”. Big data collection will allow us to utilise H2O AutoML and GCP Cloud AutoML for housing price predictions.We will apply H2O AutoML to select the best machine learning algorithms. Unlike traditional machine learning algorithms, AutoML minimises parameter adjustments and automatically tunes all parameters according to embedded algorithms. The results will then be compared with the results from GCP Cloud AutoML. We will also select the model with the highest efficacy, which will reveal the relative importance of factors and show the coefficients of the factors on housing price predictions.-
dc.languageeng-
dc.publisherSpringer Nature-
dc.relation.ispartofCurrent State of Art in Artificial Intelligence and Ubiquitous Cities-
dc.subjectAutomated valuation-
dc.subjectAutoML-
dc.subjectHealth-
dc.subjectHousing price-
dc.subjectPrediction-
dc.subjectSentiment-
dc.subjectSustainability-
dc.titlePredicting Housing Prices in Hong Kong Based on AI Interpreted Sentiment in Social Media, Health and Sustainability Factors: A White-box AutoML Research Agenda-
dc.typeBook_Chapter-
dc.identifier.doi10.1007/978-981-19-0737-1_9-
dc.identifier.scopuseid_2-s2.0-85161158896-
dc.identifier.spage135-
dc.identifier.epage154-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats