File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/978-981-19-0737-1_9
- Scopus: eid_2-s2.0-85161158896
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Book Chapter: Predicting Housing Prices in Hong Kong Based on AI Interpreted Sentiment in Social Media, Health and Sustainability Factors: A White-box AutoML Research Agenda
Title | Predicting Housing Prices in Hong Kong Based on AI Interpreted Sentiment in Social Media, Health and Sustainability Factors: A White-box AutoML Research Agenda |
---|---|
Authors | |
Keywords | Automated valuation AutoML Health Housing price Prediction Sentiment Sustainability |
Issue Date | 15-May-2022 |
Publisher | Springer Nature |
Abstract | In recent years, automation via deep learning/machine learning has flourished. As such, automated valuation will likely become a major task in valuation over the coming years. While most studies in industry/academia utilisemultivariate regression for housing price predictions, we aim to introduce explainable AutoML models (H2O AutoML and Google Cloud Platform (GCP AutoML Tables) for housing price predictions, which will be the first of its kind. The objectives are as three folds: 1. predict housing prices based on the following factors: (i) sentiment, as reflected in (ia) animal spirits (market sentiment) interpreted and constructed using social listening tools with artificial intelligence, and (ib) a sentiment index constructed based on Google searches; (ii) health-related factors, including COVID-19 and radio base stations; (iii) sustainability factors, i.e., the urban heat island effect and polychlorinated dibenzo-p-dioxins (PCDD); (iv) the number of years left on a land lease; and (v) the impact of precarious social events on housing prices. 2. Study the impact of these factors and compare their relative importance on housing price predictions. 3. Predict housing prices via (i) H2O AutoML and (ii) GCP AutoML Tables and compare their results. The first stage includes a systematic literature review and content analysis of big data housing price research for the past ten years in Science Direct, Emerald, Sage and Taylor, and Francis, to uncover the current state-of-the-art research in housing prices and revise/update the variables to be included in our prediction model. Fiftyfive highest housing price transaction records in Hong Kong will be used for the housing price predictions. Housing estate name, price, date of transaction, gross floor area, building completion date, and floor level will be downloaded from the Economic Property Research Center (EPRC).We will then reformat the data so that it can be fitted to AutoML. Second, we will obtain the sentiment of 55 housing estates and the “Hong Kong’s housing market”, as reflected in social media, by using keywords to search on a Natural Language Processing platform. Third, PCDD will be obtained from the Environmental Protection Department, while radio base station locations will be obtained from the Office of the Communications Authority. The number of years left on the housing estates’ land lease will be obtained from the Land Registry Department, with dummy variables being assigned in the prediction models to represent the precarious protest period and the number of COVID-19 cases obtained from Statista to study their impact on housing prices.Moreover, the remote sensing data of housing estates obtained from Landsat 8 (an American Earth observation satellite) can be downloaded through the United States Geological Survey (USGS). The land surface temperature, computed by ENVI 5.3 software, will be compared with nearby areas of vegetation to compute the extra heat caused by the “urban heat island effect”. Big data collection will allow us to utilise H2O AutoML and GCP Cloud AutoML for housing price predictions.We will apply H2O AutoML to select the best machine learning algorithms. Unlike traditional machine learning algorithms, AutoML minimises parameter adjustments and automatically tunes all parameters according to embedded algorithms. The results will then be compared with the results from GCP Cloud AutoML. We will also select the model with the highest efficacy, which will reveal the relative importance of factors and show the coefficients of the factors on housing price predictions. |
Persistent Identifier | http://hdl.handle.net/10722/338376 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Li, RYM | - |
dc.contributor.author | Chau, KW | - |
dc.date.accessioned | 2024-03-11T10:28:23Z | - |
dc.date.available | 2024-03-11T10:28:23Z | - |
dc.date.issued | 2022-05-15 | - |
dc.identifier.isbn | 9789811907364 | - |
dc.identifier.uri | http://hdl.handle.net/10722/338376 | - |
dc.description.abstract | In recent years, automation via deep learning/machine learning has flourished. As such, automated valuation will likely become a major task in valuation over the coming years. While most studies in industry/academia utilisemultivariate regression for housing price predictions, we aim to introduce explainable AutoML models (H2O AutoML and Google Cloud Platform (GCP AutoML Tables) for housing price predictions, which will be the first of its kind. The objectives are as three folds: 1. predict housing prices based on the following factors: (i) sentiment, as reflected in (ia) animal spirits (market sentiment) interpreted and constructed using social listening tools with artificial intelligence, and (ib) a sentiment index constructed based on Google searches; (ii) health-related factors, including COVID-19 and radio base stations; (iii) sustainability factors, i.e., the urban heat island effect and polychlorinated dibenzo-p-dioxins (PCDD); (iv) the number of years left on a land lease; and (v) the impact of precarious social events on housing prices. 2. Study the impact of these factors and compare their relative importance on housing price predictions. 3. Predict housing prices via (i) H2O AutoML and (ii) GCP AutoML Tables and compare their results. The first stage includes a systematic literature review and content analysis of big data housing price research for the past ten years in Science Direct, Emerald, Sage and Taylor, and Francis, to uncover the current state-of-the-art research in housing prices and revise/update the variables to be included in our prediction model. Fiftyfive highest housing price transaction records in Hong Kong will be used for the housing price predictions. Housing estate name, price, date of transaction, gross floor area, building completion date, and floor level will be downloaded from the Economic Property Research Center (EPRC).We will then reformat the data so that it can be fitted to AutoML. Second, we will obtain the sentiment of 55 housing estates and the “Hong Kong’s housing market”, as reflected in social media, by using keywords to search on a Natural Language Processing platform. Third, PCDD will be obtained from the Environmental Protection Department, while radio base station locations will be obtained from the Office of the Communications Authority. The number of years left on the housing estates’ land lease will be obtained from the Land Registry Department, with dummy variables being assigned in the prediction models to represent the precarious protest period and the number of COVID-19 cases obtained from Statista to study their impact on housing prices.Moreover, the remote sensing data of housing estates obtained from Landsat 8 (an American Earth observation satellite) can be downloaded through the United States Geological Survey (USGS). The land surface temperature, computed by ENVI 5.3 software, will be compared with nearby areas of vegetation to compute the extra heat caused by the “urban heat island effect”. Big data collection will allow us to utilise H2O AutoML and GCP Cloud AutoML for housing price predictions.We will apply H2O AutoML to select the best machine learning algorithms. Unlike traditional machine learning algorithms, AutoML minimises parameter adjustments and automatically tunes all parameters according to embedded algorithms. The results will then be compared with the results from GCP Cloud AutoML. We will also select the model with the highest efficacy, which will reveal the relative importance of factors and show the coefficients of the factors on housing price predictions. | - |
dc.language | eng | - |
dc.publisher | Springer Nature | - |
dc.relation.ispartof | Current State of Art in Artificial Intelligence and Ubiquitous Cities | - |
dc.subject | Automated valuation | - |
dc.subject | AutoML | - |
dc.subject | Health | - |
dc.subject | Housing price | - |
dc.subject | Prediction | - |
dc.subject | Sentiment | - |
dc.subject | Sustainability | - |
dc.title | Predicting Housing Prices in Hong Kong Based on AI Interpreted Sentiment in Social Media, Health and Sustainability Factors: A White-box AutoML Research Agenda | - |
dc.type | Book_Chapter | - |
dc.identifier.doi | 10.1007/978-981-19-0737-1_9 | - |
dc.identifier.scopus | eid_2-s2.0-85161158896 | - |
dc.identifier.spage | 135 | - |
dc.identifier.epage | 154 | - |