Comparison of Machine Learning Algorithms for House Price Prediction using Real Time Data

Swarali M. Pathak; Archana K. Chaudhari

doi:10.5281/zenodo.18493713

Volume 10, Issue 12 (December 2021)

Comparison of Machine Learning Algorithms for House Price Prediction using Real Time Data

DOI : 10.5281/zenodo.18493713

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 7,583
Authors : Swarali M. Pathak , Archana K. Chaudhari
Paper ID : IJERTV10IS120154
Volume & Issue : Volume 10, Issue 12 (December 2021)
Published (First Online): 27-12-2021
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Comparison of Machine Learning Algorithms for House Price Prediction using Real Time Data

Swarali M. Pathak, Prof. Archana K. Chaudhari Department of Instrumentation and Control Engineering Vishwakarma Institute of Technology, Pune

Abstract: Housing prices are a crucial reflection of the economy, and property values are of great interest for consumers as well as sellers. Real Estate is the one of the least transparent industries in our ecosystem. Predicting house prices with real time factors is the main aim of this research project. This paper aims to make evaluations based on some basic parameters which are considered while determining the price of a house. In order to carry out the real time research, real time housing data of Pune City has been collected manually. The project tends to use Regression technique for Machine learning as we are dealing with continuous outcome variable. We have carried out a research by implementing different regression models to compare and determine the most effective model to resolve given problem statement. The goal of this research project is to create an effective machine learning model that is able to accurately estimate the price of the house based on given features and deploy the machine learning model in the form of a website to reach out individuals.

Keywords: Linear Regression, Machine Learning, Random Forest, Real Estate, Real-time Data, Support Vector Regressor.

I.INTRODUCTION

In the past years, Machine learning has proven to be able to solve real world problems using various algorithms. It plays a major role in advances of medical imaging, spam and fraud detection, enhancements in automobile industry, security alerts and Business Analysis. In this paper, we have used machine learning algorithms to perform predictive analysis of house prices to provide an overview of real estate businesses and property demand. Data is the most important part for analysis of any problem. It provides the information in a detailed format which is able to be understood by machines. Real estate prices keep changing frequently based on certain parameters. In 2020, the average value of property prices in Pune costs around Rs 6,573 per sqft as per listings on Housing.com. For a Real estate Business, data is the most important source for analysis and predictions. It is always a perk to know about the predictions of variations of an entity which will be happening near future and business managers can act accordingly to avoid future loss. And for this we need a most accurate predicting Model for analysis. Similarly, we need a proper prediction on the real estate and the houses in the housing market to provide appropriate estimation of prices to help real estate managers know about prophecies. Buying a house will be a life time goal for most of the individuals but there are a lot of people who make huge mistakes while buying the properties. One of the common mistakes is buying properties that are too expensive but it's not worth it. Various methods have been used in the price

prediction. This project aims to predict the real estate price using the machine learning techniques with the help of the Real-Time Data of houses in Pune, India. The goal of this statistical analysis is to help us understand the relationship between house features and how these variables are used to predict house price. It uses comparison of Regression algorithms to find out best fitting model to predict the house price. So, it would be helpful for the people to avoid them from making mistakes. The results proven that this approach yields minimum error and most accuracy than individual algorithms applied. The goal of this project is to make a machine learning model that is able to accurately estimate the worth of the house given the options.

LITERATURE SURVEY
1. Real Estate Price Prediction with Regression and Classification: In this paper house prices are predicted using explanatory variables that cover many aspects of residential houses. House prices are predicted with various regression techniques including Lasso, Ridge, SVM regression and Random Forest. According to this paper, for a regression problem, the best-performing model is SVR with Gaussian kernel, with RMSE of 0.5271, however, visualization for SVR was difficult due to its high- dimensionality. According to its analysis, living area square feet, material of the roof and neighborhood have the greatest statistical significance in predicting a houses sale price. [CS 229 Autumn 2016 Project Final Report Hujia Yu, Jiafu Wu [hujiay, jiafuwu]@stanford.edu].
2. A SVR based forecasting approach for real estate price prediction: The support vector machine (SVM) has been successfully applied to classification, cluster, and forecast. This study proposes support vector regression (SVR) to forecast real estate prices in China. The aim of this paper was to examine the feasibility of SVR in real estate price prediction. The experimental results were calculated based on the mean absolute error (MAE), the mean absolute percentage error (MAPE) and the root mean squared error (RMSE) and the SVR based approach was an efficient tool to forecast real estate prices. [Hong Zhao, Rong-Qiu Chen, Wei Xu, Da-Ying Li Published in: 2009 International Conference on Machine Learning and Cybernetics].
3. Using machine learning algorithms for housing price prediction: This study used machine learning to develop housing price prediction models. This study analyzes the housing data of 5359 townhouses in Fairfax County, VA. The 10-fold cross-validation was applied to C4.5, RIPPER, Bayesian, and AdaBoost. [The case of Fairfax County,
  
  Virginia housing data, Byeonghwa Parka, Jae Kwon Baeb,aDepartment of Business Statistics, Hannam University, 70 Hannam-ro, Daedeok-gu, Republic of Korea].
4. House Price Prediction Using Machine Learning and Neural Networks: This paper aims to make evaluations based on every basic parameter that is considered while determining the price. This model used various regression techniques in its pathway, and the results are not solely determined by one technique rather it is the weighted mean of various techniques to give the most accurate results. The results proved that this approach yields minimum error and maximum accuracy than individual algorithms applied. [Ayush Varma, Abhijit Sarma, Sagar Doshi, Rohini Nair, Computer Engineering Department, KJ Somaiya College Of Engineering, Mumbai, Published in 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)].
5. Valuation Of House Prices Using Predictive Techniques: This paper uses machine learning algorithms to predict the house prices. In this paper, algorithms such as logistic regression and support vector regression, Lasso Regression technique and Decision Tree are employed to build a predictive model. It had considered housing data of 3000 properties. Logistic Regression, SVM, Lasso Regression and Decision Tree show the R-squared value of 0.98, 0.96,0.81 and 0.99 respectively. Further comparisons of these algorithms are based on parameters such as MAE, MSE, RMSE and Accuracy. [International Journal of Advances in Electronics and Computer Science, ISSN: 2393-2835 Volume-5, Issue-6, Jun.-2018].
METHODOLOGY
IMPLEMENTATION: The Flow of Implementation goes as follows:

Fig 5. Flow of the Project
Once the Implementation is done the model is predicting us the price of the property (house) in that particular location. We will deploy the model using Flask framework and create UI where the user will enter the desired values and our Model will predict the output. This is made Possible by using the python package for creating an API called Flask. For building the web application and linking the Model with the web application, first we need to extract our model into pickle and json files and design webpage using HTML, CSS and JavaScript. With this the Model is ready to be displayed and make predictions on the web application.

Fig 14. Deployment of Model using Flask
RESULTS AND DISCUSSIONS

Cross-validation of different Algorithms has proven to be a suitable method to find an acceptable best fitting algorithm for the Model. Linear Regression Algorithm is giving very precise Estimation of the house prices. For different Locations it is giving much accurate estimations. Also, according to confusion matrix linear regression is giving nearly accurate predictions. Linear Regression fits our dataset and gives the highest accuracy of 85.64%. Decision Tree gives the least accuracy of 56.02%. Support Vector Regression gives an accuracy of 62.81%.

Fig 15. Linear Regression Output Predictions for Different locations

The Model has also proved that Location and square feet area plays an important role in deciding the price of a property. This is helpful information for Sellers and buyers

to act accordingly. The GUI has provided Ease of access to the model, hence improving quality of accessibility.

Fig 16. Final Output Prediction with UI
FUTURE SCOPE
- In the future, the GUI can be made more attractive and interactive. It can also be turned into any real
  
  estate sale website where sellers can give the details and house for sale and buyers can contact according to the details given on the website.
- To simplify it for the user, there can also be a recommending system to recommend real estate
  
  properties to the user based on the predicted price. The current dataset only includes a few locations of Pune city, expanding it to other cities and states of India is the future goal.
- To make the system even more informative and user-friendly, Google maps can also be included.
This will show the neighborhood amenities such as hospitals, schools surrounding a region of 1 km from the given location. This can also be included in making predictions since the presence of such factors increases the price of real estate property.
CONCLUSION

In this research paper, we have used machine learning algorithms to predict the house prices. We have performed step by step procedure to analyze the dataset and found the correlation between the parameters. The manually collected Real-time Dataset has been collected which contains 1635 entries and independent variables. We analyze and pre- process this dataset before performing Exploratory Data Analysis. This analyzed feature set was given as an input to machine learning algorithms and calculated the performance of each model to compare based on Accuracy score. We found that Linear Regression fits our dataset and gives the highest accuracy of 85.64%. Decision Tree gives the least accuracy of 56.02%. Support Vector Regression gives an accuracy of 62.81%. Thus we conclude that we implemented regression techniques to check how well an algorithm fits to given problem statement of House price prediction.
REFERENCES

Maharshi Modi, Ayush Sharma, Dr. P. Madhavan Applied Research On House Price Prediction Using Diverse Machine Learning Techniques, International Journal of Scientific & Technology Research Volume 9, Issue 04, April 2020.
G. Naga Satish, Ch. V. Raghavendran, M.D.Sugnana Rao, Ch.Srinivasulu House Price Prediction Using Machine Learning, International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-8 Issue-9, July 2019.
Dr. M. Thamarai, Dr. S P. Malarvizhi House Price Prediction Modeling Using Machine Learning, I.J. Information Engineering and Electronic Business, 2020, 2, 15-20.
Neelam Shinde, Kiran Gawande Valuation Of House Prices Using Predictive Technique, International Journal of Advances in Electronics and Computer Science, ISSN: 2393-2835 Volume- 5, Issue-6, Jun.-2018.
Ayush Varma, Abhijit Sarma, Sagar Doshi, Rohini Nair, Computer Engineering Department, KJ Somaiya College Of Engineering, Mumbai House Price Prediction Using Machine Learning and Neural Networks, 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT).
Hong Zhao, Rong-Qiu Chen, Wei Xu, Da-Ying Li A SVR based forecasting approach for real estate price prediction, 2009 International Conference on Machine Learning and Cybernetics.
Uysal, ., GÃ¼venir, H. A. An overview of regression techniques for knowledge discovery, The Knowledge Engineering Review, Vol. 14:4, 1999, 319Â±340 (KER 14404) Printed in the United Kingdom.
Bhuriya, Dinesh, et al. Stock market predication using a linear regression., Electronics, Communication and Aerospace Technology (ICECA), 2017 International conference of. Vol. 2. IEEE 2017.
Limsombunchai House price prediction: hedonic price model vs. artificial neural network., New Zealand Agricultural and Resource Economics Society Conference 2004.
S. C. Bourassa, E. Cantoni, and M. Hoesli, Predicting house prices with spatial dependence: a comparison of alternative methods, Journal of Real Estate Research, vol. 32, no. 2, pp. 139160, 2010.
Li, Li, and Kai-Hsuan Chu Prediction of real estate price variation based on economic parameters, Applied System Innovation (ICASI), 2017 International Conference in IEEE, 2017.
Pedregosa, Fabian, et al. Scikit-learn: Machine learning in Python, Journal of machine learning research 12.Oct (2011): 2825-2830.

Comparison of Machine Learning Algorithms for House Price Prediction using Real Time Data

Leave a Reply