Business Scenario: Before a bank offers a home loan, it is critical to conduct a home appraisal. The appraisal confirms the validity of the sales price of the property for the bank.
You have been asked to build and train a regression model that estimates the price of a home, based on several factors including: square feet of home, square feet of lot, number of bedrooms, number of bathrooms, location, and so on. You have been provided data for the following variables:
Variable Data
| Variable | Description |
|---|
| ID | Unique ID for each customer. |
| Date | Sale date of the estate. This variable is to be excluded for this regression model. |
| PRICE | The property's sales price in dollars. Price is the variable that you are trying to predict. |
| BEDROOMS | Number of bedrooms above basement level. |
| BATHROOMS | Number of bathrooms above basement level. |
| SQFT_LIVI | Above ground living area in square feet. |
| SQFT_LOT | Square foot measurement for all floors. |
| FLOORS | Number of floors. |
| WATERFRONT | Is there a waterfront on the property? Yes = 1/No=0 |
| Condition | Overall condition rating. |
| GRADE | Overall grade. |
| YR_BUILT | Original property construction date (year). |
| ZIPCODE | House location |
Task Flow: In this practice exercise, you will:
- Build a regression model.
- Train the regression model.
- Verify the output from the regression model.