From 895e515042d17e6b05118695364d1920ba1f54bf Mon Sep 17 00:00:00 2001
From: sandeep chauhan <64914145+sandeep92134@users.noreply.github.com>
Date: Mon, 18 Jan 2021 19:16:52 +0530
Subject: [PATCH 1/2] Created using Colaboratory

---
 ..._of_the_Median_Values_of_Our_Dataset.ipynb | 356 ++++++++++++++++++
 1 file changed, 356 insertions(+)
 create mode 100644 module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb
diff --git a/module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb b/module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb
new file mode 100644
index 0000000..cae21e9
--- /dev/null
+++ b/module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb	
@@ -0,0 +1,356 @@
+{
+  "nbformat": 4,
+  "nbformat_minor": 0,
+  "metadata": {
+    "colab": {
+      "name": "Exercise 144: Using Linear Regression to Predict the Accuracy of the Median Values of Our Dataset",
+      "provenance": [],
+      "authorship_tag": "ABX9TyOarmSrG4xeAOWn1sZdPz4j",
+      "include_colab_link": true
+    },
+    "kernelspec": {
+      "name": "python3",
+      "display_name": "Python 3"
+    }
+  },
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "view-in-github",
+        "colab_type": "text"
+      },
+      "source": [
+        "<a href=\"https://colab.research.google.com/github/sandeep92134/The-Python-Workshop/blob/master/module%2011/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "WB7mxEAsryNC"
+      },
+      "source": [
+        "The goal of this exercise is to build a machine learning model using linear regression. Your model will predict the median value of Boston houses and, based on this, we will come to a conclusion about whether the value is optimal or not.\r\n",
+        "\r\n",
+        "This exercise will be performed on a Jupyter Notebook.\r\n",
+        "1. Open a new notebook file.\r\n",
+        "2. Now, **import** all the necessary libraries, as shown in the following code snippet:\r\n",
+        "```\r\n",
+        " import pandas as pd\r\n",
+        " import numpy as np\r\n",
+        " from sklearn.linear_model import LinearRegression\r\n",
+        " from sklearn.metrics import mean_squared_error\r\n",
+        " from sklearn.model_selection import train_test_split\r\n",
+        "```\r\n",
+        "Now that we have imported the libraries, we will load the data.\r\n",
+        "3. Load the dataset and view the DataFrames to look at the first five rows:\r\n",
+        "```\r\n",
+        " # load data\r\n",
+        " housing_df = pd.read_csv('HousingData.csv')\r\n",
+        " housing_df.head()\r\n",
+        "```\r\n",
+        "Recall that, as mentioned in Chapter 10, Data Analytics with pandas and NumPy, **housing_df = pd.read_cs('HousingData.csv')** will read the **CSV** file in parentheses and store it in a **DataFrame** called housing_df. Then, **housing_df.head()** will display the first five rows of the housing_df **DataFrame** by default.\r\n",
+        "4. Next, enter the following code to clean the dataset of null values using **.dropna()**:\r\n",
+        "```\r\n",
+        " # drop null values\r\n",
+        " housing_df = housing_df.dropna()\r\n",
+        "```\r\n",
+        "5. Now, declare the X and y variables, where you use X for the **predictor** columns and y for the **target** column:\r\n",
+        "```\r\n",
+        " # declare X and y\r\n",
+        " X = housing_df.iloc[:,:-1]\r\n",
+        " y = housing_df.iloc[:, -1]\r\n",
+        "```\r\n",
+        "6. Now we build the actual linear regression model.\r\n",
+        "7. Now, find how accurate the model is. Here, we can test it on unseen data:\r\n",
+        "8. We can now test the prediction by comparing the predicted **y-values**, which is **y_pred**, to the actual **y-values**, which is **y_test**:\r\n",
+        "\r\n",
+        "\r\n",
+        "\r\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "gj9xHtcjqLWu"
+      },
+      "source": [
+        "import pandas as pd\r\n",
+        "import numpy as np\r\n",
+        "from sklearn.linear_model import LinearRegression\r\n",
+        "from sklearn.metrics import mean_squared_error\r\n",
+        "from sklearn.model_selection import train_test_split"
+      ],
+      "execution_count": 1,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/",
+          "height": 204
+        },
+        "id": "t8UNBeDQuPyM",
+        "outputId": "aaf99d0a-7a2f-40b5-fe71-a45eada1f058"
+      },
+      "source": [
+        "# load data\r\n",
+        "housing_df = pd.read_csv('HousingData.csv')\r\n",
+        "housing_df.head()"
+      ],
+      "execution_count": 2,
+      "outputs": [
+        {
+          "output_type": "execute_result",
+          "data": {
+            "text/html": [
+              "<div>\n",
+              "<style scoped>\n",
+              "    .dataframe tbody tr th:only-of-type {\n",
+              "        vertical-align: middle;\n",
+              "    }\n",
+              "\n",
+              "    .dataframe tbody tr th {\n",
+              "        vertical-align: top;\n",
+              "    }\n",
+              "\n",
+              "    .dataframe thead th {\n",
+              "        text-align: right;\n",
+              "    }\n",
+              "</style>\n",
+              "<table border=\"1\" class=\"dataframe\">\n",
+              "  <thead>\n",
+              "    <tr style=\"text-align: right;\">\n",
+              "      <th></th>\n",
+              "      <th>CRIM</th>\n",
+              "      <th>ZN</th>\n",
+              "      <th>INDUS</th>\n",
+              "      <th>CHAS</th>\n",
+              "      <th>NOX</th>\n",
+              "      <th>RM</th>\n",
+              "      <th>AGE</th>\n",
+              "      <th>DIS</th>\n",
+              "      <th>RAD</th>\n",
+              "      <th>TAX</th>\n",
+              "      <th>PTRATIO</th>\n",
+              "      <th>B</th>\n",
+              "      <th>LSTAT</th>\n",
+              "      <th>MEDV</th>\n",
+              "    </tr>\n",
+              "  </thead>\n",
+              "  <tbody>\n",
+              "    <tr>\n",
+              "      <th>0</th>\n",
+              "      <td>0.00632</td>\n",
+              "      <td>18.0</td>\n",
+              "      <td>2.31</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>0.538</td>\n",
+              "      <td>6.575</td>\n",
+              "      <td>65.2</td>\n",
+              "      <td>4.0900</td>\n",
+              "      <td>1</td>\n",
+              "      <td>296</td>\n",
+              "      <td>15.3</td>\n",
+              "      <td>396.90</td>\n",
+              "      <td>4.98</td>\n",
+              "      <td>24.0</td>\n",
+              "    </tr>\n",
+              "    <tr>\n",
+              "      <th>1</th>\n",
+              "      <td>0.02731</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>7.07</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>0.469</td>\n",
+              "      <td>6.421</td>\n",
+              "      <td>78.9</td>\n",
+              "      <td>4.9671</td>\n",
+              "      <td>2</td>\n",
+              "      <td>242</td>\n",
+              "      <td>17.8</td>\n",
+              "      <td>396.90</td>\n",
+              "      <td>9.14</td>\n",
+              "      <td>21.6</td>\n",
+              "    </tr>\n",
+              "    <tr>\n",
+              "      <th>2</th>\n",
+              "      <td>0.02729</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>7.07</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>0.469</td>\n",
+              "      <td>7.185</td>\n",
+              "      <td>61.1</td>\n",
+              "      <td>4.9671</td>\n",
+              "      <td>2</td>\n",
+              "      <td>242</td>\n",
+              "      <td>17.8</td>\n",
+              "      <td>392.83</td>\n",
+              "      <td>4.03</td>\n",
+              "      <td>34.7</td>\n",
+              "    </tr>\n",
+              "    <tr>\n",
+              "      <th>3</th>\n",
+              "      <td>0.03237</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>2.18</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>0.458</td>\n",
+              "      <td>6.998</td>\n",
+              "      <td>45.8</td>\n",
+              "      <td>6.0622</td>\n",
+              "      <td>3</td>\n",
+              "      <td>222</td>\n",
+              "      <td>18.7</td>\n",
+              "      <td>394.63</td>\n",
+              "      <td>2.94</td>\n",
+              "      <td>33.4</td>\n",
+              "    </tr>\n",
+              "    <tr>\n",
+              "      <th>4</th>\n",
+              "      <td>0.06905</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>2.18</td>\n",
+              "      <td>0.0</td>\n",
+              "      <td>0.458</td>\n",
+              "      <td>7.147</td>\n",
+              "      <td>54.2</td>\n",
+              "      <td>6.0622</td>\n",
+              "      <td>3</td>\n",
+              "      <td>222</td>\n",
+              "      <td>18.7</td>\n",
+              "      <td>396.90</td>\n",
+              "      <td>NaN</td>\n",
+              "      <td>36.2</td>\n",
+              "    </tr>\n",
+              "  </tbody>\n",
+              "</table>\n",
+              "</div>"
+            ],
+            "text/plain": [
+              "      CRIM    ZN  INDUS  CHAS    NOX  ...  TAX  PTRATIO       B  LSTAT  MEDV\n",
+              "0  0.00632  18.0   2.31   0.0  0.538  ...  296     15.3  396.90   4.98  24.0\n",
+              "1  0.02731   0.0   7.07   0.0  0.469  ...  242     17.8  396.90   9.14  21.6\n",
+              "2  0.02729   0.0   7.07   0.0  0.469  ...  242     17.8  392.83   4.03  34.7\n",
+              "3  0.03237   0.0   2.18   0.0  0.458  ...  222     18.7  394.63   2.94  33.4\n",
+              "4  0.06905   0.0   2.18   0.0  0.458  ...  222     18.7  396.90    NaN  36.2\n",
+              "\n",
+              "[5 rows x 14 columns]"
+            ]
+          },
+          "metadata": {
+            "tags": []
+          },
+          "execution_count": 2
+        }
+      ]
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "gkhFUuT6uTUR"
+      },
+      "source": [
+        "# drop null values\r\n",
+        "housing_df = housing_df.dropna()\r\n"
+      ],
+      "execution_count": 3,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "aHDHQkZGvTpx"
+      },
+      "source": [
+        "# declare X and y\r\n",
+        "X = housing_df.iloc[:,:-1]\r\n",
+        "y = housing_df.iloc[:, -1]"
+      ],
+      "execution_count": 4,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "YmvQbsrpuyA7"
+      },
+      "source": [
+        "#Create training and test sets\r\n",
+        "X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2)"
+      ],
+      "execution_count": 5,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "ylYyCDbhvXvT"
+      },
+      "source": [
+        "#Create the regressor: reg\r\n",
+        "reg = LinearRegression()"
+      ],
+      "execution_count": 6,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "MVOnE4E0u6eZ",
+        "outputId": "ab6cb5ef-f180-41b5-d424-2d8ff0ef13f5"
+      },
+      "source": [
+        "#Fit the regressor to the training data\r\n",
+        "reg.fit(X_train, y_train)"
+      ],
+      "execution_count": 7,
+      "outputs": [
+        {
+          "output_type": "execute_result",
+          "data": {
+            "text/plain": [
+              "LinearRegression(copy_X=True, fit_intercept=True, n_jobs=None, normalize=False)"
+            ]
+          },
+          "metadata": {
+            "tags": []
+          },
+          "execution_count": 7
+        }
+      ]
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab": {
+          "base_uri": "https://localhost:8080/"
+        },
+        "id": "VYIfTckQvACx",
+        "outputId": "dbaa661b-1b6c-4874-b359-c2c90a1558dd"
+      },
+      "source": [
+        "# Predict on the test data: y_pred\r\n",
+        "y_pred = reg.predict(X_test)\r\n",
+        "# Compute and print RMSE\r\n",
+        "rmse = np.sqrt(mean_squared_error(y_test, y_pred))\r\n",
+        "print(\"Root Mean Squared Error: {}\".format(rmse))"
+      ],
+      "execution_count": 8,
+      "outputs": [
+        {
+          "output_type": "stream",
+          "text": [
+            "Root Mean Squared Error: 4.035874116638531\n"
+          ],
+          "name": "stdout"
+        }
+      ]
+    }
+  ]
+}
\ No newline at end of file

From 4c9f35cce503975777714c0384dfd4073a4de024 Mon Sep 17 00:00:00 2001
From: sandeep chauhan <64914145+sandeep92134@users.noreply.github.com>
Date: Mon, 18 Jan 2021 20:29:22 +0530
Subject: [PATCH 2/2] Delete module 11 directory

---
 ..._of_the_Median_Values_of_Our_Dataset.ipynb | 356 ------------------
 1 file changed, 356 deletions(-)
 delete mode 100644 module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb

diff --git a/module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb b/module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb
deleted file mode 100644
index cae21e9..0000000
--- a/module 11/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb	
+++ /dev/null
@@ -1,356 +0,0 @@
-{
-  "nbformat": 4,
-  "nbformat_minor": 0,
-  "metadata": {
-    "colab": {
-      "name": "Exercise 144: Using Linear Regression to Predict the Accuracy of the Median Values of Our Dataset",
-      "provenance": [],
-      "authorship_tag": "ABX9TyOarmSrG4xeAOWn1sZdPz4j",
-      "include_colab_link": true
-    },
-    "kernelspec": {
-      "name": "python3",
-      "display_name": "Python 3"
-    }
-  },
-  "cells": [
-    {
-      "cell_type": "markdown",
-      "metadata": {
-        "id": "view-in-github",
-        "colab_type": "text"
-      },
-      "source": [
-        "<a href=\"https://colab.research.google.com/github/sandeep92134/The-Python-Workshop/blob/master/module%2011/Exercise_144_Using_Linear_Regression_to_Predict_the_Accuracy_of_the_Median_Values_of_Our_Dataset.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "metadata": {
-        "id": "WB7mxEAsryNC"
-      },
-      "source": [
-        "The goal of this exercise is to build a machine learning model using linear regression. Your model will predict the median value of Boston houses and, based on this, we will come to a conclusion about whether the value is optimal or not.\r\n",
-        "\r\n",
-        "This exercise will be performed on a Jupyter Notebook.\r\n",
-        "1. Open a new notebook file.\r\n",
-        "2. Now, **import** all the necessary libraries, as shown in the following code snippet:\r\n",
-        "```\r\n",
-        " import pandas as pd\r\n",
-        " import numpy as np\r\n",
-        " from sklearn.linear_model import LinearRegression\r\n",
-        " from sklearn.metrics import mean_squared_error\r\n",
-        " from sklearn.model_selection import train_test_split\r\n",
-        "```\r\n",
-        "Now that we have imported the libraries, we will load the data.\r\n",
-        "3. Load the dataset and view the DataFrames to look at the first five rows:\r\n",
-        "```\r\n",
-        " # load data\r\n",
-        " housing_df = pd.read_csv('HousingData.csv')\r\n",
-        " housing_df.head()\r\n",
-        "```\r\n",
-        "Recall that, as mentioned in Chapter 10, Data Analytics with pandas and NumPy, **housing_df = pd.read_cs('HousingData.csv')** will read the **CSV** file in parentheses and store it in a **DataFrame** called housing_df. Then, **housing_df.head()** will display the first five rows of the housing_df **DataFrame** by default.\r\n",
-        "4. Next, enter the following code to clean the dataset of null values using **.dropna()**:\r\n",
-        "```\r\n",
-        " # drop null values\r\n",
-        " housing_df = housing_df.dropna()\r\n",
-        "```\r\n",
-        "5. Now, declare the X and y variables, where you use X for the **predictor** columns and y for the **target** column:\r\n",
-        "```\r\n",
-        " # declare X and y\r\n",
-        " X = housing_df.iloc[:,:-1]\r\n",
-        " y = housing_df.iloc[:, -1]\r\n",
-        "```\r\n",
-        "6. Now we build the actual linear regression model.\r\n",
-        "7. Now, find how accurate the model is. Here, we can test it on unseen data:\r\n",
-        "8. We can now test the prediction by comparing the predicted **y-values**, which is **y_pred**, to the actual **y-values**, which is **y_test**:\r\n",
-        "\r\n",
-        "\r\n",
-        "\r\n"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "id": "gj9xHtcjqLWu"
-      },
-      "source": [
-        "import pandas as pd\r\n",
-        "import numpy as np\r\n",
-        "from sklearn.linear_model import LinearRegression\r\n",
-        "from sklearn.metrics import mean_squared_error\r\n",
-        "from sklearn.model_selection import train_test_split"
-      ],
-      "execution_count": 1,
-      "outputs": []
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "colab": {
-          "base_uri": "https://localhost:8080/",
-          "height": 204
-        },
-        "id": "t8UNBeDQuPyM",
-        "outputId": "aaf99d0a-7a2f-40b5-fe71-a45eada1f058"
-      },
-      "source": [
-        "# load data\r\n",
-        "housing_df = pd.read_csv('HousingData.csv')\r\n",
-        "housing_df.head()"
-      ],
-      "execution_count": 2,
-      "outputs": [
-        {
-          "output_type": "execute_result",
-          "data": {
-            "text/html": [
-              "<div>\n",
-              "<style scoped>\n",
-              "    .dataframe tbody tr th:only-of-type {\n",
-              "        vertical-align: middle;\n",
-              "    }\n",
-              "\n",
-              "    .dataframe tbody tr th {\n",
-              "        vertical-align: top;\n",
-              "    }\n",
-              "\n",
-              "    .dataframe thead th {\n",
-              "        text-align: right;\n",
-              "    }\n",
-              "</style>\n",
-              "<table border=\"1\" class=\"dataframe\">\n",
-              "  <thead>\n",
-              "    <tr style=\"text-align: right;\">\n",
-              "      <th></th>\n",
-              "      <th>CRIM</th>\n",
-              "      <th>ZN</th>\n",
-              "      <th>INDUS</th>\n",
-              "      <th>CHAS</th>\n",
-              "      <th>NOX</th>\n",
-              "      <th>RM</th>\n",
-              "      <th>AGE</th>\n",
-              "      <th>DIS</th>\n",
-              "      <th>RAD</th>\n",
-              "      <th>TAX</th>\n",
-              "      <th>PTRATIO</th>\n",
-              "      <th>B</th>\n",
-              "      <th>LSTAT</th>\n",
-              "      <th>MEDV</th>\n",
-              "    </tr>\n",
-              "  </thead>\n",
-              "  <tbody>\n",
-              "    <tr>\n",
-              "      <th>0</th>\n",
-              "      <td>0.00632</td>\n",
-              "      <td>18.0</td>\n",
-              "      <td>2.31</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>0.538</td>\n",
-              "      <td>6.575</td>\n",
-              "      <td>65.2</td>\n",
-              "      <td>4.0900</td>\n",
-              "      <td>1</td>\n",
-              "      <td>296</td>\n",
-              "      <td>15.3</td>\n",
-              "      <td>396.90</td>\n",
-              "      <td>4.98</td>\n",
-              "      <td>24.0</td>\n",
-              "    </tr>\n",
-              "    <tr>\n",
-              "      <th>1</th>\n",
-              "      <td>0.02731</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>7.07</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>0.469</td>\n",
-              "      <td>6.421</td>\n",
-              "      <td>78.9</td>\n",
-              "      <td>4.9671</td>\n",
-              "      <td>2</td>\n",
-              "      <td>242</td>\n",
-              "      <td>17.8</td>\n",
-              "      <td>396.90</td>\n",
-              "      <td>9.14</td>\n",
-              "      <td>21.6</td>\n",
-              "    </tr>\n",
-              "    <tr>\n",
-              "      <th>2</th>\n",
-              "      <td>0.02729</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>7.07</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>0.469</td>\n",
-              "      <td>7.185</td>\n",
-              "      <td>61.1</td>\n",
-              "      <td>4.9671</td>\n",
-              "      <td>2</td>\n",
-              "      <td>242</td>\n",
-              "      <td>17.8</td>\n",
-              "      <td>392.83</td>\n",
-              "      <td>4.03</td>\n",
-              "      <td>34.7</td>\n",
-              "    </tr>\n",
-              "    <tr>\n",
-              "      <th>3</th>\n",
-              "      <td>0.03237</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>2.18</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>0.458</td>\n",
-              "      <td>6.998</td>\n",
-              "      <td>45.8</td>\n",
-              "      <td>6.0622</td>\n",
-              "      <td>3</td>\n",
-              "      <td>222</td>\n",
-              "      <td>18.7</td>\n",
-              "      <td>394.63</td>\n",
-              "      <td>2.94</td>\n",
-              "      <td>33.4</td>\n",
-              "    </tr>\n",
-              "    <tr>\n",
-              "      <th>4</th>\n",
-              "      <td>0.06905</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>2.18</td>\n",
-              "      <td>0.0</td>\n",
-              "      <td>0.458</td>\n",
-              "      <td>7.147</td>\n",
-              "      <td>54.2</td>\n",
-              "      <td>6.0622</td>\n",
-              "      <td>3</td>\n",
-              "      <td>222</td>\n",
-              "      <td>18.7</td>\n",
-              "      <td>396.90</td>\n",
-              "      <td>NaN</td>\n",
-              "      <td>36.2</td>\n",
-              "    </tr>\n",
-              "  </tbody>\n",
-              "</table>\n",
-              "</div>"
-            ],
-            "text/plain": [
-              "      CRIM    ZN  INDUS  CHAS    NOX  ...  TAX  PTRATIO       B  LSTAT  MEDV\n",
-              "0  0.00632  18.0   2.31   0.0  0.538  ...  296     15.3  396.90   4.98  24.0\n",
-              "1  0.02731   0.0   7.07   0.0  0.469  ...  242     17.8  396.90   9.14  21.6\n",
-              "2  0.02729   0.0   7.07   0.0  0.469  ...  242     17.8  392.83   4.03  34.7\n",
-              "3  0.03237   0.0   2.18   0.0  0.458  ...  222     18.7  394.63   2.94  33.4\n",
-              "4  0.06905   0.0   2.18   0.0  0.458  ...  222     18.7  396.90    NaN  36.2\n",
-              "\n",
-              "[5 rows x 14 columns]"
-            ]
-          },
-          "metadata": {
-            "tags": []
-          },
-          "execution_count": 2
-        }
-      ]
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "id": "gkhFUuT6uTUR"
-      },
-      "source": [
-        "# drop null values\r\n",
-        "housing_df = housing_df.dropna()\r\n"
-      ],
-      "execution_count": 3,
-      "outputs": []
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "id": "aHDHQkZGvTpx"
-      },
-      "source": [
-        "# declare X and y\r\n",
-        "X = housing_df.iloc[:,:-1]\r\n",
-        "y = housing_df.iloc[:, -1]"
-      ],
-      "execution_count": 4,
-      "outputs": []
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "id": "YmvQbsrpuyA7"
-      },
-      "source": [
-        "#Create training and test sets\r\n",
-        "X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2)"
-      ],
-      "execution_count": 5,
-      "outputs": []
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "id": "ylYyCDbhvXvT"
-      },
-      "source": [
-        "#Create the regressor: reg\r\n",
-        "reg = LinearRegression()"
-      ],
-      "execution_count": 6,
-      "outputs": []
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "colab": {
-          "base_uri": "https://localhost:8080/"
-        },
-        "id": "MVOnE4E0u6eZ",
-        "outputId": "ab6cb5ef-f180-41b5-d424-2d8ff0ef13f5"
-      },
-      "source": [
-        "#Fit the regressor to the training data\r\n",
-        "reg.fit(X_train, y_train)"
-      ],
-      "execution_count": 7,
-      "outputs": [
-        {
-          "output_type": "execute_result",
-          "data": {
-            "text/plain": [
-              "LinearRegression(copy_X=True, fit_intercept=True, n_jobs=None, normalize=False)"
-            ]
-          },
-          "metadata": {
-            "tags": []
-          },
-          "execution_count": 7
-        }
-      ]
-    },
-    {
-      "cell_type": "code",
-      "metadata": {
-        "colab": {
-          "base_uri": "https://localhost:8080/"
-        },
-        "id": "VYIfTckQvACx",
-        "outputId": "dbaa661b-1b6c-4874-b359-c2c90a1558dd"
-      },
-      "source": [
-        "# Predict on the test data: y_pred\r\n",
-        "y_pred = reg.predict(X_test)\r\n",
-        "# Compute and print RMSE\r\n",
-        "rmse = np.sqrt(mean_squared_error(y_test, y_pred))\r\n",
-        "print(\"Root Mean Squared Error: {}\".format(rmse))"
-      ],
-      "execution_count": 8,
-      "outputs": [
-        {
-          "output_type": "stream",
-          "text": [
-            "Root Mean Squared Error: 4.035874116638531\n"
-          ],
-          "name": "stdout"
-        }
-      ]
-    }
-  ]
-}
\ No newline at end of file

	CRIM	ZN	INDUS	NOX	RM	AGE	DIS	RAD	TAX	PTRATIO	B	LSTAT	MEDV
0	0.00632	18.0	2.31	0.538	6.575	65.2	4.0900	1	296	15.3	396.90	4.98	24.0
1	0.02731	0.0	7.07	0.469	6.421	78.9	4.9671	2	242	17.8	396.90	9.14	21.6
2	0.02729	0.0	7.07	0.469	7.185	61.1	4.9671	2	242	17.8	392.83	4.03	34.7
3	0.03237	0.0	2.18	0.458	6.998	45.8	6.0622	3	222	18.7	394.63	2.94	33.4
4	0.06905	0.0	2.18	0.458	7.147	54.2	6.0622	3	222	18.7	396.90	NaN	36.2