Merge pull request #1045 from pritesh2000/main

mrdbourke · web-flow · commit 085d3e34241e · 2024-08-20T18:25:33.000+10:00
Update 06_pytorch_transfer_learning.ipynb and 07_pytorch_experiment_tracking.ipynb changed some typos and update some information.
diff --git a/06_pytorch_transfer_learning.ipynb b/06_pytorch_transfer_learning.ipynb
@@ -400,7 +400,7 @@
     "| 3 | A mean of `[0.485, 0.456, 0.406]` (values across each colour channel). | `torchvision.transforms.Normalize(mean=...)` to adjust the mean of our images.  |\n",
     "| 4 | A standard deviation of `[0.229, 0.224, 0.225]` (values across each colour channel). | `torchvision.transforms.Normalize(std=...)` to adjust the standard deviation of our images.  | \n",
     "\n",
-    "> **Note:** ^some pretrained models from `torchvision.models` in different sizes to `[3, 224, 224]`, for example, some might take them in `[3, 240, 240]`. For specific input image sizes, see the documentation.\n",
+    "> **Note:** some pretrained models from `torchvision.models` in different sizes to `[3, 224, 224]`, for example, some might take them in `[3, 240, 240]`. For specific input image sizes, see the documentation.\n",
     "\n",
     "> **Question:** *Where did the mean and standard deviation values come from? Why do we need to do this?*\n",
     ">\n",
@@ -495,7 +495,7 @@
     "```\n",
     "\n",
     "Where,\n",
-    "* `EfficientNet_B0_Weights` is the model architecture weights we'd like to use (there are many differnt model architecture options in `torchvision.models`).\n",
+    "* `EfficientNet_B0_Weights` is the model architecture weights we'd like to use (there are many different model architecture options in `torchvision.models`).\n",
     "* `DEFAULT` means the *best available* weights (the best performance in ImageNet).\n",
     "    * **Note:** Depending on the model architecture you choose, you may also see other options such as `IMAGENET_V1` and `IMAGENET_V2` where generally the higher version number the better. Though if you want the best available, `DEFAULT` is the easiest option. See the [`torchvision.models` documentation](https://pytorch.org/vision/main/models.html) for more.\n",
     "    \n",
@@ -530,7 +530,7 @@
    "id": "cebcdf20-4ab7-40ba-8691-9d9af8962dab",
    "metadata": {},
    "source": [
-    "And now to access the transforms assosciated with our `weights`, we can use the `transforms()` method.\n",
+    "And now to access the transforms associated with our `weights`, we can use the `transforms()` method.\n",
     "\n",
     "This is essentially saying \"get the data transforms that were used to train the `EfficientNet_B0_Weights` on ImageNet\"."
    ]
@@ -657,7 +657,7 @@
     "\n",
     "But if you've got unlimited compute power, as [*The Bitter Lesson*](http://www.incompleteideas.net/IncIdeas/BitterLesson.html) states, you'd likely take the biggest, most compute hungry model you can.\n",
     "\n",
-    "Understanding this **performance vs. speed vs. size tradeoff** will come with time and practice.\n",
+    "Understanding this **performance vs. speed vs. size** tradeoff will come with time and practice.\n",
     "\n",
     "For me, I've found a nice balance in the `efficientnet_bX` models. \n",
     "\n",
@@ -1267,7 +1267,7 @@
     "* **Same shape** - If our images are different shapes to what our model was trained on, we'll get shape errors.\n",
     "* **Same datatype** - If our images are a different datatype (e.g. `torch.int8` vs. `torch.float32`) we'll get datatype errors.\n",
     "* **Same device** - If our images are on a different device to our model, we'll get device errors.\n",
-    "* **Same transformations** - If our model is trained on images that have been transformed in certain way (e.g. normalized with a specific mean and standard deviation) and we try and make preidctions on images transformed in a different way, these predictions may be off.\n",
+    "* **Same transformations** - If our model is trained on images that have been transformed in certain way (e.g. normalized with a specific mean and standard deviation) and we try and make predictions on images transformed in a different way, these predictions may be off.\n",
     "\n",
     "> **Note:** These requirements go for all kinds of data if you're trying to make predictions with a trained model. Data you'd like to predict on should be in the same format as your model was trained on.\n",
     "\n",
@@ -1359,7 +1359,7 @@
     "\n",
     "We can get a list of all the test image paths using `list(Path(test_dir).glob(\"*/*.jpg\"))`, the stars in the `glob()` method say \"any file matching this pattern\", in other words, any file ending in `.jpg` (all of our images).\n",
     "\n",
-    "And then we can randomly sample a number of these using Python's [`random.sample(populuation, k)`](https://docs.python.org/3/library/random.html#random.sample) where `population` is the sequence to sample and `k` is the number of samples to retrieve."
+    "And then we can randomly sample a number of these using Python's [`random.sample(population, k)`](https://docs.python.org/3/library/random.html#random.sample) where `population` is the sequence to sample and `k` is the number of samples to retrieve."
    ]
   },
   {
@@ -1445,7 +1445,7 @@
     "\n",
     "That's where the real fun of machine learning is!\n",
     "\n",
-    "Predicting on your own custom data, outisde of any training or test set.\n",
+    "Predicting on your own custom data, outside of any training or test set.\n",
     "\n",
     "To test our model on a custom image, let's import the old faithful `pizza-dad.jpeg` image (an image of my dad eating pizza).\n",
     "\n",
@@ -1521,7 +1521,7 @@
    "metadata": {},
    "source": [
     "## Main takeaways\n",
-    "* **Transfer learning** often allows to you get good results with a relatively small amount of custom data.\n",
+    "* **Transfer learning** often allows you to get good results with a relatively small amount of custom data.\n",
     "* Knowing the power of transfer learning, it's a good idea to ask at the start of every problem, \"does an existing well-performing model exist for my problem?\"\n",
     "* When using a pretrained model, it's important that your custom data be formatted/preprocessed in the same way that the original model was trained on, otherwise you may get degraded performance.\n",
     "* The same goes for predicting on custom data, ensure your custom data is in the same format as the data your model was trained on.\n",
@@ -1560,8 +1560,8 @@
     "    * You may want to try an EfficientNet with a higher number than our B0, perhaps `torchvision.models.efficientnet_b2()`?\n",
     "  \n",
     "## Extra-curriculum\n",
-    "* Look up what \"model fine-tuning\" is and spend 30-minutes researching different methods to perform it with PyTorch. How would we change our code to fine-tine? Tip: fine-tuning usually works best if you have *lots* of custom data, where as, feature extraction is typically better if you have less custom data.\n",
-    "* Check out the new/upcoming [PyTorch multi-weights API](https://pytorch.org/blog/introducing-torchvision-new-multi-weight-support-api/) (still in beta at time of writing, May 2022), it's a new way to perform transfer learning in PyTorch. What changes to our code would need to made to use the new API?\n",
+    "* Look up what \"model fine-tuning\" is and spend 30-minutes researching different methods to perform it with PyTorch. How would we change our code to fine-tune? Tip: fine-tuning usually works best if you have *lots* of custom data, where as, feature extraction is typically better if you have less custom data.\n",
+    "* Check out the new/upcoming [PyTorch multi-weights API](https://pytorch.org/blog/introducing-torchvision-new-multi-weight-support-api/) (still in beta at time of writing, May 2022), it's a new way to perform transfer learning in PyTorch. What changes to our code would need to be made to use the new API?\n",
     "* Try to create your own classifier on two classes of images, for example, you could collect 10 photos of your dog and your friends dog and train a model to classify the two dogs. This would be a good way to practice creating a dataset as well as building a model on that dataset."
    ]
   }
diff --git a/07_pytorch_experiment_tracking.ipynb b/07_pytorch_experiment_tracking.ipynb
@@ -118,7 +118,7 @@
     "| **1. Get data** | Let's get the pizza, steak and sushi image classification dataset we've been using to try and improve our FoodVision Mini model's results. |\n",
     "| **2. Create Datasets and DataLoaders** | We'll use the `data_setup.py` script we wrote in chapter 05. PyTorch Going Modular to setup our DataLoaders. |\n",
     "| **3. Get and customise a pretrained model** | Just like the last section, 06. PyTorch Transfer Learning we'll download a pretrained model from `torchvision.models` and customise it to our own problem. | \n",
-    "| **4. Train model amd track results** | Let's see what it's like to train and track the training results of a single model using TensorBoard. |\n",
+    "| **4. Train model and track results** | Let's see what it's like to train and track the training results of a single model using TensorBoard. |\n",
     "| **5. View our model's results in TensorBoard** | Previously we visualized our model's loss curves with a helper function, now let's see what they look like in TensorBoard. |\n",
     "| **6. Creating a helper function to track experiments** | If we're going to be adhering to the machine learner practitioner's motto of *experiment, experiment, experiment!*, we best create a function that will help us save our modelling experiment results. |\n",
     "| **7. Setting up a series of modelling experiments** | Instead of running experiments one by one, how about we write some code to run several experiments at once, with different models, different amounts of data and different training times. | \n",
@@ -613,7 +613,7 @@
    "source": [
     "Wonderful!\n",
     "\n",
-    "Now we've got a pretrained model let's turn into a feature extractor model.\n",
+    "Now we've got a pretrained model let's turn it into a feature extractor model.\n",
     "\n",
     "In essence, we'll freeze the base layers of the model (we'll use these to extract features from our input images) and we'll change the classifier head (output layer) to suit the number of classes we're working with (we've got 3 classes: pizza, steak, sushi).\n",
     "\n",
@@ -1034,8 +1034,6 @@
     "| VS Code (notebooks or Python scripts) | Press `SHIFT + CMD + P` to open the Command Palette and search for the command \"Python: Launch TensorBoard\". | [VS Code Guide on TensorBoard and PyTorch](https://code.visualstudio.com/docs/datascience/pytorch-support#_tensorboard-integration) |\n",
     "| Jupyter and Colab Notebooks | Make sure [TensorBoard is installed](https://pypi.org/project/tensorboard/), load it with `%load_ext tensorboard` and then view your results with `%tensorboard --logdir DIR_WITH_LOGS`. | [`torch.utils.tensorboard`](https://pytorch.org/docs/stable/tensorboard.html) and [Get started with TensorBoard](https://www.tensorflow.org/tensorboard/get_started) |\n",
     "\n",
-    "You can also upload your experiments to [tensorboard.dev](https://tensorboard.dev/) to share them publicly with others.\n",
-    "\n",
     "Running the following code in a Google Colab or Jupyter Notebook will start an interactive TensorBoard session to view TensorBoard files in the `runs/` directory.\n",
     "\n",
     "```python\n",
@@ -1067,8 +1065,7 @@
     "*Viewing a single modelling experiment's results for accuracy and loss in TensorBoard.*\n",
     "\n",
     "> **Note:** For more information on running TensorBoard in notebooks or in other locations, see the following:\n",
-    "> * [Using TensorBoard in Notebooks guide by TensorFlow](https://www.tensorflow.org/tensorboard/tensorboard_in_notebooks)\n",
-    "> * [Get started with TensorBoard.dev](https://tensorboard.dev/#get-started) (helpful for uploading your TensorBoard logs to a shareable link)"
+    "> * [Using TensorBoard in Notebooks guide by TensorFlow](https://www.tensorflow.org/tensorboard/tensorboard_in_notebooks)"
    ]
   },
   {
@@ -1585,7 +1582,7 @@
     "# Find the number of samples/batches per dataloader (using the same test_dataloader for both experiments)\n",
     "print(f\"Number of batches of size {BATCH_SIZE} in 10 percent training data: {len(train_dataloader_10_percent)}\")\n",
     "print(f\"Number of batches of size {BATCH_SIZE} in 20 percent training data: {len(train_dataloader_20_percent)}\")\n",
-    "print(f\"Number of batches of size {BATCH_SIZE} in testing data: {len(train_dataloader_10_percent)} (all experiments will use the same test set)\")\n",
+    "print(f\"Number of batches of size {BATCH_SIZE} in testing data: {len(test_dataloader)} (all experiments will use the same test set)\")\n",
     "print(f\"Number of classes: {len(class_names)}, class names: {class_names}\")"
    ]
   },
@@ -2305,33 +2302,7 @@
     "\n",
     "<img src=\"https://raw.githubusercontent.com/mrdbourke/pytorch-deep-learning/main/images/07-tensorboard-lowest-test-loss.png\" alt=\"various modelling experiments visualized on tensorboard with model that has the lowest test loss highlighted\" width=900/>\n",
     "\n",
-    "*Visualizing the test loss values for the different modelling experiments in TensorBoard, you can see that the EffNetB0 model trained for 10 epochs and with 20% of the data achieves the lowest loss. This sticks with the overall trend of the experiments that: more data, larger model and longer training time is generally better.*\n",
-    "\n",
-    "You can also upload your TensorBoard experiment results to [tensorboard.dev](https://tensorboard.dev) to host them publically for free.\n",
-    "\n",
-    "For example, running code similiar to the following: "
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 31,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# # Upload the results to TensorBoard.dev (uncomment to try it out)\n",
-    "# !tensorboard dev upload --logdir runs \\\n",
-    "#     --name \"07. PyTorch Experiment Tracking: FoodVision Mini model results\" \\\n",
-    "#     --description \"Comparing results of different model size, training data amount and training time.\""
-   ]
-  },
-  {
-   "attachments": {},
-   "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "Running the cell above results in the experiments from this notebook being publically viewable at: https://tensorboard.dev/experiment/VySxUYY7Rje0xREYvCvZXA/\n",
-    "\n",
-    "> **Note:** Beware that anything you upload to tensorboard.dev is publically available for anyone to see. So if you do upload your experiments, be careful they don't contain sensitive information. "
+    "*Visualizing the test loss values for the different modelling experiments in TensorBoard, you can see that the EffNetB0 model trained for 10 epochs and with 20% of the data achieves the lowest loss. This sticks with the overall trend of the experiments that: more data, larger model and longer training time is generally better.*"
    ]
   },
   {