Merge pull request #19 from rfbatista/fix/fix-typo-extensive

Sherlock113 · web-flow · commit 7cd46d7bd711 · 2025-07-22T12:00:39.000+08:00
fix: typo extensive
diff --git a/docs/llm-inference-basics/training-inference-differences.md b/docs/llm-inference-basics/training-inference-differences.md
@@ -23,10 +23,10 @@ Common techniques used in LLM training include:
 - **Reinforcement learning**: Allow the model to learn by trial and error, optimizing based on feedback or rewards.
 - **Self-supervised learning**: Learn by predicting missing or corrupted parts of the data, without explicit labels.
 
-Training is computationally intensive, often requiring extensive GPU or TPU clusters. While this initial cost can be very high, it is more or less a one-time expense. Once the model achieves desired accuracy, retraining is usually only necessary to update or improve the model periodically.
+Training is computationally intensive, often requiring expensive GPU or TPU clusters. While this initial cost can be very high, it is more or less a one-time expense. Once the model achieves desired accuracy, retraining is usually only necessary to update or improve the model periodically.
 
 ## Inference: Using the model in real-time
 
 LLM inference means applying the trained model to new data to make predictions. Unlike training, inference happens continuously and in real-time, responding immediately to user input or incoming data. It is the phase where the model is actively "in use." Better-trained and more finely-tuned models typically provide more accurate and useful inference.
 
-Inference compute needs are ongoing and can become very high, especially as user interactions and traffic grow. Each inference request consumes computational resources such as GPUs. While each inference step may be smaller than training in isolation, the cumulative demand over time can lead to significant operational expenses.
+Inference compute needs are ongoing and can become very high, especially as user interactions and traffic grow. Each inference request consumes computational resources such as GPUs. While each inference step may be smaller than training in isolation, the cumulative demand over time can lead to significant operational expenses.