Wednesday, 19 February 2025

Supervised learning in Machine learning

In supervised learning, a model is trained using a labelled data. That means each data point in the training data set has a known output value.


For example, I want to build a machine learning model that predict the prices of a second hand mobile phone. Following independent variables can be used to predict the price of the mobile.

 

  1. brand: (Google, Apple, OnePlus, Samsung etc,)
  2. model: iphone 11, OnePlus7 etc.,
  3. Release year
  4. Ram of the phone
  5. storage capacity
  6. Front camera quality
  7. Back camera quality
  8. warranty status
  9. price of new version of the phone in market
  10. demand of the phone in market
  11. screen size
  12. Age of the phone
  13. Number of previous owners
  14. repair history
  15. features like 5G
  16. operating system : Android, IOS
  17. battery capacity
  18. damages if any
  19. current price in the market etc.,

 

Above independent variables are used to come up with a formula to predict the dependent variable ‘price’ of the phone. Same is depicted in the below image.

 


Supervised learning mainly used in following areas.

 

a. Regression: Predicting a continuous value based in input features. In other words, predicting a dependent variable based on one or more independent variables. Price prediction of second hand mobile phone is an example of Regression. . Following are some of the examples of regression.

 

1.   House price prediction

2.   Stock price prediction

3.   Weather forecasting

4.   Demand forecasting

5.   Website traffic prediction

6.   Predicting sales of a company

7.   Predicting success rate of cancer treatment

 

 

b. Classification: Predict the category of input data. Following are some of the examples of classification.

 

1.   Image classification: Classifying the images to different categories like cat, dog, tiger, human, hill, lake etc.,

2.   Email spam detection

3.   Language detection by the text

4.   Disease category detection (diabetes, kidney disease etc.,)

5.   Text content categorization like Business, politics, stock market, Entertainment etc.,

6.   Customers classification by the past purchase history

 

Key terms in Supervised learning

  1. Independent variables: Input data points used to predict the outcome
  2. Dependent variable : It is the one that we are trying to predict from given independent variables. For example, price of second hand mobile.
  3. Training data: Labelled dataset used to train the machine learning model.
  4. Test data: Separate labelled dataset used to evaluate the model's performance.
  5. Overfitting: Model performs very well on the training data, but not work well on the new data.
  6. Underfitting: model is unable to capture the underlying patterns in the training data and also performs poorly on new data.
  7. Loss function: A mathematical equation or function that quantifies the difference between the predicted values and the actual labels in the training data. The aim of the model is to reduce the loss during training.
  8. Hyperparameters: Parameters that are set before training. For example learning rate, number of hidden layers in the neural network, batch size, number of times the model will be trained on the training data etc.,


Previous                                                    Next                                                    Home

No comments:

Post a Comment