Introduction to Machine Learning

Supervised Learning - Ensemble Methods - Random Forests and Boosting

Aspect	Single Decision Trees	Random Forests
Accuracy	Prone to overfitting	Higher accuracy due to aggregation
Robustness	Highly sensitive to noise	More robust to noise
Interpretability	Simple and explainable	Complex, harder to interpret
Scalability	Faster for small datasets	Handles high-dimensional data well

Aspect	Bagging	Boosting
Core Approach	Train models independently in parallel.	Train models sequentially, leveraging errors.
Error Reduction	Reduces variance.	Reduces bias.
Model Combination	Average or majority voting.	Weighted sum or weighted voting.
Risk of Overfitting	Lower with proper hyperparameters.	Higher if poorly tuned.
Interpretability	More interpretable (e.g., Random Forests).	Less interpretable (e.g., Gradient Boosting).
Popular Algorithms	Random Forests	AdaBoost, Gradient Boosting, XGBoost