ML Links

 

ML Links

Created Friday 29 January 2021

CLV

Heatmap

Corelation matrix

Feature selection



Pipeliling & Tranforming

Pipelines and composite estimators

How to Transform Target Variables for Regression in Python

Recursive Feature Elimination (RFE) for Feature Selection in Python

How to Calculate Feature Importance With Python


Linear Regression


Regression and Types

Residual plots

Correlation Matrix
Python correlation matrix tutorial - https://likegeeks.com/python-correlation-matrix/

Check the quality of the regression


Going Deeper into Regression Analysis with Assumptions, Plots & Solutions - https://www.analyticsvidhya.com/blog/2016/07/deeper-regression-analysis-assumptions-plots-solutions/

Homoscedasticity
- Residuals vs. predicting variables plots

Normality
- Histogram and Q-Q plot of normalized residuals - https://statisticsbyjim.com/regression/model-specification-variable-selection/

Multicollinearity
- Multicollinearity in Regression Analysis: Problems, Detection, and Solutions https://statisticsbyjim.com/regression/multicollinearity-in-regression-analysis/
- What is Multicollinearity? Here’s Everything You Need to Know - https://www.analyticsvidhya.com/blog/2020/03/what-is-multicollinearity/

Testing Linear Regression Assumptions in Python - https://jeffmacaluso.github.io/post/LinearRegressionAssumptions/

Normality tests


Ridge Regression
Complete Tutorial on Ridge and Lasso Regression - https://www.analyticsvidhya.com/blog/2016/01/ridge-lasso-regression-python-complete-tutorial/
A comprehensive beginners guide for Linear, Ridge and Lasso Regression in Python and R https://www.analyticsvidhya.com/blog/2017/06/a-comprehensive-guide-for-linear-ridge-and-lasso-regression/?


Polynomial Regression


Residual Plot


OLS


Regression Feature Importance

Use residual plots to check the assumptions of an OLS linear regression model. If you violate the assumptions, you risk producing results that you can’t trust. Residual plots display the residual values on the y-axis and fitted values, or another variable, on the x-axis. After you fit a regression model, it is crucial to check the residual plots. If your plots display unwanted patterns, you can’t trust the regression coefficients and other numeric results.

Residuals must be randomly plotted. If not, Unfortunately, some of the explanatory information has leaked over to the supposedly random error. There are a variety of reasons why a model can have this problem. The possibilities include a missing:

- Independent variable.
- Polynomial term to model a curve.
- Interaction term.
To fix the problem, you need to identify the missing information, variable, or higher-order term and include it in the model. After you correct the problem and refit the model, the residuals should look nice and random






Scenarios

Handling Zero R2



Visualization

- An Intuitive Guide to Data Visualization in Python https://www.analyticsvidhya.com/blog/2021/02/an-intuitive-guide-to-visualization-in-python/


Pipeline,

Hands-On Tutorial On Machine Learning Pipelines With Scikit-Learn - https://analyticsindiamag.com/hands-on-tutorial-on-machine-learning-pipelines-with-scikit-learn/


Cross Validation and Hyperparameter Tuning

Bagging Classifier



Decision Tree

Visualize a Decision Tree in 4 Ways - https://mljar.com/blog/visualize-decision-tree/


Standardization & Normalization


https://stats.stackexchange.com/questions/372521/in-cluster-analysis-should-i-scale-standardize-my-data-if-variables-are-in-the/372541#372541
https://stats.stackexchange.com/questions/77850/assign-weights-to-variables-in-cluster-analysis
https://www.analyticsvidhya.com/blog/2020/04/feature-scaling-machine-learning-normalization-standardization/
https://machinelearningmastery.com/how-to-improve-neural-network-stability-and-modeling-performance-with-data-scaling/
https://www.kdnuggets.com/2019/04/normalization-vs-standardization-quantitative-analysis.html
https://sebastianraschka.com/Articles/2014_about_feature_scaling.html
https://datascience.stackexchange.com/questions/45900/when-to-use-standard-scaler-and-when-normalizer
Scaling and Normalization
http://www.faqs.org/faqs/ai-faq/neural-nets/part2/section-16.html


KMeans
https://jakevdp.github.io/PythonDataScienceHandbook/05.11-k-means.html
https://stackabuse.com/k-means-clustering-with-scikit-learn/

Seaborn Plots
https://dev.to/thalesbruno/subplotting-with-matplotlib-and-seaborn-5ei8
https://seaborn.pydata.org/examples/grouped_barplot.html

Pandas Groupby - https://realpython.com/pandas-groupby/





Comments