Author: Prateek Sharma

How to use K-Nearest Neighbor (KNN) algorithm on a dataset?

By Prateek Sharma & Priya Chetty on July 16, 2018 2 Comments

K- Nearest Neighbor, popular as K-Nearest Neighbor (KNN), is an algorithm that helps to assess the properties of a new variable with the help of the properties of existing variables. KNN is applicable in classification as well as regression predictive problems.

analyse with SPSS, classification in supervised learning, Supervised learning, trend discovery

How to use an instrumental variable?

By Prateek Sharma & Priya Chetty on May 4, 2018 2 Comments

Instrumental variable is a third variable that estimates causal relationships in the regression analysis when an endogenous variable is present. Instrumental variables are useful when the independent variable in the regression model correlates with the error term in the model.

detection in supervised learning, Supervised learning

How to perform LASSO regression test?

By Prateek Sharma & Priya Chetty on April 3, 2018 1 Comment

In statistics, to increase the prediction accuracy and interpret-ability of the model, LASSO (Least Absolute Shrinkage and Selection Operator) is extremely popular. It is a regression procedure that involves selection and regularisation and was developed in 1989. Lasso regression is an extension of linear regression that uses shrinkage. The lasso imposes a constraint on the sum of the absolute values of the model parameters. Here the sum has a specific constant as an upper bound.

exploratory model analysis, regressions in supervised learning, Supervised learning

How to apply missing data imputation?

By Prateek Sharma & Priya Chetty on March 9, 2018 1 Comment

Missing data is one of the most common problems in almost all statistical analyses. If the data is not available for all the observations of variables in the model, then it is a case of ‘missing data’.

estimation in supervised learning, Supervised learning, trend analysis

Markov chain and its use in solving real world problems

By Prateek Sharma & Priya Chetty on February 27, 2018 2 Comments

Markov chain is one of the most important tests in order to deal with independent trials processes. There are two major principal theorems for these processes. The first one is the ‘Law of Large Numbers’ and the second one is the ‘Central Limit Theorem’.

estimation in supervised learning, Supervised learning

How to perform bootstrap and jackknife analysis?

By Prateek Sharma & Priya Chetty on February 26, 2018 1 Comment

Bootstrap and jackknife are superficially similar statistical techniques that involve re-sampling the data. They are nonparametric and specific resampling techniques that can estimate standard errors and confidence intervals of a population parameter.

estimation in supervised learning, Supervised learning