Non linear regression analysis in STATA and its interpretation

By Indra Giri & Priya Chetty on February 7, 2017

In the previous article on Linear Regression using STATA, a simple linear regression model was used to test the hypothesis. However the linear regression will not be effective if the relation between the dependent and independent variable is non linear. The non linear regression is used more in the real life as compared to the linear regression.

Log-transformation for non linear regression

The main purpose of logarithmic transformations is to handle situations when there is non-linear relationship between independent and dependent variables. Using the logarithmic model for one or more variables will make effective non-linear relationship and also preserve the linear model. These models also are used to analyse variables in order to transform a highly skewed model into a normal one. There are three different ways to incorporate log in the regression model.

Linear – log regression model

In the linear log regression analysis the independent variable is in log form whereas the dependent variable is kept normal. The linear log regression analysis can be written as:

eqn1

In this case the independent variable (X1) is transformed into log. One can transform the normal variable into log form using the following command:

gen lnX = log(X)  (if you want to create new variable with log transformation)  
replace X = log(X)     (if you want to replace the variable with its log form)

In case of linear log model the coefficient can be interpreted as follows:

If the independent variable is increased by 1% then the expected change in dependent variable is (β/100) units, holding all other things constant.

using log transformation of independent variable for non linear relatiohship
Regression results for level log regression

For example the linear log regression analysis was performed using the same data set in previous articles. Here price is dependent variable and lnmpg is the independent variable. The independent variable lnmpg is the log transformation of the mpg. The coefficient of lnmpg -5992.728 can be interpreted as follows:

With 1% increase in mpg the price of auto declines by (5992.72/100)  or 59.93 units, holding all other factors constant. 

Interpretation of all other components in the above table is similar to the linear regression, explained in previous article.

Log- linear regression model

As the name suggest in log linear regression model the dependent variable is in log form instead of the independent variable. The regression equation of the log-linear regression is as follows:

log level

The dependent variable can also be transformed into log form using the similar command given in the previous section.

The coefficient of independent variable in case of log-linear regression can be interpreted as:

If independent variable is increased by 1 unit, the dependent variable is expected to increase by (β*100)% holding all things constant.

log linear results
Regression results for log linear results

Using the same data set, the log linear regression has been performed. In this case lnprice is the dependent variable which is the log transformation of price and independent variable mpg is in its normal form.

The coefficient of mpg can be interpreted as:

If mpg is increased by one unit the price is expected to decrease by 3.32 %, holding all other factors constant.

Log-log model

In log-log model both the dependent and the independent variables are in log form. The regression equation of log-log regression is as follows:

log log

The coefficient of the log log model can be interpreted as:

If independent variable increase by 1% the dependent variable will increase/ decrease by β %, holding all other factors constant.

both the dependent and independent variables in log
Results for log log regression

In this case both the dependent variable and independent variable are in log form. The interpretation of coefficient  lnmpg is:

If mpg increase by 1% the price of auto decreases by 0.83%, holding all other factors constant.

The log log regression is also used to find the elasticity. This is because it shows one % change in dependent variable with change in 1% in the independent variable.

Discuss

1 thought on “Non linear regression analysis in STATA and its interpretation”