Omitted Variable Bias, Unobserved Heterogeneity, and Endogeneity

Sandhya Krishnan
2 min readOct 31, 2022

--

Photo by Andrea Piacquadio: https://www.pexels.com/photo/woman-holding-books-3768126/

Omitted variable bias occurs when a regression model omits one or more relevant variables, as a result, the variable gets reflected in the error term which leads to biased and counterintuitive estimates. It happens when a predictor is excluded and it gets reflected in the error term which leads to biased and counterintuitive estimates. It can be detected by checking the correlation between predictors.

Omitted Variable bias violates the linear regression assumption that “ Explanatory Variables must be exogenous “. That is error term should not contain any information about explanatory variables.

Omitted variables bias occurs as many individual characteristics are not observed.

For example, if I have a dataset to predict the Revenue of an Airline with predictor, Airline Name, Year, number of passengers traveling, Fuel price, and Load factor(the average capacity utilization of the fleet). But here the regression model does not consider inflight services and ground services provided by an airline to the customers which have an emotional impact on choosing the airline, management attitude, location, terminal of operation of the airline, airline policies, etc. These unconsidered factors are called unobserved heterogeneity.

When an explanatory variable is correlated with the error term it is referred to as endogeneity or the explanatory variable is endogenous.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

No responses yet

Write a response