Term
|
Definition
How we use the theory and data from economics, business, and social sciences, along with statistics to answer "how much" questions. |
|
|
Term
|
Definition
Science of collecting, organizing, presenting, analyzing, and interpreting numerical data to assist in making more effective decisions |
|
|
Term
|
Definition
Some characteristic of a population of a sample. (values vary across the population) |
|
|
Term
|
Definition
1. A random variable 2. A discrete Variable 3. A continuous variable |
|
|
Term
|
Definition
Variable whose value is unknown until it is observed |
|
|
Term
|
Definition
when a random variable can assume only a finite number of distinct values |
|
|
Term
|
Definition
when a random variable can take an infinite number of possible values |
|
|
Term
Levels of measurement of Data |
|
Definition
Descriptive and inferential statistics influenced by the levels of data measurement
1. Nominal Data (Age, brand, sex) 2. Ordinal Data (Satisfaction Rating) |
|
|
Term
|
Definition
1. A measure of asymmetry in the data 2. A measure of Peakedness in the data |
|
|
Term
1. Arithmetic Mean 2. Geometric mean 3. Variance |
|
Definition
1. Measure of location and shows the central value of the data 2. Uses multiplication instead of addition to summarize data 3. Variant or dispersion, spread around the mean |
|
|
Term
|
Definition
Measures relative variability in the data (Standard Dev/MeanX100) |
|
|
Term
Joint Probability Density Function |
|
Definition
Probability of two events occurring simultaneously |
|
|
Term
|
Definition
Ability to obtain the probability distribution of an individual random variable |
|
|
Term
|
Definition
Probability of the outcome X=2 given that Y=1 has occured |
|
|
Term
The Chi Squared Distribution |
|
Definition
Formed by summing the squared values of the independent standard normal variables (as degrees of freedom get larger, distribution becomes more symmetrical) (Deg of Freedom value determines the shape of the chi squared distribution) |
|
|
Term
|
Definition
Variable formed by the ratio of two independent Chi squared random variables that have been divided by the degrees of freedom |
|
|
Term
|
Definition
For normal distributions, empirical rule tells us the proportion of the data we can expect surrounding the mean. (For any set of observations, the minimum proportion of the values that lie within "K" standard deviations of the mean. K is any constant greater than 1) |
|
|
Term
|
Definition
Assuming the population was normally distributed, sample means would also be normally distributed |
|
|
Term
Best Linear Unbiased Estimator (BLUE) |
|
Definition
Estimator is unbiased, linear, has the smallest variance as "N" increases |
|
|
Term
|
Definition
Range of values that may contain the true population mean |
|
|
Term
Width of the Confidence interval depends on 3 things |
|
Definition
1. Sample Size - Larger "N" means narrower 2. Level of Confidence - higher level returns wider interval 3. Variability - larger value of O returns a wider interval |
|
|
Term
5 Components of Hypothesis Testing |
|
Definition
1. Null hypothesis denoted as Ho 2. Alternative hypothesis denoted as H1 3. A test Statistic 4. A level of significance/rejection region 5. Decision/Conclusion about hypothesis |
|
|
Term
|
Definition
Computed based on knowledge of the relevant probability distribution for the variable under consideration |
|
|
Term
Level of Significance/Rejection region |
|
Definition
rules set to decide whether you accept or reject the null |
|
|
Term
One sample mean comparison test |
|
Definition
Test whether the sample mean value equals some mean value |
|
|
Term
|
Definition
Rejection region if the decision rule the researcher adopts for rejecting or not rejecting the null (set %) |
|
|
Term
|
Definition
Compare the computed "t" statistic w/ a critical value from the t tables w/ the relevant degrees of freedom and a particular level of significance |
|
|
Term
|
Definition
Null hypothesis is true and we decide to reject it |
|
|
Term
|
Definition
The null hypothesis is false and we decide not to reject it |
|
|
Term
1. Time Series form 2. Cross Section Form 3. Panel Data Form |
|
Definition
1. data collected over discrete periods of time 2. data collected over different units at 1 point in time 3. data collected over sample units over period of time |
|
|
Term
1. Quantitative Data 2. Qualitative Data |
|
Definition
1. Outcome expressed as numbers 2. outcome expressed as "either or" situations |
|
|
Term
|
Definition
Statistical process for estimating the relationships between variables -Implies Y depends on X -Line of best fit has smaller sum of squared residuals |
|
|
Term
Assumptions in regression model |
|
Definition
1. Regression model is correctly specified 2. regression model is linear in parameters 3. X is not random 4. Values of X vary across observations 5. Zero covariance between the error term and the X values |
|
|
Term
|
Definition
Null hypothesis w/ multiple conjectures, expressed w/ more than one equal sign -Full model is called restricted model |
|
|
Term
3 Types of Log linear Models |
|
Definition
1. Log-log Model- all variables are in log 2. Log-Linear model- Y variable in log, X variable not in log 3. Linear-Log model- Y variable not in log, X variable in log |
|
|
Term
|
Definition
Let you estimate separate intercepts and slopes for different demographic groups |
|
|
Term
|
Definition
Another type of specification that frequently occurs in economics |
|
|
Term
1. Multicollinearity 2. perfect Multicollinearity |
|
Definition
1. When two or more explanatory variables are very highly correlated w/ each other 2. Cannot estimate all coefficients, individual coefficients have high standard errors |
|
|
Term
|
Definition
Involves observations on economic units of varying sizes -As size of unit increases, more certainty associated with outcomes -If errors don't have constant variance they are heteroskedastic |
|
|
Term
|
Definition
One way to detect heteroskedasticity is to estimate your model using OLS, predict the residuals and examine graphs |
|
|
Term
The White Test and Breusch Pagan Test |
|
Definition
Tests for continuous changes in variance |
|
|
Term
|
Definition
Test for changes in the variance across discrete subgroups |
|
|
Term
5 Steps of the White Test |
|
Definition
1. Regress Y variable by your explanators using OLS 2. Compute OLS residuals 3. Regress residual variable by a constant, all explanators, the squares of explanators, and all possible interactions between explanators 4. Compute R2 form step 3 5. Compare nR2 to the critical value from the Chi squared distribution with P degrees of freedom and decide if you should reject null |
|
|
Term
|
Definition
Econometrician selects which explanators to include |
|
|
Term
Steps in Goldfeld Quantd Test |
|
Definition
1. Divide the N of observations into H group sizes 2. Choose two groups and form the null hypothesis 3. Estimate model for group 1 4. Estimate model for group 2 5. Relate the two groups 6. Compare G to the Critical value for the F statistic and if G is greater then reject the null |
|
|
Term
Generalized Least Squares |
|
Definition
Want to transform data so that it is homoskedastic, then we can apply OLS -if nR2 is larger than the critical value we reject the null |
|
|
Term
|
Definition
Refers to a sequence of observations over time |
|
|
Term
1. Static Model 2. Dynamic Model |
|
Definition
1. A change in income at time (t) will cause a change in consumption at time (t) only 2. Past Effects the present, a change in consumption not only at time (t), but also in following periods |
|
|
Term
Dynamic Models-Autoregressive Model |
|
Definition
Model where a variable is related to its past values |
|
|
Term
|
Definition
Third way dynamics can enter a regression relationship is the error term -Variable is said to be autocorrelated or serial correlated if it exhibits correlation over time |
|
|
Term
Testing for Serial Correlation |
|
Definition
a test of serial correlation would be if the null hypoth. equaled 0 and the Alt hypoth. didn't equal 0 - If we reject the null then we are saying serial correlation exists |
|
|
Term
|
Definition
1. Estimate the model 2. Obtain the estimated residuals from the regression 3. Use the residual to compute the DW test statistic -DW Test cannot accommodate time series model with lagged Y variables |
|
|
Term
2 Tests used for serial correlation |
|
Definition
1. Durbin Watson - Focuses on first order serial correlation 2. Breusch Godfrey- takes into account correlations among disturbances lagged more than once |
|
|
Term
|
Definition
Variance of error terms are not constant. Includes economic units of varying sizes |
|
|
Term
Consequences of OLS estimator under Heteroskedasticity |
|
Definition
OLS is unbiased OLS is no longer efficient Other linear estimators will have smaller variance Estimated standard errors are incorrect Confidence intervals will be incorrect |
|
|
Term
|
Definition
Measures the relationship between the probability that an event occurs and its determinants |
|
|
Term
|
Definition
checks to see if the time series variables have a unit root and if its data is stationary |
|
|
Term
|
Definition
A stochastic trend that is a systematic pattern that is unpredictable |
|
|
Term
2 ways of measuring success in Logit Models |
|
Definition
Count R2 as the measure of predicting if the even will occur. Use Pseudo R2 as Log-Likelihood of the model |
|
|
Term
|
Definition
Data obtained is not always convenient for presentation in a table in a regression |
|
|
Term
Serial correlation Solutions |
|
Definition
Newey-West procedure to correct estimated standard errors Transform the data and use GLS if errors are AR(1). Use Feasible GLS and estimate ρ using residuals |
|
|
Term
|
Definition
convert the nonstationary series to its stationary counterpart by taking first differences |
|
|
Term
|
Definition
either convert the nonstationary series to its stationary counterpart, or estimate a regression relationship that includes a trend variable |
|
|
Term
|
Definition
any differences across economic agents can be captured by shifts in the intercept term of a standard OLS regression |
|
|
Term
two-way fixed effects model |
|
Definition
equivalent dummy variable approach if there are time specific effects |
|
|
Term
|
Definition
Using OLS to estimate a model in which the dependent variable is a binary variable |
|
|
Term
|
Definition
Degree of similarity between a given time series and a lagged version of itself over successive time intervals |
|
|