Term
Interpret interest rates as required rates of return, discount rates, or opportunity costs. |
|
Definition
An interest rate can be interpreted as the rate of return required in equilibrium for a particular investment, the discount rate for calculating the present value of future cash flows, or as the opportunity cost of consuming now, rather than saving and investing. |
|
|
Term
Explain an interest rate as the sum of a real risk-free rate, and premiums that compensate investors for bearing distinct types of risk. |
|
Definition
The real risk-free rate is a theoretical rate on a single-period loan when there is no expectation of inflation. Nominal risk-free rate = real risk-free rate + expected inflation rate.
Securities may have several risks, and each increases the required rate of return. These include default risk, liquidity risk, and maturity risk.
The required rate of return on a security = real risk-free rate + expected inflation + default risk premium + liquidity premium + maturity risk premium. |
|
|
Term
Calculate and interpret the effective annual rate, given the stated annual interest rate and the frequency of compounding. |
|
Definition
The effective annual rate when there are m compounding periods = [image] Each dollar invested will grow to [image] in one year. |
|
|
Term
Solve time value of money problems for different frequencies of compounding. |
|
Definition
For non-annual time value of money problems, divide the stated annual interest rate by the number of compounding periods per year, m, and multiply the number of years by the number of compounding periods per year. |
|
|
Term
Calculate and interpret the future value (FV) and present value (PV) of a single sum of money, an ordinary annuity, an annuity due, a perpetuity (PV only), and a series of unequal cash flows. |
|
Definition
Future value: FV = PV(1 + I/Y)N
Present value: PV = FV / (1 + I/Y)N
An annuity is a series of equal cash flows that occurs at evenly spaced intervals over time. Ordinary annuity cash flows occur at the end of each time period. Annuity due cash flows occur at the beginning of each time period.
Perpetuities are annuities with infinite lives (perpetual annuities):
[image]
The present (future) value of any series of cash flows is equal to the sum of the present (future) values of the individual cash flows. |
|
|
Term
Demonstrate the use of a time line in modeling and solving time value of money problems. |
|
Definition
Constructing a time line showing future cash flows will help in solving many types of TVM problems. Cash flows occur at the end of the period depicted on the time line. The end of one period is the same as the beginning of the next period. For example, a cash flow at the beginning of Year 3 appears at time t = 2 on the time line. |
|
|
Term
Calculate and interpret the net present value (NPV) and the internal rate of return (IRR) of an investment. |
|
Definition
The NPV is the present value of a project’s future cash flows, discounted at the firm’s cost of capital, less the project’s cost. IRR is the discount rate that makes the NPV = 0 (equates the PV of the expected future cash flows to the project’s initial cost).
The NPV rule is to accept a project if NPV > 0; the IRR rule is to accept a project if IRR > required rate of return. For an independent (single) project, these rules produce the exact same decision. |
|
|
Term
Contrast the NPV rule to the IRR rule, and identify problems associated with the IRR rule. |
|
Definition
For mutually exclusive projects, IRR rankings and NPV rankings may differ due to differences in project size or in the timing of the cash flows. Choose the project with the higher NPV as long as it is positive.
A project may have multiple IRRs or no IRR. |
|
|
Term
Calculate and interpret a holding period return (total return). |
|
Definition
The holding period return (or yield) is the total return for holding an investment over a certain period of time and can be calculated as:
[image]
|
|
|
Term
Calculate and compare the money-weighted and time-weighted rates of return of a portfolio and evaluate the performance of portfolios based on these measures. |
|
Definition
The money-weighted rate of return is the IRR calculated using periodic cash flows into and out of an account and is the discount rate that makes the PV of cash inflows equal to the PV of cash outflows.
The time-weighted rate of return measures compound growth. It is the rate at which $1 compounds over a specified performance horizon.
If funds are added to a portfolio just before a period of poor performance, the money-weighted return will be lower than the time-weighted return. If funds are added just prior to a period of high returns, the money-weighted return will be higher than the time-weighted return.
The time-weighted return is the preferred measure of a manager’s ability to select investments. If the manager controls the money flows into and out of an account, the money-weighted return is the more appropriate performance measure. |
|
|
Term
Calculate and interpret the bank discount yield, holding period yield, effective annual yield, and money market yield for U.S. Treasury bills and other money market instruments. |
|
Definition
Given a $1,000 T-bill with 100 days to maturity and a discount of $10 (price of $990):
- Bank discount yield = [image]
- Holding period yield = [image]
- Effective annual yield = [image]
- Money market yield = [image]
|
|
|
Term
Convert among holding period yields, money market yields, effective annual yields, and bond equivalent yields. |
|
Definition
Given a money market security with n days to maturity:
- Holding period yield = [image]
- Money market yield = holding period yield × [image]
- Effective annual yield = (1 + holding period yield)365/n− 1
- Holding period yield = (1 + effective annual yield)n/365− 1
- Bond equivalent yield = [(1 + effective annual yield)½ − 1] × 2
|
|
|
Term
Distinguish between descriptive statistics and inferential statistics, between a population and a sample, and among the types of measurement scales. |
|
Definition
Descriptive statistics summarize the characteristics of a data set; inferential statistics are used to make probabilistic statements about a population based on a sample.
A population includes all members of a specified group, while a sample is a subset of the population used to draw inferences about the population.
Data may be measured using different scales.
- Nominal scale—data is put into categories that have no particular order.
- Ordinal scale—data is put into categories that can be ordered with respect to some characteristic.
- Interval scale—differences in data values are meaningful, but ratios, such as twice as much or twice as large, are not meaningful.
- Ratio scale—ratios of values, such as twice as much or half as large, are meaningful, and zero represents the complete absence of the characteristic being measured.
|
|
|
Term
Define a parameter, a sample statistic, and a frequency distribution. |
|
Definition
Any measurable characteristic of a population is called a parameter.
A characteristic of a sample is given by a sample statistic.
A frequency distribution groups observations into classes, or intervals. An interval is a range of values. |
|
|
Term
Calculate and interpret relative frequencies and cumulative relative frequencies, given a frequency distribution. |
|
Definition
Relative frequency is the percentage of total observations falling within an interval.
Cumulative relative frequency for an interval is the sum of the relative frequencies for all values less than or equal to that interval’s maximum value. |
|
|
Term
Describe the properties of a data set presented as a histogram or a frequency polygon. |
|
Definition
A histogram is a bar chart of data that has been grouped into a frequency distribution. A frequency polygon plots the midpoint of each interval on the horizontal axis and the absolute frequency for that interval on the vertical axis, and connects the midpoints with straight lines. The advantage of histograms and frequency polygons is that we can quickly see where most of the observations lie. |
|
|
Term
Calculate and interpret measures of central tendency, including the population mean, sample mean, arithmetic mean, weighted average or mean, geometric mean, harmonic mean, median, and mode. |
|
Definition
The arithmetic mean is the average. [image]. Population mean and sample mean are examples of arithmetic means.
The geometric mean is used to find a compound growth rate. [image]
The weighted mean weights each value according to its influence. [image]
The harmonic mean can be used to find an average purchase price, such as dollars per share for equal periodic investments.
[image]
The median is the midpoint of a data set when the data is arranged from largest to smallest.
The mode of a data set is the value that occurs most frequently. |
|
|
Term
Calculate and interpret quartiles, quintiles, deciles, and percentiles. |
|
Definition
Quantile is the general term for a value at or below which a stated proportion of the data in a distribution lies. Examples of quantiles include:
- Quartiles—the distribution is divided into quarters.
- Quintile—the distribution is divided into fifths.
- Decile—the distribution is divided into tenths.
- Percentile—the distribution is divided into hundredths (percents).
|
|
|
Term
Calculate and interpret 1) a range and a mean absolute deviation and 2) the variance and standard deviation of a population and of a sample. |
|
Definition
The range is the difference between the largest and smallest values in a data set.
Mean absolute deviation (MAD) is the average of the absolute values of the deviations from the arithmetic mean:
[image]
Variance is defined as the mean of the squared deviations from the arithmetic mean or from the expected value of a distribution.
- Population variance = [image]where μ = population mean and N = size.
- Sample variance = [image] where X = sample mean and n = sample size.
Standard deviation is the positive square root of the variance and is frequently used as a quantitative measure of risk. |
|
|
Term
Calculate and interpret the proportion of observations falling within a specified number of standard deviations of the mean using Chebyshev's inequality. |
|
Definition
Chebyshev’s inequality states that the proportion of the observations within k standard deviations of the mean is at least 1 − 1/k2 for all k > 1. It states that for any distribution, at least:
36% of observations lie within +/− 1.25 standard deviations of the mean. 56% of observations lie within +/− 1.5 standard deviations of the mean. 75% of observations lie within +/− 2 standard deviations of the mean. 89% of observations lie within +/− 3 standard deviations of the mean. 94% of observations lie within +/− 4 standard deviations of the mean. |
|
|
Term
Calculate and interpret the coefficient of variation and the Sharpe ratio |
|
Definition
The coefficient of variation for sample data, CV = [image], is the ratio of the standard deviation of the sample to its mean (expected value of the underlying distribution).
The Sharpe ratio measures excess return per unit of risk:
[image] |
|
|
Term
Explain skewness and the meaning of a positively or negatively skewed return distribution. |
|
Definition
Skewness describes the degree to which a distribution is not symmetric about its mean. A right-skewed distribution has positive skewness. A left-skewed distribution has negative skewness.
Sample skew with an absolute value greater than 0.5 is considered significantly different from zero. |
|
|
Term
Describe the relative locations of the mean, median, and mode for a unimodal, nonsymmetrical distribution. |
|
Definition
For a positively skewed, unimodal distribution, the mean is greater than the median, which is greater than the mode.
For a negatively skewed, unimodal distribution, the mean is less than the median, which is less than the mode. |
|
|
Term
Explain measures of sample skewness and kurtosis. |
|
Definition
Kurtosis measures the peakedness of a distribution and the probability of extreme outcomes (thickness of tails).
- Excess kurtosis is measured relative to a normal distribution, which has a kurtosis of 3.
- Positive values of excess kurtosis indicate a distribution that is leptokurtic (fat tails, more peaked) so that the probability of extreme outcomes is greater than for a normal distribution.
- Negative values of excess kurtosis indicate a platykurtic distribution (thin tails, less peaked).
- Excess kurtosis with an absolute value greater than 1 is considered significant.
|
|
|
Term
Explain the use of arithmetic and geometric means when analyzing investment returns. |
|
Definition
The arithmetic mean return is appropriate for forecasting single period returns in future periods, while the geometric mean is appropriate for forecasting future compound returns over multiple periods. |
|
|
Term
Define a random variable, an outcome, an event, mutually exclusive events, and exhaustive events. |
|
Definition
A random variable is an uncertain value determined by chance.
An outcome is the realization of a random variable.
An event is a set of one or more outcomes. Two events that cannot both occur are termed "mutually exclusive" and a set of events that includes all possible outcomes is an "exhaustive" set of events. |
|
|
Term
State the two defining properties of probability and distinguish among empirical, subjective, and a priori probabilities. |
|
Definition
The two properties of probability are:
- The sum of the probabilities of all possible mutually exclusive events is 1.
- The probability of any event cannot be greater than 1 or less than 0.
A priori probability measures predetermined probabilities based on well-defined inputs; empirical probability measures probability from observations or experiments; and subjective probability is an informed guess. |
|
|
Term
State the probability of an event in terms of odds for and against the event. |
|
Definition
Probabilities can be stated as odds that an event will or will not occur. If the probability of an event is A out of B trials (A/B), the "odds for" are A to (B − A) and the "odds against" are (B − A) to A. |
|
|
Term
Distinguish between unconditional and conditional probabilities. |
|
Definition
Unconditional probability (marginal probability) is the probability of an event occurring.
Conditional probability, P(A | B), is the probability of an event A occurring given that event B has occurred. |
|
|
Term
Explain the multiplication, addition, and total probability rules. |
|
Definition
The multiplication rule of probability is used to determine the joint probability of two events:
P(AB) = P(A | B) × P(B)
The addition rule of probability is used to determine the probability that at least one of two events will occur:
P(A or B) = P(A) + P(B) − P(AB)
The total probability rule is used to determine the unconditional probability of an event, given conditional probabilities:
P(A) = P(A | B1)P(B1) + P(A | B2)P(B2) +...+ P(A | BN)P(BN)
where B1, B2,...BN is a mutually exclusive and exhaustive set of outcomes.
|
|
|
Term
Calculate and interpret 1) the joint probability of two events, 2) the probability that at least one of two events will occur, given the probability of each and the joint probability of the two events, and 3) a joint probability of any number of independent events. |
|
Definition
The joint probability of two events, P(AB), is the probability that they will both occur. P(AB) = P(A | B) × P(B). For independent events, P(A | B) = P(A), so that P(AB) = P(A) × P(B).
The probability that at least one of two events will occur is P(A or B) = P(A) + P(B) − P(AB). For mutually exclusive events, P(A or B) = P(A) + P(B), since P(AB) = 0.
The joint probability of any number of independent events is the product of their individual probabilities. |
|
|
Term
Distinguish between dependent and independent events. |
|
Definition
The probability of an independent event is unaffected by the occurrence of other events, but the probability of a dependent event is changed by the occurrence of another event. Events A and B are independent if and only if:
P(A | B) = P(A), or equivalently, P(B | A) = P(B)
|
|
|
Term
Calculate and interpret an unconditional probability using the total probability rule. |
|
Definition
Using the total probability rule, the unconditional probability of A is the probability weighted sum of the conditional probabilities:
[image]
where Bi is a set of mutually exclusive and exhaustive events.
|
|
|
Term
Explain the use of conditional expectation in investment applications. |
|
Definition
Conditional expected values depend on the outcome of some other event.
Forecasts of expected values for a stock’s return, earnings, and dividends can be refined, using conditional expected values, when new information arrives that affects the expected outcome. |
|
|
Term
Explain the use of a tree diagram to represent an investment problem. |
|
Definition
A tree diagram shows the probabilities of two events and the conditional probabilities of two subsequent events.
[image] |
|
|
Term
Calculate and interpret covariance and correlation. |
|
Definition
Covariance measures the extent to which two random variables tend to be above and below their respective means for each joint realization. It can be calculated as:
[image]
Correlation is a standardized measure of association between two random variables; it ranges in value from –1 to +1 and is equal to [image] |
|
|
Term
Calculate and interpret the expected value, variance, and standard deviation of a random variable and of returns on a portfolio. |
|
Definition
The expected value of a random variable, E(X), equals [image]
The variance of a random variable, Var(X), equals [image]
Standard deviation: [image]
The expected returns and variance of a 2-asset portfolio are given by:
[image] |
|
|
Term
Calculate and interpret covariance given a joint probability function. |
|
Definition
Given the joint probabilities for Xi and Yi, i.e., P(XiYi), the covariance is calculated as:
[image] |
|
|
Term
Calculate and interpret an updated probability using Bayes' formula. |
|
Definition
Bayes’ formula for updating probabilities based on the occurrence of an event O is:
[image]
Equivalently, based on the tree diagram below, [image]
[image] |
|
|
Term
Identify the most appropriate method to solve a particular counting problem, and solve counting problems using the factorial, combination, and permutation notations. |
|
Definition
The number of ways to order n objects is n factorial, n! = n × (n − 1) × (n − 2) × ... × 1.
There are [image] ways to assign k different labels to n items, where ni is the number of items with the label i.
The number of ways to choose a subset of size r from a set of size n when order doesn’t matter is [image] combinations; when order matters, there are [image] permutations. |
|
|
Term
Define a probability distribution and distinguish between discrete and continuous random variables and their probability functions. |
|
Definition
A probability distribution lists all the possible outcomes of an experiment, along with their associated probabilities.
A discrete random variable has positive probabilities associated with a finite number of outcomes.
A continuous random variable has positive probabilities associated with a range of outcome values—the probability of any single value is zero. |
|
|
Term
Describe the set of possible outcomes of a specified discrete random variable. |
|
Definition
The set of possible outcomes of a specific discrete random variable is a finite set of values. An example is the number of days last week on which the value of a particular portfolio increased. For a discrete distribution, p(x) = 0 when x cannot occur, or p(x) > 0 if it can. |
|
|
Term
Interpret a cumulative distribution function. |
|
Definition
A cumulative distribution function (cdf) gives the probability that a random variable will be less than or equal to specific values. The cumulative distribution function for a random variable X may be expressed as F(x) = P(X ≤ x). |
|
|
Term
Calculate and interpret probabilities for a random variable, given its cumulative distribution function. |
|
Definition
Given the cumulative distribution function for a random variable, the probability that an outcome will be less than or equal to a specific value is represented by the area under the probability distribution to the left of that value. |
|
|
Term
Define a discrete uniform random variable, a Bernoulli random variable, and a binomial random variable. |
|
Definition
A discrete uniform distribution is one where there are n discrete, equally likely outcomes.
The binomial distribution is a probability distribution for a binomial (discrete) random variable that has two possible outcomes. |
|
|
Term
Calculate and interpret probabilities given the discrete uniform and the binomial distribution functions. |
|
Definition
For a discrete uniform distribution with n possible outcomes, the probability for each outcome equals 1/n.
For a binomial distribution, if the probability of success is p, the probability of x successes in n trials is:
p(x) = P(X = x) = [image] |
|
|
Term
Construct a binomial tree to describe stock price movement. |
|
Definition
A binomial tree illustrates the probabilities of all the possible values that a variable (such as a stock price) can take on, given the probability of an up-move and the magnitude of an up-move (the up-move factor).
[image]
With an initial stock price S = 50, U = 1.01, D = [image], and prob(U) = 0.6, the possible stock prices after two periods are:
[image] |
|
|
Term
Calculate and interpret tracking error. |
|
Definition
Tracking error is calculated as the total return on a portfolio minus the total return on a benchmark or index portfolio. |
|
|
Term
Define the continuous uniform distribution and calculate and interpret probabilities, given a continuous uniform distribution. |
|
Definition
A continuous uniform distribution is one where the probability of X occurring in a possible range is the length of the range relative to the total of all possible values. Letting a and b be the lower and upper limit of the uniform distribution, respectively, then for:
[image] |
|
|
Term
Explain the key properties of the normal distribution. |
|
Definition
The normal probability distribution and normal curve have the following characteristics:
- The normal curve is symmetrical and bell-shaped with a single peak at the exact center of the distribution.
- Mean = median = mode, and all are in the exact center of the distribution.
- The normal distribution can be completely defined by its mean and standard deviation because the skew is always zero and kurtosis is always 3.
|
|
|
Term
Distinguish between a univariate and a multivariate distribution, and explain the role of correlation in the multivariate normal distribution. |
|
Definition
Multivariate distributions describe the probabilities for more than one random variable, whereas a univariate distribution is for a single random variable.
The correlation(s) of a multivariate distribution describes the relation between the outcomes of its variables relative to their expected values. |
|
|
Term
Determine the probability that a normally distributed random variable lies inside a given interval. |
|
Definition
A confidence interval is a range within which we have a given level of confidence of finding a point estimate (e.g., the 90% confidence interval for X is X − 1.65s to X+ 1.65s).
Confidence intervals for any normally distributed random variable are:
- 90%: μ ± 1.65 standard deviations.
- 95%: μ ± 1.96 standard deviations.
- 99%: μ ± 2.58 standard deviations.
The probability that a normally distributed random variable X will be within A standard deviations of its mean, μ, [i.e., P(μ − Aσ ≤ X ≤ μ + Aσ)], is calculated as two times [1 − the cumulative left–hand tail probability, F(−A)], or two times {1 − the right-hand tail probability, [1 − F(A)]}, where F(A) is the cumulative standard normal probability of A. |
|
|
Term
Define the standard normal distribution, explain how to standardize a random variable, and calculate and interpret probabilities using the standard normal distribution. |
|
Definition
The standard normal probability distribution has a mean of 0 and a standard deviation of 1.
A normally distributed random variable X can be standardized as Z =[image] and Z will be normally distributed with mean = 0 and standard deviation 1.
The z-table is used to find the probability that X will be less than or equal to a given value.
- P(X < x) = F(x) = F[image]= F(z), which is found in the standard normal probability table.
- P(X > x) = 1 – P(X < x) = 1 – F(z)
|
|
|
Term
Define shortfall risk, calculate the safety-first ratio, and select an optimal portfolio using Roy's safety-first criterion. |
|
Definition
The safety-first ratio for portfolio P, based on a target return RT, is:
[image]
Shortfall risk is the probability that a portfolio’s value (or return) will fall below a specific value over a given period of time. Greater safety-first ratios are preferred and indicate a smaller shortfall probability. Roy’s safety-first criterion states that the optimal portfolio minimizes shortfall risk. |
|
|
Term
Explain the relationship between normal and lognormal distributions and why the lognormal distribution is used to model asset prices. |
|
Definition
If x is normally distributed, ex follows a lognormal distribution. A lognormal distribution is often used to model asset prices, since a lognormal random variable cannot be negative and can take on any positive value. |
|
|
Term
Distinguish between discretely and continuously compounded rates of return, and calculate and interpret a continuously compounded rate of return, given a specific holding period return. |
|
Definition
As we decrease the length of discrete compounding periods (e.g., from quarterly to monthly) the effective annual rate increases. As the length of the compounding period in discrete compounding gets shorter and shorter, the compounding becomes continuous, where the effective annual rate = ei – 1.
For a holding period return (HPR) over any period, the equivalent continuously compounded rate over the period is ln(1 + HPR). |
|
|
Term
Explain Monte Carlo simulation and describe its major applications and limitations. |
|
Definition
Monte Carlo simulation uses randomly generated values for risk factors, based on their assumed distributions, to produce a distribution of possible security values. Its limitations are that it is fairly complex and will provide answers that are no better than the assumptions used. |
|
|
Term
Compare Monte Carlo simulation and historical simulation. |
|
Definition
Historical simulation uses randomly selected past changes in risk factors to generate a distribution of possible security values, in contrast to Monte Carlo simulation, which uses randomly generated values. A limitation of historical simulation is that it cannot consider the effects of significant events that did not occur in the sample period. |
|
|
Term
Define simple random sampling and a sampling distribution. |
|
Definition
Simple random sampling is a method of selecting a sample in such a way that each item or person in the population being studied has the same probability of being included in the sample.
A sampling distribution is the distribution of all values that a sample statistic can take on when computed from samples of identical size randomly drawn from the same population. |
|
|
Term
|
Definition
Sampling error is the difference between a sample statistic and its corresponding population parameter (e.g., the sample mean minus the population mean). |
|
|
Term
Distinguish between simple random and stratified random sampling. |
|
Definition
Stratified random sampling involves randomly selecting samples proportionally from subgroups that are formed based on one or more distinguishing characteristics, so that the sample will have the same distribution of these characteristics as the overall population. |
|
|
Term
Distinguish between time-series and cross-sectional data. |
|
Definition
Time-series data consists of observations taken at specific and equally spaced points in time.
Cross-sectional data consists of observations taken at a single point in time. |
|
|
Term
Explain the central limit theorem and its importance. |
|
Definition
The central limit theorem states that for a population with a mean μ and a finite variance σ2, the sampling distribution of the sample mean of all possible samples of size n (for n ≥ 30) will be approximately normally distributed with a mean equal to μ and a variance equal to σ2 / n. |
|
|
Term
Calculate and interpret the standard error of the sample mean. |
|
Definition
The standard error of the sample mean is the standard deviation of the distribution of the sample means and is calculated as [image], where σ, the population standard deviation, is known, and as[image], where s, the sample standard deviation, is used because the population standard deviation is unknown. |
|
|
Term
Identify and describe desirable properties of an estimator. |
|
Definition
Desirable statistical properties of an estimator include unbiasedness (sign of estimation error is random), efficiency (lower sampling error than any other unbiased estimator), and consistency (variance of sampling error decreases with sample size). |
|
|
Term
Distinguish between a point estimate and a confidence interval estimate of a population parameter. |
|
Definition
Point estimates are single value estimates of population parameters. An estimator is a formula used to compute a point estimate.
Confidence intervals are ranges of values, within which the actual value of the parameter will lie with a given probability.
confidence interval = point estimate ± (reliability factor × standard error)
The reliability factor is a number that depends on the sampling distribution of the point estimate and the probability that the point estimate falls in the confidence interval. |
|
|
Term
Describe the properties of Student's t-distribution and calculate and interpret its degrees of freedom. |
|
Definition
The t-distribution is similar, but not identical, to the normal distribution in shape—it is defined by the degrees of freedom and has fatter tails compared to the normal distribution.
Degrees of freedom for the t-distribution are equal to n − 1. Student’s t–distribution is closer to the normal distribution when df is greater, and confidence intervals are narrower when df is greater. |
|
|
Term
Calculate and interpret a confidence interval for a population mean, given a normal distribution with 1) a known population variance, 2) an unknown population variance, or 3) an unknown variance and a large sample size. |
|
Definition
For a normally distributed population, a confidence interval for its mean can be constructed using a z-statistic when variance is known, and a t-statistic when the variance is unknown. The z-statistic is acceptable in the case of a normal population with an unknown variance if the sample size is large (30+).
In general, we have:
- [image]when the variance is known, and
- [image] when the variance is unknown and the sample standard deviation must be used.
|
|
|
Term
Describe the issues regarding selection of the appropriate sample size, data-mining bias, sample selection bias, survivorship bias, look-ahead bias, and time-period bias. |
|
Definition
Increasing the sample size will generally improve parameter estimates and narrow confidence intervals. The cost of more data must be weighed against these benefits, and adding data that is not generated by the same distribution will not necessarily improve accuracy or narrow confidence intervals.
Potential mistakes in the sampling method can bias results.
These biases include data mining (significant relationships that have occurred by chance), sample selection bias (selection is non-random), look-ahead bias (basing the test at a point in time on data not available at that time), survivorship bias (using only surviving mutual funds, hedge funds, etc.), and time-period bias (the relation does not hold over other time periods). |
|
|
Term
Define a hypothesis, describe the steps of hypothesis testing, describe and interpret the choice of the null and alternative hypotheses, and distinguish between one-tailed and two-tailed tests of hypotheses. |
|
Definition
The hypothesis testing process requires a statement of a null and an alternative hypothesis, the selection of the appropriate test statistic, specification of the significance level, a decision rule, the calculation of a sample statistic, a decision regarding the hypotheses based on the test, and a decision based on the test results.
The null hypothesis is what the researcher wants to reject.
The alternative hypothesis is what the researcher wants to prove, and it is accepted when the null hypothesis is rejected.
A two-tailed test results from a two-sided alternative hypothesis (e.g., Ha: μ ≠ μ0). A one-tailed test results from a one-sided alternative hypothesis (e.g., Ha: μ > μ0, or Ha: μ < μ0). |
|
|
Term
Explain a test statistic, Type I and Type II errors, a significance level, and how significance levels are used in hypothesis testing. |
|
Definition
The test statistic is the value that a decision about a hypothesis will be based on. For a test about the value of the mean of a distribution:
[image]
A Type I error is the rejection of the null hypothesis when it is actually true, while a Type II error is the failure to reject the null hypothesis when it is actually false.
The significance level can be interpreted as the probability that a test statistic will reject the null hypothesis by chance when it is actually true (i.e., the probability of a Type I error).
A significance level must be specified to select the critical values for the test. |
|
|
Term
Explain a decision rule, the power of a test, and the relation between confidence intervals and hypothesis tests. |
|
Definition
Hypothesis testing compares a computed test statistic to a critical value at a stated level of significance, which is the decision rule for the test.
The power of a test is the probability of rejecting the null when it is false. The power of a test = 1 − P(Type II error).
A hypothesis about a population parameter is rejected when the sample statistic lies outside a confidence interval around the hypothesized value for the chosen level of significance. |
|
|
Term
Distinguish between a statistical result and an economically meaningful result. |
|
Definition
Statistical significance does not necessarily imply economic significance. Even though a test statistic is significant statistically, the size of the gains to a strategy to exploit a statistically significant result may be absolutely small or simply not great enough to outweigh transactions costs. |
|
|
Term
Explain and interpret the p-value as it relates to hypothesis testing. |
|
Definition
The p-value for a hypothesis test is the smallest significance level for which the hypothesis would be rejected. For example, a p-value of 7% means the hypothesis can be rejected at the 10% significance level but cannot be rejected at the 5% significance level. |
|
|
Term
Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the population mean of both large and small samples when the population is normally or approximately distributed and the variance is 1) known or 2) unknown. |
|
Definition
With unknown population variance, the t-statistic is used for tests about the mean of a normally distributed population: [image]. If the population variance is known, the appropriate test statistic is [image]for tests about the mean of a population. |
|
|
Term
Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the equality of the population means of two at least approximately normally distributed populations, based on independent random samples with 1) equal or 2) unequal assumed variances. |
|
Definition
For two independent samples from two normally distributed populations, the difference in means can be tested with a t-statistic. When the two population variances are assumed to be equal, the denominator is based on the variance of the pooled samples, but when sample variances are assumed to be unequal, the denominator is based on a combination of the two samples’ variances. |
|
|
Term
Identify the appropriate test statistic and interpret the results for a hypothesis test concerning the mean difference of two normally distributed populations. |
|
Definition
A paired comparisons test is concerned with the mean of the differences between the paired observations of two dependent, normally distributed samples. A t-statistic, [image], where [image], and d is the average difference of the n paired observations, is used to test whether the means of two dependent normal variables are equal. Values outside the critical t-values lead us to reject equality. |
|
|
Term
Identify the appropriate test statistic and interpret the results for a hypothesis test concerning 1) the variance of a normally distributed population, and 2) the equality of the variances of two normally distributed populations based on two independent random samples. |
|
Definition
The test of a hypothesis about the population variance for a normally distributed population uses a chi-square test statistic:[image], where n is the sample size, s2 is the sample variance, and [image] is the hypothesized value for the population variance. Degrees of freedom are n − 1.
The test comparing two variances based on independent samples from two normally distributed populations uses an F-distributed test statistic: [image], where [image] is the variance of the first sample and [image] is the (smaller) variance of the second sample. |
|
|
Term
Distinguish between parametric and nonparametric tests and describe the situations in which the use of nonparametric tests may be appropriate. |
|
Definition
Parametric tests, like the t-test, F-test, and chi-square tests, make assumptions regarding the distribution of the population from which samples are drawn. Nonparametric tests either do not consider a particular population parameter or have few assumptions about the sampled population. Nonparametric tests are used when the assumptions of parametric tests can’t be supported or when the data are not suitable for parametric tests. |
|
|
Term
Explain the principles of technical analysis, its applications, and its underlying assumptions. |
|
Definition
Underlying all of technical analysis are the following assumptions:
- Prices are determined by investor supply and demand for assets.
- Supply and demand are driven by both rational and irrational behavior.
- While the causes of changes in supply and demand are difficult to determine, the actual shifts in supply and demand can be observed in market prices.
- Prices move in trends and exhibit patterns that can be identified and tend to repeat themselves over time.
|
|
|
Term
Describe the construction of and interpret different types of technical analysis charts. |
|
Definition
Technical analysts use charts to identify trends and patterns in prices over time. A line chart is a continuous line that connects closing prices for each period. Bar charts and candlestick charts show the open, high, low, and close for each period. Volume charts often accompany price charts. Point-and-figure charts indicate significant changes in the direction of price trends. |
|
|
Term
Explain the uses of trend, support, and resistance lines, and change in polarity. |
|
Definition
In an uptrend, prices are reaching higher highs and higher lows. An uptrend line is drawn below the prices on a chart by connecting the increasing lows with a straight line.
In a downtrend, prices are reaching lower lows and lower highs. A downtrend line is drawn above the prices on a chart by connecting the decreasing highs with a straight line.
Support and resistance are price levels or ranges at which buying or selling pressure is expected to limit price movement. Commonly identified support and resistance levels include trendlines and previous high and low prices.
The change in polarity principle is the idea that breached resistance levels become support levels and breached support levels become resistance levels. |
|
|
Term
Identify and interpret common chart patterns. |
|
Definition
Technical analysts look for recurring patterns in price charts.
Head-and-shoulders patterns, double tops, and triple tops are thought to be reversal patterns at the ends of uptrends. Inverse head-and-shoulders patterns, double bottoms, and triple bottoms are thought to be reversal patterns at the ends of downtrends.
Triangles, rectangles, flags, and pennants are thought to be continuation patterns, which indicate that the trend in which they appear is likely to go further in the same direction. |
|
|
Term
Describe common technical analysis indicators: price-based, momentum oscillators, sentiment, and flow of funds. |
|
Definition
Price-based indicators include moving averages, Bollinger bands, and momentum oscillators such as the Relative Strength Index, moving average convergence/divergence lines, rate-of-change oscillators, and stochastic oscillators. These indicators are commonly used to identify changes in price trends, as well as “overbought” markets that are likely to decrease in the near term and “oversold” markets that are likely to increase in the near term.
Sentiment indicators include opinion polls, the put/call ratio, the volatility index, margin debt, and the short interest ratio. Margin debt, the Arms index, the mutual fund cash position, new equity issuance, and secondary offerings are flow-of-funds indicators. Technical analysts often interpret these indicators from a “contrarian” perspective, becoming bearish when investor sentiment is too positive and bullish when investor sentiment is too negative. |
|
|
Term
Explain the use of cycles by technical analysts. |
|
Definition
Some technical analysts believe market prices move in cycles. Examples include the Kondratieff wave, which is a 54-year cycle, and a 4-year cycle related to U.S. presidential elections. |
|
|
Term
Describe the key tenets of Elliott Wave Theory and the importance of Fibonacci numbers. |
|
Definition
Elliott wave theory suggests that prices exhibit a pattern of five waves in the direction of a trend and three waves counter to the trend. Technical analysts who employ Elliott wave theory frequently use ratios of the numbers in the Fibonacci sequence to estimate price targets and identify potential support and resistance levels. |
|
|
Term
Describe intermarket analysis as it relates to technical analysis and asset allocation. |
|
Definition
Intermarket analysis examines the relationships among various asset markets such as stocks, bonds, commodities, and currencies. In the asset allocation process, relative strength analysis can be used to identify attractive asset classes and attractive sectors within these classes. |
|
|