Term 
        
        | What are the two big categories of metrics of data? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What are the sub-categories of discrete? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        | cannot be ranked. e.g., religion, political party affiliation, etc. |  
          | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        relates information about which cases have similar traits, but allows ranked judgments |  
          | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        | can be ranked and have equal unit distance across their entire range(e.g. income in dollars- each increase by one dollar means the same thing at every point along the range) |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are two common graphs for categorical variables? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is the only measure of central tendency for categorical variables? |  
          | 
        
        
        Definition 
        
        | The mode- the most frequently occurring category |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the two major descriptive goals with continuous variables? |  
          | 
        
        
        Definition 
        
        | central tendency, dispersion (spread) |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the two broad classes of descriptive statistics? |  
          | 
        
        
        Definition 
        
        | rank statistics, moment-based statistics |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is the measure of central tendency for rank-based statistics? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is the measure of dispersion for rank-based statistics? |  
          | 
        
        
        Definition 
        
        | The IQR: interquartile range |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is the measure of central tendency for moment-based statistics? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | Which type of data, rank or moment-based, is more sensitive to outliers? |  
          | 
        
        
        Definition 
        
        | Moment-based, because the mean only needs a little bit of an outlier to be thrown off, while the median doesn't break down with outliers |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the measures of spread for moment-based statistics? |  
          | 
        
        
        Definition 
        
        | The variance and the standard deviation |  
          | 
        
        
         | 
        
        
        Term 
        
        | What would happen to the variance if we had no variation in y at all? |  
          | 
        
        
        Definition 
        
        | Variance would equal zero- no spread around the mean. |  
          | 
        
        
         | 
        
        
        Term 
        
        | What happens to the variance as data are spread further from the mean? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is the standard deviation? |  
          | 
        
        
        Definition 
        
        the average dierence between values of y and the mean of y |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the two key dimensions of dependent variables? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What are two types of research designs (hint: think spatial and temporal) |  
          | 
        
        
        Definition 
        
        | Cross-sectional, time series |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is cross-sectional data design? |  
          | 
        
        
        Definition 
        
        | Looks at multiple units at one time |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is temporal data design? |  
          | 
        
        
        Definition 
        
        | Looks at one unit over time |  
          | 
        
        
         | 
        
        
        Term 
        
        | Why might causality be clearer in a time series? |  
          | 
        
        
        Definition 
        
        | we can examine a phenomenon before and after some independent variable changes |  
          | 
        
        
         | 
        
        
        Term 
        
        | What kind of causal theories are common with the physical sciences? |  
          | 
        
        
        Definition 
        
        Deterministic theories: an increase in X by a certain amount will *always* cause an increase in Y of a certain amount |  
          | 
        
        
         | 
        
        
        Term 
        
        | What kind of causal theories are more common with humans? |  
          | 
        
        
        Definition 
        
        | Probabilistic: Increases in X cause increases in Y on average |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is an ecological fallacy? |  
          | 
        
        
        Definition 
        
        inferring individual behavior from population averages |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is the first hurdle? |  
          | 
        
        
        Definition 
        
        | Is there something connecting x and y- does it make sense that they might cause one another in a traceable way? |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is the second hurdle? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is the third hurdle? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is the fourth hurdle? |  
          | 
        
        
        Definition 
        
        Can we eliminate any Z's that might relate to X and Y and cause Y ? |  
          | 
        
        
         | 
        
        
        Term 
        
        | Finish this phrase: "While correlation does not mean causation," |  
          | 
        
        
        Definition 
        
        | Correlation is necessary for causation to exist. |  
          | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        a research design in which the researcher both controls and randomly assigns values of the treatment (key independent variable) to participants |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the two key components of experiments? |  
          | 
        
        
        Definition 
        
        | control, random assignment |  
          | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        | The value(s) of the treatment (key independent variable) X are determined by the researcher and not by the participants or nature. |  
          | 
        
        
         | 
        
        
        Term 
        
        | What two groups comprise the experimental group? |  
          | 
        
        
        Definition 
        
        | Treatment group, control group |  
          | 
        
        
         | 
        
        
        Term 
        
        | What does randomization control for? |  
          | 
        
        
        Definition 
        
        | for every possible Z regardless of whether we can even list the possible Zs |  
          | 
        
        
         | 
        
        
        Term 
        
        | Experiments have good ____________ validity but bad ____________ validity. |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is internal validity? |  
          | 
        
        
        Definition 
        
        | The extent to which we can accurately state that the observed independent variable produced the observed effect |  
          | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        | relates to the generalizability of your findings |  
          | 
        
        
         | 
        
        
        Term 
        
        | What can help with the idea that experiments are low on external validity? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What is an observational study? |  
          | 
        
        
        Definition 
        
        | one in which the researcher does not have control over the quantities of the independent variable (or any variable) |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is the population of interest in an experiment? |  
          | 
        
        
        Definition 
        
        | the set of units (people, countries, etc. . . ) that the researcher's theory relates to |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the two ways to get observational data? |  
          | 
        
        
        Definition 
        
        | Measure the entire population of interest, measure a sample of the population of interest |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the two types of data in an experiment? |  
          | 
        
        
        Definition 
        
        | Population data, sample data |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is population data in an experiment? |  
          | 
        
        
        Definition 
        
        | data about every possible relevant case |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is sample data in an experiment? |  
          | 
        
        
        Definition 
        
        | a dataset drawn from a subset of cases of some underlying population |  
          | 
        
        
         | 
        
        
        Term 
         | 
        
        
        Definition 
        
        Dened by the characteristic that every member of the population of interest has an equal probability of being selected for participation in the study. |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is a convenience sample? |  
          | 
        
        
        Definition 
        
        | Use participants that are readily at hand |  
          | 
        
        
         | 
        
        
        Term 
        
        | What is the difference between random assignment and random sampling? |  
          | 
        
        
        Definition 
        
        Random assignment refers to the decision about whom to give a potential treatment. We randomly assign people to the treatment group from the larger experimental group.
  Random sampling refers to drawing at random of a sample to study. Usually done (or attempted) in survey research. |  
          | 
        
        
         | 
        
        
        Term 
        
        | Without experimental control and random assignment, crossing hurdles ___ and ___ of causal evaluation is difficult. |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What kind of test do you perform if both variables are categorical? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What kind of test do you perform if both variables are continuous? |  
          | 
        
        
        Definition 
        
        | Correlation coefficient (Pearson's r) |  
          | 
        
        
         | 
        
        
        Term 
        
        | What kind of test do you perform if the dependent variable is continuous and the independent variable is categorical? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What kind of test do you perform if the dependent variable is categorical and the independent variable is continuous? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What do p values range between? |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What does the p value show? |  
          | 
        
        
        Definition 
        
        the probability of randomly finding a relationship in the sample that does not exist in the population |  
          | 
        
        
         | 
        
        
        Term 
        
        | As the p value approaches ____ we get more confidence that there is a real relationship between the two variables in the population |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | The more data we have, the _____ our p values will be. |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | If a p value is less than ____, the relationship is said to be _______ ________. |  
          | 
        
        
        Definition 
        
        | 0.05, statistically significant |  
          | 
        
        
         | 
        
        
        Term 
        
        | What are the three steps to trying to interpret cross tabs? |  
          | 
        
        
        Definition 
        
        1. Figure out what defines the rows and columns 2. Figure out what each cell tells you 3. Look for general patterns |  
          | 
        
        
         | 
        
        
        Term 
        
        | When is Pearson's chi squared (x^2) statistic used?  Write it down. |  
          | 
        
        
        Definition 
        
        | Used to find the relationship between two categorical variables. |  
          | 
        
        
         | 
        
        
        Term 
        
        | How do you calculate the degrees of freedom for the chi-squared statistic? |  
          | 
        
        
        Definition 
        
        | df = (r - 1)(c - 1) where r is the number of rows in your table and c is the number of columns |  
          | 
        
        
         | 
        
        
        Term 
        
        | When do you use a t-test? Write down the formula. |  
          | 
        
        
        Definition 
        
        | When the dependent variable is continuous and the independent variable is discrete. |  
          | 
        
        
         | 
        
        
        Term 
        
        | What does the numerator of the t-test formula tell you? |  
          | 
        
        
        Definition 
        
        | the greater the dierence between the means, the higher the value of t will be |  
          | 
        
        
         | 
        
        
        Term 
        
        | The denominator of the t test requires the _____ ______ of the difference of the two means.  Write down how this is calculated |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | How do you calculate degrees of freedom for the t-test? |  
          | 
        
        
        Definition 
        
        | Subtract one from the smaller of the two n's |  
          | 
        
        
         | 
        
        
        Term 
        
        | When do you use Pearson's r (correlation coefficient?) |  
          | 
        
        
        Definition 
        
        | When both the independent and dependent variables are continuous |  
          | 
        
        
         | 
        
        
        Term 
        
        | What does covariance mean? |  
          | 
        
        
        Definition 
        
        | That the variables change together |  
          | 
        
        
         | 
        
        
        Term 
        
        | How do you calculate the covariance between two variables?  Write it down. |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | If ___ is systematically higher than its mean for the same observations in which ___ is higher than its mean, we'll get a ________ contribution to the covariance. |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | How is Pearson's r calculated?  Write it down. |  
          | 
        
        
        Definition 
         | 
        
        
         | 
        
        
        Term 
        
        | What t-statistic is used to determine whether two continuous variables have a higher correlation than we would expect at random?  Write it down.  How are degrees of freedom calculated? |  
          | 
        
        
        Definition 
         | 
        
        
         |