Material to be covered in Psyc 2023

The following is meant as an indication of the material I expect we will cover this semester. (Sometimes we do not get through it all.) Since this material all builds upon the material covered in Psyc 2013, some of this will be review. It will be assumed that you have mastered that material, however, whether reviewed by me in Psyc 2023 or not. I have tentatively linked topics to classes, however, we will go through this as quickly, or as slowly as necessary. Changes to the schedule will be noted in class.

Some Problems with probabilistic reasoning: (adapted from Stanovitch)
1) man who statistics (salience of individual cases)
2) insufficient use of probabilistic information (Baye's theorem)
3) cognitive illusions
4) failure use sample size information
5) tendency to explain chance events
6) gambler's fallacy (tendency to see independent events as dependent)
7) conjunction fallacy

Four factors to consider in evaluating external validity:
1) Population studied and how sampled
2) Operational Definitions
3) Parameter values (both independent and control variables)
4) Demand characteristics (both internal & external)
other threats to validity: statistical validity
Power: avoidance of type II (b error) depends upon:
1) increases as the probability of making type I error increases (trade off between the two)
2) increases as the magnitude of the hypothesized effect increases
3) increases as the size of the sample increases (with n=102, an r of ± .16 is significant at a =.05)

Simple Analytic experiments: involve control, manipulation & measurement: importance of operational definitions
"intro psyc" problem: a legitimate problem, but often overstated

Types of research methods in psychology (and their strengths and weaknesses) Categorizing experimental designs: terminology: bivalent, multivalent, between and within subject designs (repeated-measures design): strengths and weaknesses counterbalancing (if complete order can be an IV)
randomized treatment order (and/or block randomization)
latin square (steps in creating)

terminology: independent, dependent, intervening, control, and confounding (or "extraneous") variables

Inferential Statistics: used to assess the reliability (statistical significance) of the results [probability that observed differences between groups (or among groups) occurred by chance alone (i.e., H0)
Logic of inferential statistics: comparing the variability between groups with the variability within groups (e.g., t-test and F-ratio) measurement & sampling error, and natural variation
- influenced by: i) degree of control over environmental factors
ii) subject differences
iii) sample size - influenced by: i) strength of IV,
ii) level of treatment,
iii) sensitivity of the DV's operational definition

Errors: type I (a - alpha), type II (b - beta): how controlled
1 tailed vs 2-tailed tests (decide a priori)
Between and within subjects t-test (pooled and unpooled error calculations)
Many different inferential statistics: Each intended for specific conditions

Nonparametric statistics: e.g., Mann-Whitney U-test, sign test

Analysis of Variance (ANOVA)
ANOVA followed by t-tests only if the ANOVA is significant (importance of placebo controls)
ANOVA: partitions variance: i) subject variables, ii) experimental error, iii) value of the IV

Factorial Experiments Advantages: can address the complexities of the social sciences (can examine interactive effects among multiple factors): more ecologically valid & economical.

Factorial designs: #levels IV1 x #levels IV2 x #levels IV3àetc.

Developmental Research Designs methods with age or time as a variable

Three specific experimental designs.

Cohort: a group of individuals with common experiences (e.g., born the same time)

Habituation-Dishabituation techniques a paradigm technique in infancy research

Other Quasi-Analytic Designs: Ex-post-facto designs (after the fact)

WHY?: ethical reasons or an interest in organismic variables

Problems: not randomly assigned: inherent confounds in the populations studied
sampling problems (often a convenient sample):
dropouts in prospective studies
detection bias (equally likely to detect in both groups?)

Partial solutions:
Matching: 1) subject for subject (preferable but more difficult) or 2) distribution by distribution

Measuring: so will know if potential confounds (uncontrolled variables) are confounded, and to statistically control for these variables

Retrospective studies also have additional problems in that they rely on memory so the partial solutions are more difficult to employ successfully

DVs used in Ex-post-facto studies Problem with both in that absolute risks are hidden, both should be reported

Causality and ex-post-facto designs. No one quasi-analytic experiment will unambiguously show a causal relationship, however, with converging evidence from many such studies (5, 10 or 100?) can make causal statements (like "smoking causes cancer")

Subject Sampling

Time-series designs, small-n designs non-equivalent control group
replication within-subjects (e.g., A-B-A-B-A-B¼ designs)
generalizability can be indicated by having all subjects show the same pattern

Quasi-analytic experiments: Bivalent correlation designs
for correlations: 1) select population and subjects of interest; 2) measure two variables of interest; 3) calculate the extent to which the two variables are systematically related
Graph data (scatterplot): predictor (assumed causal or IV) variable on abscissa (X-axis) and criterion or DV on ordinate (Y-axis)
Pearson's product moment correlation coefficient (for Interval or ratio data) measures the direction and degree of association.
r is the mean of z-score crossproducts: r=S (ZxZy)/N, the extent to which deviations from the average on each measure are similar for each subject sampled

Linear regression: - looks at the correlation in terms of Predictability,
r2 is a measure of the variance in Y accounted for (or predicted by) X.
1-r2: coefficient of nondetermination (also called coefficient of alienation or error variance)
Linear regression finds the best fitting line: Y'=a+bx
(minimizes the sum of squared deviations, sum of deviations between predicted values of y' and actual observed values of y =0. these deviations are called residuals) cautions - assumes linear relations among variables, truncated ranges can reduce correlations or regressions, Pearson's r (based on means) is very sensitive to the presence of outliers, heteroscedasticity (rXY relationship may vary across levels of X), combining group data can influence the size of the correlation. So: examine scatterplots!!

Problems interpreting the results of this type of research: third variable problem and directionality (not always an issue), regression artifact (e.g., Rushton), floor and ceiling effects, look for converging evidence

Correlation versus ex-post facto design: similar and can convert one to the other [e.g., assign dummy coding to the categorical (nominal) variable and calculate a point-biserial correlation coefficient]
Interpretation problems are not related to the statistical choice, rather due to the design
Causation not a simple concept: want to have:
1) an association between variables that recurs in different contexts (replication, convergent evidence),
2) have a plausible explanation showing how the predictor variable could cause the criterion variable, and
3) have no equally plausible 3rd variable that could cause the variance in the criterion variable.

While correlation doesn't imply causation, causation does imply correlation

Simpson's Paradox: when two groups are classified on some attribute (as in many ex-post facto designs), then are separated into subcategories, the group with the higher incidence (or scores) overall can have the same or even lower incidence within every one of the subcategories. e.g., 1) salaries and economics degrees, 2) race and imposition of death penalty

Partial correlation rYX2× X1: allows you to examine the relationship between 2 variables with the effect of the third removed from both. Can be viewed as the average of the simple bivariate correlations across levels of the third, "nuisance" variable partialled out. variable.

Remember: can be other confounding variable not measured

Semipartial correlations (sometimes called Part correlations) rY(X2× X1):allows you to examine the relationship between 2 variables with the effect of the third removed from one.. (see later handout on multiple regression)

Multiple Correlation and Regression (see later handout)

Discrete trials designs - Psychophysics

method of limits (ascending & descending series)
staircase method: advantages in tracking changes in sensitivity, more efficient
method of constant stimuli Signal Detection Theory

a mathematical, theoretical system that recognizing that observers are not merely passive receivers of stimuli, are also engaged in process of deciding whether they are confident enough to say they detect a signal.

Scientific theories: types of theories, functions of theories
Evaluation on the basis of : parsimony, testability, precision
Confirming vs. disconfirming strategies (confirmational bias)