Psychological Testing: Chapter 3: A Statistics Refresher
This flashcard set covers foundational concepts in psychological and statistical measurement. It explains how statistical tools help analyze data, defines measurement and the rules behind assigning numbers, introduces the concept of scales, and outlines two main scale types: continuous and discrete.
Statistical Tools
Key Terms
Statistical Tools
Used to describe, make inferences from, and draw conclusions about numbers
Measurement
Act of assigning numbers or symbols to characteristics of things (people, events, etc) according to rules
Rules Used to Assign Numbers
Guidelines for representing the magnitude (or some other characteristic) of the object being measured
Scale
Set of numbers (or other symbols) whose properties model empirical properties of the objects to which the numbers are assigned
Ways to Categorize Scales
Continuous Scale
Discrete Scale
Continuous Scale
A scale used to measure a continuous variable; exists when it is theoretically possible to divide any of the values of the scale
Related Flashcard Decks
| Term | Definition |
|---|---|
Statistical Tools | Used to describe, make inferences from, and draw conclusions about numbers |
Measurement | Act of assigning numbers or symbols to characteristics of things (people, events, etc) according to rules |
Rules Used to Assign Numbers | Guidelines for representing the magnitude (or some other characteristic) of the object being measured |
Scale | Set of numbers (or other symbols) whose properties model empirical properties of the objects to which the numbers are assigned |
Ways to Categorize Scales | Continuous Scale Discrete Scale |
Continuous Scale | A scale used to measure a continuous variable; exists when it is theoretically possible to divide any of the values of the scale |
Discrete Scale | A scale used to measure a discrete variable; example, male or female |
Units into which a continuous scale will be divided | Depends on factors such as the purpose of the measurement and practicality |
Error | Refers to the collective influence of all of the factors on a test score or measurement beoynd those specifically measured by the test or measurement |
Sources of Error | A distracting thunderstorm Particular selection of test items the instructor chose to use for the test |
Measuring Scale | Continuous if used for psychological and educational assessment and therefore can be expected to contain this sort of error |
Four levels or Scales of Measurement | Nominal, Ordinal, Interval, and Ratio Scales; Within these, different levels or scales of measurement, assigned numbers convey different kinds of information; |
Statistical Manipulation | May or may not be appropriate, depending upon the leel or scale of measurement |
Nominal Scales | These scales involve classification or categorization based on one or more distinguishing characteristics, where all things measured must be placed into mutually exclusive and exhaustive categories |
Arithmetic Operations | That can be performed with norminal data include counting for the purpose of determining how many cases fall into each category and a resulting determination of proportion or percentages |
Ordinal Scales | Permits classification rank ordering on some characteristic is also permissible with ordinal scales; have no absolute zero point; notes how much greater one ranking is than another; limited statistical analysis |
Alfred Binet | Developer of the intelligence test that bears his name, believed strongly that the data derived from an intelligence test are ordinal in nature; Emphasized that what he tried to do in the test was not to measure people but merely to classify (and rank) people on the basis of their performance on the tasks. |
Rokeach Value Survey | Uses ordinal form of measurement |
Zero in a Survey | Without meaning in such a test because the number of units that separate one testtaker’s score from another’s is simply not known |
Interval Scales | Contain equal intervals between numbers; each unit on the scale is exactly equal to any other unit on the scale; contain no absolute zero point; it is possible to average a set of measure ments and obtain a meaningful result |
Ratio Scales | Has a true zero point; all mathematical operations can meaningfully be performed because there exist equal intervals between the numbers on the scale as well as a true or absolute zero point |
Ratio-Level Measurement | Employed in some types of tests and test items, especially those involving assessment of neurological functioning; Test of hand grip, timed test of perceptual-motor ability (completion of a puzzle); no testtaker can ever obtain a score of zero on an assembly task |
Measurement used in Psychology | Ordinal level of measurement; intelligence, aptitude, and personality tests are basically and strictly speaking, ordinal; these tests indicate with more or less accuracy not the amount of intelligence, aptitude, and personality traits of individuals, but rather the rank-order positions of individuals |
Distribution | Defined as a set of test scores arrayed for recording or study |
Raw Score | Straighforward, unmodified accounting of performance that is usually numberical; may reflect a simple tally, as in the number of items responded to correctly on an achievement test |
Frequency Distribution | All scores are listed alongside the number of times each score occurred; scores may be listed the frequency of occurrence of each score in one column and the score itself in the other column |
Simple Frequency Distribution | What a frequency distribution is referred to, to indicate that individual scores have been used and the data have not been grouped |
Grouped Frequency Distribution | Test-score intervals replace the actual test scores |
Class Intervals | Test-score intervals |
Graph | A diagram or chart composed of lines, points, bars, or other symbols that describe and illustrate data |
Good Graph | The place of a single score in relation to a distribution of test scores can be understood easily |
Types of Graphs | Histogram |
Histogram | Graph with vertical lines drawn at the true limits of each test score (or class interval), forming a series of contiguous rectangles; customary for the test scores to be placed along the graph's horizontal axis (x) and for numbers indicative of the frequency of occurrence to be placed along the graph's vertical axis |
Abcissa | X axis, the graph's horizontal axis; where test scores should be placed |
Ordinate | Y axis, the graph's vertical axis; where numbers indicative of the frequency of occurrence should be placed |
Bar Graph | Numbers indicative of frequency also appear on the Y axis, and reference to some categorization appears on the x-axis; rectangular bars are not contiguous |
Frequency Polygon | Expressed by a continuous line connecting the points where test scores or class intervals (as indicated on the X-axis) meet frequencies (as indicated on the Y-axis) |
Bell-Shaped Curve | Normal graphic representation of data |
Measure of Central Tendency | Statistic that indicates the average or midmost score between the extreme scores in a distribution |
Center of a Distribution | Arithmetic Mean |
Mean | The average, takes into account the actual numerical value of every score; Sum of observations divided by the number of ebservations or test scores; most appropriate measure of central tendency for interval or ratio data when distributions are believed to be approximately normal; the most stable and useful measure of a central tendency |
Median | Defined as the middle score in a distribution; scores are ordered in a list by magnitude, in either ascending or descending order. If the total number of scores ordered is an odd number, then the median will be the score that is exactly in the middle, with one-half of the remaining scores lying above it and the other half of the scores lying below it; average calculated by obtaining the average of the two middle scores; appropriate measure of central tendency for ordinal, interval, and ratio data |
When Median is Useful to measure Central Tendency | In cases where relatively few scores fall at the high end of the distribution or relatively few scores fall at the low end of the distribution |
Mode | Most frequenctly occurring score in a distribution of scores; |
Bimodal Distribution | When there are two scores that occur with the highest frequency; not a commonly used measure of central tendency; not calculated; one merely counts and determines which score occurs most frequently; not necessarily a unique point in a distribution; useful in conveying certain types of information; useful in analyses of a qualitative or verbal nature; useful to convey information IN ADDITION to the mean; |
Variability | Indication of how scores in a distribution are scattered or dispersed |
Measures of Variability | Include the range, the interquartile range, the semi-interquartile range, the average deviation, the standard deviation and the variance |
Range | Equal to the difference between the highest and the lowest scores; simplest measure of variability to calculate; Because it is based on values of the lowest and highest scores, one extreme score can radically alter the value of the range; provides a quick but gross description of the spread of scores |
Nature of the Range | When its value is based on extreme scores in a distribution, the resulting description of variation may be understated or overstated. Better measures include interquartile range and the semiquartile range |
Quartiles | Dividing test scores into four parts such that 25% of the test scores occur in each quarter; refers to a specific point |
Quarter | Refers to an interval |
Interquartile Range | Measure of variability equal to the difference between Q3 and Q1; ordinal statistic |
Semi-Interquartile Range | Equal to the interquartle range divided by 2; |
Shape of distribution | Provided by the relative distances from Q1 and Q3 from Q2 (the median) |
Perfectly symmetrical Distribution | Q1 and Q3 will be esactly the same distance from the median |
Skewness | Lack of symmetry |
Average Deviation (AD) | Tool that could be used to describe the amount of variability in a distribution; Deviation scored are added and divided by the total number of scores to arrive at the average deviatio |
Standard Deviation | Square of each score is used; a measure of variability equal to the square root of the average squared deviations about the mean; Equal to the square root of the variance; Standard deviation measures how much - on average - individual scores of a given group vary (or deviate) from the average (or mean) score for this same group. In other words, the value of standard deviation helps show how many subjects in the group score within a certain range of variation from the mean score for the entire group. Still other way to explain it is that standard deviation measures the spread of individual results around a mean of all the results; |
Variance | Equal to the arithmetic mean of the squares of the differences between the scores in a distribution and their mean; squaring and summing all the deviation scores and then dividing by the total number of scores |
If Standard Deviation is 14.10 | 1 Standard Deviation Unit is approximately equal to 14 units of measurement or to 14 test-score points |
n-1 | only makes a difference if n is 10 or more |
Skewness | The nature and extent to which symmetry is absent; indication of how the measurements in a distribution are distributed |
Positively Skewed Distribution | When relatively few of the scores fall at the high end of the distribution; for examination result, may indicate that the test was too difficult;the distance between Q3-Q2 > Q2-Q1 |
Negatively Skewed Distribution | When relatively few of the scores fall at the low end of the distribution; for examination results, may indicate that the test was too easy; Q3-Q2 |
Symmetrical Distribution | Distances from Q1 and Q3 to the median are the sme |
Kurtosis | Used to refer to the steepness of a distribution in its center; |
Descriptions of Distributions | Platykurtic |
Platykurtic | Relatively flat |
Leptokurtic | Relatively peaked |
Mesokurtic | Somewhere in the middle |
Abraham DeMoivre & Marquis de Laplace | First ones to develop the concept of normal curve |
Karl Friedrich Gauss | Made some substantial contributions to the concept of normal curve; LaPlace-Gaussian Curve |
Karl Pearson | Credited with being the first to refer to the curve as the normal curve; Gaussian curve |
Normal Curve | Bell-shaped, smooth, mathematically defined curve that is at its center. From the center, it tapers on both sides apporaching the X-axis asymptotically; symettrical, the mean, median, and mode all have the same exact value; has two tails |
Asymptotically | It approaches but never touches the axis |
Distribution of a normal curve | Ranges from negative infinity to positive infinity |
Characteristics of all Normal Distributions | 50% occur above the mean and 50% of the scores occur below the mean |
Tail | The area on the normal curve between 2 and 3 standard deviations above the mean; the area on the normal curve between -2 and -3 standard deviations below the mean |
Standard Score | Raw score that has been converted from one scale to another scale, where the latter scale has some arbitrarily set mean and standard deviation |
Why convert raw scores to standard scores | Standard scores are more easily interpretable than raw scores; with a standard score, the position of a testtaker's performance relative to other testtakers is readily apparent |
Systems for Standard Scores | z Scores |
Zero Plus or Minus One Scale | Type of standard score scale that may be thought of as the zero plus or minus one scale |
z Score | Results from the conversion of a raw score into a number indicating how many standard deviation units the raw score is below or above the mean of the distribution; provides a convenient context for comparing scores on the same and different tests; zero plus or minus one scale |
T Scores | Fifty plus or minus ten scale; a scale with a mean set at 50 and a standard deviation set at 10; discovered by W.A. McCall named it as such in honor of EL Thorndike; standard score system composed of a scale that ranges from 5 standard deviations below the mean to 5 standard deviations above the mean; |
Stanine | Contraction of words standard and nine; test scores are often represented as stanines; they take on whole values from 1 to 9 which represent a range of performance that is half of a standard deviation width |
Scores Obtained By Linear Transformation | One that retains a direct numerical relationship to the original raw score; the magnitude of difference between standard scores exactly parallels the differences between corresponding raw scores |
Nonlinear Transformation | May be required when the data under consideration are not normally distributed yet comparisons with normal distributions need to be made; resulting standard score does not necessarily have a direct numerical relationship to the original, raw score |
Normalizing the Distribution | Involves stretching the skewed curve into the shape of a normal curve and creating a corresponding scale of standard scores |