博文

Factor Analysis in Linguistic research

已有 9124 次阅读 2012-3-22 19:26 |个人分类:语言学探讨 Linguistics|系统分类:科研笔记| research, factor, Analysis, linguistic, langauge

Linguisitcs research:

Douglas Biber (1988,1995) discuss the concept of "factor loading" which is very important for language variation analyses.

We must know the concept of "factor analysis" before understand "factor loading", and here enclosed some explanations for "factor analysis":

http://faculty.chass.ncsu.edu/garson/PA765/factor.htm

Overview

Factor analysis is used to uncover the latent structure (dimensions) of a set of variables. It reduces attribute space from a larger number of variables to a smaller number of factors and as such is a "non-dependent" procedure (that is, it does not assume a dependent variable is specified). Factor analysis could be used for any of the following purposes:

To reduce a large number of variables to a smaller number of factors for modeling purposes, where the large number of variables precludes modeling all the measures individually. As such, factor analysis is integrated in structural equation modeling (SEM), helping confirm the latent variables modeled by SEM. However, factor analysis can be and is often used on a stand-alone basis for similar purposes.
To establish that multiple tests measure the same factor, thereby giving justification for administering fewer tests. Factor analysis originated a century ago with Charles Spearman's attempts to show that a wide variety of mental tests could be explained by a single underlying intelligence factor (a notion now rejected, by the way).
To validate a scale or index by demonstrating that its constituent items load on the same factor, and to drop proposed scale items which cross-load on more than one factor.
To select a subset of variables from a larger set, based on which original variables have the highest correlations with the principal component factors.
To create a set of factors to be treated as uncorrelated variables as one approach to handling multicollinearity in such procedures as multiple regression
To identify clusters of cases and/or outliers.
To determine network groups by determining which sets of people cluster together (using Q-mode factor analysis, discussed below)

A non-technical analogy: A mother sees various bumps and shapes under a blanket at the bottom of a bed. When one shape moves toward the top of the bed, all the other bumps and shapes move toward the top also, so the mother concludes that what is under the blanket is a single thing - her child. Similarly, factor analysis takes as input a number of measures and tests, analogous to the bumps and shapes. Those that move together are considered a single thing, which it labels a factor. That is, in factor analysis the researcher is assuming that there is a "child" out there in the form of an underlying factor, and he or she takes simultaneous movement (correlation) as evidence of its existence. If correlation is spurious for some reason, this inference will be mistaken, of course, so it is important when conducting factor analysis that possible variables which might introduce spuriousness, such as anteceding causes, be taken into account.

Factor analysis is part of the general linear model (GLM) family of procedures and makes many of the same assumptions as multiple regression: linear relationships, interval or near-interval data, untruncated variables, proper specification (relevant variables included, extraneous ones excluded), lack of high multicollinearity, and multivariate normality for purposes of significance testing. Factor analysis generates a table in which the rows are the observed raw indicator variables and the columns are the factors or latent variables which explain as much of the variance in these variables as possible. The cells in this table are factor loadings, and the meaning of the factors must be induced from seeing which variables are most heavily loaded on which factors. This inferential labeling process can be fraught with subjectivity as diverse researchers impute different labels.

There are several different types of factor analysis, with the most common being principal components analysis (PCA), which is preferred for purposes of data reduction. However, common factor analysis, also called "principal factor analysis" (PFA), is preferred for purposes of causal analysis and for confirmatory factor analysis in structural equation modeling, among other settings..

Key Concepts and Terms

Exploratory factor analysis (EFA) seeks to uncover the underlying structure of a relatively large set of variables. The researcher's à priori assumption is that any indicator may be associated with any factor. This is the most common form of factor analysis. There is no prior theory and one uses factor loadings to intuit the factor structure of the data. Like other forms of data-driven analysis, EFA may overfit the data at hand, yielding non-generalizable conclusions. Therefore, also like other data-driven approaches, EFA is best used in a research design which incorporates cross-validation (ex., developing the factor model on odd-id cases and validating it on even-id cases).
Confirmatory factor analysis (CFA) seeks to determine if the number of factors and the loadings of measured (indicator) variables on them conform to what is expected on the basis of pre-established theory. Indicator variables are selected on the basis of prior theory and factor analysis is used to see if they load as predicted on the expected number of factors. The researcher's à priori assumption is that each factor (the number and labels of which may be specified à priori) is associated with a specified subset of indicator variables. A minimum requirement of confirmatory factor analysis is that one hypothesize beforehand the number of factors in the model, but usually also the researcher will posit expectations about which indicator variables will load on which factors (Kim and Mueller, 1978b: 55). The researcher seeks to determine, for instance, if measures created to represent a latent variable really belong together.
- EFA vs. CFA. Exploratory factor analysis (EFA) does not require positing the number of factors in advance, but rather the algorithm establishes factors for the researcher. Confirmatory factor analysis (CFA), in contrast, means that the researcher posits in advance the number of factors and which measured (indicator) variables load on them, then tests to see if the data conform to theory. Exploratory factor analysis (EFA) usually refers to methods described in this section while confirmatory factor analysis (CFA) usually refers to factor analysis of the measurement model in structural equation modeling (SEM), discussed separately. In SEM one uses common factor analysis (PFA) rather than principle components analysis (PCA) as is typical in EFA. Sometimes a two-stage approach is taken in which EFA is used to explore how many factors (and which indicators) are appropriate, then CFA is used to confirm the model. Also, EFA can provide a additional detailed insight into the measurement model than can the use of goodness of fit measures used in the SEM approach. As such the traditional method is a useful analytic supplement to the CFA in SEM when the measurement model merits closer examination.
- The SEM Approach. This is what is usually meant by "confirmatory factor analysis." Confirmatory factor analysis can mean the analysis of alternative measurement (factor) models using a structural equation modeling package such as AMOS or LISREL. While SEM is typically used to model causal relationships among latent variables (factors), it is equally possible to use SEM to explore CFA measurement models. This is done by removing from the model all straight arrows connecting latent variables, adding curved arrows representing covariance between every pair of latent variables, and leaving in the straight arrows from each latent variable to its indicator variables as well as leaving in the straight arrows from error and disturbance terms to their respective variables. Such a measurement model is run like any other model and is evaluated like other models, using goodness of fit measures generated by the SEM package.
  - Testing error in the measurement model. Using SEM, the researcher can explore CFA models with or without the assumption of certain correlations among the error terms of the indicator variables. Such measurement error terms represent causes of variance due to unmeasured variables as well as random measurement error. Depending on theory, it may well be that the researcher should assume unmeasured causal variables will be shared by indicators or will correlate, and thus SEM testing may well be merited. That is, including correlated measurement error in the model tests the possibility that indicator variables correlate not just because of being caused by a common factor, but also due to common or correlated unmeasured variables. This possibility would be ruled out if the fit of the model specifying uncorrelated error terms was as good as the model with correlated error specified. In this way, testing of the confirmatory factor model may well be a desirable validation stage preliminary to the main use of SEM to model the causal relations among latent variables.
  - Redundancy test of one-factor vs. multi-factor models. Using SEM, the redundancy test is to use chi-square difference (discussed in the section on structural equation modeling) to compare an original multifactor model with one which is constrained by forcing all correlations among the factors to be 1.0. If the constrained model is not significantly worse than the unconstrained one, the researcher concludes that a one-factor model would fit the data as well as a multi-factor one and, on the principle of parsimony, the one-factor model is to be preferred.
  - Measurement invariance test comparing a model across groups. Using SEM, the measurement invariance test is to use chi-square difference to assess whether a set of indicators reflects a latent variable equally well across groups in the sample. The constrained model is one in which factor loadings are specified to be equal for each class of the grouping variable. If the constrained model is not significantly worse, then the researcher concludes the indicators are valid across groups. This procedure is also called multiple group CFA. If the model fails this test, then it is necessary to examine each indicator for group invariance, since some indicators may still be invariant. This procedure, called the partial measurement invariance test is discussed by Kline (1998: 225 ff.). Note that because standard errors of factor loadings cannot be computed, there are indirect methods but no direct method for comparing models across groups and hence the need for the SEM approach.
  - Orthogonality tests. Using SEM, the orthogonality test is similar to the redundancy test, but factor correlations are set to 0. If the constrained model is not significantly worse than the unconstrained one, the factors in the model can be considered orthogonal (uncorrelated, independent). This test requires at least three indicators per factor.
Factor analysis in SPSS
- The "Factor" dialog. Factor analysis in SPSS starts by selecting Analyze, Dimension Reduction, Factor, from the main menu, yielding a screen in which the user may specify variables to be used in the factor analysis, as shown below. In this example, various types of music preference are entered along with certain demographic variables.
- Descriptives and Options. These buttons lead to the corresponding dialog screens shown below. Typically the defaults are accepted. In this example, however, a display format "Sorted by size" is requested.
- Extraction. The "Extraction" dialog allows the user to select from among seven types of extraction, discussed below. Principal components is the default. In the "Display" area, unrotated factor solutions and scree plots, also discussed below, are requested (these are not default output).
- Rotation. The "Rotation" dialog allows the user to select from among five types of extraction, discussed below. "None" is the default but Varimax, selected in this example, is the most common selection. In the "Display" area, a rotated solution is requested (not the default, but required to ease interpretation of factors). Rotation is also discussed below.
- Factor Scores. This option allows the user to save factor scores to the working dataset, added as additional columns on the right. Not saving is the default.
Factor analysis in SAS
- SAS interface. SAS code, described below, goes in the Editor window and is then submitted from the main menu (not shown) by selecting Run, Submit.
- SAS syntax. The commented SAS code below illustrates how output parallel to that from SPSS is obtained. * PROC IMPORT imports the data. Actual directory paths belong within the DATAFILE quotes but are omitted here; PROC IMPORT OUT= WORK.factor_ex DATAFILE= "GSS93 subset.sav" DBMS=SPSS REPLACE; run; * PROC FACTOR causes factor output to be computed; * The DATA statement refers back to the user-set name for the work file from the IMPORT statement above; * If PRIORS=SMC were added to the PROC FACTOR parameters list, principal factor analysis rather than principal components analysis would be computed; * ROTATE sets the rotation method, here VARIMAX; * SIMPLE displays means, standard deviations, and number of observations; * SCREE displays the scree plot of the eigenvalues: * RESIDUALS displays the residual correlation matrix and the associated partial correlation matrix; PROC FACTOR DATA=factor_ex METHOD=PRINCIPAL ROTATE=VARIMAX SIMPLE SCREE CORR RESIDUALS; * The var statement lists the variables to be used; VAR bigband blugrass country blues musicals classicl jazz opera rap hvymetal educ sex age rincom91 classic3 blues3 jazz3 rap3; run;

Types of factoring (extraction) There are two major methods of extracting the factors from a set of variables:

Principal components analysis (PCA): Used in exploratory research, as when a researcher does not have a causal model but simply wants to reduce a large number of items (ex., an 80-item survey) to a smaller number of underlying latent dimensions (ex. 7 factors). This is the most common type of "factor analysis," though technically it is really "components analysis." PCA factor space is illustrated in the plot below.
Principal factor analysis (PFA) (a.k.a. common factor analysis or principal axis factoring (PAF)): Used in confirmatory research, as when the researcher has causal model. As such it is used in conjunction with causal modeling techniques such as path analysis, partial least squares modeling, and structural equation modeling.
PCA and PFA compared. For most datasets, PCA and common factor analysis will lead to similar substantive conclusions (Wilkinson, Blank, and Gruber, 1996). However, the two differ the more the communality estimate of one variable differs from another (communality, discussed below, is the percent of variance in an indicator variable explained jointly by all the factors); the smaller the sample size; and the fewer the variables. Other comparisons of PCA and PFA are shown in the table below:

Principal components analysis (PCA), a.k.a. components analysis or factor analysis	Principal factor analysis (PFA), a.k.a. principal axis factoring (PAF), common factor analysis, or factor analysis
Analyzes a correlation matrix in which the diagonal contains 1's. (This is not equivalent to analyzing the covariance matrix.)	Analyzes a correlation matrix in which the diagonal contains the communalities. (This is equivalent to analyzing the covariance matrix, which is also what structural equation modeling does.)
PCA accounts for the total variance of variables. Factors, properly called components, reflect the common variance of variables plus the unique variance. That is, manifest variables may be conceptualized as reflecting a combination of total variance and unique variance explained by the components, plus error variance not explained by the components.	Common factor analysis accounts for the covariation among variables. Factors reflect the common variance of the variables, excluding unique (variable-specific) variance. That is, manifest variables may be conceptualized as reflecting a combination of common variance explained by the factors, plus unique and error variance not explained by the factors.
Components seek to reproduce the total variable variance as well as the correlations. That is, PCA accounts for the total variance of the variables.	Factors seek to reproduce the correlations of the variables. That is, PFA accounts for the covariation among the variables.
PCA is thus a variance-focused approach.	PFA is thus a correlation-focused approach.
For the first component, PCA creates a linear equation which extracts the maximum total variance from the variables; for the second component, PCA removes the variance explained by the first component and creates a second linear equation which extracts the maximum remaining variance; etc., continuing until the components can explain all the common and unique variance in a set of variables.	PFA seeks the least number of factors which can account for the covariance shared by a set of variables. For the first factor, PFA creates a linear equation which extracts the maximum covariance from the variables; for the second component PFA removes the covariance explained by the first component and creates a second linear equation which extracts the maximum remaining covariance; etc., continuing until the factors can explain all the covariance in a set of variables.
Normally, components are orthogonal to (uncorrelated with) one another, though an oblique (correlated) option is available.	Normally, components are orthogonal to (uncorrelated with) one another, though an oblique (correlated) option is available.
Adding variables to the model will change the factor loadings.	In principle, it is possible to add variables to the model without affecting the factor loadings.
PCA is used when the research purpose is data reduction or exploration. PCA is not used in causal modeling (ex., not used with structural equation modeling)	PFA is used when the research purpose is theory confirmation and causal modeling. A type of PFA is built into structural equation modeling programs, for instance.
In SPSS: Analyze, Data Reduction, Factor; click the Extraction button; from the "Method" drop-down, select "Principal components".	In SPSS: Analyze, Data Reduction, Factor; click the Extraction button; from the "Method" drop-down, select "Principal axis factoring".

Warning: Factor analysis is not a silver bullet. Simulations comparing factor analysis with structural equation modeling (SEM) using simulated data indicate that at least in some circumstances, factor analysis may not correctly identify the correct number of latent variables, or sometimes even come close. While factor analysis may demonstrate that a particular model with a given predicted number of latent variables is not inconsistent with the data by this technique, researchers should understand that other models with different numbers of latent variables may also have good fit by SEM techniques.
Other Extraction Methods. In addition to PCA and common factor analysis, there are other less-used extraction methods:
1. Image factoring: based on the correlation matrix of predicted variables rather than actual variables, where each variable is predicted from the others using multiple regression.
2. Minimum residual factoring also extracts factors from the correlation matrix, but this method ignores the diagonal of the correlation matrix.
3. Maximum likelihood factoring: based on a linear combination of variables to form factors, where the parameter estimates are those most likely to have resulted in the observed correlation matrix, using ML methods. ML methods assume multivariate normality. Correlations are weighted by each variable's uniqueness. (As discussed below, uniqueness is 1 minus the communality of a variable.) Maximum likelihood factoring generates a chi-square goodness-of-fit test. The researcher can increase the number of factors one at a time until a satisfactory goodness of fit is obtained. Warning: for large samples, even very small improvements in explaining variance can be significant by the goodness-of-fit test and thus lead the researcher to select too many factors.
4. Alpha factoring: based on maximizing the reliability of factors, assuming variables are randomly sampled from a universe of variables. All other methods assume cases to be sampled and variables fixed.
5. Unweighted least squares (ULS) factoring: based on minimizing the sum of squared differences between observed and estimated correlation matrices, not counting the diagonal.
6. Generalized least squares (GLS) factoring: based on adjusting ULS by weighting the correlations inversely according to their uniqueness (more unique variables are weighted less). Uniuqueness is 1 - h², where h² is the communality. Like MLF, GLS also generates a chi-square goodness-of-fit test. The researcher can increase the number of factors one at a time until a satisfactory goodness of fit is obtained.
7. Canonical factor analysis , also called Rao's canonical factoring, is a different method of computing the same model as PCA, which uses the principal axis method. CFA seeks factors which have the highest canonical correlation with the observed variables. CFA is unaffected by arbitrary rescaling of the data.

Factor Analytic Data Modes
- R-mode factor analysis. R-mode is by far the most common, so much so that it is normally assumed and not labeled as such. In R-mode, rows are cases, columns are variables, and cell entries are scores of the cases on the variables. In R-mode, the factors are clusters of variables on a set of people or other entities, at a given point of time.
- Q-mode factor analysis, also called inverse factor analysis, is factor analysis which seeks to cluster the cases rather than the variables. That is, in Q-mode the rows are variables and the columns are cases (ex., people), and the cell entries are scores of the cases on the variables. In Q-mode the factors are clusters of people for a set of variables. Q-mode is used to establish the factional composition of a group on a set of issues at a given point in time.
  A Q-mode issue has to do with negative factor loadings. In conventional factor analysis of variables, loadings are loadings of variables on factors and a negative loading indicates a negative relation of the variable to the factor. In Q-mode factor analysis, loadings are loadings of cases (often individuals) on factors and a negative loading indicates that the case/individual displays responses opposite to those who load positively on the factor. In conventional factor analysis, loading approaching zero indicates the given variable is unrelated to the factor. In Q-mode factor analysis, a loading approaching zero indicates the given case is near the mean for the factor. Cluster analysis is now more common than Q-mode factor analysis. Note, however, that correlations in factor analysis are treated in a general linear model which takes control variables into account, whereas cluster analysis uses correlations simply as similarity measures. For this reason, some researchers still prefer Q-mode factor analysis for clustering analysis.
  The following modes are rare.
- O-mode factor analysis is an older form of time series analysis in which data are collected on a single entity (ex., one U. S. Senator), the columns are years, and the rows are measures (variables). In this mode, factors show which years cluster together on a set of measures for a single entity. Based on this, one can compare entities or, in a history of the entity, one can differentiate periods for purposes of explanation of behavior.
- T-mode factor analysis is similar to O-mode in that the columns are years. However, the rows are entities (ex., cases are people) and data are gathered for a single variable. In T-mode, the factors show which years cluster together on that variable for a set of people or other entities. One might investigate, for instance, if Senators' positions on military spending are differentiated between war years and peacetime years.
- S-mode factor analysis uses entities for columns (ex., Senators), years for rows (cases), and cell entries measure a single variable. In S-mode, factors show which Senators or other entities cluster together over a period of years on a single variable. S-mode would be used, for instance, to establish the underlying factional composition of a group on an issue over a long period of time.

Factor loadings: The factor loadings, also called component loadings in PCA, are the correlation coefficients between the variables (rows) and factors (columns). Analogous to Pearson's r, the squared factor loading is the percent of variance in that indicator variable explained by the factor. To get the percent of variance in all the variables accounted for by each factor, add the sum of the squared factor loadings for that factor (column) and divide by the number of variables. (Note the number of variables equals the sum of their variances as the variance of a standardized variable is 1.) This is the same as dividing the factor's eigenvalue by the number of variables.

Interpreting factor loadings. By one rule of thumb in confirmatory factor analysis, loadings should be .7 or higher to confirm that independent variables identified a priori are represented by a particular factor, on the rationale that the .7 level corresponds to about half of the variance in the indicator being explained by the factor. However, the .7 standard is a high one and real-life data may well not meet this criterion, which is why some researchers, particularly for exploratory purposes, will use a lower level such as .4 for the central factor and .25 for other factors (Raubenheimer, 2004). Hair et al. (1998) call loadings above .6 "high" and those below .4 "low". In any event, factor loadings must be interpreted in the light of theory, not by arbitrary cutoff levels.
In the SPSS example below, focused on subjects' music preferences (coded from 1 = "like it" to 3 = "dislike it"), the red cells show the loadings for the measured (row) variables most associated with each of the six extracted components (factors). The green cell illustrates a weak to moderate cross-loading. Ideally, the researcher wants a "simple factor structure," with all main loadings greater than .70 and no cross-loadings greater than .40 (some say greater than .3). Usually, as here, actual patterns fall short of simple factor structure, though this example comes close. Rap music preference in component 3 is the most clearly and heavily loaded. Component 1 is the most diverse, associated with disliking classical, opera, Broadway, and big band music and crossloaded with being less educated.
SAS output. Corresponding SAS output is shown below for the first three factors, unsorted and with more decimal places.
Factor, component, pattern, and structure matrices. In SPSS, the factor loadings are found in a matrix labeled Factor Matrix if common factor analysis is requested, or in one labeled Component Matrix if PCA is requested. (Note SPSS output gives both a factor or component matrix and a rotated factor or component matrix. The rotated version is used to induce factor meanings).
In oblique rotation, one gets both a pattern matrix and a structure matrix. The structure matrix is simply the factor loading matrix as in orthogonal rotation, representing the variance in a measured variable explained by a factor on both a unique and common contributions basis. The pattern matrix, in contrast, contains coefficients which just represent unique contributions. The more factors, the lower the pattern coefficients as a rule since there will be more common contributions to variance explained. For oblique rotation, the researcher looks at both the structure and pattern coefficients when attributing a label to a factor.
The sum of the squared factor loadings for all factors for a given variable (row) is the variance in that variable accounted for by all the factors, and this is called the communality. In a complete PCA, with no factors dropped, this will be 1.0, or 100% of the variance. The ratio of the squared factor loadings for a given variable (row in the factor matrix) shows the relative importance of the different factors in explaining the variance of the given variable. Factor loadings are the basis for imputing a label to the different factors.
Communality, h², is the squared multiple correlation for the variable as dependent using the factors as predictors. The communality measures the percent of variance in a given variable explained by all the factors jointly and may be interpreted as the reliability of the indicator. In the example below, focused on subjects' music preferences (see example below), the extracted factors explain over 95% of preferences for rap music but only 56% for country western music. In general, communalities show for which measured variables the factor analysis is working best and least well.
- SAS output. Corresponding SAS output is shown below, with more decimal places.
- Low communality. When an indicator variable has a low communality, the factor model is not working well for that indicator and possibly it should be removed from the model. Low communalities across the set of variables indicates the variables are little related to each other. However, communalities must be interpreted in relation to the interpretability of the factors. A communality of .75 seems high but is meaningless unless the factor on which the variable is loaded is interpretable, though it usually will be. A communality of .25 seems low but may be meaningful if the item is contributing to a well-defined factor. That is, what is critical is not the communality coefficient per se, but rather the extent to which the item plays a role in the interpretation of the factor, though often this role is greater when communality is high.
- Spurious solutions. If the communality exceeds 1.0, there is a spurious solution, which may reflect too small a sample or the researcher has too many or too few factors.
- Computation. Communality for a variable is computed as the sum of squared factor loadings for that variable (row). Recall r-squared is the percent of variance explained, and since factors are uncorrelated, the squared loadings may be added to get the total percent explained, which is what communality is. For full orthogonal PCA, the initial communality will be 1.0 for all variables and all of the variance in the variables will be explained by all of the factors, which will be as many as there are variables. The "extracted" communality is the percent of variance in a given variable explained by the factors which are extracted, which will usually be fewer than all the possible factors, resulting in coefficients less than 1.0. For common factor analysis and other extraction methods, however, the communalities for the various factors will be less than 1 even initially. Communality does not change when rotation is carried out, hence in SPSS there is only one communalities table.
Uniqueness of a variable is 1 - h ². That is, uniqueness is 1 minus its communality. The "1" represents 100% of the variability of the variable and uniqueness represents total variability minus common variability, represented by communality.
Eigenvalues: Also called characteristic roots. The eigenvalue for a given factor measures the variance in all the variables which is accounted for by that factor. The ratio of eigenvalues is the ratio of explanatory importance of the factors with respect to the variables. If a factor has a low eigenvalue, then it is contributing little to the explanation of variances in the variables and may be ignored as redundant with more important factors.
- Interpretation. Eigenvalues measure the amount of variation in the total sample accounted for by each factor. Note that the eigenvalue is not the percent of variance explained but rather a measure of amount of variance in relation to total variance (since variables are standardized to have means of 0 and variances of 1, total variance is equal to the number of variables). SPSS will output a corresponding column titled '% of variance'. A factor's eigenvalue may be computed as the sum of its squared factor loadings for all the variables.
  In the SPSS example below, again on analysis of music preferences, 18 components (factors) would be needed to explain 100% of the variance in the data. However, using the conventional criterion of stopping when the initial eigenvalue drops below 1.0, only 6 of the 18 factors were actually extracted in this analysis. These six account for 72% of the variance in the data.
- SAS output. Corresponding SAS output is shown below.
- Extraction sums of squared loadings. Initial eigenvalues and eigenvalues after extraction (listed by SPSS as "Extraction Sums of Squared Loadings") are the same for PCA extraction, but for other extraction methods, eigenvalues after extraction will be lower than their initial counterparts. SPSS also prints "Rotation Sums of Squared Loadings" and even for PCA, these eigenvalues will differ from initial and extraction eigenvalues, though their total will be the same.
Trace is the sum of variances for all factors, which is equal to the number of variables since the variance of a standardized variable is 1.0. A factor's eigenvalue divided by the trace is the percent of variance it explains in all the variables, usually labeled percent of trace in computer output. Computer output usually lists the factors in descending order of eigenvalue, along with a cumulative percent of trace for as many factors as are extracted.
Factor scores: "Factor scores" usually refer to the default type (default in SPSS at least), which is "regression scores," based on ordinary least squares (OLS) esstimates. In PCA, factor scores are properly called "component scores. A factor score is a score for a given individual or observation on a given factor. Factor scores can be correlated even when an orthogonal factor estraction was performed. To compute the factor score for a given case, one takes the case's standardized score on each indicator variable in turn, multiplies by the corresponding factor loading of the indicator variable on the given factor, then sums these products across all measured variables. Thus, for the PCA analysis below, the component scores shown in pink multiplied by a subject's standardized measured scores on the row variables would sum to that subject's component score on component 1. Computing factor scores allows one to look for factor outliers. Also, factor scores may be used as variables in subsequent modeling. Regression factor scores have a mean of 0.
- Bartlett scores - Bartlett scores may be preferred over regression scores on the argument that they better conform to the original factor structure. In a simulation study, Beauducel (2005: 157) concluded, "the extremely high congruency of the Bartlett component pattern with the original factor pattern makes them most compatible with the common practice in exploratory and confirmatory factor analysis." Bartlett scores are computed by minimizing the squared residuals as in regression scoring, but the residuals are weighted by the reciprocal of the unique standard deviation. Bartlett scores also have a mean of 0 and may be correlated. In SPSS, Bartlett scores may be selected by clicking the "Scores" button in the "Factor Analysis" dialog, leading to the "Factor Scores" dialog shown below.
- Anderson-Rubin scores - Anderson-Rubin factor scores are a modification of Bartlett scores to ensure orthogonality. Therefore Anderson-Rubin scores are uncorrelated. They have a mean of 0 and a standard deviation of 1.
- SPSS. The SPSS FACTOR procedure saves standardized factor scores as variables in your working data file. In SPSS, click Scores; select 'Save as Variables' and 'Display factor score coefficient matrix'. The factor (or in PCA, component) score coefficient matrix contains the regression coefficients used down the columns to compute scores for cases, were one to want to do this manually. By default SPSS will name them FAC1_1,FAC2_1, FAC3_1, etc., for the corresponding factors (factor 1, 2 and 3) of analysis 1; and FAC1_2, FAC2_2, FAC3_2 for a second set of factor scores, if any, within the same procedure, and so on. Although SPSS adds these variables to the right of your working data set automatically, they will be lost when you close the dataset unless you re-save your data.
Criteria for determining the number of factors
- Comprehensibility. Though not a strictly mathematical criterion, there is much to be said for limiting the number of factors to those whose dimension of meaning is readily comprehensible. Often this is the first two or three. Using one or more of the methods below, the researcher determines an appropriate range of solutions to investigate. For instance, the Kaiser criterion may suggest three factors and the scree test may suggest 5, so the researcher may request 3-, 4-, and 5-factor solutions and select the solution which generates the most comprehensible factor structure.
- Kaiser criterion: A common rule of thumb for dropping the least important factors from the analysis is the K1 rule. Though originated earlier by Guttman in 1954, the criterion is usually referenced in relation to Kaiser's 1960 work which relied upon it. The Kaiser rule is to drop all components with eigenvalues under 1.0. It may overestimate or underestimate the true number of factors; the preponderance of simulation study evidence suggests it usually overestimates the true number of factors, sometimes severely so (Lance, Butts, and Michels, 2006). The Kaiser criterion is the default in SPSS and most computer programs but is not recommended when used as the sole cut-off criterion for estimated the number of factors.
- Scree plot: The Cattell scree test, shown below for SPSS output, plots the components as the X axis and the corresponding eigenvalues as the Y axis. As one moves to the right, toward later components, the eigenvalues drop. When the drop ceases and the curve makes an elbow toward less steep decline, Cattell's scree test says to drop all further components after the one starting the elbow. This rule is sometimes criticised for being amenable to researcher-controlled "fudging." That is, as picking the "elbow" can be subjective because the curve has multiple elbows or is a smooth curve, the researcher may be tempted to set the cut-off at the number of factors desired by his or her research agenda.Researcher bias may be introduced due to the subjectivity involved in selecting the elbow. The scree criterion may result in fewer or more factors than the Kaiser criterion. In the example below, the Kaiser criterio stopped at 6 components, but some researchers might use the scree plot criterion to stop at 5 or even 3.
- SAS output. Corresponding scree plot output for SAS is shown below.
- Parallel analysis (PA), also known as Humphrey-Ilgen parallel analysis. PA is now often recommended as the best method to assess the true number of factors (Velicer, Eaton, and Fava, 2000: 67; Lance, Butts, and Michels, 2006). PA selects the factors which are greater than random. The actual data are factor analyzed, and separately one does a factor analysis of a matrix of random numbers representing the same number of cases and variables. For both actual and random solutions, the number of factors on the x axis and cumulative eigenvalues on the y axis is plotted. Where the two lines intersect determines the number of factors to be extracted. Though not available directly in SPSS or SAS, O'Connor (2000) presents programs to implement PA in SPSS, SAS, and MATLAB. These programs are located at http://flash.lakeheadu.ca/~boconno2/nfactors.html.
- Minimum average partial (MAP) criterion. Developed by Velicer, this criterion is similar to PA in good resuls, but more complex to implement. O'Connor (2000), linked above, also presents programs for MAP.
- Variance explained criteria: Some researchers simply use the rule of keeping enough factors to account for 90% (sometimes 80%) of the variation. Where the researcher's goal emphasizes parsimony (explaining variance with as few factors as possible), the criterion could be as low as 50%.
- Joliffe criterion: A less used, more liberal rule of thumb which may result in twice as many factors as the Kaiser criterion. The Joliffe rule is to crop all components with eigenvalues under .7.
- Mean eigenvalue. This rule uses only the factors whose eigenvalues are at or above the mean eigenvalue. This strict rule may result in too few factors.
  Before dropping a factor below one's cut-off, however, the researcher should check its correlation with the dependent variable. A very small factor can have a large correlation with the dependent variable, in which case it should not be dropped. Also, as a rule of thumb, factors should have at least three high, interpretable loadings -- fewer may suggest that the reasearcher has asked for too many factors.
Using reproduced correlation residuals to validate the choice of number of factors
- Reproduced correlations is the correlation matrix of original items which would result on the assumption that the computed factors were the true and only factors. For any given pair of variables, the reproduced correlation is the product of their factor loadings on the first factor plus the product on the second factor, etc., for all factors. The diagonal values are the extracted communalities.
- Reproduced correlation residuals or "residual correlation matrix" is the matrix of differences between the reproduced and actual correlations. The closer the residuals are to zero (i.e., low or non-significant), the more confidence the researcher has in his or her selection of the number of factors in the solution. In SPSS, footnotes to the table of residual correlations reports the percentage of non-redundant residual correlations greater than .05. In a good factor analysis, this percentage is low. (This is not a test used to reject a model.)
  The reproduced correlation residuals matrix may help the researcher to identify particular correlations which are ill reproduced by the factor model with the current number of factors. By experimenting with different models with different numbers of factors, the researcher may assess which model best reproduces the correlations which are most critical to his or her research purpose.
- In SPSS, click the Descriptives button in the "Factor Analysis" dialog, then check "Reproduced" in the Correlations area of the "Factor Analysis: Descriptives" dialog. This option prints a table containing two subtables, the reproduced correlations on top and the reproduced correlation residuals on the bottom.
Rotation methods. Rotation serves to make the output more understandable and is usually necessary to facilitate the interpretation of factors. The sum of eigenvalues is not affected by rotation, but rotation will alter the eigenvalues (and percent of variance explained) of particular factors and will change the factor loadings. Since alternative rotations may explain the same variance (have the same total eigenvalue) but have different factor loadings, and since factor loadings are used to intuit the meaning of factors, this means that different meanings may be ascribed to the factors depending on the rotation - a problem often cited as a drawback to factor analysis. If factor analysis is used, the researcher may wish to experiment with alternative rotation methods to see which leads to the most interpretable factor structure.
Oblique rotations, discussed below, allow the factors to be correlated, and so a factor correlation matrix is generated when oblique is requested. Normally, however, an orthogonal method such as varimax is selected and no factor correlation matrix is produced as the correlation of any factor with another is zero.
- No rotation is the default in SPSS, but it is a good idea to select a rotation method, usually varimax. The original, unrotated principal components solution maximizes the sum of squared factor loadings, efficiently creating a set of factors which explain as much of the variance in the original variables as possible. The amount explained is reflected in the sum of the eigenvalues of all factors. However, unrotated solutions are hard to interpret because variables tend to load on multiple factors.
- Varimax rotation is an orthogonal rotation of the factor axes to maximize the variance of the squared loadings of a factor (column) on all the variables (rows) in a factor matrix, which has the effect of differentiating the original variables by extracted factor. Each factor will tend to have either large or small loadings of any particular variable. A varimax solution yields results which make it as easy as possible to identify each variable with a single factor. This is the most common rotation option.
- Quartimax rotation is an orthogonal alternative which minimizes the number of factors needed to explain each variable. This type of rotation often generates a general factor on which most variables are loaded to a high or medium degree. Such a factor structure is usually not helpful to the research purpose.
- Equamax rotation is a compromise between Varimax and Quartimax criteria.
- Direct oblimin rotation, sometimes called just "oblique rotation," is the standard method when one wishes a non-orthogonal (oblique) solution -- that is, one in which the factors are allowed to be correlated. This will result in higher eigenvalues but diminished interpretability of the factors. The Delta factor can be specified to constrain the degree of correlation allowed in the solution, with 0 allowing the highest correlation and a large negative Delta constraining the solution to be nearly orthogonal (uncorrelated factors). See below. See also hierarchical factor analysis.
- Promax rotation is an alternative non-orthogonal (oblique) rotation method which is computationally faster than the direct oblimin method and therefore is sometimes used for very large datasets.
PRINCALS: A computer program which adapts PCA for non-metric and non-linear relationships. Its use is still rare.
The Component Transformation Matrix in SPSS output shows the correlation of the factors before and after rotation.

Assumptions

Valid imputation of factor labels. Factor analysis is notorious for the subjectivity involved in imputing factor labels from factor loadings. For the same set of factor loadings, one researcher may label a factor "work satisfaction" and another may label the same factor "personal efficacy," for instance. The researcher may wish to involve a panel of neutral experts in the imputation process, though ultimately there is no "correct" solution to this problem.
No selection bias/proper specification. The exclusion of relevant variables and the inclusion of irrelevant variables in the correlation matrix being factored will affect, often substantially, the factors which are uncovered. Although social scientists may be attracted to factor analysis as a way of exploring data whose structure is unknown, knowing the factorial structure in advance helps select the variables to be included and yields the best analysis of factors. This dilemma creates a chicken-and-egg problem. Note this is not just a matter of including all relevant variables. Also, if one deletes variables arbitrarily in order to have a "cleaner" factorial solution, erroneous conclusions about the factor structure will result. See Kim and Mueller, 1978a: 67-8.
No outliers. Outliers can impact correlations heavily and thus distort factor analysis. One may use Mahalanobis distance to identify cases which are multivariate outliers, then remove them from the analysis prior to factor analysis. One can also create a dummy variable set to 1 for cases with high Mahalanobis distance, then regress this dummy on all other variables. If this regression is non-significant (or simply has a low R-squared for large samples) then the outliers are judged to be at random and there is less danger in retaining them. The ratio of the beta weights in this regression indicates which variables are most associated with the outlier cases.
Interval data are assumed. However, Kim and Mueller (1978b 74-5) note that ordinal data may be used if it is thought that the assignment of ordinal categories to the data do not seriously distort the underlying metric scaling. Likewise, these authors allow use of dichotomous data if the underlying metric correlations between the variables are thought to be moderate (.7) or lower. The result of using ordinal data is that the factors may be that much harder to interpret.
- Problems with categorical variables. Note that categorical variables with similar splits will necessarily tend to correlate with each other, regardless of their content (see Gorsuch, 1983). This is particularly apt to occur when dichotomies are used. The correlation will reflect similarity of "difficulty" for items in a testing context, hence such correlated variables are called difficulty factors. The researcher should examine the factor loadings of categorical variables with care to assess whether common loading reflects a difficulty factor or substantive correlation. Improper use of dichotomies can result in too many factors.
  Problems arise even when the number of categories is greater than two. Spurious factors may be created not because items are similar in meaning but because they are similar in difficulty (Gorsuch, 1974; Lawrence, Shaw, Baker, Baron-Cohen, & David, 2004). Treating ordinal variables as interval is a form of measurement error and hence involves attenuation of correlation. This is why basing exploratory factor analysis (EFA) on a matrix of polychoric correlations, which are designed for ordinal data, results in higher factor loadings and higher eigenvalues as a rule. Monte Carlo studies by Joreskog & Sorbom (1986) uphold the desirability of basing EFA on polychoric matrixes, as does research by Muthen & Kaplan (1985) and by Gilley & Uhlig (1993). Polychoric correlation matrices can be created in PRELIS, the front end to LISREL, described by Joreskog & Sorbom (1986). See the discussion of levels of data.
- Problems with dichotomous data. Shapiro, Lasarev, & McCauley (2002) used simulation methods to study biases of factor analysis in a dataset of dichotomous variables. "Our work has shown," they concluded, "that application of standard rules to 19 randomly generated and independently created dichotomous variables could result in models containing five factors, which explained approximately 30 percent of the total variance. Even more troubling is the realization that rotated loadings in excess of 0.40, the traditional cutoff used by investigators, occurred more than 95 percent of the time in our randomly generated data set. If, as our simulation demonstrated, similar results can be obtained using randomly generated data, we are forced to reconsider the existence of syndromes found in earlier studies, especially those discovered through factor analysis of dichotomous variables." That is, dichotomous data tend to yield many factors (by the usual Kaiser criterion), and many variables loaded on these factors (by the usual .40 cutoff), even for randomly generated data.
Linearity. Factor analysis is a linear procedure. Of course, as with multiple linear regression, nonlinear transformation of selected variables may be a pre-processing step. The smaller the sample size, the more important it is to screen data for linearity.
Multivariate normality of data is required for related significance tests. PCA and common factor analysis, significance testing apart, have no distributional assumptions. Note, however, that a less-used variant of factor analysis, maximum likelihood factor analysis, does assume multivariate normality. The smaller the sample size, the more important it is to screen data for normality. Moreover, as factor analysis is based on correlation (or sometimes covariance), both correlation and covariance will be attenuated when variables come from different underlying distributions (ex., a normal vs. a bimodal variable will correlate less than 1.0 even when both series are perfectly co-ordered). Nonetheless, normality is not considered one of the critical assumptions of factor analysis. See further discussion in the FAQ section.
Homoscedasticity. Since factors are linear functions of measured variables, homoscedasticity of the relationship is assumed. However, homoscedasticity is not considered a critical assumption of factor analysis.
Orthogonality (for common factor analysis but not PCA): the unique factors should be uncorrelated with each other or with the common factors. Recall that common factor analysis (PFA) factors only the common variance, ignoring the unique variance. This is not an issue for PCA, which factors the total variance.
Underlying dimensions shared by clusters of variables are assumed. If this assumption is not met, the "garbage in, garbage out" (GIGO) principle applies. Factor analysis cannot create valid dimensions (factors) if none exist in the input data. In such cases, factors generated by the factor analysis algorithm will not be comprehensible. Likewise, the inclusion of multiple definitionally-similar variables representing essentially the same data will lead to tautological results.
Moderate to moderate-high intercorrelations without multicollinearity are not mathematically required but applying factor analysis to a correlation matrix with only low intercorrelations will generate a solution with nearly as many factors as there are original variables, thereby defeating the data reduction purposes of factor analysis. Some researchers require correlations > .30 to conduct factor analysis. Alternatively, KMO of .60 or higher indicates data will factor well.
Absence of high multicollinearity. Some researchers employ factor analysis precisely because measured indicators display multicollinearity, whereas the factors based on them will be orthogonal. On the other hand, multicollinearity increase the standard error of factor loadings, making them less reliable and thereby making more difficult the process of inferring labels for factors. To avoid this, some researchers advocate the combining of colinear terms or elimination of colinear terms prior to factor analysis. Other researchers go so far as to advocate foregoing factor analysis in the face of multicollinearity, instead simply picking the most meaningful of the colinear indicators, admitting multicollinearity, and proceeding with analysis with the comfort of knowing just what the selected indicator represents rather than having to rely on a factor whose exact meaning is difficult to assess. KMO statistics may be used to detect multicollinearity in a factor analysis, or data may first be screened using VIF or tolerance in regression.
No perfect multicollinearity. Singularity in the input matrix, also called an ill-conditioned matrix, arises when two or more variables are perfectly redundant. Singularity prevents the matrix from being inverted and prevents a solution.
Factor interpretations and labels must have face validity and/or be rooted in theory. It is notoriously difficult to assign valid meanings to factors. A recommended practice is to have a panel not otherwise part of the research project assign one's items to one's factor labels. A rule of thumb is that at least 80% of the assignments should be correct.
Sphericity. Bartlett's test of sphericity should be significant. In SPSS, select Analyze - Statistics - Data Reduction - Factor - Variables (input variables) - Descriptives - Correlation Matrix - check KMO and Bartlett's test of sphericity.
Adequate sample size. At a minimum, there must be more cases than factors.

SPSS Output Example

Annotated SPSS Factor Analysis Output

Frequently Asked Questions

How does factor analysis compare with cluster analysis and multidimensional scaling?
How many cases do I need to do factor analysis?
How do I input my data as a correlation matrix rather than raw data?
How many variables do I need to do factor analysis? The more, the better?
What is KMO? What is it used for?
Why is normality not required for factor analysis when it is an assumption of correlation, on which factor analysis rests?
Is it necessary to standardize one's variables before applying factor analysis?
Can you pool data from two samples together in factor analysis?
How does factor comparison of the factor structure of two samples work?
Why is rotation of axes necessary?
Why are the factor scores I get the same when I request rotation and when I do not?
Why is oblique (non-orthogonal) rotation rare in social science?
When should oblique rotation be used?
What is hierarchical factor analysis and how does it relate to oblique rotation?
How high does a factor loading have to be to consider that variable as a defining part of that factor?
What is simple factor structure, and is the simpler, the better?
How is factor analysis related to validity?
What is the matrix of standardized component scores, and for what might it be used in research?
What are the pros and cons of common factor analysis compared to PCA?
Why are my PCA results different in SAS compared to SPSS?
How do I do Q-mode factor analysis of cases rather than variables?
How else may I use factor analysis to identify clusters of cases and/or outliers?
Can factor analysis handle hierarchical/multilevel data?
What do I do if I want to factor categorical variables?

How does factor analysis compare with cluster analysis and multidimensional scaling?

COMPARING FACTOR ANALYSIS, MULTI-DIMENSIONAL SCALING, AND CLUSTER ANALYSIS

FACTOR

MDS

CLUSTER

Multivariate? (Uses

partial coefficients

which control for

other variables in the

model).

Yes

Group both vars and cases?

Designed for

variables but

data matrix

could be flipped

to factor cases

(Q-mode).

Designed for

variables; data

matrix could be

flipped to scale cases.

Hierarchical clustering

gives choice of clustering

either. Others are

designed for clustering

cases; data matrix could

be flipped to cluster variables.

How many groups?

Use Kaiser criterion

(eigenvalues > 1)

or scree plot.

Number of factors

is influenced by

number of variables.

Minimize stress

or use scree

plot. Number of groups is not influenced by

number of variables,

and MDS may yield

fewer groups.

In hierarchical clustering

stop when dendogram

distance jump is large. In

two-step clustering, use

lowest BIC and largest

ratio of change. No

method in k-means clustering.

How to label groups?

Infer from factor

Loadings.

Infer from which

objects cluster

In p-space map. Some

then confirm groups

In cluster analysis.

Infer from group

memberships of cases.

Criteria for good model fit

Cumulative % of

variance explained

in eigenvalue

table. Simple factor

structure. High

communalities. Low

reproduced correlation

residuals. Maximum

likelihood or GLS

extraction methods

have goodness of fit tests.

R-squared> .60;

Scatterplots of

linear (covariates)

or nonlinear

(factors) fit form

45-degree line.

Low proximity coefficient

in agglomeration table

for hierarchical

clustering. Low mean

square error in k-means

clustering in SPSS anova

table, or in SAS high overall

R-square and CCC > 3.

Low BIC and high

ratio of distance in

2-step clustering in

auto-clustering table.

Save group membership

If cases grouped?

Yes, can save

factor scores for

cases if factoring

variables; or in

Q-mode, factor

scores reflect group

membership tendencies.

No, would have to

do manually.

All three methods can save

cluster membership number.

Most central output.

Table of factor loadings.

Perceptual map.

Cluster membership table

for all methods. Tree

diagram (dendogram)

for hierarchical

clustering.

How to tell which

Variables are most

Important?

Rotated factor

loadings.

Decomposition

of normalized

stress table in

Proxscal.

Variable-wise importance

plot in 2-step clustering,

Predictor importance

plot in SPSS 20.

How to spot influential

cases?

Not available; consider

pre-processing with

regression, which has

casewise diagnostics

and can save influence

measures.

Can obtain a

solution for each

individual when

input is a

rectangular matrix;

weirdness index for INDSCAL/WMDS

models. Also consider

pre-processing with

regression.

Distance in cluster

membership table

In k-means clustering.

Assumptions

Assumptions of general

linear models, such as

linearity and

normally distributed

variables..

GLM assumptions

do not apply.

GLM assumptions

do not apply.

How many cases do I need to do factor analysis?
How do I input my data as a correlation matrix rather than raw data?
How many variables do I need in factor analysis? The more, the better?
What is KMO? What is it used for?
Why is normality not required for factor analysis when it is an assumption of correlation, on which factor analysis rests?
Is it necessary to standardize one's variables before applying factor analysis?
Can you pool data from two samples together in factor analysis?
How does factor comparison of the factor structure of two samples work?
Why is rotation of axes necessary?
Why are the factor scores I get the same when I request rotation and when I do not?
Why is oblique (non-orthogonal) rotation rare in social science?
When should oblique rotation be used?
What is hierarchical factor analysis and how does it relate to oblique rotation?
How high does a factor loading have to be to consider that variable as a defining part of that factor?
What is simple factor structure, and is the simpler, the better?
How is factor analysis related to validity?
What is the matrix of standardized component scores, and for what might it be used in research?
What are the pros and cons of common factor analysis compared to PCA?
Why are my PCA results different in SAS compared to SPSS?
How do I do Q-mode factor analysis of cases rather than variables?
How else may I use factor analysis to identify clusters of cases and/or outliers?
Can factor analysis handle hierarchical/multilevel data?
What do I do if I want to factor categorical variables?

Bibliography

Beauducel, André (2005). How to describe the difference between factors and corresponding factor-score estimates. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences 1(4), 143 - 158.
Bryant and Yarnold (1995). Principal components analysis and exploratory and confirmatory factor analysis. In Grimm and Yarnold, Reading and understanding multivariate analysis. American Psychological Association Books.
Dunteman, George H. (1989). Principal components analysis. Thousand Oaks, CA: Sage Publications, Quantitative Applications in the Social Sciences Series, No. 69.
Fabrigar, L. R., Wegener, D. T., MacCallum, R. C., & Strahan, E. J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychological Methods, 4: 272-299.
Gilley, William F. & Uhlig, George E. (). Factor analysis and ordinal data. Education, 114(2): 258-265.
Gorsuch, R. (1974). Factor analysis. Philadelphia: W.B. Saunders.
Gorsuch, R. L. (1983). Factor Analysis. Hillsdale, NJ: Lawrence Erlbaum. Orig. ed. 1974.
Hair, J.F., Jr.l Anderson, R. E.; Tatham, R. L.; & Black, W. C. (1998). Multivariate data analysis with readings, 5th ed.. Englewood Cliffs, NJ: Prentice-Hall.
Hatcher, Larry (1994). A step-by-step approach to using the SAS system for factor analysis and structural equation modeling. Cary, NC: SAS Institute. Focus on the CALIS procedure.
Hutcheson, Graeme and Nick Sofroniou (1999). The multivariate social scientist: Introductory statistics using generalized linear models. Thousand Oaks, CA: Sage Publications.
Jensen, A. R. (1998). The g factor: The science of mental ability. Westport, CT: Praeger.
Joreskog, K. G. & Sorbom, D. (1986). PRELIS: A Preprocessor for Lisrel. Mooresville, IN: Scientific Software, Inc.
Kaiser, H. F. (1970). A second generation Little Jiffy. Psychometrika, 35, 401-415.
Kim, Jae-On and Charles W. Mueller (1978a). Introduction to factor analysis: What it is and how to do it. Thousand Oaks, CA: Sage Publications, Quantitative Applications in the Social Sciences Series, No. 13.
Kim, Jae-On and Charles W. Mueller (1978b). Factor Analysis: Statistical methods and practical issues. Thousand Oaks, CA: Sage Publications, Quantitative Applications in the Social Sciences Series, No. 14.
Kline, Rex B. (1998). Principles and practice of structural equation modeling. NY: Guilford Press. Covers confirmatory factor analysis using SEM techniques. See esp. Ch. 7.
Lance, Charles E, Marcus M. Butts, and Lawrence C. Michels (2006). The sources of four commonly reported cutoff criteria: What did they really say? Organizational Research Methods 9(2): 202-220. Discusses Kaiser and other criteria for selecting number of factors.
Lawley, D. N. and A. E. Maxwell (1971). Factor analysis as a statistical method. London: Butterworth and Co.
Lawrence, E. J.; Shaw, P.; Baker, D.; Baron-Cohen, S.; & David, A. S. (2004). Measuring empathy: Reliability and validity of the empathy quotient. Psychological Medicine, 34(5), 911–919.
Levine, Mark S. (1977). Canonical analysis and factor comparison. Thousand Oaks, CA: Sage Publications, Quantitative Applications in the Social Sciences Series, No. 6.
Matthews, G., Deary, I. J., & Whiteman, M. C. (2003). Personality traits, Second edition. Cambridge: Cambridge University Press.
Muthen, B. O. & Kaplan, D. (1985). A comparison of some methodologies for the factor analysis of non-normal Likert variables. British Journal of Mathematical and Statistical Psychology38: 171-189.
Norman, G. R., and D. L. Streiner (1994). Biostatistics: The bare essentials. St. Louis, MO: Mosby.
O'Connor, B. P. (2000). SPSS and SAS programs for determining the number of components using parallel analysis and Velicer's MAP test. Behavior Research Methods, Instrumentation, and Computers 32: 396-402. .
Norušis. Marija J. (2005). SPSS 13.0 Statistical Procedures Companion. Chicago: SPSS, Inc.
Pett, Marjorie A., Nancy R. Lackey, and John J. Sullivan (2003). Making sense of factor analysis: The use of factor analysis for instrument development in health care research. Thousand Oaks, CA: Sage Publications.
Raubenheimer, J. E. (2004). An item selection procedure to maximize scale reliability and validity. South African Journal of Industrial Psychology, 30 (4), 59-64.
Shapiro, Susan E.; Lasarev, Michael R.; & McCauley, Linda (2002). Factor analysis of Gulf War illness: What does it add to our understanding of possible health effects of deployment? American Journal of Epidemiology 156: 578-585. Retrieved 3/7/08 from http://aje.oxfordjournals.org/cgi/content/full/156/6/578.
Velicer, W. F., Eaton, C. A., and Fava, J. L. (2000). Construct explication through factor or component analysis: A review and evaluation of alternative procedures for determining the number of factors or components. Pp. 41-71 in R. D. Goffin and E. Helmes, eds., Problems and solutions in human assessment. Boston: Kluwer. Upholds PA over K1 as a number of factors cutoff criterion.
Widaman, K. F. (1993). Common factor analysis versus principal component analysis: Differential bias in representing model parameters?" Multivariate Behavioral Research 28: 263-311. Cited with regard to preference for common factor analysis over PCA in confirmatory factor analysis in SEM.
Wilkinson, L., G. Blank, and C. Gruber (1996). Desktop Data Analysis with SYSTAT. Upper Saddle River, NJ: Prentice-Hall.

转载本文请联系原作者获取授权，同时请注明本文来自戴光荣科学网博客。
链接地址：https://blog.sciencenet.cn/blog-331736-550639.html

上一篇：Pro-drop language
下一篇：Emic/Etic Distinctions

收藏 IP: 161.64.143.*| 热度|

当前推荐数：0

该博文允许注册用户评论请点击登录评论 (0 个评论)

数据加载中...

返回顶部

博文发布时间已经超过87600小时，评论已关闭。

戴光荣

扫一扫，分享此博文

语料库翻译研究+认知空间分享 http://blog.sciencenet.cn/u/carldy 探索翻译研究新途径，反思语言认知研究

博文

Factor Analysis in Linguistic research

当前推荐数：0

该博文允许注册用户评论请点击登录评论 (0 个评论)

戴光荣

全部作者的精选博文

全部作者的其他最新博文

全部精选博文导读

相关博文

语料库翻译研究+认知空间分享 http://blog.sciencenet.cn/u/carldy 探索翻译研究新途径，反思语言认知研究

博文

Factor Analysis in Linguistic research

当前推荐数：0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

戴光荣

全部作者的精选博文

全部作者的其他最新博文

全部精选博文导读

相关博文

该博文允许注册用户评论请点击登录评论 (0 个评论)