Exploratory and Confirmatory Factor Analyses of the Arabic Version of the Childhood Autism Rating Scale (CARS-2)

Overview of the CARS-2 Factor Analysis Study

  • This research, entitled "Exploratory and confirmatory factor analyses of the Arabic version of the Childhood Autism Rating Scale," was conducted by Bander Alotaibi (University of Hail, Saudi Arabia) and Abdulhadi Alotaibi (Umm Al-Qura University, Saudi Arabia).
  • The study evaluates the factor structure, reliability, and validity of the Arabic version of the Childhood Autism Rating Scale, Second Edition (CARS2CARS-2).
  • The sample consisted of 301301 children diagnosed with Autism Spectrum Disorder (ASDASD) according to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM5DSM-5) criteria.
  • Key Findings:     - Internal consistency reliability was found to be 0.790.79.     - Inter-rater reliability was measured at 0.650.65.     - Intraclass correlation coefficient (ICCICC) was 0.760.76.     - Exploratory Factor Analysis (EFAEFA) suggested a three-factor solution: Communications, Emotions, and Senses and Physical.     - Confirmatory Factor Analysis (CFACFA) confirmed a 1414-item, three-factor model that adequately fits the data (RMSEA=0.08RMSEA = 0.08).     - The findings support the continued use and relevance of the Arabic version of CARS2CARS-2 in identifying and assessing ASDASD.

Context and Evolution of ASD Diagnosis

  • Prevalence Estimates: According to the Autism and Developmental Disabilities Monitoring Network (20182018), ASDASD prevalence is estimated at one in 5959 children.
  • Clinical Characteristics: ASDASDs are neurodevelopmental disorders defined by skill deficits, cognitive/motor delays, and sensory sensitivities.
  • Diagnostic Transitions:     - DSMIVDSM-IV Criteria: Included Autistic Disorder (ADAD), Asperger Syndrome (ASAS), and Childhood Disintegrative Disorder (CDDCDD) under the Pervasive Developmental Disorder category. Pervasive Developmental Disorder Not Otherwise Specified (PDDNOSPDD-NOS) was previously included.     - DSM5DSM-5 Changes: Removed impaired language as a primary marker; replaced it with deficits in social communication and restrictive/repetitive interests and behaviors. Social Communication Disorder (SCDSCD) was added.     - Clinical Challenges: Evaluations are difficult due to continuous modifications in diagnostic criteria, specifically the move from DSMIVDSM-IV to DSM5DSM-5.

Assessment Tools: The CARS and CARS-2

  • The original Childhood Autism Rating Scale (CARSCARS) was published in 19801980 as a clinical observation tool utilizing a four-point scale.
  • The CARS2CARS-2 (published in 20102010) includes two versions:     - Standard Form (CARS2STCARS2-ST): Maintains the original scale format.     - High-Functioning (CARS2HFCARS2-HF): Designed for individuals aged six years and older with an estimated IQIQ of 80+80+ who communicate fluently.
  • Methodology of Assessment: Ratings are based on direct observations by clinicians combined with collateral information from parents or teachers.
  • Sensitivity Data:     - CARS2STCARS2-ST sensitivity based on DSM5DSM-5 is 0.840.84, compared to 0.810.81 for DSMIVTRDSM-IV-TR.     - Diagnostic agreement between CARSCARS and DSM5DSM-5 is approximately 84%84\%.
  • Known Limitations: The CARSCARS has a historical propensity to classify young children with intellectual disabilities (IDID) as having autism (Lord, 19951995).

Study Methodology and Participants

  • Sample Location: The Unit of Evaluation and Diagnosis at Hail Charitable Association for Children with Disability in Hail, Saudi Arabia.
  • Participant Demographics (N=301N = 301):     - Gender: 77.4%77.4\% Male (n=233n = 233), 22.6%22.6\% Female (n=68n = 68).     - Age: 47.8%47.8\% were 252-5 years old; 52.2%52.2\% were 6126-12 years old.     - Diagnoses: 56.8%56.8\% Autistic Disorder (ADAD); 43.2%43.2\% Social Communication Disorder (SCDSCD).
  • Exclusion Criteria: Participants with Attention Deficit Hyperactivity Disorder (ADHDADHD) associated with IDID and stereotyped movements were excluded due to uncertain nosological status.
  • Evaluation Team: At minimum, a licensed clinical psychologist, child psychiatrist, education specialist, speech pathologist, and occupational therapist (expertise ranging from 55 to 15+15+ years).
  • Procedure:     - Psychiatrists provided DSM5DSM-5 clinical diagnoses independently.     - Clinical psychologists and speech pathologists rated the CARS2CARS-2 independently from the psychiatrist's diagnosis to reduce rater bias.     - Data derived from CARS2STCARS2-ST (for children aged 6+6+) and CARS2HFCARS2-HF (for children under 66).

Statistical Analysis Framework

  • Reliability Metrics: Cronbach’s α\alpha (internal consistency), Intraclass Correlation Coefficient (ICCICC), and Cohen’s κ\kappa (agreement between CARS2CARS-2 and DSM5DSM-5).
  • Diagnostic Accuracy: Receiver operating characteristic (ROCROC) analyses, sensitivity, specificity, and Area Under the Curve (AUCAUC).
  • Construct Validity:     - Data was randomly split into Subsample 1 (n=151n = 151) and Subsample 2 (n=150n = 150).     - Exploratory Factor Analysis (EFAEFA): Used Principal Axis Factoring (PAFPAF) with oblique Promax rotation. Kaiser–Meyer–Olkin (KMOKMO) measure was 0.8750.875, exceeding the 0.60.6 threshold. Bartlett’s test of sphericity was statistically significant.     - Confirmatory Factor Analysis (CFACFA): Used Analysis of Moment Structures (AmosAmos) 25.025.0 to evaluate model fit using indices like χ2\chi^2, CFICFI, TLITLI, RMSEARMSEA, AICAIC, and BICBIC.
  • Retention Criteria: Eigenvalues > 1.00; factor pattern loadings > 0.40.

Research Results: Diagnostic Accuracy and Reliability

  • Participant Scores: Mean CARS2CARS-2 score was 36.5036.50 (SD=5.27SD = 5.27), ranging from 1515 to 4949.
  • Ideal Cutoff Point: A cutoff score of 26\ge 26 was determined for children younger than 1313.
  • Diagnostic Efficacy of Cutoff 2626:     - Sensitivity: 0.960.96.     - Specificity: 0.700.70.     - Correctly classified 250250 of 301301 children.     - Positive Predictive Value (PPVPPV): 76.2%76.2\%.     - Negative Predictive Value (NPVNPV): 97.2%97.2\%.     - AUCAUC: 0.650.65 (95%CI=0.530.7895\%\,CI = 0.53-0.78, P=0.03P = 0.03).
  • Reliability Statistics:     - Internal consistency (total scores): α=0.79\alpha = 0.79.     - Inter-rater agreement (Cohen’s κ\kappa): 0.650.65 (moderate agreement).     - Intraclass Correlation Coefficient (ICCICC): 0.760.76.

Research Results: Validation and Factor Structure

  • The EFA produced a three-factor solution accounting for 55.83%55.83\% of the common variance.
  • Factor 1: "Communications"     - Variance: 40.40%40.40\%.     - Items: Imitation (0.7580.758), Verbal communication (0.7430.743), Non-verbal communication (0.5700.570), Level and consistency of intellectual response (0.4840.484), Visual Response (0.4240.424), Relation to people (0.4290.429).     - Internal consistency: α=0.70\alpha = 0.70.
  • Factor 2: "Emotions"     - Variance: 8.29%8.29\%.     - Items: Emotional response (0.5840.584), Fear and nervousness (0.5320.532), General impressions (0.4490.449), Adaptation to change (0.4000.400).     - Internal consistency: α=0.67\alpha = 0.67.
  • Factor 3: "Senses and Physical"     - Variance: 7.13%7.13\%.     - Items: Body use (0.4630.463), Object use (0.8890.889), Taste, smell, touch response and use (0.7730.773), Activity level (0.5920.592).     - Internal consistency: α=0.65\alpha = 0.65.
  • Non-loading Item: Listening Response did not exceed the 0.400.40 loading threshold.
  • Factor Correlations: Medium to large correlations between factors.     - Communications/Emotions: 0.6430.643.     - Communications/Senses Physical: 0.6130.613.     - Emotions/Senses Physical: 0.5690.569.

Confirmatory Factor Analysis (CFA) Fit Indices

  • The CFA tested several models on Subsample 2 (n=150n = 150).
  • Initial Model (15 items): χ2(df=87)=195.097\chi^2(df=87) = 195.097, CFI=0.86CFI = 0.86, TLI=0.84TLI = 0.84, RMSEA=0.09RMSEA = 0.09.
  • 14-Item Model (excluding Listening Response): χ2(df=74)=158.159\chi^2(df=74) = 158.159, CFI=0.89CFI = 0.89, TLI=0.86TLI = 0.86, RMSEA=0.08RMSEA = 0.08.
  • Final Model (with 14 items and two error covariance paths):     - χ2(df=72)=141.270\chi^2(df=72) = 141.270     - CFI=0.91CFI = 0.91 (adequate fit)     - TLI=0.88TLI = 0.88 (adequate fit)     - RMSEA=0.08RMSEA = 0.08 (good fit)     - GFI=0.89GFI = 0.89     - NFI=0.83NFI = 0.83     - AIC=207.270AIC = 207.270     - CAIC=339.621CAIC = 339.621
  • Paths were added between error terms for items 11 and 22, and items 33 and 44 to improve the model.

Discussion and Significance

  • Relevance to Cultures: Results suggest the CARS2CARS-2 factor structure is appropriate for the Arabic context, similar to Western and other non-Western cultures like Swedish, Japanese, and Indian populations.
  • Symptom Domains: The three-factor solution aligns more closely with DSM5DSM-5 symptom domains rather than DSMIVDSM-IV.
  • Item Ambiguity:     - Listening Response: Wording such as "use his senses" is vague; clinicians may interpret it as a social/communicative tool or a sensory pattern. Its loading was split between Communications (0.210.21) and Senses and Physical (0.370.37).     - Visual Response: Loaded on Senses and Physical, differing from previous studies (e.g., Moulton et al., 20162016) where it loaded on Communications.     - Level and Consistency of Intellectual Response: Loading on Communications suggests clinicians view intellectual variability as a core feature of the disorder.
  • Cutoff Comparison: The ideal Arabic cutoff of 2626 is consistent with Lebanese studies (Akoury-Dirani et al., 20132013) but lower than suggested cutoffs in Japan (3030) and India (3333).

Limitations and Future Directions

  • Sample Bias: The sample was drawn from a specific clinical population and did not compare results against other measures like the Autism Diagnostic Observation Schedule (ADOSADOS) or the Autism Behavior Checklist (ABCABC).
  • Test-Retest: Test-retest reliability was not determined in this study.
  • Age Variability: While the wide age range (2122-12) is a strength, it may obscure findings specific to narrow age brackets.
  • Future Recommendations:     - Distinguish more clearly between sensory and social concepts in item terminology.     - Use concurrent administration of tests specifically for study purposes.     - Directly compare factors identified across different discrete age groups.