# Psychometric Analysis Tool

<div class="container" id="bkmrk-psychometric-analysi"><header class="header"><div class="subtitle"><span style="color: rgb(0, 0, 0);">A Complete User Guide &amp; Best Practices</span></div></header><nav class="toc">## 📑 Table of Contents

- [Overview](#bkmrk-overview-the-psychom)
- [Getting Started](#bkmrk-getting-started-1-ac)
- [Publication Selection](#bkmrk-publication-selectio)
- [Analysis Types](#bkmrk-analysis-types-%F0%9F%94%AC-ind)
- [Quality Indicators](#bkmrk-quality-indicators-%26)
- [Data Preprocessing](#bkmrk-data-preprocessing-m)
- [Interpreting Results](#bkmrk-interpreting-results)
- [Best Practices](#bkmrk-best-practices-sampl)
- [Troubleshooting](#bkmrk-troubleshooting-comm)
- [Technical Details](#bkmrk-technical-details-st)

</nav><main><section id="bkmrk-overview-the-psychom">## Overview

The **Psychometric Analysis Tool** is a sophisticated statistical analysis component within WASPL that evaluates the quality and reliability of educational assessments. It provides comprehensive psychometric analysis capabilities for educators and researchers to validate their test instruments according to professional measurement standards.

### <span class="emoji">📊</span> Statistical Analysis

Comprehensive reliability analysis using Cronbach's Alpha, item discrimination, difficulty analysis, and item-total correlations.

### <span class="emoji">🎯</span> Quality Assessment

Automated quality indicators with professional thresholds and recommendations for test improvement.

### <span class="emoji">📋</span> Multi-Publication Analysis

Compare multiple test administrations or combine data for robust statistical analysis.

### <span class="emoji">🔍</span> Data Validation

Built-in detection of methodological issues, outliers, and data quality problems.

</section><section id="bkmrk-getting-started-1-ac">## Getting Started

<div class="workflow-steps"><div class="step-card"><div class="step-number">1</div></div></div>### Access the Tool

Navigate to your test in WASPL Editor and select the **Psychometrics** tab. Only tests with EXAM mode publications will show analysis options.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">2</div></div></div>### Review Publications

The tool automatically loads all eligible publications. Review the summary statistics and quality indicators for each publication.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">3</div></div></div>### Select Data

Choose which publications to include in your analysis. Use quick selection tools or manual selection based on your research needs.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">4</div></div></div>### Configure Analysis

Select analysis type (Individual, Grouped, or Comparative) and configure data preprocessing options.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">5</div></div></div>### Run Analysis

Execute the psychometric analysis and review the comprehensive results with recommendations.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">6</div></div></div>### Export Results

Generate professional reports in PDF format or export raw data for further analysis.

### 💡 Prerequisites

<div class="info-box">- **EXAM Mode Publications**: Only publications in EXAM mode are eligible for psychometric analysis
- **Minimum Sample Size**: At least 10 participants recommended for basic analysis
- **Complete Responses**: Best results require high completion rates (80%+)

</div></section><section id="bkmrk-publication-selectio">## Publication Selection

### Understanding Publication Cards

Each publication is displayed with comprehensive information to help you make informed selection decisions:

<div class="metrics-grid"><div class="metric-card"><div class="metric-value">👥</div><div class="metric-label">Participant Count</div></div></div>Total number of students who attempted the test

<div class="metrics-grid"><div class="metric-card">  
</div><div class="metric-card"><div class="metric-value">✅</div><div class="metric-label">Completion Rate</div></div></div>Percentage of students who completed all items

<div class="metrics-grid"><div class="metric-card">  
</div><div class="metric-card"><div class="metric-value">⏱️</div><div class="metric-label">Average Time</div></div></div>Mean completion time for the assessment

<div class="metrics-grid"><div class="metric-card">  
</div><div class="metric-card"><div class="metric-value">🔍</div><div class="metric-label">Data Quality</div></div></div>Automated detection of anomalies or issues

### Quick Selection Tools

#### <span class="emoji">☑️</span> Select All

Include all available publications for maximum sample size

#### <span class="emoji">🕐</span> Most Recent

Select the 3 most recent publications for current performance analysis

#### <span class="emoji">📈</span> Largest Samples

Choose publications with the highest participant counts for statistical power

### Filtering and Sorting

- **Search Filter**: Find publications by name or keyword
- **Sort Options**: Order by date, participant count, completion rate, or alphabetically
- **Minimum Participants**: Set threshold to filter out small samples

### ⚠️ Sample Size Recommendations

<div class="warning-box">- **N ≥ 100**: Required for robust IRT analysis and factor analysis
- **N ≥ 50**: Minimum for exploratory factor analysis
- **N ≥ 30**: Sufficient for reliable Cronbach's Alpha estimates
- **N &lt; 30**: Limited to basic descriptive statistics

</div></section><section id="bkmrk-analysis-types-%F0%9F%94%AC-ind">## Analysis Types

### <span class="emoji">🔬</span> Individual Analysis

**Purpose**: Analyze each publication separately for comparison

**Use Case**: Compare performance across different administrations, groups, or time periods

**Output**: Separate reliability and item statistics for each publication

### <span class="emoji">📊</span> Grouped Analysis

**Purpose**: Combine all selected publications into one comprehensive analysis

**Use Case**: Maximize sample size for robust statistical estimates

**Output**: Single set of psychometric statistics based on combined data

### <span class="emoji">🔀</span> Comparative Analysis

**Purpose**: Global analysis plus between-group comparisons

**Use Case**: Research studies comparing different populations or conditions

**Output**: Combined statistics plus significance tests between groups

### 💡 Recommendation

**Grouped Analysis** is recommended for most educational applications as it provides the most reliable statistical estimates by maximizing sample size. Use Individual Analysis when you need to compare specific administrations or investigate changes over time.

</section><section id="bkmrk-quality-indicators-%26">## Quality Indicators &amp; Thresholds

### Reliability Categories (Cronbach's Alpha)

#### A - Excellent

α ≥ 0.90

<div class="quality-thresholds"><div class="threshold-card threshold-excellent"><small>Outstanding reliability for high-stakes testing</small></div><div class="threshold-card threshold-good">  
</div></div>#### B - Good

0.80 ≤ α &lt; 0.90

<div class="quality-thresholds"><div class="threshold-card threshold-good"><small>Good reliability for most educational purposes</small></div><div class="threshold-card threshold-acceptable">  
</div></div>#### C - Acceptable

0.70 ≤ α &lt; 0.80

<div class="quality-thresholds"><div class="threshold-card threshold-acceptable"><small>Acceptable for formative assessment</small></div><div class="threshold-card threshold-poor">  
</div></div>#### D - Poor

α &lt; 0.70

<div class="quality-thresholds"><div class="threshold-card threshold-poor"><small>Needs improvement before use</small></div></div>### Item Quality Standards

<table class="spec-table"><thead><tr><th>Metric</th><th>Good</th><th>Acceptable</th><th>Problematic</th><th>Interpretation</th></tr></thead><tbody><tr><td>**Difficulty**</td><td>30-70%</td><td>20-80%</td><td>&lt;20% or &gt;80%</td><td>Percentage of students who answered correctly</td></tr><tr><td>**Discrimination**</td><td>≥0.40</td><td>0.30-0.39</td><td>&lt;0.30</td><td>Ability to distinguish high from low performers</td></tr><tr><td>**Item-Total Correlation**</td><td>≥0.30</td><td>0.20-0.29</td><td>&lt;0.20</td><td>Consistency with overall test performance</td></tr><tr><td>**Point-Biserial**</td><td>≥0.25</td><td>0.15-0.24</td><td>&lt;0.15</td><td>Alternative discrimination measure</td></tr></tbody></table>

### 🎯 Quality Interpretation

<div class="highlight-box">- **Green Items**: Meet or exceed quality standards - retain these items
- **Yellow Items**: Acceptable quality but could be improved
- **Red Items**: Below standards - consider revision or removal

</div></section><section id="bkmrk-data-preprocessing-m">## Data Preprocessing

### Methodological Issue Detection

The tool automatically identifies common methodological issues that can affect analysis validity:

### <span class="emoji">🔄</span> Multiple Attempts

**Issue**: Students taking the test multiple times

**Impact**: Learning effects, violation of independence

**Solution**: Use only first attempts or best attempts

### <span class="emoji">⚠️</span> Incomplete Data

**Issue**: Students who didn't complete the test

**Impact**: Selection bias, reduced statistical power

**Solution**: Exclude incomplete responses or use imputation

### <span class="emoji">📈</span> Sample Size

**Issue**: Insufficient sample size for chosen analysis

**Impact**: Unreliable estimates, reduced power

**Solution**: Combine publications or limit analysis scope

### <span class="emoji">⏱️</span> Timing Anomalies

**Issue**: Extremely fast or slow completion times

**Impact**: Invalid response patterns

**Solution**: Automatic outlier detection and exclusion

### Quality Control Options

- **Multiple Attempts Exclusion**: Automatically keep only first attempts
- **Completion Threshold**: Set minimum percentage of items completed
- **Timing Filters**: Remove responses with suspicious timing patterns
- **Response Pattern Analysis**: Detect random or non-engaged responding

### ⚠️ Statistical Assumptions

Psychometric analysis assumes:

<div class="warning-box">- Independence of observations (no collaboration)
- Unidimensional measurement (items measure the same construct)
- Sufficient sample size for stable estimates
- Honest responding (students trying their best)

</div></section><section id="bkmrk-interpreting-results">## Interpreting Results

### Overall Test Quality

The analysis provides an overall grade (A-D) based on multiple quality indicators:

#### 📊 Analysis Results Overview

**Overall Grade:** B (Good Quality)

**Cronbach's Alpha:** 0.84 (Good Reliability)

**Sample Size:** 156 participants

**Items Analysis:** 12 Good, 6 Acceptable, 2 Problematic

### Item-Level Analysis

Each test item receives detailed statistical analysis:

<table class="spec-table"><thead><tr><th>Item</th><th>Difficulty</th><th>Discrimination</th><th>Item-Total r</th><th>Status</th><th>Recommendation</th></tr></thead><tbody><tr><td>Item 1</td><td>65%</td><td>0.45</td><td>0.42</td><td style="color: var(--success-color);">✓ Good</td><td>Retain - excellent quality</td></tr><tr><td>Item 2</td><td>35%</td><td>0.32</td><td>0.28</td><td style="color: var(--warning-color);">⚠ Acceptable</td><td>Consider slight revision</td></tr><tr><td>Item 3</td><td>15%</td><td>0.18</td><td>0.12</td><td style="color: var(--danger-color);">✗ Problematic</td><td>Review or remove - too difficult</td></tr></tbody></table>

### Recommendations

#### ✅ Actions for Test Improvement

<div class="success-box">- **Retain high-quality items** (discrimination ≥ 0.40)
- **Revise problematic items** with low discrimination or extreme difficulty
- **Consider removing items** that don't contribute to test reliability
- **Add more items** if overall reliability is below 0.80

</div></section><section id="bkmrk-best-practices-sampl">## Best Practices

### Sample Size Guidelines

#### 🎯 For Classroom Assessment

<div class="analysis-types"><div class="analysis-type">- Minimum N = 20 for basic reliability
- Target N = 30+ for stable estimates
- Combine classes when possible

</div><div class="analysis-type">  
</div></div>#### 🔬 For Research Studies

<div class="analysis-types"><div class="analysis-type">- Minimum N = 100 for IRT analysis
- Target N = 200+ for complex models
- Power analysis for group comparisons

</div><div class="analysis-type">  
</div></div>#### 📊 For High-Stakes Testing

<div class="analysis-types"><div class="analysis-type">- Target N = 500+ for operational use
- Multiple field test administrations
- Cross-validation with independent samples

</div></div>### Data Quality Checklist

#### ✓ Before Running Analysis

<div class="highlight-box">- Verify test was administered under standardized conditions
- Check for adequate completion rates (&gt;80% recommended)
- Review timing data for suspicious patterns
- Ensure sample represents intended population
- Document any special circumstances during administration

</div>### Interpreting Low Reliability

#### 🔍 Common Causes of Poor Reliability

<div class="warning-box">- **Too few items**: Reliability increases with test length
- **Heterogeneous content**: Items measuring different constructs
- **Poor item quality**: Items with low discrimination
- **Inappropriate difficulty**: Items too easy or too hard
- **Small sample size**: Unstable estimates with N &lt; 30

</div></section><section id="bkmrk-troubleshooting-comm">## Troubleshooting

### Common Issues and Solutions

### <span class="emoji">❌</span> No Publications Available

**Cause**: Only EXAM mode publications are eligible

**Solution**: Ensure test has been published in EXAM mode with student data

### <span class="emoji">⚠️</span> Analysis Fails

**Cause**: Insufficient data or computational error

**Solution**: Check sample size, data completeness, and try simpler analysis

### <span class="emoji">📊</span> Unrealistic Results

**Cause**: Data quality issues or methodological problems

**Solution**: Review preprocessing options and data collection procedures

### <span class="emoji">🐌</span> Slow Performance

**Cause**: Large datasets or complex analysis

**Solution**: Reduce sample size or simplify analysis type

### Error Messages

<table class="spec-table"><thead><tr><th>Error</th><th>Meaning</th><th>Solution</th></tr></thead><tbody><tr><td>"Insufficient data"</td><td>Sample size too small</td><td>Select more publications or reduce analysis complexity</td></tr><tr><td>"No variance in responses"</td><td>All students gave same answers</td><td>Check item difficulty and administration conditions</td></tr><tr><td>"Matrix not positive definite"</td><td>Correlation matrix issues</td><td>Remove problematic items or increase sample size</td></tr><tr><td>"Analysis timeout"</td><td>Computation took too long</td><td>Reduce sample size or contact support</td></tr></tbody></table>

</section><section id="bkmrk-technical-details-st">## Technical Details

### Statistical Methods

<table class="spec-table"><thead><tr><th>Metric</th><th>Formula/Method</th><th>Purpose</th></tr></thead><tbody><tr><td>Cronbach's Alpha</td><td>α = (k/(k-1)) × (1 - Σσᵢ²/σₓ²)</td><td>Internal consistency reliability</td></tr><tr><td>Item Difficulty</td><td>p = Number correct / Total attempts</td><td>Proportion of students answering correctly</td></tr><tr><td>Item Discrimination</td><td>Point-biserial correlation</td><td>Ability to differentiate performance levels</td></tr><tr><td>Item-Total Correlation</td><td>Corrected correlation (item removed from total)</td><td>Consistency with overall performance</td></tr></tbody></table>

### Computational Features

- **Missing Data Handling**: Listwise deletion or pairwise correlations
- **Outlier Detection**: Z-score and timing-based filtering
- **Bootstrap Confidence Intervals**: For reliability estimates
- **Effect Size Calculations**: Cohen's d for group comparisons

### Export Formats

#### 📄 PDF Report

Professional formatted report with all statistics, charts, and recommendations

#### 📊 JSON Data

Raw statistical output for integration with other tools or custom analysis

#### 📈 CSV Export

Item-level statistics for spreadsheet analysis or graphing

### 🔧 Integration with WASPL

<div class="info-box">- **Test Repository**: Pulls item information and test structure
- **Results Database**: Accesses student response data
- **User Authentication**: Integrated with WASPL security system
- **Publication System**: Links to test administration records

</div></section></main><footer class="footer">This tool follows established psychometric standards and guidelines from organizations such as AERA, APA, and NCME.

**WASPL Platform** | Psychometric Analysis Guide Version 1.0 | Last Updated: June 2025

</footer></div>