# Psychometrics in WASPL

WASPL has a psychometric tool to estimate the quality of item and test. It helps also to measure the item difficulty level for CAT test creation

# Psychometric Data Generator

This tools generates fake data to simulate the definition levels. It is a calibration tool.

# Psychometric Data Generator - User Guide

<div class="container" id="bkmrk-psychometric-data-ge"><header class="header"># Psychometric Data Generator

<div class="subtitle">User Guide &amp; Technical Reference</div></header><nav class="toc">## 📑 Table of Contents

- [Overview](#bkmrk-overview-the-psychom)
- [Purpose and Applications](#bkmrk-purpose-and-applicat)
- [What the Generator Creates](#bkmrk-what-the-generator-c)
- [Quick Start Presets](#bkmrk-quick-start-presets-)
- [Expert Mode Configuration](#bkmrk-expert-mode-configur)
- [Cronbach's Alpha Categories](#bkmrk-cronbach%27s-alpha-cat)
- [Generation Process](#bkmrk-generation-process-1)
- [Technical Specifications](#bkmrk-technical-specificat)
- [Best Practices](#bkmrk-best-practices-for-d)
- [Troubleshooting](#bkmrk-troubleshooting-comm)
- [Integration with WASPL](#bkmrk-integration-with-was)

</nav><main><section id="bkmrk-overview-the-psychom">## Overview

The **Psychometric Data Generator** is a powerful tool designed to create realistic test datasets with valid psychometric metrics for WASPL assessments. This tool generates simulated student responses that maintain statistically sound characteristics, making it ideal for testing, demonstrations, training, and quality validation.

</section><section id="bkmrk-purpose-and-applicat">## Purpose and Applications

### Primary Uses

- **Testing &amp; Validation**: Generate datasets to test WASPL's analytical capabilities
- **Demonstrations**: Create realistic data for showcasing platform features
- **Training**: Provide educational datasets for learning psychometric concepts
- **Quality Assurance**: Test detection algorithms with known data characteristics
- **Research**: Generate controlled datasets for psychometric research

### Key Benefits

- **Realistic Data**: Simulated responses follow actual response patterns
- **Controlled Quality**: Target specific reliability coefficients (Cronbach's α)
- **Instant Generation**: Create datasets in seconds rather than months
- **Educational Value**: Understand the relationship between item quality and test reliability

</section><section id="bkmrk-what-the-generator-c">## What the Generator Creates

The Psychometric Data Generator produces:

### 1. Student Response Data

- **Individual Responses**: Simulated answers for each student to each test item
- **Response Patterns**: Realistic distribution following Item Response Theory (IRT)
- **Consistency Modeling**: Variable response consistency based on student ability

### 2. Psychometric Metrics

- **Cronbach's Alpha**: Test reliability coefficient (internal consistency)
- **Item Discrimination**: How well items differentiate between students
- **Item Difficulty**: Distribution of item difficulty parameters
- **Response Timing**: Realistic completion times per item

### 3. Statistical Properties

- **Score Distribution**: Normal or custom distributions of total scores
- **Item-Total Correlations**: Relationships between item and total performance
- **Standard Errors**: Measurement precision indicators
- **Missing Data**: Realistic patterns of incomplete responses

</section><section id="bkmrk-quick-start-presets-">## Quick Start Presets

The generator offers three pre-configured presets for immediate use:

### <span class="emoji">🎯</span> Realistic Demo

<div class="preset-card">- **Target**: α ≥ 0.85 (Grade B)
- **Quality**: High-quality items (80% good items)
- **Use Case**: Professional demonstrations and standard testing
- **Characteristics**: Balanced difficulty, good discrimination

</div>### <span class="emoji">🔍</span> Detection Test

<div class="preset-card">- **Target**: α ≈ 0.40 (Grade D)
- **Quality**: Mixed quality with problematic items
- **Use Case**: Testing quality detection algorithms
- **Characteristics**: Includes poor items, low reliability

</div>### <span class="emoji">📚</span> Educational Training

<div class="preset-card">- **Target**: α ≥ 0.75 (Grade C)
- **Quality**: Acceptable quality for learning
- **Use Case**: Training and educational purposes
- **Characteristics**: Moderate quality, instructional value

</div></section><section id="bkmrk-expert-mode-configur">## Expert Mode Configuration

For advanced users, Expert Mode provides full control over generation parameters:

### Core Parameters

- **Target Cronbach's Alpha**: Set desired reliability (0.5 - 0.95)
- **Minimum Discrimination**: Item quality threshold (0.1 - 0.6)
- **Response Consistency**: Student behavior variability (0.1 - 0.8)
- **Sample Size**: Number of students to simulate
- **Missing Data Rate**: Percentage of incomplete responses

### Advanced Options

- **Timing Generation**: Include realistic completion times
- **Debug Mode**: Additional diagnostic information
- **Custom Distributions**: Specify ability and difficulty distributions

</section><section id="bkmrk-cronbach%27s-alpha-cat">## Cronbach's Alpha Categories (A, B, C, D)

The generator uses standard psychometric thresholds to categorize test reliability:

### Category A - Excellent <span class="badge badge-success">α ≥ 0.9</span>

<div class="category-grid"><div class="category-card category-a">- **Interpretation**: Outstanding reliability
- **Suitable For**: High-stakes testing, certification exams
- **Characteristics**: Very consistent measurement, minimal measurement error

</div><div class="category-card category-b">  
</div></div>### Category B - Good <span class="badge badge-info">0.8 ≤ α &lt; 0.9</span>

<div class="category-grid"><div class="category-card category-b">- **Interpretation**: Good reliability
- **Suitable For**: Most educational assessments, research
- **Characteristics**: Reliable measurement with acceptable error

</div><div class="category-card category-c">  
</div></div>### Category C - Acceptable <span class="badge badge-warning">0.7 ≤ α &lt; 0.8</span>

<div class="category-grid"><div class="category-card category-c">- **Interpretation**: Acceptable reliability
- **Suitable For**: Formative assessment, initial testing
- **Characteristics**: Adequate for most purposes, some measurement error

</div><div class="category-card category-d">  
</div></div>### Category D - Insufficient <span class="badge badge-danger">α &lt; 0.7</span>

<div class="category-grid"><div class="category-card category-d">- **Interpretation**: Poor reliability
- **Suitable For**: Pilot testing, diagnostic purposes only
- **Characteristics**: High measurement error, results should be interpreted cautiously

</div></div></section><section id="bkmrk-generation-process-1">## Generation Process

<div class="process-steps"><div class="step-card"><div class="step-number">1</div></div></div>### Configuration

<div class="process-steps"><div class="step-card">1. Select a Quick Start preset or choose Expert Mode
2. Configure generation parameters
3. Select target test and publication(s)
4. Review settings and estimated generation time

</div><div class="step-card"><div class="step-number">2</div></div></div>### Validation

<div class="process-steps"><div class="step-card">- System validates configuration parameters
- Checks for realistic parameter combinations
- Estimates generation time and resource requirements

</div><div class="step-card"><div class="step-number">3</div></div></div>### Generation

<div class="process-steps"><div class="step-card">- Creates simulated response matrix
- Applies psychometric models (IRT/CTT)
- Calculates reliability and item statistics
- Generates timing data (if enabled)

</div><div class="step-card"><div class="step-number">4</div></div></div>### Results

<div class="process-steps"><div class="step-card">- Displays generation summary
- Shows achieved vs. target metrics
- Provides data quality indicators
- Saves results to selected publication(s)

</div></div></section><section id="bkmrk-technical-specificat">## Technical Specifications

### Supported Models

<table class="spec-table"><thead><tr><th>Model</th><th>Description</th><th>Use Case</th></tr></thead><tbody><tr><td>**Classical Test Theory (CTT)**</td><td>Traditional reliability analysis</td><td>Standard psychometric evaluation</td></tr><tr><td>**Item Response Theory (IRT)**</td><td>Modern psychometric modeling</td><td>Advanced measurement precision</td></tr><tr><td>**Rasch Model**</td><td>Specific IRT implementation for dichotomous items</td><td>Educational assessment</td></tr></tbody></table>

### Data Format

- **Response Matrix**: Students × Items binary/polytomous responses
- **Metadata**: Student IDs, item parameters, session information
- **Timing Data**: Response times in milliseconds
- **Quality Metrics**: Comprehensive psychometric statistics

### Performance

<table class="spec-table"><thead><tr><th>Dataset Size</th><th>Student Count</th><th>Generation Time</th></tr></thead><tbody><tr><td>Small Datasets</td><td>&lt; 50 students</td><td>&lt; 1 second</td></tr><tr><td>Medium Datasets</td><td>50-200 students</td><td>1-2 seconds</td></tr><tr><td>Large Datasets</td><td>200+ students</td><td>2-5 seconds</td></tr></tbody></table>

</section><section id="bkmrk-best-practices-for-d">## Best Practices

### For Demonstrations

<div class="highlight-box">- Use "Realistic Demo" preset
- Target α ≥ 0.85 for professional appearance
- Include timing data for realistic simulation

</div>### For Testing &amp; QA

<div class="info-box">- Use "Detection Test" preset for algorithm validation
- Mix high and low quality items
- Test edge cases with extreme parameters

</div>### For Training

<div class="warning-box">- Use "Educational Training" preset
- Show progression from poor to excellent reliability
- Demonstrate impact of item quality on overall test reliability

</div>### For Research

<div class="highlight-box">- Use Expert Mode for precise control
- Document all parameter settings
- Validate against real data when possible

</div></section><section id="bkmrk-troubleshooting-comm">## Troubleshooting

### Common Issues

- **Generation Fails**: Check parameter ranges and test selection
- **Poor Quality Results**: Adjust discrimination thresholds
- **Unrealistic Data**: Review consistency and timing parameters

### Performance Optimization

- Limit student count for faster generation
- Disable timing data if not needed
- Use appropriate quality thresholds

</section><section id="bkmrk-integration-with-was">## Integration with WASPL

The generated data integrates seamlessly with:

- **Results Analysis**: Full psychometric reporting
- **CAT System**: Adaptive testing calibration
- **Quality Dashboard**: Real-time monitoring
- **Export Functions**: Multiple format support

</section></main><footer class="footer">This tool is part of the WASPL Developer Tools suite, designed to support comprehensive assessment development and validation workflows.

**WASPL Platform** | Documentation Version 1.0 | Last Updated: June 2025

</footer></div>

# Psychometric Analysis Tool

<div class="container" id="bkmrk-psychometric-analysi"><header class="header"><div class="subtitle"><span style="color: rgb(0, 0, 0);">A Complete User Guide &amp; Best Practices</span></div></header><nav class="toc">## 📑 Table of Contents

- [Overview](#bkmrk-overview-the-psychom)
- [Getting Started](#bkmrk-getting-started-1-ac)
- [Publication Selection](#bkmrk-publication-selectio)
- [Analysis Types](#bkmrk-analysis-types-%F0%9F%94%AC-ind)
- [Quality Indicators](#bkmrk-quality-indicators-%26)
- [Data Preprocessing](#bkmrk-data-preprocessing-m)
- [Interpreting Results](#bkmrk-interpreting-results)
- [Best Practices](#bkmrk-best-practices-sampl)
- [Troubleshooting](#bkmrk-troubleshooting-comm)
- [Technical Details](#bkmrk-technical-details-st)

</nav><main><section id="bkmrk-overview-the-psychom">## Overview

The **Psychometric Analysis Tool** is a sophisticated statistical analysis component within WASPL that evaluates the quality and reliability of educational assessments. It provides comprehensive psychometric analysis capabilities for educators and researchers to validate their test instruments according to professional measurement standards.

### <span class="emoji">📊</span> Statistical Analysis

Comprehensive reliability analysis using Cronbach's Alpha, item discrimination, difficulty analysis, and item-total correlations.

### <span class="emoji">🎯</span> Quality Assessment

Automated quality indicators with professional thresholds and recommendations for test improvement.

### <span class="emoji">📋</span> Multi-Publication Analysis

Compare multiple test administrations or combine data for robust statistical analysis.

### <span class="emoji">🔍</span> Data Validation

Built-in detection of methodological issues, outliers, and data quality problems.

</section><section id="bkmrk-getting-started-1-ac">## Getting Started

<div class="workflow-steps"><div class="step-card"><div class="step-number">1</div></div></div>### Access the Tool

Navigate to your test in WASPL Editor and select the **Psychometrics** tab. Only tests with EXAM mode publications will show analysis options.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">2</div></div></div>### Review Publications

The tool automatically loads all eligible publications. Review the summary statistics and quality indicators for each publication.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">3</div></div></div>### Select Data

Choose which publications to include in your analysis. Use quick selection tools or manual selection based on your research needs.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">4</div></div></div>### Configure Analysis

Select analysis type (Individual, Grouped, or Comparative) and configure data preprocessing options.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">5</div></div></div>### Run Analysis

Execute the psychometric analysis and review the comprehensive results with recommendations.

<div class="workflow-steps"><div class="step-card">  
</div><div class="step-card"><div class="step-number">6</div></div></div>### Export Results

Generate professional reports in PDF format or export raw data for further analysis.

### 💡 Prerequisites

<div class="info-box">- **EXAM Mode Publications**: Only publications in EXAM mode are eligible for psychometric analysis
- **Minimum Sample Size**: At least 10 participants recommended for basic analysis
- **Complete Responses**: Best results require high completion rates (80%+)

</div></section><section id="bkmrk-publication-selectio">## Publication Selection

### Understanding Publication Cards

Each publication is displayed with comprehensive information to help you make informed selection decisions:

<div class="metrics-grid"><div class="metric-card"><div class="metric-value">👥</div><div class="metric-label">Participant Count</div></div></div>Total number of students who attempted the test

<div class="metrics-grid"><div class="metric-card">  
</div><div class="metric-card"><div class="metric-value">✅</div><div class="metric-label">Completion Rate</div></div></div>Percentage of students who completed all items

<div class="metrics-grid"><div class="metric-card">  
</div><div class="metric-card"><div class="metric-value">⏱️</div><div class="metric-label">Average Time</div></div></div>Mean completion time for the assessment

<div class="metrics-grid"><div class="metric-card">  
</div><div class="metric-card"><div class="metric-value">🔍</div><div class="metric-label">Data Quality</div></div></div>Automated detection of anomalies or issues

### Quick Selection Tools

#### <span class="emoji">☑️</span> Select All

Include all available publications for maximum sample size

#### <span class="emoji">🕐</span> Most Recent

Select the 3 most recent publications for current performance analysis

#### <span class="emoji">📈</span> Largest Samples

Choose publications with the highest participant counts for statistical power

### Filtering and Sorting

- **Search Filter**: Find publications by name or keyword
- **Sort Options**: Order by date, participant count, completion rate, or alphabetically
- **Minimum Participants**: Set threshold to filter out small samples

### ⚠️ Sample Size Recommendations

<div class="warning-box">- **N ≥ 100**: Required for robust IRT analysis and factor analysis
- **N ≥ 50**: Minimum for exploratory factor analysis
- **N ≥ 30**: Sufficient for reliable Cronbach's Alpha estimates
- **N &lt; 30**: Limited to basic descriptive statistics

</div></section><section id="bkmrk-analysis-types-%F0%9F%94%AC-ind">## Analysis Types

### <span class="emoji">🔬</span> Individual Analysis

**Purpose**: Analyze each publication separately for comparison

**Use Case**: Compare performance across different administrations, groups, or time periods

**Output**: Separate reliability and item statistics for each publication

### <span class="emoji">📊</span> Grouped Analysis

**Purpose**: Combine all selected publications into one comprehensive analysis

**Use Case**: Maximize sample size for robust statistical estimates

**Output**: Single set of psychometric statistics based on combined data

### <span class="emoji">🔀</span> Comparative Analysis

**Purpose**: Global analysis plus between-group comparisons

**Use Case**: Research studies comparing different populations or conditions

**Output**: Combined statistics plus significance tests between groups

### 💡 Recommendation

**Grouped Analysis** is recommended for most educational applications as it provides the most reliable statistical estimates by maximizing sample size. Use Individual Analysis when you need to compare specific administrations or investigate changes over time.

</section><section id="bkmrk-quality-indicators-%26">## Quality Indicators &amp; Thresholds

### Reliability Categories (Cronbach's Alpha)

#### A - Excellent

α ≥ 0.90

<div class="quality-thresholds"><div class="threshold-card threshold-excellent"><small>Outstanding reliability for high-stakes testing</small></div><div class="threshold-card threshold-good">  
</div></div>#### B - Good

0.80 ≤ α &lt; 0.90

<div class="quality-thresholds"><div class="threshold-card threshold-good"><small>Good reliability for most educational purposes</small></div><div class="threshold-card threshold-acceptable">  
</div></div>#### C - Acceptable

0.70 ≤ α &lt; 0.80

<div class="quality-thresholds"><div class="threshold-card threshold-acceptable"><small>Acceptable for formative assessment</small></div><div class="threshold-card threshold-poor">  
</div></div>#### D - Poor

α &lt; 0.70

<div class="quality-thresholds"><div class="threshold-card threshold-poor"><small>Needs improvement before use</small></div></div>### Item Quality Standards

<table class="spec-table"><thead><tr><th>Metric</th><th>Good</th><th>Acceptable</th><th>Problematic</th><th>Interpretation</th></tr></thead><tbody><tr><td>**Difficulty**</td><td>30-70%</td><td>20-80%</td><td>&lt;20% or &gt;80%</td><td>Percentage of students who answered correctly</td></tr><tr><td>**Discrimination**</td><td>≥0.40</td><td>0.30-0.39</td><td>&lt;0.30</td><td>Ability to distinguish high from low performers</td></tr><tr><td>**Item-Total Correlation**</td><td>≥0.30</td><td>0.20-0.29</td><td>&lt;0.20</td><td>Consistency with overall test performance</td></tr><tr><td>**Point-Biserial**</td><td>≥0.25</td><td>0.15-0.24</td><td>&lt;0.15</td><td>Alternative discrimination measure</td></tr></tbody></table>

### 🎯 Quality Interpretation

<div class="highlight-box">- **Green Items**: Meet or exceed quality standards - retain these items
- **Yellow Items**: Acceptable quality but could be improved
- **Red Items**: Below standards - consider revision or removal

</div></section><section id="bkmrk-data-preprocessing-m">## Data Preprocessing

### Methodological Issue Detection

The tool automatically identifies common methodological issues that can affect analysis validity:

### <span class="emoji">🔄</span> Multiple Attempts

**Issue**: Students taking the test multiple times

**Impact**: Learning effects, violation of independence

**Solution**: Use only first attempts or best attempts

### <span class="emoji">⚠️</span> Incomplete Data

**Issue**: Students who didn't complete the test

**Impact**: Selection bias, reduced statistical power

**Solution**: Exclude incomplete responses or use imputation

### <span class="emoji">📈</span> Sample Size

**Issue**: Insufficient sample size for chosen analysis

**Impact**: Unreliable estimates, reduced power

**Solution**: Combine publications or limit analysis scope

### <span class="emoji">⏱️</span> Timing Anomalies

**Issue**: Extremely fast or slow completion times

**Impact**: Invalid response patterns

**Solution**: Automatic outlier detection and exclusion

### Quality Control Options

- **Multiple Attempts Exclusion**: Automatically keep only first attempts
- **Completion Threshold**: Set minimum percentage of items completed
- **Timing Filters**: Remove responses with suspicious timing patterns
- **Response Pattern Analysis**: Detect random or non-engaged responding

### ⚠️ Statistical Assumptions

Psychometric analysis assumes:

<div class="warning-box">- Independence of observations (no collaboration)
- Unidimensional measurement (items measure the same construct)
- Sufficient sample size for stable estimates
- Honest responding (students trying their best)

</div></section><section id="bkmrk-interpreting-results">## Interpreting Results

### Overall Test Quality

The analysis provides an overall grade (A-D) based on multiple quality indicators:

#### 📊 Analysis Results Overview

**Overall Grade:** B (Good Quality)

**Cronbach's Alpha:** 0.84 (Good Reliability)

**Sample Size:** 156 participants

**Items Analysis:** 12 Good, 6 Acceptable, 2 Problematic

### Item-Level Analysis

Each test item receives detailed statistical analysis:

<table class="spec-table"><thead><tr><th>Item</th><th>Difficulty</th><th>Discrimination</th><th>Item-Total r</th><th>Status</th><th>Recommendation</th></tr></thead><tbody><tr><td>Item 1</td><td>65%</td><td>0.45</td><td>0.42</td><td style="color: var(--success-color);">✓ Good</td><td>Retain - excellent quality</td></tr><tr><td>Item 2</td><td>35%</td><td>0.32</td><td>0.28</td><td style="color: var(--warning-color);">⚠ Acceptable</td><td>Consider slight revision</td></tr><tr><td>Item 3</td><td>15%</td><td>0.18</td><td>0.12</td><td style="color: var(--danger-color);">✗ Problematic</td><td>Review or remove - too difficult</td></tr></tbody></table>

### Recommendations

#### ✅ Actions for Test Improvement

<div class="success-box">- **Retain high-quality items** (discrimination ≥ 0.40)
- **Revise problematic items** with low discrimination or extreme difficulty
- **Consider removing items** that don't contribute to test reliability
- **Add more items** if overall reliability is below 0.80

</div></section><section id="bkmrk-best-practices-sampl">## Best Practices

### Sample Size Guidelines

#### 🎯 For Classroom Assessment

<div class="analysis-types"><div class="analysis-type">- Minimum N = 20 for basic reliability
- Target N = 30+ for stable estimates
- Combine classes when possible

</div><div class="analysis-type">  
</div></div>#### 🔬 For Research Studies

<div class="analysis-types"><div class="analysis-type">- Minimum N = 100 for IRT analysis
- Target N = 200+ for complex models
- Power analysis for group comparisons

</div><div class="analysis-type">  
</div></div>#### 📊 For High-Stakes Testing

<div class="analysis-types"><div class="analysis-type">- Target N = 500+ for operational use
- Multiple field test administrations
- Cross-validation with independent samples

</div></div>### Data Quality Checklist

#### ✓ Before Running Analysis

<div class="highlight-box">- Verify test was administered under standardized conditions
- Check for adequate completion rates (&gt;80% recommended)
- Review timing data for suspicious patterns
- Ensure sample represents intended population
- Document any special circumstances during administration

</div>### Interpreting Low Reliability

#### 🔍 Common Causes of Poor Reliability

<div class="warning-box">- **Too few items**: Reliability increases with test length
- **Heterogeneous content**: Items measuring different constructs
- **Poor item quality**: Items with low discrimination
- **Inappropriate difficulty**: Items too easy or too hard
- **Small sample size**: Unstable estimates with N &lt; 30

</div></section><section id="bkmrk-troubleshooting-comm">## Troubleshooting

### Common Issues and Solutions

### <span class="emoji">❌</span> No Publications Available

**Cause**: Only EXAM mode publications are eligible

**Solution**: Ensure test has been published in EXAM mode with student data

### <span class="emoji">⚠️</span> Analysis Fails

**Cause**: Insufficient data or computational error

**Solution**: Check sample size, data completeness, and try simpler analysis

### <span class="emoji">📊</span> Unrealistic Results

**Cause**: Data quality issues or methodological problems

**Solution**: Review preprocessing options and data collection procedures

### <span class="emoji">🐌</span> Slow Performance

**Cause**: Large datasets or complex analysis

**Solution**: Reduce sample size or simplify analysis type

### Error Messages

<table class="spec-table"><thead><tr><th>Error</th><th>Meaning</th><th>Solution</th></tr></thead><tbody><tr><td>"Insufficient data"</td><td>Sample size too small</td><td>Select more publications or reduce analysis complexity</td></tr><tr><td>"No variance in responses"</td><td>All students gave same answers</td><td>Check item difficulty and administration conditions</td></tr><tr><td>"Matrix not positive definite"</td><td>Correlation matrix issues</td><td>Remove problematic items or increase sample size</td></tr><tr><td>"Analysis timeout"</td><td>Computation took too long</td><td>Reduce sample size or contact support</td></tr></tbody></table>

</section><section id="bkmrk-technical-details-st">## Technical Details

### Statistical Methods

<table class="spec-table"><thead><tr><th>Metric</th><th>Formula/Method</th><th>Purpose</th></tr></thead><tbody><tr><td>Cronbach's Alpha</td><td>α = (k/(k-1)) × (1 - Σσᵢ²/σₓ²)</td><td>Internal consistency reliability</td></tr><tr><td>Item Difficulty</td><td>p = Number correct / Total attempts</td><td>Proportion of students answering correctly</td></tr><tr><td>Item Discrimination</td><td>Point-biserial correlation</td><td>Ability to differentiate performance levels</td></tr><tr><td>Item-Total Correlation</td><td>Corrected correlation (item removed from total)</td><td>Consistency with overall performance</td></tr></tbody></table>

### Computational Features

- **Missing Data Handling**: Listwise deletion or pairwise correlations
- **Outlier Detection**: Z-score and timing-based filtering
- **Bootstrap Confidence Intervals**: For reliability estimates
- **Effect Size Calculations**: Cohen's d for group comparisons

### Export Formats

#### 📄 PDF Report

Professional formatted report with all statistics, charts, and recommendations

#### 📊 JSON Data

Raw statistical output for integration with other tools or custom analysis

#### 📈 CSV Export

Item-level statistics for spreadsheet analysis or graphing

### 🔧 Integration with WASPL

<div class="info-box">- **Test Repository**: Pulls item information and test structure
- **Results Database**: Accesses student response data
- **User Authentication**: Integrated with WASPL security system
- **Publication System**: Links to test administration records

</div></section></main><footer class="footer">This tool follows established psychometric standards and guidelines from organizations such as AERA, APA, and NCME.

**WASPL Platform** | Psychometric Analysis Guide Version 1.0 | Last Updated: June 2025

</footer></div>