# Psychometric Data Generator

This tools generates fake data to simulate the definition levels. It is a calibration tool.

# Psychometric Data Generator - User Guide

<div class="container" id="bkmrk-psychometric-data-ge"><header class="header"># Psychometric Data Generator

<div class="subtitle">User Guide &amp; Technical Reference</div></header><nav class="toc">## 📑 Table of Contents

- [Overview](#bkmrk-overview-the-psychom)
- [Purpose and Applications](#bkmrk-purpose-and-applicat)
- [What the Generator Creates](#bkmrk-what-the-generator-c)
- [Quick Start Presets](#bkmrk-quick-start-presets-)
- [Expert Mode Configuration](#bkmrk-expert-mode-configur)
- [Cronbach's Alpha Categories](#bkmrk-cronbach%27s-alpha-cat)
- [Generation Process](#bkmrk-generation-process-1)
- [Technical Specifications](#bkmrk-technical-specificat)
- [Best Practices](#bkmrk-best-practices-for-d)
- [Troubleshooting](#bkmrk-troubleshooting-comm)
- [Integration with WASPL](#bkmrk-integration-with-was)

</nav><main><section id="bkmrk-overview-the-psychom">## Overview

The **Psychometric Data Generator** is a powerful tool designed to create realistic test datasets with valid psychometric metrics for WASPL assessments. This tool generates simulated student responses that maintain statistically sound characteristics, making it ideal for testing, demonstrations, training, and quality validation.

</section><section id="bkmrk-purpose-and-applicat">## Purpose and Applications

### Primary Uses

- **Testing &amp; Validation**: Generate datasets to test WASPL's analytical capabilities
- **Demonstrations**: Create realistic data for showcasing platform features
- **Training**: Provide educational datasets for learning psychometric concepts
- **Quality Assurance**: Test detection algorithms with known data characteristics
- **Research**: Generate controlled datasets for psychometric research

### Key Benefits

- **Realistic Data**: Simulated responses follow actual response patterns
- **Controlled Quality**: Target specific reliability coefficients (Cronbach's α)
- **Instant Generation**: Create datasets in seconds rather than months
- **Educational Value**: Understand the relationship between item quality and test reliability

</section><section id="bkmrk-what-the-generator-c">## What the Generator Creates

The Psychometric Data Generator produces:

### 1. Student Response Data

- **Individual Responses**: Simulated answers for each student to each test item
- **Response Patterns**: Realistic distribution following Item Response Theory (IRT)
- **Consistency Modeling**: Variable response consistency based on student ability

### 2. Psychometric Metrics

- **Cronbach's Alpha**: Test reliability coefficient (internal consistency)
- **Item Discrimination**: How well items differentiate between students
- **Item Difficulty**: Distribution of item difficulty parameters
- **Response Timing**: Realistic completion times per item

### 3. Statistical Properties

- **Score Distribution**: Normal or custom distributions of total scores
- **Item-Total Correlations**: Relationships between item and total performance
- **Standard Errors**: Measurement precision indicators
- **Missing Data**: Realistic patterns of incomplete responses

</section><section id="bkmrk-quick-start-presets-">## Quick Start Presets

The generator offers three pre-configured presets for immediate use:

### <span class="emoji">🎯</span> Realistic Demo

<div class="preset-card">- **Target**: α ≥ 0.85 (Grade B)
- **Quality**: High-quality items (80% good items)
- **Use Case**: Professional demonstrations and standard testing
- **Characteristics**: Balanced difficulty, good discrimination

</div>### <span class="emoji">🔍</span> Detection Test

<div class="preset-card">- **Target**: α ≈ 0.40 (Grade D)
- **Quality**: Mixed quality with problematic items
- **Use Case**: Testing quality detection algorithms
- **Characteristics**: Includes poor items, low reliability

</div>### <span class="emoji">📚</span> Educational Training

<div class="preset-card">- **Target**: α ≥ 0.75 (Grade C)
- **Quality**: Acceptable quality for learning
- **Use Case**: Training and educational purposes
- **Characteristics**: Moderate quality, instructional value

</div></section><section id="bkmrk-expert-mode-configur">## Expert Mode Configuration

For advanced users, Expert Mode provides full control over generation parameters:

### Core Parameters

- **Target Cronbach's Alpha**: Set desired reliability (0.5 - 0.95)
- **Minimum Discrimination**: Item quality threshold (0.1 - 0.6)
- **Response Consistency**: Student behavior variability (0.1 - 0.8)
- **Sample Size**: Number of students to simulate
- **Missing Data Rate**: Percentage of incomplete responses

### Advanced Options

- **Timing Generation**: Include realistic completion times
- **Debug Mode**: Additional diagnostic information
- **Custom Distributions**: Specify ability and difficulty distributions

</section><section id="bkmrk-cronbach%27s-alpha-cat">## Cronbach's Alpha Categories (A, B, C, D)

The generator uses standard psychometric thresholds to categorize test reliability:

### Category A - Excellent <span class="badge badge-success">α ≥ 0.9</span>

<div class="category-grid"><div class="category-card category-a">- **Interpretation**: Outstanding reliability
- **Suitable For**: High-stakes testing, certification exams
- **Characteristics**: Very consistent measurement, minimal measurement error

</div><div class="category-card category-b">  
</div></div>### Category B - Good <span class="badge badge-info">0.8 ≤ α &lt; 0.9</span>

<div class="category-grid"><div class="category-card category-b">- **Interpretation**: Good reliability
- **Suitable For**: Most educational assessments, research
- **Characteristics**: Reliable measurement with acceptable error

</div><div class="category-card category-c">  
</div></div>### Category C - Acceptable <span class="badge badge-warning">0.7 ≤ α &lt; 0.8</span>

<div class="category-grid"><div class="category-card category-c">- **Interpretation**: Acceptable reliability
- **Suitable For**: Formative assessment, initial testing
- **Characteristics**: Adequate for most purposes, some measurement error

</div><div class="category-card category-d">  
</div></div>### Category D - Insufficient <span class="badge badge-danger">α &lt; 0.7</span>

<div class="category-grid"><div class="category-card category-d">- **Interpretation**: Poor reliability
- **Suitable For**: Pilot testing, diagnostic purposes only
- **Characteristics**: High measurement error, results should be interpreted cautiously

</div></div></section><section id="bkmrk-generation-process-1">## Generation Process

<div class="process-steps"><div class="step-card"><div class="step-number">1</div></div></div>### Configuration

<div class="process-steps"><div class="step-card">1. Select a Quick Start preset or choose Expert Mode
2. Configure generation parameters
3. Select target test and publication(s)
4. Review settings and estimated generation time

</div><div class="step-card"><div class="step-number">2</div></div></div>### Validation

<div class="process-steps"><div class="step-card">- System validates configuration parameters
- Checks for realistic parameter combinations
- Estimates generation time and resource requirements

</div><div class="step-card"><div class="step-number">3</div></div></div>### Generation

<div class="process-steps"><div class="step-card">- Creates simulated response matrix
- Applies psychometric models (IRT/CTT)
- Calculates reliability and item statistics
- Generates timing data (if enabled)

</div><div class="step-card"><div class="step-number">4</div></div></div>### Results

<div class="process-steps"><div class="step-card">- Displays generation summary
- Shows achieved vs. target metrics
- Provides data quality indicators
- Saves results to selected publication(s)

</div></div></section><section id="bkmrk-technical-specificat">## Technical Specifications

### Supported Models

<table class="spec-table"><thead><tr><th>Model</th><th>Description</th><th>Use Case</th></tr></thead><tbody><tr><td>**Classical Test Theory (CTT)**</td><td>Traditional reliability analysis</td><td>Standard psychometric evaluation</td></tr><tr><td>**Item Response Theory (IRT)**</td><td>Modern psychometric modeling</td><td>Advanced measurement precision</td></tr><tr><td>**Rasch Model**</td><td>Specific IRT implementation for dichotomous items</td><td>Educational assessment</td></tr></tbody></table>

### Data Format

- **Response Matrix**: Students × Items binary/polytomous responses
- **Metadata**: Student IDs, item parameters, session information
- **Timing Data**: Response times in milliseconds
- **Quality Metrics**: Comprehensive psychometric statistics

### Performance

<table class="spec-table"><thead><tr><th>Dataset Size</th><th>Student Count</th><th>Generation Time</th></tr></thead><tbody><tr><td>Small Datasets</td><td>&lt; 50 students</td><td>&lt; 1 second</td></tr><tr><td>Medium Datasets</td><td>50-200 students</td><td>1-2 seconds</td></tr><tr><td>Large Datasets</td><td>200+ students</td><td>2-5 seconds</td></tr></tbody></table>

</section><section id="bkmrk-best-practices-for-d">## Best Practices

### For Demonstrations

<div class="highlight-box">- Use "Realistic Demo" preset
- Target α ≥ 0.85 for professional appearance
- Include timing data for realistic simulation

</div>### For Testing &amp; QA

<div class="info-box">- Use "Detection Test" preset for algorithm validation
- Mix high and low quality items
- Test edge cases with extreme parameters

</div>### For Training

<div class="warning-box">- Use "Educational Training" preset
- Show progression from poor to excellent reliability
- Demonstrate impact of item quality on overall test reliability

</div>### For Research

<div class="highlight-box">- Use Expert Mode for precise control
- Document all parameter settings
- Validate against real data when possible

</div></section><section id="bkmrk-troubleshooting-comm">## Troubleshooting

### Common Issues

- **Generation Fails**: Check parameter ranges and test selection
- **Poor Quality Results**: Adjust discrimination thresholds
- **Unrealistic Data**: Review consistency and timing parameters

### Performance Optimization

- Limit student count for faster generation
- Disable timing data if not needed
- Use appropriate quality thresholds

</section><section id="bkmrk-integration-with-was">## Integration with WASPL

The generated data integrates seamlessly with:

- **Results Analysis**: Full psychometric reporting
- **CAT System**: Adaptive testing calibration
- **Quality Dashboard**: Real-time monitoring
- **Export Functions**: Multiple format support

</section></main><footer class="footer">This tool is part of the WASPL Developer Tools suite, designed to support comprehensive assessment development and validation workflows.

**WASPL Platform** | Documentation Version 1.0 | Last Updated: June 2025

</footer></div>