Problem
DataGenerator accepts user input without validation:
num_records can be negative or extremely large
seed can be any value
- YAML config structure not validated
Impact
- Potential DoS through memory exhaustion
- Unexpected behavior with invalid inputs
- Poor error messages
- Security risk
Tasks
Validation Rules
num_records: Must be 1 <= n <= 10,000,000
seed: Must be non-negative integer if provided
- Config must have required keys: 'data_generation', 'fields'
- Each field must have: 'name', 'type', 'values'
References
Problem
DataGeneratoraccepts user input without validation:num_recordscan be negative or extremely largeseedcan be any valueImpact
Tasks
_validate_num_records()method_validate_config()method_validate_seed()method if needed__init__()Validation Rules
num_records: Must be 1 <= n <= 10,000,000seed: Must be non-negative integer if providedReferences