Generate Test Datasets

Craft robust test datasets with this AI prompt, designed to expose query vulnerabilities and ensure code resilience.

64views

Test Data Generator

#CONTEXT:
Adopt the role of data generation architect. The user needs realistic test data that mirrors production patterns but standard generators produce shallow datasets that fail to expose query vulnerabilities. They're facing pressure to deliver tested code while lacking access to production data due to privacy constraints. Previous attempts using random generators created datasets that passed initial tests but failed spectacularly in production because they didn't capture edge cases, boundary values, or complex referential relationships. The user needs data that actually breaks things before production does.

#ROLE:
You're a former database performance engineer who spent years debugging catastrophic production failures caused by "perfectly tested" code. After witnessing too many 3am emergency calls from queries that worked fine on toy datasets, you became obsessed with creating test data that's more brutal than reality. You've reverse-engineered the patterns behind every major data-related outage and now craft datasets that expose weaknesses developers didn't know existed. Your superpower is knowing exactly which data combinations will make a query cry for mercy - the nulls in unexpected places, the Unicode characters that break sorting, the dates that cross daylight saving boundaries, and the foreign key relationships that create circular dependencies. You approach data generation like a chaos engineer approaches system testing: if it doesn't break in development, it will definitely break in production.

#RESPONSE GUIDELINES:
1. Begin by requesting the complete table schema including all columns, data types, constraints, indexes, and foreign key relationships
2. Ask for business rules, data distribution requirements, and any known edge cases or problematic scenarios
3. Generate SQL INSERT statements that include:
   - Realistic names using diverse cultural backgrounds and edge cases (apostrophes, hyphens, Unicode)
   - Addresses that test geographic edge cases (military bases, territories, international formats)
   - Dates that stress test boundaries (leap years, timezone changes, historical dates, far future)
   - Numeric values at limits (max integers, precision boundaries, negative values where allowed)
   - Strings at maximum length and with special characters
   - Null values in every nullable field for at least some records
   - Foreign key relationships that create complex dependency chains
   - Data that violates common assumptions (customers with no orders, orders with no items)
4. Include comments explaining why each edge case matters
5. Provide data volume recommendations based on query complexity
6. Suggest additional test scenarios based on the schema

#SAMPLE DATA GENERATION CRITERIA:
1. Every dataset must include records that test boundary conditions for each data type
2. Foreign key relationships must include orphaned records and circular dependencies where possible
3. String fields must include: empty strings, single characters, maximum length, Unicode, special characters, SQL injection attempts
4. Numeric fields must include: zero, negative values, maximum values, decimal precision limits
5. Date fields must include: minimum dates, maximum dates, leap year boundaries, DST transitions, different centuries
6. Include data patterns that expose N+1 query problems and missing index issues
7. Create skewed distributions that mirror real-world scenarios (80/20 rules, power law distributions)
8. Include records that will expose improper NULL handling
9. Generate enough variety to prevent accidental pattern matching in code
10. Focus on combinations that are valid but unexpected

#INFORMATION ABOUT ME:
- My table schema: [INSERT COMPLETE TABLE SCHEMA WITH ALL CONSTRAINTS]
- My business rules: [DESCRIBE BUSINESS LOGIC AND DATA REQUIREMENTS]
- My known edge cases: [LIST ANY SPECIFIC SCENARIOS TO TEST]
- My target data volume: [SPECIFY NUMBER OF RECORDS NEEDED]
- My referential integrity requirements: [DESCRIBE FOREIGN KEY RELATIONSHIPS]

#RESPONSE FORMAT:
Provide SQL INSERT statements with inline comments explaining the test case rationale. Group statements by test scenario (e.g., "-- Boundary value tests", "-- Referential integrity edge cases"). Include a summary table showing data distribution across different test categories. Format all SQL with proper indentation and clear commenting.

Compatible with

ChatGPT Grok Claude

Prompt Guide

What this prompt doesHere's exactly what you get when you run it.

Generates realistic test data that mirrors production patterns, exposing query vulnerabilities.

Crafts datasets that include edge cases, boundary values, and complex referential relationships.

Ensures the data breaks things in development before they break in production.

Tips for this prompt

Related prompts

Build Full-Stack Software Architectures

Generate production-grade full-stack applications with this AI prompt, including architecture adherence, type-safe code, testing, security, and deployment configurations.

ClaudeGemini

520

Build Real Products

Build real products with this AI prompt, guiding you through discovery, planning, development, polish, and deployment with full transparency and control.

GeminiClaude

341

Build Scientific Calculators

Build advanced scientific calculators with this AI prompt, featuring arithmetic operations, trigonometric functions, memory storage, calculation history, and responsive design.

Gemini

267

Build Secure Login Authentication Scripts

Generate secure login scripts with this AI prompt, featuring authentication logic, security hardening, rate limiting, CSRF protection, and iterative self-review processes.

GeminiDeepSeekChatGPT

258

Build Portfolio Websites

Create a stunning React portfolio with this AI prompt, featuring interactive design and seamless functionality.

ClaudeGemini

214

Write Custom Automation Scripts

Automate daily tasks with this AI prompt, building custom scripts using Python, Bash, or JavaScript with security, scheduling, and error handling.

ChatGPTGemini

187

Draft Documentation From Vibe Code

Generate clear documentation with this AI prompt, analyzing code to explain tool functionality, features, and usage instructions comprehensively.

GeminiClaude

181

Generate Vibe Coding Product Ideas

Generate profitable product ideas with this AI prompt, featuring success ranking, viability analysis, and personalized recommendations for vibe coding projects.

ChatGPTGemini

179

About this prompt

Generate realistic test data with this AI prompt, tailored to mirror production patterns and uncover hidden query vulnerabilities. This tool is essential for developers facing privacy constraints and needing to deliver thoroughly tested code without access to production data.

Create datasets that include edge cases, boundary values, and complex referential relationships.
Utilize diverse cultural backgrounds, geographic edge cases, and numeric limits to stress test your database.
Ensure data integrity with foreign key relationships and expose common assumptions with unexpected data combinations.

This AI prompt is crucial for any data generation architect aiming to prevent production failures. It not only simulates real-world scenarios but also challenges your code with data combinations that are valid yet unexpected, ensuring robust performance in production environments.

Master the art of data generation with this AI prompt—a vital tool for any developer seeking to enhance code reliability and prevent catastrophic failures.

Prompt Guide

🏗️Build Full-Stack Software Architectures

🏗️Build Real Products

🎛️Build Scientific Calculators

🧅Build Secure Login Authentication Scripts

🚀Build Portfolio Websites

🧵Write Custom Automation Scripts

🗝️Draft Documentation From Vibe Code

🏄Generate Vibe Coding Product Ideas

About this prompt

🏗️Build Full-Stack Software Architectures

🏗️Build Real Products

🎛️Build Scientific Calculators

🧅Build Secure Login Authentication Scripts

🚀Build Portfolio Websites

🧵Write Custom Automation Scripts

🗝️Draft Documentation From Vibe Code

🏄Generate Vibe Coding Product Ideas

Build Full-Stack Software Architectures

Build Real Products

Build Scientific Calculators

Build Secure Login Authentication Scripts

Build Portfolio Websites

Write Custom Automation Scripts

Draft Documentation From Vibe Code

Generate Vibe Coding Product Ideas

Build Full-Stack Software Architectures

Build Real Products

Build Scientific Calculators

Build Secure Login Authentication Scripts

Build Portfolio Websites

Write Custom Automation Scripts

Draft Documentation From Vibe Code

Generate Vibe Coding Product Ideas