$ How It Works
From signup to data in under 3 minutes
1. Sign Up & Get Your Key
Create a free account. Your API key is generated instantly — no credit card required.
2. Generate Data
Call the API with your desired table type, row count, and sector. Get results in seconds.
3. Build & Ship
Use realistic synthetic data for testing, training ML models, or compliance workflows.
Integrate in Minutes
First-class SDKs for your stack. Or just use curl.
curl https://api.vynfi.com/v1/generate/quick \ -H "Authorization: Bearer vf_live_7mN4kP2x..." \ -H "Content-Type: application/json" \ -d '{ "preset": "retail_small", "tables": ["journal_entries"], "rows": { "journal_entries": 1000 }, "format": "json" }'Built for Every Use Case
Audit Testing
Generate realistic journal entries with known anomalies for audit analytics testing. Calibrated to real-world distributions.
Fintech Development
Build and test financial applications with production-quality synthetic data. No real customer data exposure.
Academic Research
Create large-scale datasets for fraud detection, process mining, and financial ML research.
Compliance Validation
Test SOX, Basel III, and IFRS workflows with realistic synthetic data. Full COSO control mappings and evaluation reports.
Quality by Design
Statistical validation built into the generation engine
Benford MAD
Mean Absolute Deviation for first-digit compliance. Rated 'excellent conformity' by Nigrini's criteria.
F1 Score Delta
Target: ML fraud detectors trained on synthetic data within 3% F1 of real-data baselines.
Copula Families
Gaussian, Clayton, Gumbel, Frank, and Student-t copulas model complex inter-variable dependencies.
Anomaly Types
Spanning 5 categories: timing, amount, relationship, pattern, and structural anomalies with ground-truth labels.
Built on Real-World Research
The DataSynth engine was calibrated against 155 real-world datasets, encompassing 364 million journal entries and 2.4 billion line items across industries and geographies.
Real-World Datasets
Analyzed for distribution calibration and statistical benchmarking across 10 industry sectors.
Journal Entries
In the calibration corpus used to derive realistic financial patterns and temporal dynamics.
Line Items
Processed to build inter-table correlation models and cross-entity relationship graphs.
42 Country Packs
Localized tax, banking, and accounting standards for realistic regional data
Each pack includes locale configuration, multi-cultural naming, regional holidays, tax frameworks, banking standards, and accounting frameworks.
Simple, Transparent Pricing
Start free. Scale as you grow.
Anomaly types
Coherence validators
Country packs
Rows per second
Datasets analyzed
Powered by the DataSynth engine — a purpose-built Rust engine with 16 crates and counting.
Ready to generate your first dataset?
10,000 credits free every month. No credit card required.
You scrolled all the way down. We respect that.