Generate data
Instantly generate accurate and private synthetic data to make your dataset privacy compliant. We are offering the synthetic data api empowering everyone, compromising nothing.
Operations
Unlock operational testing without touching sensitive data. Streamline processes, stay compliant, and move faster.
“We needed to test dashboards and workflows without exposing real data. Now we test freely, keep innovating and stay compliant.”
— Head of Operations
Machine Learning
Train smarter models without waiting for access to data. Generate balanced, private, ready-to-use data instantly.
“As our teams grew, data access lagged and drained momentum. Teams now generate data on demand, tailored to their needs.”
— Data Engineer
API for builders
We want to offer humans and things instant access to synthetic data. Whether you’re powering a realtime application or training AI.
Tabular Data Synthesis
Generate structured datasets with the same statistical properties as your real data.
-
Privacy safe replication for sharing and analysis.
-
Model ready synthetic data that preserves correlations.
-
Outputs to match business rules and data types.
Time Series Generation
Create synthetic time series data that preserves patterns and seasonalities.
-
Capture trends and noise to mimic the real world.
-
Enable forecasting without relying on production data.
-
Variable-length sequences across time resolutions.
curl -X POST https://api.getsimula.com/v1/synthesize \
-d ‘{
“source”: “customer_data.csv”,
“schema_version”: “v2.3”,
“type”: “tabular”,
“privacy”: {
“level”: “high”,
“differential_privacy”: true,
“epsilon”: 0.8
},
“generation”: {
“records”: 10000,
“random_seed”: 42,
“sampling_temperature”: 0.9
},
“output”: {
“format”: “csv”,
“destination”: “s3://synthetic-datasets/test-bucket/churn_2025_06.csv”,
“validate”: true,
“validation_report”: “quality+privacy”
}
}’
Get data done!
No hidden fees, straightforward rates that scale with you.
Developer
Start with the fully-featured free tier, then pay for only what you use as you grow.
1500 free per month
then € 0,03/credit
→ 2 concurrent jobs
→ 1500 free credits
→ 1 business day email support
Enterprise
Standardize synthetic data and privacy engineering across your enterprise.
custom setup,
fixed and usage fee
→ 20 concurrent jobs
→ Hosted environment on AWS
→ 24/7 phone and chat support
→ Expert data consult
Questions, answered
Everything you need, from how credits are calculated and how much compute they buy.
What is a credit?
A credit is a simple unit measuring API call duration for cloud compute. 100 credits is equal to 5 minutes of API duration.
How are credits calculated?
Credit usage is based on the duration from the time you issue an API call until the point your tasks terminate, rounded up to the nearest minute. Each minute of API duration bills as 20 credits for cloud compute.
Do credits roll over?
The free credits included in your subscription are reset each month and do not roll over.
What if I go over my free monthly credits?
If a customer has not entered credit card information, their service use will be suspended after the free monthly credits have been used. Once a credit card is provided, usage will be metered and the customer will be charged for the total number of credits consumed at the end of a month.
How much data per worker instance?
This answer varies based on the complexity and type of data. On a typical CSV dataset containing 156 characters per record and a mix of numeric and categorical data, a Simula cloud instance is able to synthesize approximately 6k records per 100 credits, including training. A Simula cloud instance can transform approximately 133k records per 100 credits, or classify 133k records when using a standard policy to redact or label personal data types.
How many worker instances concurrently?
Simula’s cloud workers are built to scale linearly, so you can scale concurrent containers to meet your needs. By default, each Developer account is limited to 2 concurrent workers. These limits are applied across the entire account and are not per user.
Request a dataset
"*" indicates required fields
Building Europe’s
synthetic data factory.
Cliff and Mark founded Simula to fix Europe’s synthetic data gap. Their engine spins up GDPR safe datasets on demand, and the plug and play API lets any team boost ML accuracy.
The Simula
“The simulacrum is never that which conceals the truth, it is the truth which conceals that there is none. The simulacrum is true.”
— Jean Baudrillard
We were obsessed. Not with data but with the simulacrum. The copy without an original. The echo louder than its source.
In a world where reality is overwritten by models, why is data still treated as if it were mined from the earth; finite and fragile? We didn’t want more access. We wanted a new source. So we built Simula.
Not to extract data from the world, but to create data about the world; ethically, infinitely, synthetically. For machines, for people, for systems still waiting to be tested or imagined.
This isn’t just simulation. It’s rethinking the possible.