Early accessSome features may be unavailable
Process Mining

OCEL Event Logs for Process Mining

Generate synthetic event logs with known ground truth for algorithm benchmarking, conformance checking, and process mining research.

The Data Problem in Process Mining

Extracting event logs from ERP systems is the hardest part of any process mining project

Weeks of Extraction

Extracting event logs from SAP (BKPF/EKKO/VBAK tables), Oracle, or Dynamics requires weeks of SQL development, domain expertise, and data cleansing.

Incomplete Timestamps

Production systems often lack granular timestamps. Change documents may be missing, creating gaps in the event sequence that break conformance checking.

Privacy Constraints

Event logs contain employee IDs, customer references, and transaction amounts. Anonymization removes the attributes that make process mining useful.

No Ground Truth

Real-world process logs don't come with labeled variants. Evaluating conformance-checking algorithms requires known reference models that production data can't provide.

8 Process Types

End-to-end business processes with realistic activity sequences and timing distributions

1

Procure-to-Pay (P2P)

Purchase requisition through invoice payment and GL posting

2

Order-to-Cash (O2C)

Sales order through delivery, billing, and cash collection

3

Record-to-Report (R2R)

Journal entry posting through trial balance and close

4

Hire-to-Retire (H2R)

Employee onboarding through payroll and offboarding

5

Plan-to-Produce (P2P-M)

Production planning through manufacturing execution

6

Issue-to-Resolution (I2R)

Incident creation through triage, fix, and closure

7

Quote-to-Cash (Q2C)

Opportunity through proposal, contract, and revenue recognition

8

Acquire-to-Dispose (A2D)

Fixed asset acquisition through depreciation and disposal

Built for Process Mining

Variant Analysis

Configurable process variant distributions. Set the happy-path ratio, define deviation patterns, and control rework loop frequencies for realistic process models.

Conformance Checking

Each generated log includes the reference BPMN model. Measure fitness, precision, and generalization against known ground truth — no manual annotation required.

OCEL 2.0 Multi-Object

Native OCEL 2.0 output with multiple object types per event. Track how a purchase order, goods receipt, and invoice interact across the process lifecycle.

pm4py Integration

Generated logs are directly loadable with pm4py. Includes helper scripts for discovery (Alpha, Heuristic, Inductive Miner) and conformance checking out of the box.

Export Formats

Native compatibility with every major process mining platform

XES 2.0

IEEE standard for event log interchange. Compatible with ProM, PM4Py, and academic tooling.

Celonis IBC

Native Celonis Intelligence Base Connector format for direct upload to Celonis EMS.

Disco CSV

Fluxicon Disco-compatible CSV with case ID, activity, and timestamp columns.

Parquet / OCEL 2.0

Columnar format for large-scale analysis. Native OCEL 2.0 JSON-LD for multi-object event logs.

pm4py Integration Example

import vynfi
import pm4py

client = vynfi.Client(api_key="vf_live_...")

job = client.generate(
    sector="manufacturing",
    module="process_mining",
    process_type="procure_to_pay",
    cases=10_000,
    variant_count=25,
    happy_path_ratio=0.65,
    output_format="xes",
)

# Load directly into pm4py
log = pm4py.read_xes(job.download_path())
net, im, fm = pm4py.discover_petri_net_inductive(log)
fitness = pm4py.fitness_token_based_replay(log, net, im, fm)
print(f"Fitness: {fitness['average_trace_fitness']:.3f}")
Live demo

Try it — process mining on VynFi data

This runs pm4py on our Supply Chain OCEL dataset. No setup, no install — just explore.

Powered by pm4py · Data from VynFi/vynfi-supply-chain-ocel

Skip the extraction — start mining

Generate production-quality event logs in minutes, not weeks. 5,000 free credits to start.