Our forensic analysis assembles the strongest available evidence that your IP was used to train AI models — combining technical analysis, dataset investigation, and documented methodology designed for legal scrutiny.
In 2025, two federal judges sided with AI companies — not because training was authorized, but because plaintiffs hadn't produced forensic evidence that specific works were in the data. One judge noted that plaintiffs with better evidence will often prevail. The evidentiary bar isn't impossibly high. It just hasn't been met.
No single approach produces definitive evidence on its own. We layer independent methods to build the most complete forensic picture available.
We investigate training data supply chains directly — tracing whether your copyrighted works appear in documented datasets, crawl indices, and disclosed data sources. Direct evidence of inclusion, not inference.
We prompt models to reproduce your work, document everything, and score each output against your source material and a control set of similar content that wasn't in training. If the model reconstructs your work with significantly higher fidelity than the controls, that gap is the signal.
Where model weights are publicly available, we conduct analysis that goes beyond what's possible through API-only access. Direct access to model parameters enables deeper detection sensitivity that closed-source models don't permit. Particularly effective on fine-tuned models and community checkpoints.
Forensic integrity matters more than marketing claims. This is how we position our findings.
No methodology — ours or anyone else's — can prove with absolute certainty that a specific work was in a training set. The science is advancing rapidly, but honest limitations exist. We document them.
The strongest available body of evidence from multiple independent approaches. More substantive than what plaintiffs have recently brought to court, documented with transparent methodology so your legal team knows exactly what they're working with.
A targeted analysis of your highest-priority assets against major AI models — combining dataset investigation with technical analysis. Delivered as a documented report with full methodology, stated confidence levels, and clear next steps for your legal team.