Surfaced
Find the story your documents never gave up.
For journalists and researchers.
The problem
Most investigative work starts with documents — thousands of them. PDFs that have never been indexed, never been searchable, never been read by anyone outside the organisation that produced them. Reading them manually takes months. Ctrl+F gets you nowhere. The patterns, the anomalies, the connections that make a story — they stay buried. Not because they are not there, but because there has never been a practical way to surface them.
How it works
Point Surfaced at your document collection
A folder of PDFs, scanned or text-based
Surfaced extracts and cleans the text
Applying OCR where needed
NLP and clustering analyse the collection
Language patterns, topics, and statistical relationships across the full collection
Receive a structured research report
Cluster analysis, topic models, classifications, and statistical findings ready for further investigation
The output
A timestamped research folder containing a cluster analysis report, topic model output, document classifications, and statistical reports including chi-square and ANOVA results. Everything is exportable and ready to feed into your own analysis or reporting workflow.
Who it is for
Surfaced is built for investigative journalists, academic researchers, and policy analysts working with large collections of unstructured documents. If you have ever stared at a folder of PDFs and wondered what is actually in there, Surfaced is for you.
Book an AI Discovery Call
Tell us about your workflow. We will tell you exactly what we can build and how long it will take.
Book a call