Glossary / Legal Tech

eDiscovery & Semantic Search

Electronic discovery (e-discovery) refers to discovery in legal proceedings such as litigation, government investigations, or Freedom of Information Act requests, where the information sought is in electronic format (ESI).

The Data Explosion

In modern lawsuits, evidence isn't just paper. It's millions of emails, Slack messages, PDFs, and database records. Manually reviewing this data is impossible. eDiscovery tools essentially "Find the needle in the haystack" using technology.

The EDRM Model

The Electronic Discovery Reference Model (EDRM) defines the standard workflow:

Semantic Search vs. Keyword Search

Traditional discovery relied on keywords (e.g., finding documents containing "fraud"). However, fraudsters might use code words.

Semantic Search uses Natural Language Processing (NLP) and Vector Embeddings to understand the meaning and context. It can find a document about "cooking the books" even if the word "fraud" is never mentioned, because the AI understands the semantic similarity.

Technology-Assisted Review (TAR)

Modern Legal Tech Software employs TAR (Technology-Assisted Review) where senior lawyers train an AI on a small set of documents ("Seed Set"), and the AI then predicts the relevance of the remaining millions of documents (Predictive Coding), saving thousands of billable hours.

Related Terms