You are here
Equivio
Equivio: the standard for near-duplicate identification
Equivio is the de-facto industry standard for near-duplicate identification. Equivio's advanced near-duplicate detection process significantly reduces the time and cost associated with preliminary document review.
Research shows that 70-80% of electronic documents are either duplicates or near-duplicates. By detecting and grouping near-duplicates, Equivio generates immediate, concrete benefits.
Key features
- Handles all types of near-duplicates – documents with minor text changes (such as form letters to customers or versions of a contract), same text with different formatting (wording in bold, italics etc.) or same text in different file type (Word and PDF)
- Supports broad range of formats – Microsoft text tools (Word, PowerPoint, Excel), email, ZIP, XML, HTML, source code, OCR-generated files
- User-defined threshold for similarity
- Allows the user to define the source set of documents using a broad set of parameters, such as file extensions, file sizes and modification time and date
- Equivio reads files only once, the first time they are introduced to the system
- Equivio deconstructs files into logical parts such as chapters or email body and attachments.
- Supports multi-lingual environments

Call 






