AI-Powered Document Similarity Detection System
Document Detective is an intelligent system developed by Agriculture and Agri-Food Canada (AAFC) that helps the Access to Information and Privacy (ATIP) team streamline and accelerate the process of finding identical or very similar documents across government repositories. The system extracts documents from AAFC SharePoint, OneDrive, Outlook, and end-user desktops to identify duplicates and near-duplicates, improving efficiency in information management and access processes.
This system is currently in production and is primarily used by Government of Canada employees within AAFC's ATIP operations. Importantly, the system does not involve the processing of personal information, focusing instead on document metadata and content analysis. The tool features a scalable and flexible architecture that allows it to grow with organizational needs and potentially be deployed across other federal departments to improve document management processes government-wide.
Document Detective operates with full transparency regarding its capabilities and limitations. The system has been designed to support public administration objectives while maintaining appropriate safeguards. Users should understand that this is an automated tool designed to assist human decision-makers, not replace them, in the critical task of managing government information access.