Digitizing Historical Immigration Records
This system uses AWS Textract, an optical character recognition (OCR) technology, to extract and organize information from over 11,000 pages of Canada Gazette immigration records. The project was developed by Library and Archives Canada to make historical immigration documents more accessible for genealogy research and public record searches.
The system automatically reads text and data from scanned historical documents, structures the information in searchable formats, and makes it available through an ElasticSearch database. This allows researchers and the general public to quickly find relevant immigration information without manually reviewing thousands of pages of historical archives.
Please note: This system has been retired and is no longer in active use. However, the digitized records remain available for research purposes. The system processes personal information such as names, dates, and immigration details contained in historical government records. Both Library and Archives Canada employees and members of the public can access the indexed information for genealogical research.