Entraînement de l'IA sur des œuvres protégées et l'économie créative du Canada
Les systèmes d'IA de pointe sont entraînés sur des œuvres canadiennes protégées sans consentement ni compensation. La Loi sur le droit d'auteur n'a pas d'exception pour l'entraînement de l'IA. Les industries créatives contribuant 53 G$ au PIB font face au déplacement par les alternatives générées par l'IA.
Frontier AI systems are trained on vast corpora that include copyrighted works by Canadian creators — books, journalism, music, visual art, code, and academic research — without consent, compensation, or attribution. This creates a dual hazard: it may violate creators' existing rights under Canadian copyright law, and the resulting AI systems increasingly substitute for the creative labour that produced the training data, eroding the economic foundations of Canada's cultural industries.
The legal landscape is actively contested — and Canadian litigation is now at the forefront. In November 2024, a coalition of Canada's leading news publishers — The Canadian Press, Torstar, The Globe and Mail, Postmedia, and CBC/Radio-Canada — sued OpenAI for copyright infringement through unauthorized scraping of news content to train ChatGPT. In November 2025, the Ontario Superior Court ruled it had jurisdiction to hear the case, rejecting OpenAI's motion to dismiss and awarding the plaintiffs $260,000 in costs. This is the first Canadian case to directly address copyrighted content use for AI training.
Separately, Vancouver author J.B. MacKinnon filed four class actions in BC Supreme Court in 2025 against Meta, Anthropic, Databricks, and NVIDIA, alleging they used "the Pile" — an 800-gigabyte dataset containing roughly 196,640 unlicensed books, many by Canadian authors — to train their models. In total, no fewer than six class actions were proposed across Federal and Provincial Courts in Quebec and British Columbia in 2025. Internationally, Anthropic agreed to a $1.5 billion settlement in Bartz v. Anthropic (August 2025), covering approximately 500,000 pirated works — the largest copyright settlement in U.S. history — with direct implications for Canadian authors whose works were in the training datasets.
Canada's Copyright Act does not contain a text-and-data-mining (TDM) exception for AI training, unlike Japan, the UK, Singapore, or the EU's conditional exception. The government launched a formal consultation on copyright and AI in October 2023, receiving close to 1,000 responses. ISED published its "What We Heard" report on February 11, 2025, finding no stakeholder consensus: creators argued unauthorized AI training violates existing law, while technology companies contended TDM extracts factual patterns rather than expressive content. No Copyright Act amendments have been tabled as of March 2026.
The Standing Committee on Canadian Heritage (CHPC) commenced a study on the effects of AI on creative industries in September 2025, hearing from the Coalition for the Diversity of Cultural Expressions, Copibec, SOCAN, and others. The committee moved to drafting instructions in November 2025. Canadian creators' organizations are unified in demanding a licensing regime. SOCAN stated that AI companies "ingesting and training models on copyrighted musical works without permission from, compensation for, or credit to creators is not fair use, but theft." The Writers' Union of Canada stated unequivocally that "AI training, including text and data mining, on published material without author permission is copyright infringement." Access Copyright, Copibec, and Music Publishers Canada have taken similar positions.
Canada's creative industries contribute approximately $55.5 billion annually to GDP and employ over 600,000 people (2020 figures). The structural risk is that AI systems trained on the output of these industries erode their economic viability, creating a paradox where the training data pipeline depends on an industry that AI is simultaneously undermining.
Préjudices
Les systèmes d'IA entraînés sur des œuvres canadiennes protégées sans consentement, compensation ou attribution produisent des résultats qui concurrencent le travail créatif original. Plusieurs recours collectifs canadiens ont été déposés alléguant l'utilisation non autorisée de matériel protégé.
Le contenu généré par l'IA réduit la demande de travail créé par des humains, érodant les fondements économiques des industries culturelles canadiennes. Les éditeurs, médias et professionnels créatifs canadiens signalent des revenus en baisse attribués aux substituts générés par l'IA.
Preuves
9 rapports
-
Close to 1,000 responses; no stakeholder consensus on AI training and copyright
- J.B. MacKinnon on Why He's Suing the AI Giants Source principale
MacKinnon filed four class actions against Meta, Anthropic, Databricks, NVIDIA for using 'the Pile' dataset with 196,640 unlicensed books
- Anthropic agrees to pay $1.5B US to settle author class action Source principale
Bartz v. Anthropic $1.5B settlement covering ~500,000 pirated works — largest copyright settlement in U.S. history
-
Ontario Superior Court ruled it has jurisdiction to hear Canadian publishers' copyright lawsuit against OpenAI; $260K costs awarded
-
AI training on published material without author permission is copyright infringement
-
SOCAN: AI training on copyrighted music 'is not fair use, but theft'
-
Copibec called for concrete action rather than continued study
-
CHPC committee studying AI and creative industries since September 2025
-
Six class actions proposed across Federal and Provincial Courts in 2025
Détails de la fiche
Recommandations de politiqueévalué
Clarify whether AI training on copyrighted works constitutes fair dealing or requires licensing under Canadian law
INDU Committee / Government consultationEstablish mandatory transparency requirements for AI training data provenance
EU AI Act modelDevelop compensation mechanisms for Canadian creators whose works are used in AI training
Access Copyright / SOCANÉvaluation éditoriale évalué
Les industries créatives du Canada — 53 G$ au PIB, plus de 650 000 emplois — sont simultanément exploitées pour les données d'entraînement de l'IA et déplacées par les systèmes résultants. La Loi sur le droit d'auteur n'a pas d'exception ETD, créant une incertitude juridique. Le gouvernement a consulté mais n'a pas légiféré.
Entités impliquées
Fiches connexes
- Labour Market Shifts in AI-Exposed Occupations and Early-Career Employment Stagnationrelated
- Large Language Model Training Data and Canadian Privacy Rightsrelated
Taxonomieévalué
Historique des modifications
| Version | Date | Modification |
|---|---|---|
| v1 | 10 mars 2026 | Initial publication |