Auditor General Found CRA's $18-Million AI Chatbot Gave Incorrect Tax Answers

Confirmed Contested Significant

Canada's $18M tax chatbot answered only two of six basic questions correctly, the Auditor General found.

Occurred: February 1, 2020 (month) Reported: December 12, 2025

Field: narrative

Reports (sources available for citation)

[1] CBC News (2025-10-21) — In scathing report, AG finds CRA call centres are slow to answer and often inaccurate Supports: Auditor General found CRA chatbot Charlie provided accurate answers only a third of the time [2] 980 CJME (2025-10-25) — CRA must fix human responses before pursuing AI, experts say Supports: AI experts warned that AI will replicate or amplify errors from inaccurate human agents [3] iPhone in Canada (2025-12-12) — Auditor General Slams Ottawa's $18 Million CRA Chatbot 'Charlie' Supports: CRA spent $18 million on chatbot Charlie; AG found it gave wrong answers 66% of the time [4] Unpublished (2025-12-12) — The CRA spent $18M on 'Charlie,' a new tax information chatbot that is wrong most of the time Supports: Charlie met a 70% accuracy threshold in internal testing; upgraded to generative AI in November 2025

Text to add citations to

The Canada Revenue Agency announced Charlie, an AI-powered chatbot, in February 2020 and launched it in March 2020 to answer taxpayer questions about tax filing, benefits, and CRA services. By the time of the Auditor General's review in October 2025, the system had processed over 18 million questions at a cost of approximately $18 million (iPhone in Canada, 2025; Unpublished, 2025).

The Auditor General's October 2025 report found significant accuracy problems. When tested with common taxpayer questions, the chatbot answered only two out of six questions correctly (CBC News, 2025; iPhone in Canada, 2025). The Auditor General also noted that other publicly available AI tools answered five out of six of the same questions correctly.

With 18 million queries processed, the error rate identified by the Auditor General raises concerns about the accuracy of tax information provided to Canadians through the system. Taxpayers who relied on Charlie's responses may have received incorrect information about filing requirements, deadlines, or eligibility for benefits. The CRA chatbot carries the implicit authority of the federal tax agency, and users have no straightforward way to know when the chatbot's answer is wrong.

The Auditor General also noted that Charlie had met only a 70% accuracy threshold in CRA's own internal testing — meaning inaccurate responses 30% of the time (Unpublished, 2025). The Secretary of State (Canada Revenue Agency), Wayne Long, responded publicly: "We've got a lot of room for improvement. We know it, we accept the report and we're going to do better." The CRA acknowledged it could not confirm real-world accuracy without a comprehensive review of every interaction. In November 2025, the CRA upgraded Charlie to a generative AI version and reported pre-release testing showed approximately 90% accuracy, though the agency acknowledged it could not confirm real-world accuracy (Unpublished, 2025).

Materialized From

AI Confabulation in Consequential Canadian Contexts

Harms

When tested by the Auditor General with six common taxpayer questions, CRA's chatbot Charlie answered only two correctly, providing wrong information about tax obligations, filing requirements, and CRA procedures. The chatbot had processed an estimated 18 million queries over its lifetime.

MisinformationService DisruptionSignificantPopulation

CRA spent $18 million on an AI chatbot without adequate accuracy testing or ongoing quality monitoring, deploying it as an official government information source that carried the implicit authority of the federal tax agency.

MisinformationService DisruptionModeratePopulation

Evidence

4 reports

In scathing report, AG finds CRA call centres are slow to answer and often inaccurate Primary source
Media — CBC News (Oct 21, 2025)
Auditor General found CRA chatbot Charlie provided accurate answers only a third of the time
CRA must fix human responses before pursuing AI, experts say
Media — 980 CJME (Oct 25, 2025)
AI experts warned that AI will replicate or amplify errors from inaccurate human agents
Auditor General Slams Ottawa's $18 Million CRA Chatbot 'Charlie'
Media — iPhone in Canada (Dec 12, 2025)
CRA spent $18 million on chatbot Charlie; AG found it gave wrong answers 66% of the time
The CRA spent $18M on 'Charlie,' a new tax information chatbot that is wrong most of the time
Media — Unpublished (Dec 12, 2025)
Charlie met a 70% accuracy threshold in internal testing; upgraded to generative AI in November 2025

Record details

Responses & Outcomes

Nov 1, 2025Canada Revenue Agencyinstitutional actionActive

Upgraded Charlie to a generative AI version following the Auditor General's findings; reported pre-release testing showed approximately 90% accuracy but acknowledged it could not confirm real-world accuracy

Editorial Assessment assessed

The federal tax authority spent $18 million on an AI chatbot (iPhone in Canada, 2025; Unpublished, 2025) that the Auditor General found gave incorrect answers to basic tax questions (CBC News, 2025). The chatbot processed over 18 million queries, raising concerns about the accuracy of tax information provided to Canadians through the system.

Entities Involved

Canada Revenue Agency

deployer

AI Systems Involved

CRA AI Chatbot

AI-powered chatbot 'Charlie' deployed to answer taxpayer questions about tax filing, benefits, and CRA services; processed over 18 million questions and was found by the Auditor General to answer only two out of six test questions correctly

Related Records

AI in Canadian Government Automated Decision-Makingrelated

Taxonomyassessed

Domain

Public ServicesFinance & Banking

Harm type

MisinformationService Disruption

AI pathway

Deployment ContextMonitoring Absent

Lifecycle phase

DeploymentMonitoring

AIID: Incident #1310

Changelog

Changelog
Version	Date	Change
v1	Mar 8, 2026	Initial publication
v2	Mar 11, 2026	Neutrality and factuality review: corrected AG report date from December 2025 to October 2025 (report tabled October 21, 2025); fixed launch date (announced February 2020, launched March 2020); replaced inferred wrong-answer topics with AG's actual comparison to other AI tools; corrected 70% accuracy framing (AG cited this as evidence of inadequacy, not CRA's defense); clarified Secretary of State pushback was about broader report findings; removed three fabricated policy recommendation attributions (AG's formal recommendations addressed call centre operations, not chatbot-specific measures).

Version 2

← All incidents