Gemini and ChatGPT both produced harmful content, including explicit advice about how to self-harm, in response to 62 per cent of requests, according to an audit by the Centre for Media, Technology and Democracy at McGill University. Anthropic only provided harmful responses two per cent of the time. (The Logic)
Talking point: The federal government recently tabled plans for a new commission to regulate the safety of AI chatbots. The auditors measured how well popular chatbots used by Canadians prevent the categories of harmful content laid out in the government bill, with the exception of child sexual abuse material. They used a large language model to simulate a conversation with a user, starting with a benign prompt and escalating to a request for harmful content. The researchers found that Gemini most reliably produced harmful responses. ChatGPT’s developer model had a similarly high harmful response rate, but unlike Gemini, the consumer-facing app was better at filtering out dangerous content. In an email statement to The Logic, Google spokesperson Lauren Skelly said the Gemini App has an extensive system of safeguards to prevent content that violates its policies.
Loading...
You have shared 5 articles this month and reached the maximum amount of shares available.
CloseIf you would like to purchase a sharing license please contact The Logic support at [email protected].
CloseYou have gifted 0 article(s) this month and have 5 remaining.
Recipients will be able to read the full text of the article after submitting their email address. They will not have access to other articles or subscriber benefits.
Get up to speed in minutes with insights and analysis on the most important stories of the day, every weekday.
See the bigger picture with reporters and industry experts in subscriber-exclusive events.
Membership provides access to our popular Slack channel, participation in subscriber surveys and invitations to exclusive events with our journalists and special guests.