harmful content (Entity)

Associated Press 39 related

The EU warns of possible action after the US imposes travel bans on five Europeans, saying it will defend its “regulatory autonomy against unjustified measures”

What You Need to Know for Your Next Trip Mohd Haider / Benzinga : Elon Musk Reacts To US Travel Ban On Former EU Commissioner: ‘...Gets His Dessert’ Sky News : EU warns of possible action after US bar...

2025-12-25 View

The Verge 9 related

Researchers claim that prompts framed as riddle-like poems could skirt AI chatbots' safety features designed to block production of explicit or harmful content

Riddle-like poems tricked chatbots into spewing hate speech and helping design nuclear weapons and nerve agents.

2025-12-05 View

Financial Times 2 related

Ofcom chief Melanie Dawes warns social media companies to prove their algorithms protect under-18s from seeing harmful content, or face enforcement action

Ofcom chief Melanie Dawes reveals she held meetings with US AI firms over Online Safety Act — Tech companies will be subject …

2025-10-31 View

Bloomberg 6 related

How AI is increasingly being used to replace human content moderators, who say that the tech is not yet capable of reliably identifying harmful content

Kevin decided on a career in content moderation after his YouTube recommendations took a bewildering swerve.

2025-08-23 View

Reuters 12 related

The European Commission says that France, Spain, Italy, Denmark, and Greece will test a blueprint for an age verification app meant to protect children online

if I live that long [embedded post] @bluespacecanary : Honestly I wish the EU would hurry up and do the thing it actually wants (banning American tech companies from operating there) so it can get a p...

2025-07-15 View

TechCrunch 7 related

OpenAI launches the “Safety evaluations hub”, a webpage showing how its models score on tests for harmful content generation, jailbreaks, and hallucinations

OpenAI is moving to publish the results of its internal AI model safety evaluations more regularly in what the outfit …

2025-05-15 View

The Guardian 18 related

Ofcom outlines 40+ child safety measures for websites and apps to introduce from July 2025 or face large fines under the Online Safety Act, including age checks

Companies will be legally required to block children's access to harmful content under UK's Online Safety Act or face large fines

2025-04-24 View

Reuters 2 related

Court documents: India criticizes Elon Musk's X for wrongly labeling an official website, used to notify tech firms of harmful content, as a “censorship portal”

India has criticised Elon Musk's X for wrongly labelling as a “censorship portal” an official website …

2025-03-28 View

Bloomberg 1 related

Microsoft identifies hackers in the US, Iran, the UK, Hong Kong, and Vietnam who bypassed guardrails on AI tools and sold access to other malicious groups

US and overseas hackers sold access to tools, which were then used to generate harmful content, Microsoft says.

2025-02-27 View

Financial Times 10 related

Anthropic details Constitutional Classifiers, a protective LLM layer designed to stop AI model jailbreaking by monitoring inputs and outputs for harmful content

inputs designed to bypass its safety training and force it to produce outputs that might be harmful. Our new technique is a step towards robust jailbreak defenses. Read the blog post: https://anthropi...

2025-02-04 View

harmful content

Related Entities

Top Voices

Explore Further

Coverage Timeline