The EU warns of possible action after the US imposes travel bans on five Europeans, saying it will defend its “regulatory autonomy against unjustified measures”
What You Need to Know for Your Next Trip Mohd Haider / Benzinga : Elon Musk Reacts To US Travel Ban On Former EU Commissioner: ‘...Gets His Dessert’ Sky News : EU warns of possible action after US bar...
Researchers claim that prompts framed as riddle-like poems could skirt AI chatbots' safety features designed to block production of explicit or harmful content
Riddle-like poems tricked chatbots into spewing hate speech and helping design nuclear weapons and nerve agents.
Ofcom chief Melanie Dawes warns social media companies to prove their algorithms protect under-18s from seeing harmful content, or face enforcement action
Ofcom chief Melanie Dawes reveals she held meetings with US AI firms over Online Safety Act — Tech companies will be subject …
How AI is increasingly being used to replace human content moderators, who say that the tech is not yet capable of reliably identifying harmful content
Kevin decided on a career in content moderation after his YouTube recommendations took a bewildering swerve.
The European Commission says that France, Spain, Italy, Denmark, and Greece will test a blueprint for an age verification app meant to protect children online
if I live that long [embedded post] @bluespacecanary : Honestly I wish the EU would hurry up and do the thing it actually wants (banning American tech companies from operating there) so it can get a p...
OpenAI launches the “Safety evaluations hub”, a webpage showing how its models score on tests for harmful content generation, jailbreaks, and hallucinations
OpenAI is moving to publish the results of its internal AI model safety evaluations more regularly in what the outfit …
Ofcom outlines 40+ child safety measures for websites and apps to introduce from July 2025 or face large fines under the Online Safety Act, including age checks
Companies will be legally required to block children's access to harmful content under UK's Online Safety Act or face large fines
Court documents: India criticizes Elon Musk's X for wrongly labeling an official website, used to notify tech firms of harmful content, as a “censorship portal”
India has criticised Elon Musk's X for wrongly labelling as a “censorship portal” an official website …
Microsoft identifies hackers in the US, Iran, the UK, Hong Kong, and Vietnam who bypassed guardrails on AI tools and sold access to other malicious groups
US and overseas hackers sold access to tools, which were then used to generate harmful content, Microsoft says.
Anthropic details Constitutional Classifiers, a protective LLM layer designed to stop AI model jailbreaking by monitoring inputs and outputs for harmful content
inputs designed to bypass its safety training and force it to produce outputs that might be harmful. Our new technique is a step towards robust jailbreak defenses. Read the blog post: https://anthropi...