Topic

security 13 signals 1 in 24h

AI Security

Model abuse, prompt injection, AI safety tooling, supply chain risk, and red teaming.

Latest 2026-08-01 12:39 UTC 13 source-backed 0 watch RSS JSON Feed Page JSON

Latest Signals

AI Security feed

13 on this page 13 total

2026-08-01 12:39 UTC

Overview of the AI Security Market from Hacker News Discussion

A Hacker News discussion highlights key points about the AI security market, referencing a field guide by Zeltser. The discussion currently has three points and no comments.

Hacker News USEFUL NOW PRACTICAL

SOURCE-BACKED 79% Open signal Original source

2026-07-31 23:17 UTC

Framework choice explains only ~0.06% of agentic AI security outcomes

A study analyzing 7,020 trials found that the choice of AI framework accounts for approximately 0.06% of security outcomes in agentic AI systems. This suggests that other factors play a much larger role in determining AI security.

Hacker News OPEN SOURCE TECHNICAL

SOURCE-BACKED 79% Open signal Original source

2026-07-31 14:02 UTC

Research on Stealthy Audio Prompt Injections Targeting Multimodal LLMs

A new study explores stealthy concurrent audio prompt injections as an attack vector against multimodal large language model (LLM) agents. The research highlights potential vulnerabilities in how these models process audio inputs.

Hacker News SECURITY TECHNICAL

SOURCE-BACKED 78% Open signal Original source

2026-07-30 19:38 UTC

Prompt Injection Remains a Vulnerability in LLM Applications

Prompt injection attacks continue to pose risks in large language model (LLM) applications, as discussed in a Hacker News thread. These attacks exploit the way LLMs process input prompts, leading to potential security breaches.

Hacker News SECURITY GENERAL

SOURCE-BACKED 78% Open signal Original source

2026-07-27 21:56 UTC

Microsoft unveils cost-effective AI security tools outperforming rivals

Microsoft has introduced new AI security tools that it claims outperform competing platforms while costing less. These tools aim to enhance AI security measures effectively and affordably.

Ars Technica AI USEFUL NOW PRACTICAL

SOURCE-BACKED 92% Open signal Original source

2026-07-27 16:25 UTC

Microsoft launches global AI red teaming initiative to boost AI security

Microsoft's External Red Team Alliance (EXTRA) is a global initiative partnering with universities and experts to identify AI risks and improve security testing. EXTRA aims to enhance the resilience of advanced AI systems through collaborative red teaming efforts.

Microsoft Security Blog RESEARCH TECHNICAL

SOURCE-BACKED 95% Open signal Original source

2026-07-27 15:38 UTC

From /Init to Code Execution with Opus 5 – An Indirect Prompt Injection Story

Hacker News surfaced this AI signal from veganmosfet.codeberg.page: From /Init to Code Execution with Opus 5 – An Indirect Prompt Injection Story.

Hacker News SECURITY TECHNICAL

SOURCE-BACKED 78% Open signal Original source

2026-07-24 05:38 UTC

Aligning AI Chess Agents with Human Reasoning for Safer Decision-Making

The paper explores aligning complex AI reasoning agents with human conceptual models to improve AI security and safety. It emphasizes the need to characterize and integrate insights from agents with different reasoning architectures for predictable deployment.

arXiv RESEARCH TECHNICAL

SOURCE-BACKED 95% Open signal Original source

2026-07-15 03:21 UTC

Frontier AI Agents Tested for Autonomous Clinical AI Security Audits

Researchers propose an evaluation task to assess whether advanced AI agents can independently conduct structured security audits on clinical AI models. This aims to address the challenge of detecting adversarial vulnerabilities that could harm patients without requiring extensive human expertise.

arXiv TECHNICAL

SOURCE-BACKED 95% Open signal Original source

2026-04-02 16:00 UTC

Google Workspace Tackles Indirect Prompt Injection Threats

Google's GenAI Security Team outlines a continuous approach to mitigate indirect prompt injection (IPI), an evolving threat vector targeting AI systems. The strategy focuses on ongoing detection and response to protect Google Workspace users.

Google Security Blog TECHNICAL

SOURCE-BACKED 95% Open signal Original source

2025-12-08 18:03 UTC

Architecting Security for Agentic Capabilities in Chrome

Posted by Nathan Parker, Chrome security team Chrome has been advancing the web’s security for well over 15 years, and we’re committed to meeting new challenges and opportunities with AI. Billions of people trust Chrome...

Google Security Blog PRACTICAL

SOURCE-BACKED 91% Open signal Original source

2025-08-25 16:01 UTC

GitHub outlines VS Code defenses against prompt injection attacks

GitHub Security Lab highlights risks of indirect prompt injections in VS Code that can expose tokens, files, or run code without consent. The post details VS Code features that help mitigate these vulnerabilities.

GitHub Security Lab TECHNICAL

SOURCE-BACKED 93% Open signal Original source

2025-06-13 16:03 UTC

Google outlines layered defense strategy against prompt injection attacks

Google's GenAI Security Team highlights emerging threats from generative AI and proposes a layered defense approach to mitigate prompt injection attacks. The strategy aims to enhance security as generative AI adoption grows rapidly.

Google Security Blog TECHNICAL

SOURCE-BACKED 95% Open signal Original source