Microsoft launches lightweight scanner to detect backdoors in open-weight LLMs

Security Tool/Service

First reported

04.02.2026 19:52

Last updated

04.02.2026 19:52

Happening score

H score 16

1 unique sources, 1 articles

Summary

Hide ▲

Microsoft introduced a lightweight scanner that can detect backdoors in open-weight large language models and improve trust in AI deployments. The tool is designed to flag trigger-based poisoning with three observable signals and a low false positive rate. It matters because the scanner can be applied across common GPT-style models without additional training or prior knowledge of the hidden behavior.

Related Happenings

SKILLCLOAK and SKILLDETONATE expose AI coding-agent skill scanner evasion with runtime-packed malware

Technical Analysis

H score22 First: 06.07.2026 09:33 Last: 06.07.2026 09:33 Sources 1

About this happening: **SKILLCLOAK** shows that malicious **AI coding-agent skills** can be rewritten to evade static scanners while still executing, exposing **credentials**, **source code**, and term...

Open Happening

Lnk-it-up open-source suite for generating and detecting malicious Windows LNK shortcuts

Security Tool/Service

H score14 First: 12.02.2026 23:01 Last: 12.02.2026 23:01 Sources 1

About this happening: **lnk-it-up** is a newly released open-source suite for **Windows LNK shortcuts** that helps testers generate deceptive files and helps defenders spot shortcuts where **Explorer**...

Open Happening

DEAD#VAX campaign using IPFS-hosted VHD phishing to deploy AsyncRAT

Campaign

H score33 First: 04.02.2026 19:24 Last: 04.02.2026 19:24 Sources 1

About this happening: The **DEAD#VAX** campaign is using **phishing-delivered IPFS-hosted VHD files** to deploy **AsyncRAT**, creating a stealthier path to **fileless endpoint compromise**. The chain r...

Open Happening

Shadow-Void-044 and Shadow-Earth-045 PeckBirdy cyber-espionage campaigns

Campaign

H score34 First: 28.01.2026 18:19 Last: 28.01.2026 18:19 Sources 1

About this happening: Two **China-aligned** **PeckBirdy** espionage campaigns were identified, widening risk to **Chinese gambling websites**, **Asian government entities**, and a **Philippine educatio...

Open Happening

Microsoft Office actively exploited security feature bypass (CVE-2026-21509)

Vulnerability

H score18 First: 27.01.2026 09:19 Last: 27.01.2026 09:19 Sources 1

About this happening: **CVE-2026-21509** is a **7.8 CVSS** Microsoft Office **security feature bypass** that was **actively exploited** to bypass **OLE mitigations** and deliver malicious Office files....

Open Happening

Timeline

04.02.2026 19:52 2 articles · 5mo ago

Microsoft releases lightweight scanner for open-weight LLM backdoors

Initial Disclosure
Microsoft's AI Security team introduced a lightweight scanner for open-weight large language models that looks for trigger-dependent attention and output signatures, as well as leaked memorized poisoning data, to flag embedded backdoors with a low false positive rate. The scanner requires no additional training or prior knowledge of the hidden behavior and can return ranked trigger candidates for common GPT-style models.
Show sources

Microsoft Develops Scanner to Detect Backdoors in Open-Weight Large Language Models — thehackernews.com — 04.02.2026 19:52

Microsoft Develops Scanner to Detect Backdoors in Open-Weight Large Language Models — thehackernews.com — 04.02.2026 19:52
Open in new tab

Summary

Related Happenings

SKILLCLOAK and SKILLDETONATE expose AI coding-agent skill scanner evasion with runtime-packed malware

Lnk-it-up open-source suite for generating and detecting malicious Windows LNK shortcuts

DEAD#VAX campaign using IPFS-hosted VHD phishing to deploy AsyncRAT

Shadow-Void-044 and Shadow-Earth-045 PeckBirdy cyber-espionage campaigns

Microsoft Office actively exploited security feature bypass (CVE-2026-21509)

Timeline

Microsoft releases lightweight scanner for open-weight LLM backdoors