Why does image-based phishing bypass email security?

Traditional email security tools rely on text analysis, keyword matching, natural language processing, and URL extraction to detect threats. When the message is rendered as a single image, there is no text for these tools to parse. The email body appears empty to content filters, allowing the phishing message to reach the inbox undetected.

How is image-based phishing detected?

Detection requires optical character recognition (OCR) to extract text from the image and computer vision to identify brand logos, QR codes, and other visual indicators of phishing. These technologies allow security tools to read and analyze the visual content just as a human recipient would.

What is the difference between image-based phishing and steganography?

Image-based phishing displays the phishing message visually as an image to bypass text filters. Steganography hides malicious payloads inside image files so they are invisible to both humans and scanners. In image-based phishing, the threat is what you see. In steganography, the threat is what you cannot see.

What is Image-Based Phishing?

Q: What is image-based phishing?

Image-based phishing is a technique where the entire email body is delivered as an embedded image (PNG, JPEG, or GIF) instead of rendered HTML text. The message content, including logos, text, and URLs, is baked into the image file, preventing text-based email security filters from parsing or analyzing the content.

Image-based phishing renders the entire email message as an embedded image to bypass text-based security filters. Detection requires OCR or computer vision to extract and analyze the visual content.

Image-Based Phishing Explained

Image-based phishing is a social engineering technique where the entire email body is delivered as an embedded image rather than rendered HTML text. Instead of writing the phishing message in a format that email clients render as selectable text, attackers compose the message in a design tool, export it as a PNG, JPEG, or GIF, and embed that image file inline. The result looks identical to a normal email from the recipient's perspective, but contains zero parseable text for security tools to analyze. MITRE ATT&CK classifies phishing under Initial Access (T1566), and image-based delivery represents one of the most effective evasion variants within that technique.

How Image-Based Phishing Works

The attack follows a consistent pattern:

Message composition. The attacker creates a convincing email layout (brand logos, formatted text, call-to-action buttons) and exports the entire composition as a single image file. Every element that would normally be HTML, including headings, body copy, footer disclaimers, and URLs, is flattened into pixels.
Inline embedding or remote hosting. The image is either embedded directly into the email body using Base64 encoding or hosted on a reputable domain (cloud storage, image hosting services) and referenced via an HTML tag. Remote hosting adds a layer of evasion because the email filter sees only a URL pointing to a trusted domain rather than the phishing content itself.
Link delivery. The attacker wraps the entire image in a clickable hyperlink () so that clicking anywhere on the image redirects the victim to a credential harvesting page. In some variants, no clickable link exists at all. The image displays a URL that victims must type manually, eliminating any link for security tools to scan.
QR code variants. A growing subset of image-based phishing embeds QR codes in a technique known as quishing within the image. This shifts the attack to mobile devices, where endpoint protection is typically weaker and users are more likely to authenticate without scrutiny.

Research analyzing 386 verified phishing emails found that "Text in Image" was the most prevalent obfuscation technique at 47%, appearing in nearly half of all phishing samples studied. The technique was significantly correlated with successful antispam evasion.

Why Image-Based Phishing Evades Traditional Filters

Conventional email security relies on multiple text-dependent analysis methods, all of which fail against image-only messages:

Keyword and pattern matching. Filters that flag suspicious phrases ("verify your account," "payment failed," "click here immediately") find nothing to match because no text exists in the email body.
Natural language processing. NLP models trained to detect urgency, impersonation patterns, and social engineering language cannot process pixel data.
URL extraction and reputation checking. When the URL is rendered inside an image rather than coded as an HTML anchor, security tools cannot extract or evaluate it. The phishing link is invisible to automated scanning.
Sender-content correlation. Filters that compare the sender domain against the email content for consistency (for example, flagging a message that claims to be from a bank but originates from a freemail address) cannot perform this analysis without parseable body text.

These gaps explain why image-based phishing has become a preferred evasion method, particularly when combined with impersonation techniques to strengthen the social engineering pretext.

Image-Based Phishing Detection from IRONSCALES

IRONSCALES uses computer vision and OCR to analyze image content within emails, extracting embedded text, identifying brand logos, and detecting QR codes that text-based scanners cannot parse.

Related Terms

Email Attack of the Day is a daily series from IRONSCALES spotlighting real phishing attacks caught by Adaptive AI and our community of 35,000+ security professionals. Each post breaks down a real attack. What it looked like, why it worked, and what to do about it.

Explore More Articles

Say goodbye to Phishing, BEC, and QR code attacks. Our Adaptive AI automatically learns and evolves to keep your employees safe from email attacks.

For Enterprises

For MSPs & MSSPs

Protect Better

Simplify Operations

Empower Your Org

17,000+ Customers and Counting

Case Studies

Reviews

Osterman Research: The (Higher) Business Cost of Phishing

Case Study: Telit

Our Awards

How IRONSCALES Works

Platform Overview

API Integration

Artificial Intelligence

Human Element

Agentic Capabilities

Agents Overview

Red-Teaming Agent

Phishing SOC Agent

Phishing Simulation Agent

Platform Tours

BY USE CASE

Business Email Compromise

Advanced Malware & URL Attacks

Account Takeover Attacks

Email Encryption

Deepfake Attack Protection

DMARC Management

Phishing Simulation Testing

Security Awareness Training

BY PLATFORM

BY PROJECT

BY INDUSTRY

BY ROLE

LEARN

Blog

Threat Intelligence Center

Cybersecurity Glossary

Resource Library

Guides

Platform Tours

CONNECT

Events

Newsletter

LinkedIn

The Hidden Gaps in SEG Protection

New Gartner® Email Security Magic Quadrant™

Attack of the Day Explorer

Phishing Prevention

Spear Phishing

Voice Phishing

BY TYPE

MSPs and MSSPs

Resellers

Technology Partners

ENGAGE

Become Partner

Partner Portal

Partner with IRONSCALES

What is Image-Based Phishing?

Image-Based Phishing Explained

How Image-Based Phishing Works

Why Image-Based Phishing Evades Traditional Filters

Image-Based Phishing Detection from IRONSCALES

Related Terms

Explore More Articles

What is Direct Send?

What is S/MIME?

What is Data Detection and Response (DDR)?

What is Direct Send?

What is S/MIME?

What is Data Detection and Response (DDR)?

What is Multi-Tenancy Security?

What is ESP Abuse (Email Service Provider Abuse)?

What is Display Name Spoofing?

What is Multi-Tenancy Security?

What is ESP Abuse (Email Service Provider Abuse)?

What is Display Name Spoofing?