← All Challenges

Challenge 12: Unicode Homoglyph Substitution

Difficulty: Hard
Category: Signature Detection Bypass


Objective

Bypass the scanner’s signature detection by replacing one or more characters in the payload with visually identical characters from different Unicode blocks. The string looks the same to human eyes but has completely different bytes on disk.

Scanner Behavior

The scanner performs static byte-pattern matching against file contents. It searches for the following strings as exact ASCII byte sequences:

The scanner matches against specific byte values corresponding to ASCII characters. It does not perform visual similarity analysis, Unicode normalization, or homoglyph detection.

Rules

Hints

  1. Not all characters that look like a are actually a. The Cyrillic а (U+0430) looks identical to Latin a (U+0061) but has completely different bytes.
  2. Unicode contains thousands of characters that are visual duplicates of ASCII letters across different scripts (Cyrillic, Greek, mathematical symbols, etc.).
  3. Replacing even one character with its homoglyph breaks the ASCII byte pattern match.
  4. This technique is used in real-world phishing attacks (IDN homograph attacks) and is extremely effective against byte-level scanners.

View Solution