Unicode Character Inspector

Samples:

Summary

7Characters
6Unique
10UTF-8 bytes
1Non-ASCII
6BMP
1Supplementary

Character Breakdown (7 characters)

H
U+0048
LATIN CAPITAL LETTER H
Letter
UTF-8
0x48
UTF-16
U+0048
HTML
H
JS/JSON
H
e
U+0065
LATIN SMALL LETTER E
Letter
UTF-8
0x65
UTF-16
U+0065
HTML
e
JS/JSON
e
l
U+006C
LATIN SMALL LETTER L
Letter
UTF-8
0x6C
UTF-16
U+006C
HTML
l
JS/JSON
l
l
U+006C
LATIN SMALL LETTER L
Letter
UTF-8
0x6C
UTF-16
U+006C
HTML
l
JS/JSON
l
o
U+006F
LATIN SMALL LETTER O
Letter
UTF-8
0x6F
UTF-16
U+006F
HTML
o
JS/JSON
o
U+0020
SPACE
Separator
UTF-8
0x20
UTF-16
U+0020
HTML
JS/JSON
🌍
U+1F30D
NONCHARACTER (U+1F30D)
Other
UTF-8
0xF0 0x9F 0x8C 0x8D
UTF-16
U+D83C U+DF0D
HTML
🌍
JS/JSON
\u{1F30D}

What This Tool Does

Unicode Character Inspector is built for deterministic developer and agent workflows.

Inspect any text character-by-character: code points, UTF-8/16/32 byte sequences, HTML entities, JS escapes, and Unicode categories. 100% client-side.

Use How to Use for execution steps and FAQ for constraints, policies, and edge cases.

Last updated:

This tool is provided as-is for convenience. Output should be verified before use in any production or critical context.

Agent Invocation

Best Path For Builders

Browser workflow

Runs instantly in the browser with private local processing and copy/export-ready output.

Browser Workflow

This tool is optimized for instant in-browser execution with local data handling. Run it here and copy/export the output directly.

/unicode-inspector/

For automation planning, fetch the canonical contract at /api/tool/unicode-inspector.json.

How to Use Unicode Character Inspector

  1. 1

    Paste or type your text

    Enter any text in the input box — plain ASCII, accented letters, emoji, CJK characters, RTL scripts, or a mix. You can also click a sample button to load a ready-made example like 'Emoji mix' or 'Zero-width'.

  2. 2

    Read the Summary panel

    The Summary row shows total character count, unique code points, total UTF-8 byte count, non-ASCII character count, and how many characters fall inside vs. outside the Basic Multilingual Plane (BMP).

  3. 3

    Inspect individual character cards

    Each character gets its own card showing: the glyph, its Unicode code point (e.g. U+1F30D), official name, category badge (Letter, Emoji, Control…), UTF-8 bytes, UTF-16 code units, HTML entity, and JavaScript escape sequence.

  4. 4

    Copy any encoding

    Click the small 'copy' button next to any encoding value — UTF-8 bytes, UTF-16 units, HTML entity, or JS escape — to copy that representation directly to your clipboard. Use this to paste exact escape sequences into your source code.

Frequently Asked Questions

What is Unicode Character Inspector?
Unicode Character Inspector is a free online tool that breaks any text string into its individual characters and shows each one's Unicode code point (e.g. U+1F30D), UTF-8 byte sequence, UTF-16 code units, UTF-32 value, HTML entity, JavaScript escape, and Unicode category — instantly in your browser.
How do I use Unicode Character Inspector?
Paste or type any text into the input box. Each character appears as a card showing its code point, name, category badge, and all encoding representations. Use the sample buttons to try emoji, accented letters, CJK, RTL, or zero-width characters.
Is Unicode Character Inspector free?
Yes. Unicode Character Inspector is completely free with no account or sign-up required.
Does Unicode Character Inspector send my text to a server?
No. All character inspection happens entirely in your browser using the JavaScript Unicode API. Your text never leaves your device.
Why do emoji show two UTF-16 units?
Emoji and other supplementary characters (above U+FFFF) fall outside the Basic Multilingual Plane. In UTF-16 they use a surrogate pair: a high surrogate (D800–DBFF) plus a low surrogate (DC00–DFFF). The inspector shows both units — the exact values JavaScript's string engine uses internally.