Convert text to/from Unicode escapes, HTML entities, UTF-8 bytes, code points, normalization forms, ASCII codes, and transliterate between scripts. All in your browser with complete privacy.
Type or paste any text into the input box. Stats update in real-time showing characters, unique chars, code points, and UTF-8 byte count.
Select from 20+ modes: \uXXXX escapes, HTML entities, UTF-8 bytes, code points, ASCII decimal/hex/binary/octal, NFC/NFD/NFKC/NFKD normalization, Cyrillic/Greek transliteration, full-width, ASCII filter, script detection, and more.
Click Convert or use the Swap button to exchange input and output. Copy results to clipboard or clear to start fresh.
The FreeNestTools Unicode Converter is a free, browser-based tool that instantly converts text between multiple Unicode formats. Whether you need to encode text as \uXXXX escape sequences for JavaScript strings, generate HTML entities for web pages, view UTF-8 byte representations, or decode escaped Unicode back to readable text — this tool handles it all in real-time, right in your browser.
Unicode is the global standard for encoding text across all modern systems. Every character you see — from basic Latin letters to Chinese ideographs, Arabic script, mathematical symbols, and emoji — has a unique code point (like U+0041 for 'A' or U+1F600 for '😀'). This tool reveals those code points and provides multiple ways to represent them.
This tool is essential for web developers who need to embed Unicode escapes in JavaScript, JSON, or CSS. Software engineers use it to debug encoding issues and work with internationalized text. Content creators use HTML entities to ensure special characters render correctly across all browsers. Students and educators studying character encoding, UTF-8, and internationalization find it invaluable for learning and experimentation.
Supported formats include: \uXXXX Escape XXX; Decimal XXXX; Hex UTF-8 Bytes U+XXXX Code Point Decimal Code Point Decode Escapes NFC/NFD/NFKC/NFKD To Cyrillic To Greek Full-width Script Detect ASCII Decimal ASCII Hex ASCII Binary ASCII Octal ASCII Only Decode ASCII
The \uXXXX escape format represents each character as a Unicode escape sequence using four hexadecimal digits — for example, \u0041 for 'A' or \u00E9 for 'é'. This is commonly used in JavaScript, JSON, and C-family languages. Characters outside the Basic Multilingual Plane (BMP) — like most emoji — are represented as surrogate pairs (e.g., \uD83D\uDE00 for '😀').
HTML entities (&#XXXX; for decimal and &#xXXXX; for hexadecimal) let you embed any Unicode character safely in HTML documents. The UTF-8 bytes view shows the actual byte sequence used in the most common encoding standard — invaluable for debugging file encoding issues. The U+ code point display shows the official Unicode designation, and the Decode mode converts any of these representations back into the actual characters.
All processing happens entirely in your browser using client-side JavaScript. Your text is never uploaded to any server — it never leaves your device. This ensures your content remains completely private and secure. There are no registration, no hidden charges, and no usage limits. Convert as much text as you need, as often as you like.
Unicode to Language features extend the tool beyond simple encoding. The Normalization modes (NFC, NFD, NFKC, NFKD) transform text into its canonical or compatibility forms — essential for text comparison, database indexing, and resolving encoding mismatches. To Cyrillic and To Greek transliterate Latin text into Cyrillic or Greek scripts using standard mapping systems. Full-width converts standard ASCII characters to their full-width Unicode equivalents used in East Asian typography. Script Detect analyzes text and reveals which Unicode scripts and blocks are present with a visual breakdown.
The built-in ASCII Converter provides complete ASCII code conversion. ASCII Decimal, Hex, Binary, and Octal modes display each character's ASCII code in different number bases — invaluable for programmers, students learning about encoding, and debugging network protocols. ASCII Only filters out all non-ASCII characters, leaving only the 128 standard ASCII characters. Decode ASCII intelligently detects and converts ASCII codes (decimal, hex, binary, and octal formats) back into readable text, supporting mixed-format input for maximum flexibility.
This tool is ideal for web developers writing JavaScript, JSON, and HTML with special characters, software engineers debugging encoding issues in internationalized applications, linguists and translators working with transliteration across writing systems, content management teams ensuring proper character rendering across global websites, SEO professionals optimizing for multi-language content, students learning about character encoding standards, and anyone who needs to convert between Unicode representations quickly and accurately. For generating fancy text styles, try the Fancy Text Generator.
\uXXXX escape sequence. For example, "Hello" becomes \u0048\u0065\u006C\u006C\u006F. Characters outside the BMP (like emoji) appear as surrogate pairs like \uD83D\uDE00.\u0048\u0065\u006C\u006C\u006F) into the input, and click Convert. The tool automatically recognizes and decodes \uXXXX sequences, &#XXXX; HTML entities, &#xXXXX; hex entities, and U+XXXX notation. It handles mixed content — regular text mixed with escapes — seamlessly.F0 9F 98 80), U+XXXX (standard Unicode code point notation), Decimal (plain decimal code point numbers), Decode Escapes (converts any format back to text), NFC/NFD/NFKC/NFKD (Unicode normalization forms), To Cyrillic (Latin text transliteration to Cyrillic), To Greek (Latin text transliteration to Greek), Full-width (converts ASCII to full-width Unicode characters), and Script Detect (analyzes text to identify all Unicode scripts present).U+XXXX. For example, the Latin letter 'A' has code point U+0041 (decimal 65), the Euro sign '€' is U+20AC (decimal 8364), and the grinning face emoji '😀' is U+1F600 (decimal 128512). Code points range from U+0000 to U+10FFFF, providing space for over 1.1 million characters.U+1F300 to U+1FAF6. In JavaScript/JSON, these require surrogate pairs — two \uXXXX sequences that combine to form the full code point. For example, '😀' (U+1F600) becomes \uD83D\uDE00. This tool correctly handles surrogate pairs and shows both the escaped form and the decoded emoji. Modern JavaScript also supports \u{1F600} bracket notation for direct code point escaping.&#XXXX; (e.g., © for é) and hex entities use &#xXXXX; (e.g., © for é). They are essential when you need to display characters that have special meaning in HTML (like < and >) or characters not available on your keyboard. They also ensure proper rendering across all browsers and character encodings."\u0048\u0065") are automatically interpreted by the JavaScript engine. To convert a \uXXXX string literal into the actual characters programmatically, use JSON.parse('"\\u0048\\u0065"') or the native String.fromCodePoint() method. For a visual approach, just paste your escape sequences into this tool in Decode Escapes mode and click Convert. You can also use the Swap button to move the result back to the input for further conversion.