Text to Unicode

List Unicode code points for all characters in a text string.

Word Count Limit: 50

Upload File

Share on Social Media:

Our online Text to Unicode converter is a user-friendly tool that allows you to convert plain text into its corresponding Unicode code points. It provides flexibility to input text directly into the designated text area or upload data from a text file using the "Upload File" button.

 

How to Use

 

Input Text:

  • Enter or paste the text you want to convert into the provided text area.
  • Alternatively, use the "Upload File" button to load text from a file.

Convert to Unicode:

  • Click the "Convert to Unicode" button to initiate the conversion process.

View Results:

  • The resulting sequence of Unicode code points separated by spaces will be displayed below the text area.

Copy to Clipboard:

  • Click the "Copy to Clipboard" button to easily copy the Unicode code points to your clipboard.

Save as a Text File:

  • Use the "Save as TXT" button to download the Unicode code points as a text file to your device

 

Unicode Encoding

Unicode represents each character with a unique numerical value known as a code point. Code points are typically written in hexadecimal and prefixed with "U+" (e.g., U+0041 for the letter 'A').

Unicode defines code points in the range U+0000 to U+10FFFF, providing space for a vast number of characters from various scripts and symbol sets:

  • ASCII / Latin Alphabet: U+0020 to U+007F
  • Emoticons: U+1F600 to U+1F64F
  • Common Punctuation: U+2000 to U+206F
  • Mathematical Symbols: U+2200 to U+22FF
  • Cyrillic Alphabet: U+0400 to U+04FF
  • Hiragana and Katakana: U+3040 to U+30FF
  • Chinese, Japanese, Korean (CJK) Ideographs: U+4E00 to U+9FFF

For more information see the Unicode Standard

 

Using Unicode to Create HTML Entities

HTML entities are special codes used to represent reserved characters and symbols in HTML documents. These entities begin with an ampersand (&) and end with a semicolon (;). HTML entities are especially useful when you want to display characters that have special meanings in HTML (e.g., <>&) or characters that may not be directly typed on a keyboard, such as currency symbols, emoticons, mathematical symbols, etc.

To create HTML Entity:

  • Identify the Unicode code point for the character you want to represent.
  • Use the format &#[code_point]; to create an HTML entity, replacing [code_point] with the actual Unicode code point. Code points can be expressed in either decimal or hexadecimal format. In the latter case, the 'x' is used as a prefix to indicate hexadecimal notation.

Examples:

For the letter 'A' :

  • Unicode: U+0041
  • HTML Entity (decimal): &#65; 
  • HTML Entity (hexadecimal): &#x41;

For the euro sign '€':

  • Unicode: U+20AC
  • HTML Entity (decimal): &#8364; 
  • HTML Entity (hexadecimal): &#x20AC;

 

Some HTML entities can be also represented using the character's name. For example:  &euro;, &nbsp; (non-breaking space), etc. 

For more details see 8.5 Named character references — HTML5