Java Tutorial: Unicode in Java

By Xah Lee. Date: . Last updated: .

One thing great about Java is that it is Unicode based. That means, you can use characters from writing systems that are not english alphabets (For example, Chinese or math symbols), not just in data strings, but in function and variable names too.

Here's a example code using Unicode characters in class names and variable names.

class  {
    String  = "north";
    double π = 3.14159;

class UnicodeTest {
    public static void main(String[] arg) {
        方 x1 = new ();
        System.out.println( x1.北 );
        System.out.println( x1.π );

Any character in source code can also be represented by its Unicode number. By starting with \u followed by its 4 digits hexadecimal code.

class TestUniEsp \u007b
    static \u0069nt \u611b = 3;
    public static void main(String[] arg) {
    System.out.println( \u611b );

In the above example, \u007b is the left curly braces “{”, \u0069 is lowercase “i”, \u611b is the Chinese char “愛” (meaning love).


For a good discussion of exactly what char is allowed in identifier, see: languages with full Unicode support By David Hopwood. @