Python: Unicode Tutorial 🐍

By Xah Lee. Date: 2005-01-30. Last updated: 2022-11-05.

This page describe working with unicode in python 3.

[see Unicode: Character Set, Encoding, UTF-8, Codepoint]

Python 2 Unicode Tutorial

For Python 2, see Python 2: Unicode Tutorial

Python Source Code Encoding

Python Source Code Encoding

Set Input/Output to UTF-8

Python: Set STDOUT Encoding to UTF-8

Unicode in String

Python 3's string is a sequence of unicode characters. You do not need the u in u"…", but you can add it for familiarity with python 2. The u has no meaning.

Unicode Characters in Variable and Function Names

Python 3 allows Unicode characters in variable and function names, but they must be letter characters. Non-letter characters are not allowed. [see What Characters Are Unicode Letter]

def φ(n):
    return n + 1

α = 4
print(φ(α))
# 5

♥ = 4

print(♥)

#     ♥ = 4
#     ^
# SyntaxError: invalid character in identifier
error: cannot format -: '♥'

Python Unicode

Unicode Tutorial 🐍
Python 2: Unicode Tutorial
Python Source Code Encoding
Set STDOUT Encoding to UTF-8
Unicode Escape
Get Unicode Name, Codepoint
Convert File Encoding