Invisible Character from Twitter

By Xah Lee. Date: . Last updated: .

When copying text from Twitter or Google Plus, you'll get a invisible Unicode character the “U+FEFF: ZERO WIDTH NO-BREAK SPACE” (aka BYTE ORDER MARK). And sometimes “U+A0: NO-BREAK SPACE” too.

When you copy and paste to your blog, these invsibile character may not be what you want.

for the tech savvy emacs users, you can use the following emacs lisp code to solve the problem:

ELisp: Replace Invisible Unicode Chars

You can write a {Perl, Python, Ruby, Bash} script to solve the problem. See:

see also Unicode BOM Hack