diff mercurial/lsprofcalltree.py @ 26879:a24b98f4e03c

encoding: re-escape U+DCxx characters in toutf8b input (issue4927) This is the final missing piece in fully round-tripping random byte strings through UTF-8b. While this issue means that UTF-8 <-> UTF-8b isn't fully bijective, we don't expect to ever see U+DCxx codepoints in "real" UTF-8 data, so it should remain bijective in practice.
author Matt Mackall <mpm@selenic.com>
date Thu, 05 Nov 2015 17:30:10 -0600
parents beae42f3d93b
children 071af8d385a9
line wrap: on
line diff