Mercurial > hg
view doc/hg.1.txt @ 9424:799373ff2554
highlight: fixes garbled text in non-UTF-8 environment
This patch treats all files inside repository as encoded by
locale's encoding when pygmentize.
We can assume that most files are written in locale's encoding,
but current implementation treats them as UTF-8.
So there's no way to specify the encoding of files.
Current implementation, db7557359636 (issue1341):
1. Convert original `text`, which is treated as UTF-8, to locale's encoding.
`encoding.tolocal()` is the method to convert from internal UTF-8 to local.
If original `text` is not UTF-8, e.g. Japanese EUC-JP, some characters
become garbled here.
2. pygmentize, with no UnicodeDecodeError.
This patch:
1. Convert original `text`, which is treated as locale's encoding, to unicode.
Pygments prefers unicode object than raw str. [1]_
If original `text` is not encoded by locale's encoding, some characters
become garbled here.
2. pygmentize, also with no UnicodeDecodeError :)
3. Convert unicode back to raw str, which is encoded by locale's.
.. [1] http://pygments.org/docs/unicode/
author | Yuya Nishihara <yuya@tcha.org> |
---|---|
date | Sat, 29 Aug 2009 15:24:15 +0900 |
parents | 1de6e7e1bb9f |
children | d6eecafaf12f |
line wrap: on
line source
HG(1) ===== Matt Mackall <mpm@selenic.com> :man source: Mercurial :man manual: Mercurial Manual NAME ---- hg - Mercurial source code management system SYNOPSIS -------- *hg* 'command' ['option']... ['argument']... DESCRIPTION ----------- The *hg* command provides a command line interface to the Mercurial system. COMMAND ELEMENTS ---------------- files ...:: indicates one or more filename or relative path filenames; see "FILE NAME PATTERNS" for information on pattern matching path:: indicates a path on the local machine revision:: indicates a changeset which can be specified as a changeset revision number, a tag, or a unique substring of the changeset hash value repository path:: either the pathname of a local repository or the URI of a remote repository. include::hg.1.gendoc.txt[] FILES ----- `.hgignore`:: This file contains regular expressions (one per line) that describe file names that should be ignored by *hg*. For details, see *hgignore(5)*. `.hgtags`:: This file contains changeset hash values and text tag names (one of each separated by spaces) that correspond to tagged versions of the repository contents. `/etc/mercurial/hgrc`, `$HOME/.hgrc`, `.hg/hgrc`:: This file contains defaults and configuration. Values in `.hg/hgrc` override those in `$HOME/.hgrc`, and these override settings made in the global `/etc/mercurial/hgrc` configuration. See *hgrc(5)* for details of the contents and format of these files. Some commands (e.g. revert) produce backup files ending in `.orig`, if the `.orig` file already exists and is not tracked by Mercurial, it will be overwritten. BUGS ---- Probably lots, please post them to the mailing list (See Resources below) when you find them. SEE ALSO -------- *hgignore(5)*, *hgrc(5)* AUTHOR ------ Written by Matt Mackall <mpm@selenic.com> RESOURCES --------- http://mercurial.selenic.com/[Main Web Site] http://selenic.com/hg[Source code repository] http://selenic.com/mailman/listinfo/mercurial[Mailing list] COPYING ------- Copyright \(C) 2005-2009 Matt Mackall. Free use of this software is granted under the terms of the GNU General Public License (GPL).