Differences between revisions 2 and 3
Revision 2 as of 2005-01-06 02:56:40
Size: 3122
Editor: ZoomQuiet
Comment:
Revision 3 as of 2009-12-25 07:10:12
Size: 3160
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 7: Line 7:
-- Zoom.Quiet [[[DateTime(2004-08-12T21:21:57Z)]]]
[[TableOfContents]]
-- Zoom.Quiet [<<DateTime(2004-08-12T21:21:57Z)>>]
<<TableOfContents>>
Line 12: Line 12:
 * [http://nocache.czug.org/blog/panjunyong/python-encoding Python的编码处理技术] -- CZUG的整理  * [[http://nocache.czug.org/blog/panjunyong/python-encoding|Python的编码处理技术]] -- CZUG的整理
Line 14: Line 14:
 * [http://www.linuxforum.net/forum/showflat.php?Cat=&Board=python&Number=495520&page=&view=&sb=&o=&vc=1 在Python中使用中文]
 * [http://www.linuxforum.net/forum/showthreaded.php?Cat=&Board=python&Number=495807&Search=true&Forum=python&Words=gb&Match=Entire%20Phrase&Searchpage=0&Limit=25&Old=allposts&Main=495520 关于何时需要码制转换的一些考虑]
 * [[http://www.linuxforum.net/forum/showflat.php?Cat=&Board=python&Number=495520&page=&view=&sb=&o=&vc=1|在Python中使用中文]]
 * [[http://www.linuxforum.net/forum/showthreaded.php?Cat=&Board=python&Number=495807&Search=true&Forum=python&Words=gb&Match=Entire%20Phrase&Searchpage=0&Limit=25&Old=allposts&Main=495520|关于何时需要码制转换的一些考虑]]
Line 17: Line 17:
 * [http://www.linuxforum.net/forum/showflat.php?Cat=&Board=python&Number=352059&page=&view=&sb=&o=&vc=1 页面的编码选择]
 * [http://www.linuxforum.net/forum/showflat.php?Cat=&Board=python&Number=444261&page=&view=&sb=&o=&vc=1 CGI的Py中文处理]
 * [[http://www.linuxforum.net/forum/showflat.php?Cat=&Board=python&Number=352059&page=&view=&sb=&o=&vc=1|页面的编码选择]]
 * [[http://www.linuxforum.net/forum/showflat.php?Cat=&Board=python&Number=444261&page=&view=&sb=&o=&vc=1|CGI的Py中文处理]]
Line 21: Line 21:
 * [http://cjkpython.i18n.org/ cjkpython]-- 标准的亚洲支持编辑 Python版本  * [[http://cjkpython.i18n.org/|cjkpython]]-- 标准的亚洲支持编辑 Python版本
Line 23: Line 23:
   * [http://www.donews.net/limodou/archive/2004/03/30/9429.aspx 如何让python支持gb2312解码]
   * [http://pyrecord.freezope.org/download/gb2312.zip/down GB2312解码文件]
   * [[http://www.donews.net/limodou/archive/2004/03/30/9429.aspx|如何让python支持gb2312解码]]
   * [[http://pyrecord.freezope.org/download/gb2312.zip/down|GB2312解码文件]]
Line 26: Line 26:
 * [http://www.linuxforum.net/forum/gshowflat.php?Cat=&Board=python&Number=284910&page=3&view=collapsed&sb=5&o=all&fpart= 生成GB2312字符集的内码表]
 * [http://www.linuxforum.net/forum/gshowflat.php?Cat=&Board=python&Number=387330&page=2&view=collapsed&sb=5&o=all&fpart= 实现Plone的GB2312编码]
 * [[http://www.linuxforum.net/forum/gshowflat.php?Cat=&Board=python&Number=284910&page=3&view=collapsed&sb=5&o=all&fpart=|生成GB2312字符集的内码表]]
 * [[http://www.linuxforum.net/forum/gshowflat.php?Cat=&Board=python&Number=387330&page=2&view=collapsed&sb=5&o=all&fpart=|实现Plone的GB2312编码]]
Line 30: Line 30:
  * (代码)[http://pyrecord.freezope.org/download/pygb2big.zip/down GB与Big5码互换]   * (代码)[[http://pyrecord.freezope.org/download/pygb2big.zip/down|GB与Big5码互换]]
Line 35: Line 35:
   * python [http://www.linuxforum.net/forum/showthreaded.php?Cat=&Board=python&Number=427317&page=&view=&sb=&o= 中文解决方法] gb2312 <==> utf8    * python [[http://www.linuxforum.net/forum/showthreaded.php?Cat=&Board=python&Number=427317&page=&view=&sb=&o=|中文解决方法]] gb2312 <==> utf8
Line 43: Line 43:
 * [http://www.cl.cam.ac.uk/~mgk25/unicode.html 什么是unicode?什么是UTF-8?UTF-16?] -- FAQ for xUnix
 * [http://www.donews.net/limodou/archive/2004/04/13/12002.aspx UTF-8编码的文件检测函数]
 * 检测函数升级! [http://www.donews.net/limodou/archive/2004/09/23/107404.aspx NewEdit-The change of UTF-8 detecting]
 * attachment:encdet.py Google 列表中提到的处理脚本
 * [[http://www.cl.cam.ac.uk/~mgk25/unicode.html|什么是unicode?什么是UTF-8?UTF-16?]] -- FAQ for xUnix
 * [[http://www.donews.net/limodou/archive/2004/04/13/12002.aspx|UTF-8编码的文件检测函数]]
 * 检测函数升级! [[http://www.donews.net/limodou/archive/2004/09/23/107404.aspx|NewEdit-The change of UTF-8 detecting]]
 * [[attachment:encdet.py]] Google 列表中提到的处理脚本
Line 48: Line 48:
 * [http://www.linuxforum.net/forum/gshowflat.php?Cat=&Board=python&Number=427317&page=0&view=collapsed&sb=5&o=all&fpart= python 中文解决方法 gb2312 <==> utf8 - China Linux Forum]  * [[http://www.linuxforum.net/forum/gshowflat.php?Cat=&Board=python&Number=427317&page=0&view=collapsed&sb=5&o=all&fpart=|python 中文解决方法 gb2312 <==> utf8 - China Linux Forum]]
Line 51: Line 51:
 * [http://www.chinesepython.org/cgi_bin/cgb.cgi/home.html 中蟒]  * [[http://www.chinesepython.org/cgi_bin/cgb.cgi/home.html|中蟒]]

整理一下子,我们可以收集到的 Python 中文处理技巧!

  • 大家一起来总结,争取形成一个中文Python 开发常用库出来?!

-- Zoom.Quiet [2004-08-12 21:21:57]

原则

统一到unicode上来吧!

Web的中文处理

中文的支持

GB与Big5

  • 如何在Python中进行 GB到Big5的转换?
  • 如何在Python中进行 Big5到GB的转换?

GB与unicode

  • 如何在Python中进行 GB到unicode的转换?
  • 如何在Python中进行 unicode到GB的转换?
  • 个人建议 --GreyRoar

    • 1。如非必要,尽量避免转换,直接使用Unicode就可以;
    • 2。尽量不要来回转换;
    • 3。不要假设Unicode是超集。

UTF-8

中文处理代码片段

内核支持!

PythonInChinese (last edited 2009-12-25 07:10:12 by localhost)