Differences between revisions 2 and 8 (spanning 6 versions)
Revision 2 as of 2005-03-31 07:29:24
Size: 233
Editor: ZoomQuiet
Comment:
Revision 8 as of 2009-12-25 07:16:19
Size: 1599
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
'''   Unicode 的PythonIc 处理![http://www.jorendorff.com/articles/unicode/index.html 原文出处] '''Unicode 的PythonIc 处理![[http://www.jorendorff.com/articles/unicode/index.html|原文出处]]
Line 6: Line 5:
::-- ZoomQuiet [[[DateTime(2005-03-31T07:28:35Z)]]]
[[TableOfContents]]
::-- ZoomQuiet [<<DateTime(2005-03-31T07:28:35Z)>>]
<<TableOfContents>>
Line 10: Line 9:
'''草稿 by Jason Orendorff, 1 March 2002'''
程序员现在必须懂得 uncode 因为:
 * It is one of the cornerstones of software internationalization.
  * 这是软件国际化的王道
 * The Web is full of Unicode data.
  * Web 已经充满了 Unicode 的数据
 * XML and HTML are based on Unicode.
  * XML 和 HTML 技术是基于 Unicode 的
 * WinNT-based operating systems use Unicode for all string values internally.
  * WinNT 基础的技术都是 Unicode 来处理所有字串的
{{{
This article is a short course on Unicode programming.
本文指出了处理Unicode 的快速途径 Pythonic 式的!
}}}

== Unicode 基本介绍 ==
    * An Introduction to Unicode
<<Include(/Introduction)>>
== 编码 ==
<<Include(/Encodings)>>
    * Encodings
== Unicode 在 web ==
<<Include(/onWeb)>>
    * Unicode on the Web
== Unicode 在 HTML和XML ==
<<Include(UnicodeinHTMLandXML)>>
    * Unicode in HTML and XML
          o Specifying the Encoding
          o Numeric Character References
== Unicode 在JAVA ==
    * Unicode in Java
          o Unicode Text in Java Source Code
          o Unicode I/O in Java
          o Encoding and Decoding Java Strings
== Unicode 在 Python ==
<<Include(UnicodeInPython)>>
== Unicode 在 晕倒死 ==
    * Unicode and Windows Programming
== 接下来? ==
    * Where to Go Next

Unicode 的PythonIc 处理!原文出处

::-- ZoomQuiet [2005-03-31 07:28:35]

Unicode for Programmers

草稿 by Jason Orendorff, 1 March 2002 程序员现在必须懂得 uncode 因为:

  • It is one of the cornerstones of software internationalization.
    • 这是软件国际化的王道
  • The Web is full of Unicode data.
    • Web 已经充满了 Unicode 的数据
  • XML and HTML are based on Unicode.
    • XML 和 HTML 技术是基于 Unicode 的
  • WinNT-based operating systems use Unicode for all string values internally.
    • WinNT 基础的技术都是 Unicode 来处理所有字串的

This article is a short course on Unicode programming.
本文指出了处理Unicode 的快速途径 Pythonic 式的!

Unicode 基本介绍

  • An Introduction to Unicode

编码

  • Encodings

Unicode 在 web

  • Unicode on the Web

Unicode 在 HTML和XML

  • Unicode in HTML and XML
    • o Specifying the Encoding o Numeric Character References

Unicode 在JAVA

  • Unicode in Java
    • o Unicode Text in Java Source Code o Unicode I/O in Java o Encoding and Decoding Java Strings

Unicode 在 Python

::-- ZoomQuiet [2005-03-31 07:39:55]

Unicode in Python

Unicode 字串

Unicode Strings in Python

标准库的Unicode 支持

Unicode Support in the Python Standard Library

Unicode编码的文件

Unicode files and Python

输出Unicode字串

print and Unicode strings

Unicode和正则表达式

Python-cn 列表中讨论得来 {{{发件人: cpunion <[email protected]> 回复: [email protected] 收件人: [email protected] 日期: 2005-5-27 上午10:09 主题: Re: [python-chinese] 关于python正则表达式 的一个问题 }}}

  • 多字节文字一定要用unicode处理,先遵守这一点,再去做其它的。

   1 a = unicode ("""随着信息技术的发展,计算机应用渗透到社会生活的各个领域,特
   2 别是在电子商务中的应用,使人们对信息的依赖程度越来越大,从而使信息安全技
   3 术显得格外重要。信息安全技术主要是研究计算机系统信息的机密性、完整性、可
   4 获取性和真实性,它的核心是加密技术。加密技术根据加密密钥与解密密钥是否相
   5 同可分为对称加密技术(单密钥加密技术)和非对称加密技术(公开密钥加密技
   6 术)。加个叹号!加个问号?试试句号加引号。“试试叹号加引号!”。“试试问号
   7 加引号?” 加点废话"""
   8     , "utf-8")
   9 expression = unicode ("。|!|?|。”|!”|?", "utf-8")
  10 import re
  11 listSentence = re.split (expression, a)
  12 for i in listSentence:
  13    print i

Unicode 在 晕倒死

  • Unicode and Windows Programming

接下来?

  • Where to Go Next

PythonInUnicode (last edited 2009-12-25 07:16:19 by localhost)