Differences between revisions 7 and 8
Revision 7 as of 2007-04-13 01:44:15
Size: 1597
Editor: ZoomQuiet
Comment:
Revision 8 as of 2009-12-25 07:16:19
Size: 1599
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
'''Unicode 的PythonIc 处理![http://www.jorendorff.com/articles/unicode/index.html 原文出处] '''Unicode 的PythonIc 处理![[http://www.jorendorff.com/articles/unicode/index.html|原文出处]]
Line 5: Line 5:
::-- ZoomQuiet [[[DateTime(2005-03-31T07:28:35Z)]]]
[[TableOfContents]]
::-- ZoomQuiet [<<DateTime(2005-03-31T07:28:35Z)>>]
<<TableOfContents>>
Line 26: Line 26:
[[Include(/Introduction)]] <<Include(/Introduction)>>
Line 28: Line 28:
[[Include(/Encodings)]] <<Include(/Encodings)>>
Line 31: Line 31:
[[Include(/onWeb)]] <<Include(/onWeb)>>
Line 34: Line 34:
[[Include(UnicodeinHTMLandXML)]] <<Include(UnicodeinHTMLandXML)>>
Line 44: Line 44:
[[Include(UnicodeInPython)]] <<Include(UnicodeInPython)>>

Unicode 的PythonIc 处理!原文出处

::-- ZoomQuiet [2005-03-31 07:28:35]

Unicode for Programmers

草稿 by Jason Orendorff, 1 March 2002 程序员现在必须懂得 uncode 因为:

  • It is one of the cornerstones of software internationalization.
    • 这是软件国际化的王道
  • The Web is full of Unicode data.
    • Web 已经充满了 Unicode 的数据
  • XML and HTML are based on Unicode.
    • XML 和 HTML 技术是基于 Unicode 的
  • WinNT-based operating systems use Unicode for all string values internally.
    • WinNT 基础的技术都是 Unicode 来处理所有字串的

This article is a short course on Unicode programming.
本文指出了处理Unicode 的快速途径 Pythonic 式的!

Unicode 基本介绍

  • An Introduction to Unicode

编码

  • Encodings

Unicode 在 web

  • Unicode on the Web

Unicode 在 HTML和XML

  • Unicode in HTML and XML
    • o Specifying the Encoding o Numeric Character References

Unicode 在JAVA

  • Unicode in Java
    • o Unicode Text in Java Source Code o Unicode I/O in Java o Encoding and Decoding Java Strings

Unicode 在 Python

::-- ZoomQuiet [2005-03-31 07:39:55]

Unicode in Python

Unicode 字串

Unicode Strings in Python

标准库的Unicode 支持

Unicode Support in the Python Standard Library

Unicode编码的文件

Unicode files and Python

输出Unicode字串

print and Unicode strings

Unicode和正则表达式

Python-cn 列表中讨论得来 {{{发件人: cpunion <[email protected]> 回复: [email protected] 收件人: [email protected] 日期: 2005-5-27 上午10:09 主题: Re: [python-chinese] 关于python正则表达式 的一个问题 }}}

  • 多字节文字一定要用unicode处理,先遵守这一点,再去做其它的。

   1 a = unicode ("""随着信息技术的发展,计算机应用渗透到社会生活的各个领域,特
   2 别是在电子商务中的应用,使人们对信息的依赖程度越来越大,从而使信息安全技
   3 术显得格外重要。信息安全技术主要是研究计算机系统信息的机密性、完整性、可
   4 获取性和真实性,它的核心是加密技术。加密技术根据加密密钥与解密密钥是否相
   5 同可分为对称加密技术(单密钥加密技术)和非对称加密技术(公开密钥加密技
   6 术)。加个叹号!加个问号?试试句号加引号。“试试叹号加引号!”。“试试问号
   7 加引号?” 加点废话"""
   8     , "utf-8")
   9 expression = unicode ("。|!|?|。”|!”|?", "utf-8")
  10 import re
  11 listSentence = re.split (expression, a)
  12 for i in listSentence:
  13    print i

Unicode 在 晕倒死

  • Unicode and Windows Programming

接下来?

  • Where to Go Next

PythonInUnicode (last edited 2009-12-25 07:16:19 by localhost)