Differences between revisions 8 and 41 (spanning 33 versions)
Revision 8 as of 2005-03-24 01:06:36
Size: 2598
Editor: ZoomQuiet
Comment:
Revision 41 as of 2009-12-25 07:18:09
Size: 5919
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
[http://www-900.ibm.com/developerWorks/cn/linux/theme/special/index.shtml#python 可爱的Python 系列文章] '''深入Python 文本处理
'''
<<TableOfContents>>
<<Include(ObpRelating)>>
||{{attachment:cover-small.jpg}}||<<Include(TPiPauthor)>>||
''等不及中文版了!自个儿来!''
-- Zoom.Quiet [<<DateTime(2004-08-09T23:30:02Z)>>]
 * 071013 迁移到工程环境:http://code.google.com/p/openbookproject
  * SVN:http://openbookproject.googlecode.com/svn/trunk/TPiPy/zh
  * 堪误: http://code.google.com/p/openbookproject/issues/list
   * `注意`使用 `[TPiP]`前缀
   * '''[[http://openbookproject.googlecode.com/svn/trunk/TPiPy/README.txt|README.txt 工程历史记要]]'''
Line 4: Line 15:
的作者: '''
David Mertz
'''
是非常有名的Python 专家!
##[[Include(PythonZen)]]
Line 9: Line 17:
http://www-900.ibm.com/developerWorks/cn/i/p-mertz.gif <<Include(TPiP/revision)>>
= 资源/信息 =
 * TPiP中文翻译主页面就是本页。
   * http://wiki.woodpecker.org.cn/moin/TPiP/
   * 所有链接、变化请在本页更新,如果多个主页难免顾此失彼。
 * 加入翻译,获取权限
   * 在[[http://wiki.woodpecker.org.cn/moin/UserPreferences|woodpecker]]注册, 此处人员调配,任务认领。
   * 加入[[http://code.google.com/p/openbookproject/|google code]],此处源码仓库。
   * 加入[[http://groups.google.com/group/openbookproject|google group]],此乃讨论去处。和google code需要分别加入。
   * 请发信通报老大[[ZoomQuiet]],告知你的`啄木鸟ID`,`google ID`,以获得一站式批准
   * 其他问题请找[[lwl]]
 * 所有讨论/问题,以及代码签入请在标题使用`[TPiP]`前缀,例如“''[TPiP] [chap4] 50% 完成''”。
 * 讨论请到[[http://code.google.com/p/openbookproject/|openbookproject]] OBP 的[[http://groups.google.com/group/openbookproject|OBP group]] 讨论组。
 * 翻译和校对请到[[TPiP/revision|revision]]认领。需要啄木鸟ID。
 * 翻译使用[[http://code.google.com/p/openbookproject/source|svn]]签入签出即可。
   * 需要google code权限,否则只能签出。
   * [[http://code.google.com/p/openbookproject/source/checkout|使用说明]]/[[http://code.google.com/apis/gadgets/docs/tools.html#Checkout|如何checkout]]
   * [[http://www.subversion.org.cn/svnbook/|svnbook 中文版]] / [[http://internetducttape.com/2007/03/03/howto_google_code_hosting_subversion_tortoisesvn/|TortoiseSVN解说, 图文并茂]]
   * 在每段英文下以`同样格式`翻译成中文,方便校对,以及将来转换格式。
 * 校对请到[[http://code.google.com/p/openbookproject/issues/list|这里]]提交issue。
   * 注意使用`[TPiP]`标签
   * 最好能抄送译者以及其他校对人员,经讨论后由译者更新,这样避免冲突。
 * 英汉术语对照表,查看[[http://code.google.com/p/openbookproject/wiki/TechnoDict|技术翻译词典]]
   * 更新先修改[[http://openbookproject.googlecode.com/svn/trunk/TPiPy/zh/terms.txt|terms.txt]],使用svn
   * 再更改"[[http://code.google.com/p/openbookproject/wiki/TechnoDict|技术翻译词典]]"页面。
   * 这样可尽最大可能保持版本一致。
Line 11: Line 44:
是那种相信 esse est denunte 的 Foucauldian Berkeley。可以通过 [email protected] 与 David 联系;
他的[http://gnosis.cx/publish/ 个人 Web 页面] 上介绍了他的生活。
=== Mertz 授权 ===
作者授权:`David Mertz已经回信同意OpenBookProject翻译发布这本书的中文版`。原文经允许抄录如下:
{{{
2008/11/29 Wenliang Lu <[email protected]>:
> ---------- Forwarded message ----------
> From: David Mertz <[email protected]>
> Date: Mon, Dec 10, 2007 at 5:14 PM
> Subject: Re: TPiP chinese translation
> To: Wenliang Lu <[email protected]>
> Cc: "Dr. David Mertz" <[email protected]>
>
>
>> Hi David,
>> We have read your book of TPiP, which is an amazing book!
>> Just wondering can you grand us the permission to translate TPiP into
>> Chinese.
>> We will distribute at http://code.google.com/p/openbookproject/ for
>> NON-commercial purpose,
>> as this project aims to translate good python books into Chinese to
>> promote Python in China.
>
> I would be very happy to have this translation made and published at the
> Open Book Project. I am not of any mind to impose any republication
> permissions on anything I write (my articles are released to the public
> domain explicitly, for example).
>
> Per copyright law, it's not entirely simple to authorize, since the
> publisher maintains rights on the book. However, I am sufficiently
> comfortable that the agreement I have with the publisher to allow gratis
> online publication via my own site can reasonably be taken to cover
> publication of a translation as well.
>
> Moreover, at a practical level, while the book has sold moderately well, it
> is hardly the sort of large seller that the publisher will worry about
> translating. So there should not be any concrete issue with the translation
> and publication you suggest. Please proceed... let me know the progress as
> things advance, just for my own gratification.
>
> Yours, David...
>
> -----------------------------------------------------------------------
> mertz@ | The specter of free information is haunting the `Net! All the
> gnosis | powers of IP- and crypto-tyranny have entered into an unholy
> .cx | alliance...ideas have nothing to lose but their chains. Unite
> | against "intellectual property" and anti-privacy regimes!
}}}
Line 14: Line 91:
他发表的新书:[Python Text Processing in Python(原文)] = TPiP =
== acknowledgments ==
/AcKnowledgments FOLKS WHO HAVE MADE THIS BOOK BETTER
== intro ==
[[/Intro]] INTRODUCTION
== chap1 ==
[[/Chap1]] PYTHON BASICS
Line 16: Line 99:
''等不及中文版了!自个儿来!''
-- Zoom.Quiet [[[DateTime(2004-08-09T23:30:02Z)]]]
== chap2 ==
[[/Chap2]] BASIC STRING OPERATIONS
Line 19: Line 102:
[[TableOfContents]]
== FRONTMATTER -- PREFACE ==
-------------------------------------------------------------------
'''蟒之禅'''
{{{
== chap3 ==
[[/Chap3]] REGULAR EXPRESSIONS
Line 25: Line 105:
    Beautiful is better than ugly.
    Explicit is better than implicit.
    Simple is better than complex.
    Complex is better than complicated.
    Flat is better than nested.
    Sparse is better than dense.
    Readability counts.
    Special cases aren't special enough to break the rules.
    Although practicality beats purity.
    Errors should never pass silently.
    Unless explicitly silenced.
    In the face of ambiguity, refuse the temptation to guess.
    There should be one--and preferably only one--obvious way to do it.
    Although that way may not be obvious at first unless you're Dutch.
    Now is better than never.
    Although never is often better than *right* now.
    If the implementation is hard to explain, it's a bad idea.
    If the implementation is easy to explain, it may be a good idea.
    Namespaces are one honking great idea--let's do more of those!
      --Tim Peters, "The Zen of Python"
}}}
=== SECTION 1 -- What is Text Processing? ===
-------------------------------------------------------------------
== chap4 ==
[[/Chap4]] PARSERS AND STATE-MACHINES

== chap5 ==
[[/Chap5]] INTERNET TOOLS AND TECHNIQUES

== appendix_a ==
[[/AppendixA]] A SELECTIVE AND IMPRESSIONISTIC SHORT REVIEW OF PYTHON

== appendix_b ==
[[/AppendixB]] A DATA COMPRESSION PRIMER

== appendix_c ==
[[/AppendixC]] UNDERSTANDING UNICODE

== appendix_d ==
[[/AppendixD]] A STATE-MACHINE FOR ADDING MARKUP TO TEXT

== glossary ==
[[/glossary]] GLOSSARY TERMS
Line 50: Line 127:
=== SECTION 2 -- The Philosophy of Text Processing ===

-------------------------------------------------------------------

=== SECTION 3 -- What You'll Need to Use This Book ===
-------------------------------------------------------------------

=== SECTION 4 -- Conventions Used in This Book ===
-------------------------------------------------------------------

=== SECTION 5 -- A Word on Source Code Examples ===
-------------------------------------------------------------------

=== SECTION 6 -- External Resources ===
-------------------------------------------------------------------

=== 反馈 ===
 * 如何得到TPiP原文?
  * 呀呀呀??不好意思,直接到 David Mertz 的主页去拿是也乎 -- Zoom.Quiet
== 反馈 ==
<<Include(/discuss)>>

深入Python 文本处理

2007-10-30可愛的Py 图书工程重启倡议; 2008-03-30可爱的Python编辑约定

啄木鸟

{o} DiP

(!) Py食谱

{*} OBP/列表

(!) 博文出版>可爱Py

{*} CDBE

090827ZoomQuiet 关闭主要内部的浏览权限,开始半年期的图书发售沉默期!重开日期:100227

本色巨蟒升级版倡议...

cover-small.jpg

p-mertz.gif

可爱的Python 系列文章
的作者: David Mertz是非常有名的Python 专家!
是那种相信 esse est denunte 的 Foucauldian Berkeley。可以通过 [email protected] 与 David 联系;
他的个人 Web 页面 上介绍了他的生活。
由他撰写出版的::Text Processing in Python(原文)

等不及中文版了!自个儿来! -- Zoom.Quiet [2004-08-09 23:30:02]

翻译流程

借鉴一下CorePy2的工作方式。

  1. 翻译,一定要和原文的格式一样。
  2. 译者自校
  3. 二校
  4. 三校(如果需要的话, 且人手足够)
  5. 删除英文
    • 提取只有中文的版本,保存名使用后缀 _zh_SmartAscii.txt

  6. 整理工作,用脚本将这些稿件输出成MoinMoin, rest,LaTeX等。

    • 原文自称"Smart ASCII"格式
    • 看完本书,理应轻松写脚本搞定。
    • 书中就提供了好几种解析方法
    • 转换成MoinMoin格式,保存名使用后缀 _zh_MoinMoin.txt

    • 转换成RST格式,保存名使用后缀 _zh_RST.txt

TPiP 翻译进度

名称

行数

翻译

进度

译者自校进度

二校

进度

三校

进度

备注

Intro

介绍

485

Chap1

Python基础

5722

DuYanJun

Chap2

基本字符串操作

4827

digitalwit

Chap3

正则表达式

2835

wenliang.lu

(./)

(./)

Chap4

解析器和状态机

4607

wenliang.lu

(./)

0%

Chap5

互联网工具和技术

3923

wenliang.lu

(./)

AppendixA

python精要

1959

HuangYi

(./)

AppendixB

数据压缩入门

681

wenliang.lu

(./)

AppendixC

理解Unicode

327

wenliang.lu

(./)

AppendixD

给文本增加标记的状态机

373

wenliang.lu

(./)

glossary

词汇表

197

AcKnowledgments

鸣谢

149

资源/信息

  • TPiP中文翻译主页面就是本页。
  • 加入翻译,获取权限
    • woodpecker注册, 此处人员调配,任务认领。

    • 加入google code,此处源码仓库。

    • 加入google group,此乃讨论去处。和google code需要分别加入。

    • 请发信通报老大ZoomQuiet,告知你的啄木鸟IDgoogle ID,以获得一站式批准

    • 其他问题请找lwl

  • 所有讨论/问题,以及代码签入请在标题使用[TPiP]前缀,例如“[TPiP] [chap4] 50% 完成”。

  • 讨论请到openbookproject OBP 的OBP group 讨论组。

  • 翻译和校对请到revision认领。需要啄木鸟ID。

  • 翻译使用svn签入签出即可。

  • 校对请到这里提交issue。

    • 注意使用[TPiP]标签

    • 最好能抄送译者以及其他校对人员,经讨论后由译者更新,这样避免冲突。
  • 英汉术语对照表,查看技术翻译词典

Mertz 授权

作者授权:David Mertz已经回信同意OpenBookProject翻译发布这本书的中文版。原文经允许抄录如下:

2008/11/29 Wenliang Lu <[email protected]>:
> ---------- Forwarded message ----------
> From: David Mertz <[email protected]>
> Date: Mon, Dec 10, 2007 at 5:14 PM
> Subject: Re: TPiP chinese translation
> To: Wenliang Lu <[email protected]>
> Cc: "Dr. David Mertz" <[email protected]>
>
>
>> Hi David,
>> We have read your book of TPiP, which is an amazing book!
>> Just wondering can you grand us the permission to translate TPiP into
>> Chinese.
>> We will distribute at http://code.google.com/p/openbookproject/ for
>> NON-commercial purpose,
>> as this project aims to translate good python books into Chinese to
>> promote Python in China.
>
> I would be very happy to have this translation made and published at the
> Open Book Project.  I am not of any mind to impose any republication
> permissions on anything I write (my articles are released to the public
> domain explicitly, for example).
>
> Per copyright law, it's not entirely simple to authorize, since the
> publisher maintains rights on the book. However, I am sufficiently
> comfortable that the agreement I have with the publisher to allow gratis
> online publication via my own site can reasonably be taken to cover
> publication of a translation as well.
>
> Moreover, at a practical level, while the book has sold moderately well, it
> is hardly the sort of large seller that the publisher will worry about
> translating.  So there should not be any concrete issue with the translation
> and publication you suggest.  Please proceed... let me know the progress as
> things advance, just for my own gratification.
>
> Yours, David...
>
> -----------------------------------------------------------------------
> mertz@ | The specter of free information is haunting the `Net!  All the
> gnosis | powers of IP- and crypto-tyranny have entered into an unholy
> .cx    | alliance...ideas have nothing to lose but their chains.  Unite
>       | against "intellectual property" and anti-privacy regimes!

TPiP

acknowledgments

/AcKnowledgments FOLKS WHO HAVE MADE THIS BOOK BETTER

intro

/Intro INTRODUCTION

chap1

/Chap1 PYTHON BASICS

chap2

/Chap2 BASIC STRING OPERATIONS

chap3

/Chap3 REGULAR EXPRESSIONS

chap4

/Chap4 PARSERS AND STATE-MACHINES

chap5

/Chap5 INTERNET TOOLS AND TECHNIQUES

appendix_a

/AppendixA A SELECTIVE AND IMPRESSIONISTIC SHORT REVIEW OF PYTHON

appendix_b

/AppendixB A DATA COMPRESSION PRIMER

appendix_c

/AppendixC UNDERSTANDING UNICODE

appendix_d

/AppendixD A STATE-MACHINE FOR ADDING MARKUP TO TEXT

glossary

/glossary GLOSSARY TERMS

反馈

  • 如何得到TPiP原文?
    • 呀呀呀??不好意思,直接到 David Mertz 的主页去拿是也乎 -- Zoom.Quiet
  • David Mertz的声明 (http://gnosis.cx/TPiP/):

   This stuff is copyrighted by AW (except the code samples which are released to the public domain). Feel free to use this material personally; but no permission is given for further distribution beyond your personal use.
  • 是不是不能拿来翻译发布?
  • 我们是自用哪??没有进行出版发布的,保留他的版权声明,作我们自个儿的事儿就好…… ZoomQuiet

  • well, 下载了, 第四章很有意思阿, 状态机~~~, 附录d也很不错. 计划阅读之 --- hoxide
  • 因为有书出来,所以感觉还是有点担心版权问题. -- andelf

TPiP (last edited 2009-12-25 07:18:09 by localhost)