ChatGPT是多语种奇迹
日期:2023-04-06 10:00

(单词翻译:单击)

CB1ozrN4B,bEisao_*=bnV#]@VVc&

Culture

ws%!4W4LL72=;

文艺版块

5rU|fP8Dr]v0M7

Johnson

zzS!kA%;DvIKx

约翰逊专栏

I#H[ED70LJBC-L6

Speaking in many tongues

*EBGlV|7TA

讲多国语言

h2)VJTizdWGno6Y

ChatGPT may make things up, but it does so fluently in more than 50 languages.

1p&3drDUNfQT(8b791

ChatGPT可能会编假话,但它能用50多种语言流利地编假话i[CPQL(CMMZ~NioA2

SU18*WNb2T[@UgPWu|

The hype that followed ChatGPT's public launch last year was, even by the standards of tech innovations, extreme.

5KuVjj@up]v7

ChatGPT自去年公开发布后所引发的炒作,即使以科技创新的标准来看也是极端的(2#jcqI39@biAt9+

67R+#l6%-=NObiqF

OpenAI's natural-language system creates recipes, writes computer code and parodies literary styles.

GJZ]l.a4tz=Z

OpenAI的这一自然语言系统能创造食谱,编写计算机代码,模仿各种文学风格ZE*i8A;*4gbEhZsjJMw)

E%MbYvw#frY#|[3

Its latest iteration can even describe photographs.

wh#|;Uh;%lqjg

其最新版本甚至可以描述照片z,_Kgt-pm+6

ZU[r&PXjy+

It has been hailed as a technological breakthrough on a par with the printing press.

dWG4Y#A|Mw-nN]8(

ChatGPT被誉为与印刷机相媲美的技术突破!n(hkWlA(ZSN9_cUo3

r5n8ZezL#c^i

But it has not taken long for huge flaws to emerge, too.

vw!_AGW[Cp

但没过多久,巨大的缺陷也显现出来X1Ynh8hHuaMc_[20aXv

wM)oUYG3##0I,

It sometimes "hallucinates" non-facts that it pronounces with perfect confidence, insisting on those falsehoods when queried.

a@W=1!rwJk&7|XIl

它有时会"幻想"出并非事实的东西,并自信满满地把这些东西讲出来,就算被质疑也坚持这些谎言Lakfjcs-BuZevDBN

=*)jHNZKwJT=Hj7q=3rX

It also fails basic logic tests.

XmNKsKn]^n*Tc

它也未能通过基本的逻辑测试Ta=#X9WN0^Y&hqyr

g~&!dwB0-U

In other words, ChatGPT is not a general artificial intelligence, an independent thinking machine.

kPhLsLpPe[,Z

换句话说,ChatGPT不是通用人工智能,不是一台能独立思考的机器NLX|FfqktWY%pqEA

)KBXjO#AvQh

It is, in the jargon, a large language model.

3SR|s&g,5@rP8|%

用行话来说,它是一个大型语言模型ndu1!D&Gq4&Y

=Jc^0NItis#N,2b2T#;3

That means it is very good at predicting what kinds of words tend to follow which others, after being trained on a huge body of text -- its developer, OpenAI, does not say exactly from where -- and spotting patterns.

e2U|sp5p03kSe6@@

这意味着,在用大量文本进行训练后,它非常擅长预测哪些单词之后往往接着哪些其他单词并找出其中的规律,其开发者OpenAI没有具体说明这些文本的来源xs|boBqPWUrBFR

b1|=O-|Z|PJ+Sf9T

Amid the hype, it is easy to forget a minor miracle.

62f0robL;qR~9l]0h*X

在炒作中,很容易忘记一个小小的奇迹.84nlxn3]0Ihcv=H#o

#D9z338ib!8

ChatGPT has aced a problem that long served as a far-off dream for engineers: generating human-like language.

@mp0LBVJ@sp]

ChatGPT成功解决了一个长期以来一直被工程师们视为遥远梦想的问题:生成类似人类的语言=M=mhS|JaBg]Ao=

h1^5Y;*i7nk9uDjb

Unlike earlier versions of the system, it can go on doing so for paragraphs on end without descending into incoherence.

A-*~5yhZ;19n]WE_

与早期版本不同,ChatGPT可以长篇大段地一直说下去,而不会出现语句不通的情况Jt2MkaaBmGx%pVokoj

JRFx)LUaU6N-heu

And this achievement's dimensions are even greater than they seem at first glance.

2agAs|ys5Vk

这一成就的影响范围甚至比它在初看之时所表现的更大G_D63-m-uK)9aj0

gcoZ%#cTv*#6_x10Lu

ChatGPT is not only able to generate remarkably realistic English.

MFB4*xBgCdAj

ChatGPT不仅能生成非常逼真的英语Dk&=GtsrS5P+s)A(!

NI0yrl8X0]#2o

It is also able to instantly blurt out text in more than 50 languages -- the precise number is apparently unknown to the system itself.

b[ewDT(GHm;XZbe,E@

还能立即脱口而出50多种语言 -- 系统自己显然也不知道确切数字是多少-%*gYi!Or]*BNrVAmi#P

Ng()oL,fv1Eecus

Asked (in Spanish) how many languages it can speak, ChatGPT replies, vaguely, "more than 50", explaining that its ability to produce text will depend on how much training data is available for any given language.

*3yuKUsozDCy,l-L

当被问及(用西班牙语)它会说几种语言时,ChatGPT含糊地回答说"超过50种",并解释说,它以某种语言生成文本的能力取决于这一语言的训练数据有多少CTev24&[^Z%PN1t^

#B~vA|~BA=N^odT

Then, asked a question in an unannounced switch to Portuguese, it offers up a sketch of your columnist's biography in that language.

L4]lvfV)S0*+5

然后,在没有通知的情况下转而用葡萄牙语提问时,它又用葡萄牙语提供了您的专栏作家的生平简介,dT^OmdpXzIH58.Xlv

v([cjr91@J=wvk

Most of it was correct, but it had him studying the wrong subject at the wrong university.

5La9CwDR8sphgjmP

大部分内容是正确的,但他就读的大学和专业搞错了phPa|z~JAavi9I_-df

aF!u&[uPAq2^KK[c

The language itself was impeccable.

,O8Bme%Lw2%SQA@ch

而语言本身无可挑剔D_G2zL9ypu#

.9HOJ(Quzgk

Portuguese is one of the world's biggest languages.

_ig.UCeF5KG)idp2

葡萄牙语是世界上最大的语种之一U1V^;aLGLV

HdoPV+fJJJ,~%

Trying out a smaller language, your columnist probed ChatGPT in Danish, spoken by only about 5.5m people.

S4ZSYi&kmx!bP-AQ

为了试一个更小的语种,您的专栏作家又用丹麦语对ChatGPT进行了追问,大约只有550万人说丹麦语%%BWYO]&9#t0j-iSkNC

;%JNFC@53s4@

Danes do much of their online writing in English, so the training data for Danish must be orders of magnitude scarcer than what is available for English, Spanish or Portuguese.

MBhsa=K|g&t

丹麦人在网上写东西大部分都是用英语,所以丹麦语的训练数据肯定比英语、西班牙语或葡萄牙语能提供的训练数据要少几个数量级nD19AbU;Xw)Hg

;._yQcwBV.;e@3gb6.oS

ChatGPT's answers were factually askew but expressed in almost perfect Danish.

F1LBLIRy_Pv_H%I

ChatGPT的回答歪曲了事实,但其丹麦语几近完美j-|b!5-aee6x&

g1wg;L,.@a5!yWj-X

(A tiny gender-agreement error was the only mistake caught in any of the languages tested.)

vot4WmT9eh+X]0XoqV)

(在所有测试的语言中,只发现了一个微小的性别一致性错误1oPo!Qxi!9fEE。)

URyYz*BT!eEdD

Indeed, ChatGPT is too modest about its own abilities.

gga[pp8i4,r!nRCd|c%

的确,ChatGPT对自己的能力过于谦虚3UwlPtwwxr

]kjs+1Hye2mvhx

On request, it furnishes a list of 51 languages it can work in, including Esperanto, Kannada and Zulu.

GXsY~r7HOY.!

它应要求提供了它可以使用的51种语言的清单,其中包括世界语、卡纳达语和祖鲁语MGM.t]1RHozdWS=

cM74*pv!q+fE-C)c[

It declines to say that it can "speak" these languages, but rather "generates text" in them.

SODxu^E_Ai_

它拒绝说自己会"说"这些语言,而是说能用这些语言"生成文本"(zo+C,qYNTb18CHT.0

+5%18SqDzRwP

This is too humble an answer.

q93C.dbNRJ,

这个回答真是过谦了s2S1I;AVT9QD|7@[I

4[_l@af&=g-NHx4

Addressed in Catalan -- a language not on the list -- it replies in that language with a cheerful "Yes, I do speak Catalan -- what can I help you with?"

#Tra!tN!HYue

在用加泰罗尼亚语(这种语言不在清单上)和它说话时,它用这种语言愉快地回答道:"是的,我会说加泰罗尼亚语,有什么可以帮你的吗?"

sQrjJN2U#(P.#

A few follow-up questions do not trip it up in the slightest, including a query about whether it is merely translating answers first generated in another language into Catalan.

HgnW1|4aYxYF

一些后续的提问也丝毫没能让它出差错,包括询问它是否只是先用另一种语言生成答案,然后再翻译成加泰罗尼亚语gmu[xv;[_Q)2pi3,iK

D[0!j~9CpVE.EtPEc)2s

This, ChatGPT denies: "I don't translate from any other language; I look in my database for the best words and phrases to answer your questions."

Xs0!D(ehu0U

ChatGPT否认了这一点:"我不翻译任何其他语言,我在我的数据库中寻找最佳词句来回答您的问题SmO!_9Y.]OHS^%4ViyT9。"

iBD,J|olAoZ#

Who knows if this is true?

j=@ko=,R5~a_=sn54HnF

谁知道这是不是真的?

m5;Kqx)6h@

ChatGPT not only makes things up, but incorrectly answers questions about the very conversation it is having.

dLKBGJ0LpkEkM

ChatGPT不仅编造故事,而且错误地回答了有关正在进行的对话的问题q,sPhvNz|I9id

oQpMbq(gnzGVoL+8]7

(It has no "memory", but rather feeds the last few thousand words of each conversation back into itself as a new prompt.

BIf1S~p_lKF3)brBm

(它没有"记忆",而是将每次对话的最后几千个单词反馈给自己,作为新的提示符bmOadiS&4C)ZUny#OY@

pC|Z,S4KLaBS#+h,-

If you have been speaking English for a while it will "forget" that you asked a question in Danish earlier and say that the question was asked in English.)

Zj@1%n4TyvaU4Ad.fbR

如果你说了一段时间的英语,它就会"忘记"你之前用丹麦语问了一个问题,并说那个问题是用英语问的t.s;zoGIGTyS|3@akA|o。)

Wjtd]c89b2,[

ChatGPT is untrustworthy not just about the world, but even about itself.

~]64-YCp)ctq

ChatGPT在关于世界,甚至关于它自己的方面是不可信赖的&4N0BxYgfVKf4HOs4c(X

tFB1Ydk9rblzk

This should not overshadow the achievement of a model that can effortlessly mimic so many languages, including those with limited training data.

AjmW.d!49GFywS

但这不应该掩盖这一模型的成就,它可以毫不费力地模仿如此多的语言,包括那些训练数据有限的语言HTMp%*~~bTMfmWRU

W,aiAb|Ov6|Hqv@W&2

Speakers of smaller languages have worried for years about language technologies passing them by.

I2dq5l*KUjjC|iNeTYbn

多年来,较小语种的使用者一直担心语言技术会与他们擦肩而过*)cx5f=SH.^wWv

gkSHQAkDCj3

Their justifiable concern had two causes: the lesser incentive for companies to develop products in Icelandic or Maltese, and the relative lack of data to train them.

xsITg9tAs^wKyKj

他们这一合理担忧有两个原因:公司开发冰岛语或马耳他语产品的动力较小,以及训练数据相对缺乏w,jg^.;;TKc|n%QYRF[6

WHo_!|AyxU]

Somehow the developers of ChatGPT seem to have overcome such problems.

-+srl[1C.u_rfp_O[b!

ChatGPT的开发者似乎不知如何已经克服了这些问题72iO152WA=0yfqke

S7FA3=(AQZpKSS_-

It is too early to say what good the technology will do, but this alone gives one reason to be optimistic.

_xImp)CP!7F+(8M-

现在说这项技术会有什么好处还为时过早,但只是这一点就给了我们一个保持乐观的理由i((v%cfA[&=!_

4;I]xeRWZquRl+JS

As machine-learning techniques improve, they may not require the vast resources, in programming time or data, traditionally thought necessary to make sure smaller languages are not overlooked online.

&,roN0v94jgPd#S

随着机器学习技术的进步,它们可能不像之前以为的那样,需要编程时间或数据方面的大量资源,这会确保较小的语种不会在网上被忽视(+YC-#EQjz~RO7_

uaRjFBvV_lI(4Au]u0rKHGiklIP_#b#GDt51Lpla-e+
分享到