经济学人:计算机模拟视觉_国外媒体资讯

位置：首页 > 英语听力 > 国外媒体资讯 > 经济学人双语版 > 经济学人科技系列 > 正文

经济学人:计算机模拟视觉

日期:2013-02-04 15:57

(单词翻译:单击)

0204
0201
0131
0130
0129
0127_3
0126_3
0125_2
0124_2
0123_2
0122_3
0121
0119_2
0118_2
0117_2
0116_3
0115_3
0114_3
0113_2
0112_2
0111_2
0110_3
0109_2
0108_3
0106_2
0105
0104_3
0103_2
0102_4
0101_3
1230_2
1229
1228
1227_3
1226_3
1225_3
1224_3
1223_2
1222
1221_2
1220_3
1219_3
1218_2
1217_2
1216_3
1215_4
1214_3
1213_5
1212_4
1211_2
1210_2
1209_2
1208_2
1207_2
1206_4
1205_3
1204_4
1203_3
1202_3
1201_3
1130_3
1129_3
1128_3
1127_2
1126_2
1125_2
1124
1123_2
1122_3
1121_2
1120_3
1120_4
1118_2
1117_2
1116_2
1115_3
1114_3
1109_2
1108_3
1107_3
1106_4
1105_2
1104_2
1103_2
1102
1101_3
1031_3
1030_2
1029_3
1028_2
1027_3
1026_3
1025_4
1024_3
1023_3
1022_2
1019_3
1017_3
1015_2
1014_3
1013_2
1010_7
1009_2
1008_3
1007
1006
1005
1004_2
1002_2
1001_2
0930_2
0928_3
0927_2
0926_3
0925_4
0924_4
0923_2
0922_4
0921
0919
0918_3
0917_3
0916
0915_3
0914
0912
0911_2
0910_2
0909_2
0902
0901_3
0829
0823_3
0822_3
0822_4
0820_3
0823_1
0822_1
0809
0808_1
0726_1
0725_1
0712_1
0711_1
0628_1
0627_1
0621_1
0620_1
0531_1
0530_1
0517_1
0510_1
0504_1
0503_1
0426_1
0425_1
0412
0411_1
0329_1
0322_1
0315_1
0308_1
0301
0222_1
0215_1
0208
0126_1
0125_1
0112_1
0111_1
0110_1
0104_1
1221_1
1215_1
1214_1
1201_1
1130_1
1123_1
1109_1
1027_1
1026_1
1025_1
1013_1
1012_1
0929_1
0929_2
0922_1
0922_2
0915_1
0915_2
0908_1
0908_2
0901_1
0901_2
0825_1
0825_2
0818_1
0818_2
0811_1
0811_2
0804_1
0804_2
0728_1
0728_2
0721_1
0721_2
0714_1
0714_2
0707_1
0707_2
0630_1
0630_2
0623_1
0623_2
0623_3
0616_1
0616_2
0609_1
0609_2
0603_1
0601
0526
0524_1
0520
0519_1
0512
0510_2
0505_1
0503_2
0428_1
0426_2
0421_1
0419_1
0414_1
0413
0409_1
0408_1
0401_1
0331
0324
0322_2
0317_1
0315_2
0311_1
0310_1
0305_1
0304_1
0215_2
0210_1
0209
0206_1
0205_1
0128_1
0127_1
0120
0119_1
0118_1
0101_1
1231
1216_1
1215_2
1214_2
1202_1
1201_2
1130_2
1117_1
1116_1
1113_1
1112_1
1104_1
1103_1
1023_1
1019_1
1016_1
1014_1
1010_1
1002_1
1001_1
0925_1
0924_1
0918_1
0917_1
0911_1
0910_1
0904_1
0903_1
0828
0827_1
0821_1
0820_1
0814_1
0813_1
0807_1
0806_1
0731_1
0730_1
0724_1
0723_1
0717_1
0716_1
0710_1
0709_1
0702_1
0626_1
0625_1
0619_1
0618_1
0612_1
0611_1
0607_1
0605_1
0529_1
0528_1
0522_1
0521_1
0515_1
0514_1
0508_1
0507_1
0501_1
0430
0424_1
0423_1
0417_1
0416_1
0410
0403_1
0402
0327_1
0326_1
0320_1
0319_1
0313_1
0312_1
0306_1
0305_2
0228_1
0227_1
0221_1
0220_1
0214_1
0213_1
0207_1
0206_2
0117_1
0114_1
0108_1
0107_1
1227_1
1226_1
1220_1
1219_1
1213_1
1212_1
1206_1
1205_1
1129_1
1128_1
1122_1
1121_1
1115_1
1114_1
1108_1
1107_1
1101_1
1031_1
1025_2
1024_1
1018_1
1017_1
1011_1
1010_2
1004_1
0925_2
0924_2
0913
0909_1
0908_3
0903_2
0830
0827_2
0823_2
0820_2
0814_2
0813_2
0807_2
0806_2
0731_2
0730_2
0724_2
0723_2
0717_2
0716_2
0710_2
0709_2
0703_1
0702_2
0626_2
0625_2
0619_2
0618_2
0612_2
0611_2
0605_2
0604_1
0524_2
0506_1
0503_3
0424_2
0423_2
0405
0329_2
0320_2
0319_2
0313_2
0313_3
0122_1
0122_2
0102_1
0102_2
1225_1
1224_1
1213_2
1213_3
1212_2
1204_1
1204_2
1120_1
1119_1
1112_2
1029_1
1027_2
1026_2
1020
1019_2
1010_3
1010_4
1010_5
0928_1
0928_2
0918_2
0917_2
0905
0904_2
0815_1
0814_3
0522_2
0521_2
0519_2
0518
0517_2
0516_1
0515_2
0514_2
0511
0510_3
0508_2
0507_2
0505_2
0504_2
0503_4
0502_1
0501_2
0427_1
0426_3
0425_2
0424_3
0423_3
0421_2
0420_1
0116_1
0115_1
0418_1
0417_2
0414_2
1215_3
1208_1
1207_1
1206_2
0524_3
0523
1211_1
1203_1
0824
0714_3
0527
0515_3
0104_2
1008_1
0819
0814_4
0725_2
0627_2
0613
0325_1
0321_1
0320_3
0318_1
0317_2
0314_1
0313_4
0312_2
0311_2
0310_2
0307_1
0306_2
0305_3
0304_2
0303
0228_2
0227_2
0226
0225
0224
0221_2
0220_2
0219_1
0218
0217_1
0214_2
0213_2
0212
0211
0210_2
0207_2
0128_2
0127_2
0126_2
0124_1
0123_1
0116_2
0115_2
0114_2
0113_1
0110_2
0109_1
0108_2
0107_2
0106_1
0103_1
0102_3
0101_2
1230_1
1227_2
1226_2
1225_2
1224_2
1223_1
1220_2
1219_2
1218_1
1217_1
1216_2
1213_4
1212_3
1210_1
1209_1
1206_3
1205_2
1204_3
1203_2
1202_2
1129_2
1128_2
1127_1
1126_1
1125_1
1122_2
1120_2
1119_2
1118_1
1115_2
1114_2
1113_2
1112_3
1111
1108_2
1107_2
1106_1
1106_2
1106_3
1105_1
1101_2
1031_2
1030_1
1029_2
1028_1
1025_3
1024_2
1023_2
1022_1
1021
1018_2
1017_2
1016_2
1015_1
1014_2
1012_2
1011_2
1010_6
1009_1
1008_2
0930_1
0929_3
0927_1
0926_1
0926_2
0925_3
0924_3
0923_1
0922_3
0822_2
0821_2
0815_2
0813_3
0812
0808_2
0807_3
0806_3
0805
0802
0801
0731_3
0730_3
0729
0726_2
0725_3
0724_3
0723_3
0722
0719
0718
0717_3
0716_3
0715
0714_4
0712_2
0711_2
0711_3
0709_3
0708
0705
0704
0703_2
0702_3
0701
0628_2
0627_3
0626_3
0625_3
0624
0621_2
0620_2
0619_3
0618_3
0617
0614
0608
0607_2
0606
0605_3
0604_2
0603_2
0531_2
0530_2
0529_2
0528_2
0525
0524_4
0522_3
0521_3
0517_3
0516_2
0515_4
0514_3
0510_4
0509
0508_3
0507_3
0506_2
0503_5
0502_2
0428_2
0427_2
0426_4
0425_3
0424_4
0420_2
0419_2
0418_2
0416_2
0415
0414_3
0411_2
0409_2
0408_2
0407
0403_2
0401_2
0329_3
0328
0327_2
0326_2
0325_2
0322_3
0321_2
0320_4
0319_3
0318_2
0315_3
0314_2
0312_3
0311_3
0308_2
0307_2
0306_3
0305_4
0304_3
0228_3
0227_3
0222_2
0219_2
0217_2
0206_3
0205_2

Science and technology
科学和技术
Computer vision
计算机模拟视觉
Eye robot
你是我的眼
Poor eyesight remains one of the main obstacles to letting robots loose among humans.
放手让机器人在人类社会自由活动仍存在重大障碍—它们看不清楚。
But it is improving, in part by aping natural vision.
然而人工视能正在逐渐提高，途径之一就是模拟自然视觉。
ROBOTS are getting smarter and more agile all the time.
机器人的反应总是在变得越来越灵活，动作也越来越敏捷。
They disarm bombs, fly combat missions, put together complicated machines, even play football.
它们会拆卸炸弹、驾驶战斗机执行任务、组装复杂机械，甚至还会踢足球。
Why, then, one might ask, are they nowhere to be seen, beyond war zones, factories and technology fairs?
那么，人们不禁要问，为什么除了在战场、工厂和科技产品展销会，生活中都看不到机器人的踪影呢？
One reason is that they themselves cannot see very well.
一个原因就是它们自己眼神不大好。
And people are understandably wary of purblind contraptions bumping into them willy-nilly in the street or at home.
机器人跟睁眼瞎差不多，要是把它们弄到大街上，或者摆在家里，搞不好就没头没脑地把人给撞了——对这玩意儿谨慎一点也是人之常情。

1_副本.jpg
All that a camera-equipped computer sees is lots of picture elements, or pixels.
装有摄像头的计算机能看到的一切，仅仅是大量的图像元素，又称像素。
A pixel is merely a number reflecting how much light has hit a particular part of a sensor.
像素只不过是一个数值，反映照到传感器某个部位的光线亮度是多少。
The challenge has been to devise algorithms that can interpret such numbers as scenes composed of different objects in space.
困难在于，要编写出一套计算程序，可以把这些数字再现为空间中不同物体构成的景象。
This comes naturally to people and, barring certain optical illusions, takes no time at all as well as precious little conscious effort.
这一切对于人类来说，只是一种本能—除非出现某些错觉—立杆而见影，在意识上可谓不费吹灰之力。
Yet emulating this feat in computers has proved tough.
然而事实证明，在计算机上模拟人的这一天赋实非易事。
In natural vision, after an image is formed in the retina it is sent to an area at the back of the brain, called the visual cortex, for processing.
自然视觉的过程是：视网膜成像后，图像被传送到大脑后部叫做视觉皮层的地方，在那里进行信息处理。
The first nerve cells it passes through react only to simple stimuli, such as edges slanting at particular angles.
图像经过的第一组神经元只能对简单的视觉刺激作出反射，例如物体朝某些角度倾斜的边缘。
They fire up other cells, further into the visual cortex, which react to simple combinations of edges, such as corners.
第一组神经元又将兴奋传给视觉皮层更深处的神经元，这些神经细胞可以对简单的物体轮廓作出反应，例如空间中的角落。
Cells in each subsequent area discern ever more complex features, with those at the top of the hierarchy responding to general categories like animals and faces, and to entire scenes comprising assorted objects.
越往后，神经元能识别的图像特征越复杂，最高级别神经区域可以对动物和脸等总体类别作出反应，最后将包罗万象的场景整合到一起。
All this takes less than a tenth of a second.
而整个过程只需要不到十分之一秒。
The outline of this process has been known for years and in the late 1980s Yann LeCun, now at New York University, pioneered an approach to computer vision that tries to mimic the hierarchical way the visual cortex is wired.
很早以前，人们就已经了解这一过程的大致情形。二十世纪80年代末，现就职于纽约大学的雅安?勒存率先涉足计算机视觉研究，试图模拟人脑视觉皮层内神经元层层递进的布线方式。
He has been tweaking his convolutional neural networks ever since.
从那时起，他就一直在调整和改良他的卷积神经网络。
Seeing is believing
眼见为实
A ConvNet begins by swiping a number of software filters, each several pixels across, over the image, pixel by pixel.
卷积神经网络首先用几个软件滤光器，对图像逐一像素地进行扫描，每个滤光器只能通过几个像素。
Like the brain's primary visual cortex, these filters look for simple features such as edges.
就像大脑的初级视觉皮层，这些滤光器只负责收集物体边缘等简单图像特征。
The upshot is a set of feature maps, one for each filter, showing which patches of the original image contain the sought-after element.
结果得到一组特征图，每一张特征图对应一个滤光器，显示出原始图像中的哪些块包含要筛选到的要素。
A series of transformations is then performed on each map in order to enhance it and improve the contrast.
随后，每一张特征图都要进行一系列调整，以提高它的画质、改善它的明暗对比度。
Next, the maps are swiped again, but this time rather than stopping at each pixel, the filter takes a snapshot every few pixels.
接下来，对这些特征图再次进行扫描，但这一次，滤光器不会在像素上逐一停留，而是每几个像素快拍一次。
That produces a new set of maps of lower resolution.
这样，得到一组新的分辨率较低的特征图。
These highlight the salient features while reining in computing power.
这些步骤凸显图像最显著的特征，同时对计算资源进行严格控制。
The whole process is then repeated, with several hundred filters probing for more elaborate shapes rather than just a few scouring for simple ones.
然后，将整个过程重复一遍，用几百个滤光器探查更为精细的物体形状，而不是随便扫视一些简单的形状。
The resulting array of feature maps is run through one final set of filters.
由此得到的特征图阵列，被输送经过最后一组滤光器。
These classify objects into general categories, such as pedestrians or cars.
它们可以对物体进行大体归类—是行人还是汽车等等。
Many state-of-the-art computer-vision systems work along similar lines.
许多尖端计算机视觉模拟系统都采用类似的原理运行。
The uniqueness of ConvNets lies in where they get their filters.
卷积神经网络的独特之处在于它们的滤光器已经做得登峰造极。
Traditionally, these were simply plugged in one by one, in a laborious manual process that required an expert human eye to tell the machine what features to look for, in future, at each level.
以往，滤光器只是一个接一个地接通。这一过程由手工完成，极为繁琐，需要一名专家全程用肉眼观察，然后向机器下达指令，告诉它下一步检索什么样的特征。
That made systems which relied on them good at spotting narrow classes of objects but inept at discerning anything else.
于是，依靠手动操作滤光器的计算机视觉系统，可以识别的物体类别十分有限，而无法分辨其他更多的东西。
Dr LeCun's artificial visual cortex, by contrast, lights on the appropriate filters automatically as it is taught to distinguish the different types of object.
相比之下，勒存博士的人工视觉皮层，可以在按照设定程序识别不同类型的物体时，自动接通相应的滤光器。
When an image is fed into the unprimed system and processed, the chances are it will not, at first, be assigned to the right category.
把一张图像输入他的系统进行处理，如果这个系统没有预先存储任何资料，第一次使用时体统有可能会把这张图像错误归类。
But, shown the correct answer, the system can work its way back, modifying its own parameters so that the next time it sees a similar image it will respond appropriately.
但是，告诉它正确答案之后，系统将重新识别图像，并修改自身的参数，以便下一次再看到类似的图像，可以做出恰当的回应。
After enough trial runs, typically 10,000 or more, it makes a decent fist of recognising that class of objects in unlabelled images.
经过足够的试运行之后——通常需要进行1万次以上——要在未经标示的图像上识别那一类物体，卷积神经网络可以完成得相当出色。
This still requires human input, though.
然而，这个阶段还是需要人类对其进行信息输入。
The next stage is unsupervised learning, in which instruction is entirely absent.
下一个阶段为无监督学习，在这个过程中没有任何人的指导。
Instead, the system is shown lots of pictures without being told what they depict.
是的，向勒存的计算机视觉系统展示大量图片，不告诉系统图上画的是什么。
It knows it is on to a promising filter when the output image resembles the input.
如果输出的图像和输入的图像几乎一样，系统就知道自身的滤光器升级了。
In a computing sense, resemblance is gauged by the extent to which the input image can be recreated from the lower-resolution output.
在计算机学上，两张图片是否相像的判断标准是，像素较低的输出图像可以在多大程度上复原为输入的图像。
When it can, the filters the system had used to get there are retained.
一旦可以还原，为系统所用而达到这种效果的滤光器就被保留下来。
In a tribute to nature's nous, the lowest-level filters arrived at in this unaided process are edge-seeking ones, just as in the brain.
在这个体现自然界智能的过程中，在无人辅助阶段，滤光器达到的最初等级为物体边缘搜索，正如人脑中的情形一样。
The top-level filters are sensitive to all manner of complex shapes.
最高等级的滤光器各种光怪陆离的形状都十分敏感。
Caltech-101, a database routinely used for vision research, consists of some 10,000 standardised images of 101 types of just such complex shapes, including faces, cars and watches.
加州理工101是进行视觉研究常规使用的数据库，它存储了约1万幅标准化图像，描述101类和标准化图像复杂程度相当的物体形状，包括脸、汽车和手表等。
When a ConvNet with unsupervised pre-training is shown the images from this database it can learn to recognise the categories more than 70% of the time.
当给事先经过无人监督训练的卷积神经网络展示这个数据库中的图像时，它可以通过学习辨认图像的类别，成功几率超过70%。
This is just below what top-scoring hand-engineered systems are capable of—and those tend to be much slower.
而最先进的手动视觉系统可以做到的也只比这个高一点点—并且它们的辨认速度往往慢得多。
This approach which Geoffrey Hinton of the University of Toronto, a doyen of the field, has dubbed deep learning need not be confined to computer-vision.
勒存的方法多伦多大学的杰弗里?希尔顿是该领域的泰斗，他将这一方法命名为深度学习不一定局限于计算机视觉领域。
In theory, it ought to work for any hierarchical system:language processing, for example.
理论上，该方法还可以用在任何等级系统当中，譬如语言处理。
In that case individual sounds would be low-level features akin to edges, whereas the meanings of conversations would correspond to elaborate scenes.
在这种情况下，音素就是语言识别的初级特征，相当于模拟视觉中的物体边缘，而对话的含义则相当于复杂场景。
For now, though, ConvNet has proved its mettle in the visual domain.
然而，目前卷积神经网络已经在视觉领域大显神威。
Google has been using it to blot out faces and licence plates in its Streetview application.
谷歌一直在街道实景应用程序中使用该系统，识别人脸和车牌，对其进行模糊处理。
It has also come to the attention of DARPA, the research arm of America's Defence Department.
它还引起了美国国防部高等研究计划局的注意。
This agency provided Dr LeCun and his team with a small roving robot which, equipped with their system, learned to detect large obstacles from afar and correct its path accordingly—a problem that lesser machines often, as it were, trip over.
他们为勒存博士和他的团队提供了一个漫游机器人，给它装上卷积神经网络系统后，这个机器人学会了探测远处的大型障碍物，并相应地纠正行进路线—可以说，没有安装该系统的机器人通常都会在这个问题上绊住。
The scooter-sized robot was also rather good at not running into the researchers.
这个漫游机器人只有小孩玩的滑板车那么大，却还相当有眼色:它不会撞向研究人员。
In a selfless act of scientific bravery, they strode confidently in front of it as it rode towards them at a brisk walking pace, only to see it stop in its tracks and reverse.
研究人员们发扬科学家舍身忘我的大无畏精神，做了一个实验：当机器人步履轻盈地向他们开过来时，他们突然昂首阔步迎面冲向机器人。结果发现，机器人半路停下并转向。
Such machines may not quite yet be ready to walk the streets alongside people, but the day they can is surely not far off.
当然，这类机器人要走上街头与人为伍，或许还略欠火候。但是，它们可以自由行走那一天想必已经不远了。

分享到

重点单词

decentadj. 体面的，正派的，得体的，相当好的

distinguishvt. 区别，辨认，使显著

trialadj. 尝试性的; 审讯的 n. 尝试，努力，试验，试

tributen. 贡品，颂词，称赞，(表示敬意的)礼物

uniquenessn. 唯一性，独特性

discernv. 辨别，看清楚

assortedadj. 配合的，组合的，各色俱备的，适合的动词ass

agileadj. （动作）敏捷的，灵活的，（头脑）机灵的

categoryn. 种类，类别

composedadj. 镇静的，沉着的