人工智能公司的数据争夺战(上)_国外媒体资讯

位置：首页 > 英语听力 > 国外媒体资讯 > 经济学人双语版 > 经济学人商业系列 > 正文

人工智能公司的数据争夺战(上)

日期:2023-08-24 10:00

(单词翻译:单击)

MP3点击下载

0826_1
0825_1
0812_1
0811_1
0729_1
0728_1
0715_1
0714_1
0701_1
0630_1
0617_1
0616_1
0603_1
0602_1
0520_1
0519_1
0513_1
0512_1
0429_1
0428_1
0421_1
0415_1
0414_1
0331_1
0325_1
0324_1
0311_1
0310_1
0303_1
0224_1
0218_1
0217_1
0210_1
0128_1
0127_1
0120_1
0113_1
0106_1
1230_1
1223_1
1216_1
1209_1
1203_1
1202_1
1125_1
1118_1
1111_1
1028_1
1021_1
1014_1
1005_1
0928_1
0928_2
0921_1
0921_2
0914_1
0914_2
0907_1
0907_2
0831_1
0831_2
0824_1
0824_2
0817_1
0817_2
0810_1
0810_2
0803_1
0803_2
0727_1
0727_2
0720_1
0720_2
0713_1
0713_2
0706_1
0706_2
0628_1
0628_2
0628_3
0628_4
0615_1
0615_2
0612_1
0611_1
0605_1
0604_1
0530_1
0529_1
0522_1
0521_1
0510_1
0509_1
0507_1
0502_1
0430_1
0424_1
0423_1
0413_1
0413_2
0402_1
0401_1
0318_1
0317_1
0304_1
0303_2
0302_1
0219_1
0218_2
0206
0205_1
0122_1
0121_1
0109_1
0108_1
1225_1
1224_1
1212_1
1211_1
1128_1
1127_1
1114_1
1113_1
1030_1
1029_1
1017_1
1016_1
1011_1
1010_1
0929_1
0928_3
0927_1
0925_1
0918_1
0917_1
0911_1
0910_1
0904_1
0903_1
0827_1
0826_2
0820_1
0819_1
0813_1
0812_2
0807_1
0806_1
0731_1
0730_1
0724_1
0723_1
0717_1
0716_1
0710_1
0708_1
0704_1
0702_1
0626_1
0624_1
0618_1
0616_2
0611_2
0609
0605_2
0602_2
0528_1
0525_1
0521_2
0519_2
0515_1
0512_2
0508_1
0505_1
0430_2
0429_2
0425_1
0424_2
0418_1
0417_1
0411_1
0410_1
0401_2
0331_2
0326_1
0321_1
0320_1
0311_2
0310_2
0306_1
0305_1
0224_2
0223_1
0222
0211_1
0210_2
0113_2
0113_3
0113_4
1223_2
1223_3
1213_1
1212_2
1202_2
1201_1
1130_1
1118_2
1117_1
1116_1
1104_1
1103_1
1102_1
1022_1
1021_2
1018_1
1017_2
0910_2
0909_1
0908_1
0907_3
0827_2
0826_3
0825_2
0824_3
0813_2
0812_3
0811_2
0810_3
0803_3
0802_1
0727_3
0726_1
0718_1
0717_2
0708_2
0704_2
0703_1
0627_1
0626_2
0623_1
0622_1
0612_2
0611_3
0606
0605_3
0530_2
0529_2
0523_1
0522_2
0517_1
0516_1
0513_2
0510_2
0509_2
0501
0430_3
0422_1
0419_1
0416_1
0415_2
0412
0411_2
0410_2
0404_1
0404_2
0328_1
0328_2
0325_2
0324_2
0315_1
0314_1
0307_1
0307_2
0301_1
0301_2
0225_1
0225_2
0218_3
0218_4
0201_1
0201_2
0121_2
0118_1
0111_1
0110_1
0107_1
0104_1
1227_1
1227_2
1226_1
1217_1
1217_2
1206_1
1206_2
1206_3
1128_2
1127_2
1121_1
1121_2
1114_2
1113_2
1106_1
1106_2
1031_1
1031_2
1023_1
1023_2
1015_1
1015_2
1002_1
0930_1
0919_1
0919_2
0911_2
0910_3
0908_2
0907_4
0906_1
0830_1
0829_1
0823_1
0822_1
0818_1
0817_3
0808_1
0807_2
0525_2
0524_1
0523_2
1116_2
1115_1
1111_2
1110_1
1109_1
1108_1
1107_1
1106_3
1104_2
1103_2
1102_2
1101_1
1016_2
1015_3
1014_2
1013_1
1012_1
1011_2
1010_2
1009_1
0914_3
0913_1
0815
0814_1
0809
0808_2
0710_2
0707_1
0622_2
0621_1
0510_3
0509_3
0508_2
0505_2
0504_1
0503_1
0410_3
0407_1
0406
0401_3
0317_2
0228_1
0227_1
0221_1
0220_1
0217_2
0216_1
0215_1
0213_1
0210_3
0209
0208
0207_1
0123_1
0120_2
0104_2
0102_1
1230_2
1229_1
1228_1
1223_4
1205_1
1202_3
1123_1
1122_1
1118_3
1117_2
1116_3
1102_3
1101_2
1031_3
1014_3
1011_3
1009_2
0930_2
0927_2
0706_3
0705_1
0704_3
0701_2
0616_3
0615_3
0614_1
0603_2
0602_3
0601
0516_2
0513_3
0512_3
0509_4
0506_1
0503_2
0429_3
0218_5
0217_3
0129_1
0126_1
0120_3
0115_1
0114_1
0112_1
0111_2
0108_2
0105_1
1221_1
1215_1
1214_1
1209_2
1202_4
1201_2
1127_3
1126_1
1125_2
1124_1
1102_4
1030_2
1029_2
1027_1
1023_3
1019_1
1012_2
1010_3
1009_3
1008_1
0930_3
0929_2
0925_2
0921_3
0918_2
0917_2
0914_4
0911_3
0908_3
0907_5
0906_2
0828_1
0827_3
0826_4
0825_3
0821_1
0820_2
0819_2
0818_2
0814_2
0813_3
0811_3
0810_4
0805_1
0804_1
0803_4
0731_2
0730_2
0728_2
0724_2
0723_2
0722_1
0720_3
0717_3
0713_3
0707_2
0701_3
0625_1
0623_2
0619_1
0618_2
0617_2
0616_4
0615_4
0612_3
0611_4
0602_4
0529_3
0526_1
0520_2
0519_3
0507_2
0505_3
0504_2
0430_4
0429_4
0427_1
0424_3
0416_2
0415_3
0414_2
0409_1
0407_2
0402_2
0330
0325_3
0323
0320_2
0319_1
0317_3
0311_3
0309
0302_2
0228_2
0216_2
0215_2
0213_2
0210_4
0205_2
0204_1
0203
0202
0129_2
0128_2
0127_2
0126_2
0123_2
0121_3
0120_4
0119_1
0115_2
0113_5
0112_2
0107_2
0105_2
0104_3
1230_3
1229_2
1226_2
1224_2
1223_5
1222_1
1219_1
1217_3
1216_2
1215_2
1212_3
1209_3
1208_1
1205_2
1204_1
1203_2
1202_5
1201_3
1126_2
1125_3
1124_2
1121_3
1119_1
1117_3
1114_3
1113_3
1112_1
1110_2
1107_2
1106_4
1104_3
1103_3
1031_4
1027_2
1024_1
1023_4
1022_2
1017_3
1016_3
1015_4
1014_4
1013_2
1011_4
1010_4
1009_4
0926_1
0925_3
0924_1
0923_1
0919_3
0918_3
0917_3
0916_1
0915_1
0912_1
0911_4
0910_4
0909_2
0904_2
0903_2
0902_1
0901_1
0829_2
0828_2
0827_4
0822_2
0821_2
0819_3
0818_3
0812_4
0811_4
0808_3
0805_2
0804_2
0801_1
0731_3
0730_3
0729_2
0725_1
0724_3
0722_2
0716_2
0715_2
0714_2
0711_1
0710_3
0709_1
0708_3
0707_3
0704_4
0703_2
0702_2
0701_4
0630_2
0627_2
0626_3
0625_2
0624_2
0623_3
0620_1
0619_2
0619_3
0618_3
0617_3
0616_5
0528_2
0527
0526_2
0523_3
0522_3
0521_3
0520_3
0512_4
0509_5
0508_3
0507_3
0506_2
0505_4
0504_3
0430_5
0429_5
0428_2
0425_2
0424_4
0423_2
0422_2
0421_2
0418_2
0417_2
0416_3
0415_4
0414_3
0411_3
0410_4
0409_2
0408_1
0404_3
0403_1
0327_1
0326_2
0325_4
0324_3
0321_2
0320_3
0319_2
0318_2
0317_4
0314_2
0313
0312_1
0311_4
0310_3
0307_3
0306_2
0305_2
0304_2
0303_3
0228_3
0227_2
0224_3
0221_2
0220_2
0219_2
0218_6
0217_4
0214
0213_3
0212
0211_2
0210_5
0207_2
0128_3
0127_3
0124_1
0123_3
0122_2
0121_4
0117_1
0116_1
1231
1230_4
1227_3
1226_3
1225_2
1224_3
1223_6
1220_1
1219_2
1218_1
1217_4
1216_3
1213_2
1212_4
1210_1
1209_4
1206_4
1205_3
1204_2
1203_3
1202_6
1129_1
1128_3
1127_4
1126_3
1125_4
1122_2
1120_1
1119_2
1118_4
1115_2
1114_4
1113_4
1112_2
1111_3
1108_2
1107_3
1106_5
1105_1
1104_4
1101_3
1031_5
1030_3
1029_3
1028_2
1028_3
1025_1
1024_2
1023_5
1022_3
1021_3
1018_2
1017_4
1016_4
1015_5
1014_5
1012_3
1011_5
1010_5
1009_5
1008_2
0930_4
0929_3
0927_3
0926_2
0925_4
0924_2
0924_3
0923_2
0922_1
0918_4
0917_4
0917_5
0916_2
0913_2
0912_2
0911_5
0910_5
0909_3
0909_4
0906_3
0905
0904_3
0829_3
0823_2
0823_3
0822_3
0821_3
0816
0814_3
0814_4
0813_4
0812_5
0808_4
0807_3
0806_2
0805_3
0802_2
0801_2
0731_4
0730_4
0729_3
0726_2
0725_2
0724_4
0723_3
0722_3
0719
0718_2
0717_4
0716_3
0715_3
0714_3
0712
0711_2
0711_3
0709_2
0708_4
0705_2
0704_5
0703_3
0702_3
0701_5
0628_5
0627_3
0626_4
0625_3
0624_3
0621_2
0620_2
0619_4
0618_4
0617_4
0614_2
0608
0607
0605_4
0604_2
0603_3
0531
0530_3
0529_4
0528_3
0525_3
0524_2
0522_4
0517_2
0516_3
0515_2
0514
0510_4
0509_6
0508_4
0507_4
0506_3
0503_3
0502_2
0428_3
0427_2
0426
0425_3
0424_5
0420
0419_2
0418_3
0416_4
0415_5
0414_4
0411_4
0409_3
0408_2
0407_3
0403_2
0401_4
0329
0328_3
0327_2
0326_3
0325_5
0322
0321_3
0320_4
0319_3
0318_3
0315_2
0314_3
0312_2
0311_5
0308
0307_4
0306_3
0305_3
0301_3
0227_3
0223_2
0221_3
0218_7
0216_3
0204_2
0201_3
0131
0130
0129_3
0127_4
0126_3
0125
0124_2
0123_4
0122_3
0121_5
0119_2
0118_2
0117_2
0116_2
0115_3
0114_2
0113_6
0112_3
0111_3
0110_2
0109_2
0108_3
0107_3
0107_4
0106_2
0105_3
0104_4
0103
0102_2
0101
1230_5
1229_3
1228_2
1227_4
1226_4
1225_3
1224_4
1223_7
1222_2
1221_2
1220_2
1219_3
1218_2
1217_5
1216_4
1215_3
1214_2
1213_3
1212_5
1211_2
1210_2
1209_5
1208_2
1207
1206_5
1205_4
1204_3
1203_4
1202_7
1201_4
1130_2
1129_2
1128_4
1127_5
1126_4
1125_5
1124_3
1123_2
1122_3
1121_4
1120_2
1120_3
1119_3
1118_5
1117_4
1116_4
1115_3
1114_5
1109_2
1108_3
1107_4
1106_6
1106_7
1105_2
1104_5
1103_4
1102_5
1101_4
1031_6
1030_4
1029_4
1028_4
1027_3
1026
1025_2
1024_3
1023_6
1022_4
1019_2
1017_5
1015_6
1014_6
1013_3
1012_4
1012_5
1011_6
1011_7
1010_6
1009_6
1008_3
1007
1006
1005_2
1004
1002_2
1001
0930_5
0928_4
0927_4
0927_5
0926_3
0925_5
0924_4
0923_3
0922_2
0921_4
0919_4
0918_5
0917_6
0916_3
0915_2
0914_5
0912_3
0912_4
0911_6
0910_6
0909_5
0908_4
0908_5
0902_2
0901_2
0831_3
0830_2
0829_4
0827_5
0823_4
0823_5
0822_4
0821_4
0820_3

Business

商业版块

Digging for digits

挖掘数据

A scramble for data is underway among AI companies.

各大人工智能公司正在抢夺数据iG=PXm|=jjJ2Lmb;Inh。

Not so long ago analysts were openly wondering whether artificial intelligence (AI) would be the death of Adobe, a maker of software for creative types.

不久前，分析人士还在公开猜测，人工智能是否会导致创意软件制造商Adobe的灭亡.0k;J0f@E8UWNPsr_。

New tools like DALL-E 2 and Midjourney, which conjure up pictures from text, seemed set to render Adobe’s image-editing offerings redundant.

像DALL-E 2和Midjourney这样的新工具能够根据文本描述而生成图像，似乎让Adobe的图像编辑功能变得多余ncI&d00o8RMov。

As recently as April, Seeking Alpha, a financial-news site, published an article headlined “Is AI the Adobe killer?”

就在4月份，财经新闻网站“寻找阿尔法”发表了一篇题为“AI会杀死Adobe吗？”的文章yPK;Y~~i*A=B7n@.。

Far from it.

完全不会7ba_ok3NsUgR6#6U。

Adobe has used its database of hundreds of millions of stock photos to build its own suite of AI tools, dubbed Firefly.

Adobe已经利用其容纳数亿张版权照片的数据库而建立了自己的人工智能工具套件，名为“萤火虫”4y0#R;kNyq[hp。

Since its release in March the software has been used to create over 1bn images, says Dana Rao, a company executive.

Adobe的高管达纳·拉奥表示，自3月份发布以来，萤火虫软件已被用于创建逾10亿张图片oAOgZ)jsi+J),s-。

By avoiding mining the internet for images, as rivals did, Adobe has skirted the deepening dispute over copyright that now dogs the industry.

通过避免像竞争对手那样在互联网上搜罗图像，Adobe规避了目前困扰着该行业的日益深化的版权纠纷问题V,8^!Z#2shsjzRXX9。

The firm’s share price has risen by 36% since Firefly was launched.

自萤火虫发布以来，Adobe的股价已经上涨了36%-*hegz8Zp68kY。

Adobe’s triumph over the doomsters illustrates a wider point about the contest for dominance in the fast-developing market for AI tools.

Adobe对末日预言者的胜利说明了一个更广泛的问题，这个问题关乎如何在快速发展的AI工具市场争夺主导地位i&P1Ly3-Qud63[#biY。

The supersize models powering the latest wave of so-called “generative” AI rely on oodles of data.

为最新一波“生成式”人工智能提供动力的超大模型依赖于海量数据A_f)1-f63z%^RQARi(Uc。

Having already helped themselves to much of the internet, often without permission, AI firms are now seeking out new data sources to sustain the feeding frenzy.

人工智能公司已经自助取用了互联网上的大量数据（通常未经许可），现在正在寻找新的数据来源，以继续给模型疯狂投喂5qK8guiz9h[2wZ2。

Meanwhile, companies with vast troves of the stuff are weighing up how best to profit from it.

与此同时，拥有大量数据的公司正在权衡如何最好地从中获利=8tu_e2JPT!Et。

A data land grab is under way.

一场数据土地掠夺正在进行,ykJFB+Y]3=!5rH6SCd3。

The two essential ingredients for an AI model are datasets, on which the system is trained, and processing power, through which the model detects relationships within and among those datasets.

人工智能模型的两个基本要素是数据集和处理能力，系统用数据集进行训练，模型通过处理发现数据集内部和不同数据集之间的关系6g(P&Hxq[rooM(%5!I=。

Those two ingredients are, to an extent, substitutes: a model can be improved either by ingesting more data or adding more processing power.

在某种程度上，这两个要素可互相替代：模型可以通过摄入更多数据而改进，也可以通过加强处理能力而改进EX(&k-EX8GJdXx#J=g。

The latter, however, is becoming difficult owing to a shortage of specialist AI chips, leading model-builders to be doubly focused on seeking out data.

然而，由于专业人工智能芯片的短缺，加强处理能力正变得越来越困难，这导致模型建造者们加倍专注于寻找数据^Heaa4=e%E0P,*D%8xp。

Demand for data is growing so fast that the stock of high-quality text available for training may be exhausted by 2026, reckons Epoch AI, a research outfit.

研究团队“纪元AI”估计，由于对数据的需求增长非常迅速，因此可用于训练的高质量文本储备可能在2026年耗尽F&uTK@CaXS.JgeTqhP。

The latest AI models from Google and Meta, two tech giants, are likely trained on over 1trn words.

谷歌和Meta这两家科技巨头的最新AI模型的训练数据可能会超过1万亿个单词30OS8Bb_JJhu65Z。

By comparison, the sum total of English words on Wikipedia, an online encyclopedia, is about 4bn.

相比之下，在线百科全书维基百科的英文单词总数约为40亿个OHhYk[Y#7+Dwb9]eiG。

It is not only the size of datasets that counts. The better the data, the better the model.

重要的不仅仅是数据集的大小p,;iHHA_MERq9eI*zyL。数据越好，模型就越好FFW_Q^#%4pQt。

Text-based models are ideally trained on long-form, well-written, factually accurate writing, notes Russell Kaplan of Scale AI, a data startup.

数据初创公司Scale AI的拉塞尔·卡普兰指出，对于基于文本的模型，最理想的训练数据是篇幅长、文笔好、符合事实的文字OL^b2XXFv5OKN。

Models fed this information are more likely to produce similarly high-quality output.

被输入这种信息的模型更有可能生成同样高质量的产出.24iGI=yf~i.#IQ。

Likewise, AI chatbots give better answers when asked to explain their working step by step, increasing demand for sources like textbooks.

同样，当AI聊天机器人被要求一步一步地解释原理时，它们会给出更好的答案，这就增加了对教科书等资源的需求&kXSwZwak!e。

Specialised information sets are also prized, as they allow models to be “fine-tuned” for more niche applications.

专业的信息集也很受重视，因为这些信息集可以让模型进行“微调”，以适应更小众领域的应用r!rq#uJq+EZ9k;95GZg。

Microsoft’s purchase of GitHub, a repository for software code, for $7.5bn in 2018 helped it develop a code-writing AI tool.

2018年，微软斥资75亿美元收购了软件代码库GitHub，这使微软开发出了一款编写代码的AI工具1-1wUwG-eV%FC。

As demand for data grows, accessing it is getting trickier, with content creators now demanding compensation for material that has been ingested into AI models.

随着对数据需求的增长，访问数据变得越来越棘手，内容创建者现在要求对输入给AI模型的材料收取报酬&mGvMz[sIbq。

A number of copyright-infringement cases have already been brought against model-builders in America.

在美国，已经有多起针对模型建造者的侵犯版权案件f&wwfwrg@2No|!.~m。

A group of authors, including Sarah Silverman, a comedian, are suing OpenAI, maker of ChatGPT, an AI chatbot, and Meta.

包括喜剧演员莎拉·西尔弗曼在内的一群作家正在起诉OpenAI（AI聊天机器人ChatGPT的制造商）和Meta]0ZxFs*vUvG13W&。

A group of artists are similarly suing Stability AI, which builds text-to-image tools, and Midjourney.

一群艺术家也在起诉Stability AI（开发文本转图像的工具）和Midjourney92%ziMA^]wM]d)。

分享到