经济学人:社交媒体数据的市场 细品弱水三千
日期:2013-01-03 14:39


Business A market for social-media data Sipping from the fire hose
商业 社交媒体数据的市场 细品弱水三千
Making sense of a torrent of tweets
MOST tweets are inane, but a million may contain valuable information.
Fed through clever algorithms, a torrent of microblogs can reveal changes in a nation's mood.
Hence the excitement about a new market: the sale and analysis of real-time social-media data.
DataSift, a start-up, will soon launch a marketplace for such information.
Analysing social media used to be a cottage industry.
Firms gathered data slowly and patchily, through mechanisms not built for the purpose.
Many online services kept their data locked up, because there was no way to make money from them. All this is changing.
Twitter was the first to move because it generates ever more data: the number of tweets per day now exceeds 230m, up more than 100% from the beginning of the year.
Twitter would like to turn its popularity into money, but rather than beefing up its own infrastructure, it plans to outsource the task of distributing and selling its data to DataSift and Gnip, another start-up.
Both DataSift and Gnip are striving to be "data platforms".
DataSift 和Gnip都在为成为"数据平台"而努力。
They collect and standardise information from all kinds of social-media services—not only Twitter, but also Facebook, YouTube and others.
Both Gnip and DataSift have built robust networks which can cope with massive amounts of data in real time.
And both are enforcing licensing rules: for instance, that a stream of tweets can be analysed but not republished.
Gnip, based in Boulder, Colorado, is more of a wholesale distributor. It charges $33,000 a month for a feed of half of all tweets.
Customers can also subscribe to feeds of tweets containing web links or certain keywords.
Buyers are mostly social-media monitoring companies, which analyse the data for a fee.
Sysomos, a Canadian firm, for example, allows firms to track in real time what people think about certain products.
DataSift serves both big corporations and individuals.
Customers can define sophisticated filters, for instance to find all tweets by men who are interested in a new product and live in London.
Charges for DataSift depend on the filter's complexity and the amount of data delivered.
The streams from Gnip and DataSift can be combined with data from more specialised firms that try to extract meaning from social-media data.
Lexalytics, for instance, analyses the sentiment of messages and posts.
Klout measures the influence of social-media users (some firms give people with a high Klout score preferential treatment).
Having a marketplace such as DataSift has already encouraged other social-media services to open their data vaults, says Nick Halstead, the founder of DataSift.
DataSift的创始人尼克?霍尔斯特德(Nick Halstead)说,有了像DataSift那样的数据交易市场,使得更多的社交媒体服务商开启其数据的仓库。
Financial firms have become interested in feeding such data into the algorithms they use to make investment decisions, says Chris Moody, Gnip's president.
Gnip的总裁克里斯?穆迪(Chris Moody)说,金融企业对此也有兴趣:将这样的数据放到他们的算法里,来做出投资决策。
And corporations are increasingly keen on combining social-media data with customer information.
Yet growth in this market could be held back—by privacy concerns.
Most people think that tweets are only up to 140 characters long.
But those who sip from Twitter's fire hose can get much more information, including a sender's location, the biography on his profile page and how many people have subscribed to his messages (see blog post on the map of a tweet).
Most of this information is freely available on Twitter's website.
But if users realise how their data are used, they may clam up.

  • privacyn. 隐私,隐居,秘密
  • definev. 定义,解释,限定,规定
  • certainadj. 确定的,必然的,特定的 pron. 某几个,某
  • extractn. 榨出物,精华,摘录 vt. 拔出,榨出,摘录,提取
  • socialadj. 社会的,社交的 n. 社交聚会
  • strivingn. 努力;斗争 v. 力争;奋斗;努力(strive的
  • keenadj. 锋利的,敏锐的,强烈的,精明的,热衷的
  • scoren. 得分,刻痕,二十,乐谱 vt. 记分,刻划,划线,
  • bouldern. 大圆石,巨砾
  • torrentn. 激流,山洪 adj. 奔流的,汹涌的