What Will Journalist-Programmers Do?

One of the most common questions I've gotten about our journalist/programmer scholarships comes from news organizations: "When can we hire them?" And recent developments suggest that the need for people with both journalism and programming skills is only going to increase. 数据挖掘交友

For Northwestern's Readership Institute blog, I wrote last week about the growing number of data-driven applications being published on on news Web sites. I used the Indianapolis Star's Data Central as a case study. It's worth pointing out, though, that the paper was able to publish most of these databases without involving professional programmers. This reflects one of the driving trends in technology: the development of tools that enable data-driven publishing with modest levels of technical skills.

数据挖掘工具

If the tools are getting easier for non-programmers to use, what would a person with both journalism and programming skills do at a news organization? 数据挖掘实验室

Some of the answers might be found in the experience of the Star and other news organizations (such as the Asbury Park Press) that are leading the way in deploying databases on their Web sites. Successful as these initiatives have been, I would argue that they are just a start. Some projects are clearly both more complex and potentially more rewarding -- for news organizations and their online audience -- than others. 数据挖掘交友

For the Readership Institute post, I began to put together a hierarchy of data-driven journalism. At the low end are the simplest kinds of projects in which the news organization doesn't do much beyond making the data available. At the high end are the most ambitious applications, in which the news organization adds value through smart interface development, journalistic analysis, creativity in presentation or connections to storytelling. 数据挖掘实验室

Level 1: Data delivery. Here a news organization obtains data and makes it available in a browsable form. There's no additional reporting and little functionality for the online user. The Star's CEO salaries database is an example.

数据挖掘交友

Level 2: Data search. This is by far the most common way data is made available. Users are expected to find relevant information by entering text into a search box. An example: The Cincinnati Enquirer's database of home sales prices. 数据挖掘论坛

Level 3: Data exploration. Compare the search results page for a typical searchable database like Cincinnati home sales prices to the browse options on Adrian Holovaty's chicagocrime.org. There's a search box on the page, but the site allows easy exploration of the data in a way most online databases do not. Click on any of the browse options and you are presented with additional links that you can click on and explore the information more thoroughly. I recently heard Adrian talk about his approach to developing database application. He talks about applying "The Treatment" to online data, by which he means, "Present it in ways that make it fun and serendipitous." His motto is: "Everything that can be linked should be linked." His work shows that searchability is just the beginning. 数据挖掘实验室

Level 4: Data visualization. Rows and columns are often not the best way to present data. For many databases, the most valuable thing a news organization can do is provide a way for people to visualize what the data show. The most obvious approaches involve mapping, at least for databases that have a geographic element. Thanks to Google and Yahoo!, it is relatively easy to add maps to any database that includes addresses. But the possibilities for data visualization go way beyond mapping. A site that is doing some very interesting things with data visualization is Digg.com, a tech-oriented site where content is prioritized based on user voting. Check out Digg Labs for some creative ways the Digg team is finding to prioritize news using visual interfaces.

数据挖掘论坛

Level 5: Data experiences and storytelling. When a news organization can effectively marry traditional reporting and storytelling with database development capabilities, truly new forms of journalism can emerge. Here are two examples of what I'm talking about: 数据挖掘交友

  • The Los Angeles Times' homicide map. What makes this project interesting is that behind the map is a page (actually a blog post) about every individual murder in Los Angeles this year. And for each murder, the Times allows comments (after staff review), which often take the form of tributes to the homicide victim. These comments are often poignant and compelling -- transforming dry statistics into human stories.
  • Politifact, a joint project of the St. Petersburg Times and Congressional Quarterly. This is a data-driven application designed "to help you find the truth in the presidential campaign."

I'd also list some examples of data-driven journalism created by the News21 reporting project, an initiative (funded by the Knight Foundation and the Carnegie Corporation) involving graduate students from Northwestern University, the University of Southern California, Columbia University and the University of California-Berkeley. (Disclosure: I have served as an adviser to the students and helped them work with Flash developers to build these projects.) These three examples marry original reporting and data-driven presentation: 数据挖掘实验室

  • Digital Trails, a story about how information about people is captured and stored as they go about their daily activities. The News21 reporters followed a young woman around the Washington area and identified every instance in which she left a "digital trail," then found out where that information was stored and how it might be shared with companies or the government. Underlying the multimedia reporting and Flash interface is a database in which every trail is a data element. Flash design and programming were done by From Scratch Design Studio of Washington, DC.
  • Government Data Mining, a project that started with the most complete list ever compiled about government data-mining programs. The student team then had the interesting idea of using an interface similar to the ones used by data-mining software to allow people to explore these programs, government agency by government agency. FromScratch worked on this project as well.
  • One Vote Under God, a comprehensive look at this year's presidential candidates and issues related to religion. The Flash interface for this project, developed by Michael Nix Design in Chicago, enables users to explore the candidates religious backgrounds and positions on religion-related issues, as well as to compare any pair of candidates.

I'm hoping that the journalist-programmers who graduate from our new program will both be more likely to come up with ideas like this, and be able to help make them happen.

[数据挖掘专家] [数据挖掘研究院] [数据挖掘论坛] [数据挖掘实验室]
上一篇:Sentiment analysis and consumer generated content
下一篇:a method to recognize specificity determining residues from multiple
最新评论共有 0 位网友发表了评论 , 查看所有评论
发表评论( 不能超过250字,需审核,请自觉遵守互联网相关政策法规。 )
匿名?
数据挖掘网站导航 数据挖掘论坛导航
  • 数据挖掘工具
  • 数据挖掘论坛
  • DataCruncher - Cognos
  • MineSet - MathSoft
  • Intelligent Miner - GainSmarts
  • Sqlserver - SAS - Clementine
  • CART - Weka - WizSoft
  • NeuroShell - ModelQuest
  • data mining tools - Darwin
  • 数据挖掘交友
  • 数据挖掘博客
  • 数据挖掘工具
  • 数据挖掘资源
  • 数据挖掘技术算法
  • 数据挖掘相关期刊、会议
  • 研究院联盟合作专区
  • 数据挖掘基础与相关技术
  • 数据挖掘厂商与就业
  • 数据挖掘研究者乐园
  • 知名厂商数据挖掘工具资料
  • 国内数据挖掘实验室
  • Foreign Data Mining Lab
  • 热点关注
  • 支持向量机算法及其代码实现
  • Boosting算法及其代码实现
  • K近邻算法
  • Kalman filter toolbox for Matlab
  • Decision Trees算法及其代码实现
  • 生物信息学--机器学习方法
  • [mlchina] ICML 2008 Call for Papers
  • Java Machine Learning Library
  • Paperless office? Only on paper
  • Normal Bayes 分类器
  • 论坛最新话题
  • Foundations of Statistical Natural Langu
  • Game Theory meet Data Mining: A Recent P
  • System Building: How does it help or hin
  • 数据挖掘与Clementine培训
  • 新手报到
  • 求 SASEM 客户流失预测分析
  • 数据挖掘工程师/搜索研究院—北京——无线
  • 数据挖掘入门介绍(如何着手数据挖掘)
  • Information Overload Survey Results
  • The INEX 2005 Workshop on Element Retrie
  • 相关资讯
  • 预言:50年后机器人威胁人类 数十亿人丧命智
  • Paperless office? Only on paper
  • Simplicity vs. Complexity
  • 生物信息学--机器学习方法
  • Java Machine Learning Library
  • IBM visualization software uses 3D avata
  • Combining classifiers to predict gene fu
  • Anyone has experience using data mining
  • The 3rd International Conference on Larg
  • A satisfied customer
  • 数据挖掘实验室资料
  • 数据挖掘博客地址
  • 数据挖掘实验室网站地址
  • Prepare for Medicare audits by using dat
  • 注册成为SAS用户与爱好者俱乐部会员
  • 水南梅
  • 明日烟
  • 新人报道
  • 下载
  • 厦门服务器托管,450元/月—0592-5177319 高
  • 买空间送域名--0592-5177319 高静