Posts Tagged ‘journalism’

How metadata could pay for newspapers

February 13th, 2010

What if newspapers published not just stories but databases? Dan Conover’s vision for the future of newspapers is inspired in part by his first reporting job, for NATO:

When we spotted something interesting, we recorded it in a highly structured way that could be accurately and quickly communicated over a two-way radio, to be transcribed by specialists at our border camp and relayed to intelligence analysts in Brussells.

The story, says Conover, is only one aspect of reporting. The other part? Gathering structured metadata, which could be stored in a database—or expressed as linked data.1

Newspapers already have classification systems and professional taxonomists. The New York Times’ classifications system, in use since 1851, now aggregates stories from the archives in Times Topics, a website and API.2

What if, in addition to these classifications, each story had even more structured metadata?
Capturing metadata ranges from automatic to manual. Some automatic capture is already standard (timestamps) or could be (saving GPS coordinates from a photo or storing timestamps), and some information needing manual capture (like the number of alarms of a fire) is already reported.

Dan compares the “old way” with his “new way”:

The old way:

Dan the reporter covers a house fire in 2005. He gives the street address, the date and time, who was victimized, who put it out, how extensive the fire was and what investigators think might have caused it. He files the story, sits with an editor as it’s reviewed, then goes home. Later, he takes a phone call from another editor. This editor wants to know the value of the property damaged in the fire, but nobody has done that estimate yet, so the editor adds a statement to that effect. The story is published and stored in an electronic archive, where it is searchable by keyword.

The new way:

Dan the reporter covers a house fire in 2010. In addition to a street address, he records a six-digit grid coordinate that isn’t intended for publication. His word-processing program captures the date and time he writes in his story and converts it to a Zulu time signature, which is also appended to the file.

As he records the names of the victimized and the departments involved in putting out the fire, he highlights each first reference for computer comparison. If the proper name he highlights has never been mentioned by the organization, Dan’s newswriting word processor prompts him to compare the subject to a list of near-matches and either associate the name with an existing digital file or approve the creation of a new one.

When Dan codes the story subject as “fire,” his word processor gives him a new series of fields to complete. How many alarms? Official cause? Forest fire (y/n)? Official damage estimate? Addresses of other properties damaged by the fire? And so on. Every answer he can’t provide is coded “Pending.”

Later, Dan sits with an editor as his story is reviewed, but a second editor decides not to call him at home because he sees the answer to the damage-estimate question in the file’s metadata. The story is published and archived electronically, along with extensive metadata that now exists in a relational database. New information (the name of victims, for instance) automatically generates new files, which are retained by the news organization’s database but not published.

And those information fields Dan coded as “Pending?” Dan and his editors will be prompted to provide that structured information later — and the prompting will continue until the data set is completed.

- Dan Conover in The “Lack of Vision” thing? Well, here’s a hopeful vision for you

And that data set? It might even be saleable, even though each individual story had perhaps been given away for free. Dan highlights some possibilities, and entire industries have grown around repackaging free and non-free data (e.g. U.S. Census data, phone book data). I think of mashups such as Everyblock and hyperlocal news sites like outside.in.

  1. Some news organizations, like the New York Times (see Linked Open Data) and the BBC (overview, tech blog) are already embracing linked data. []
  2. I delved into Times Topics’ taxonomy and vocabulary in an earlier post. []

Tags: , , , , , , ,
Posted in future of publishing, information ecosystem, semantic web | Comments (1)

Horizon scanning and the digital underbelly

March 29th, 2009

Gaynor Backhouse writes a great post about libraries, holding out for “a guided tour of the library’s digital underbelly”. My favorite part is her metaphor about horizon-scanning:

Horizon scanning is a bit like doing a jigsaw you’ve bought from a car boot sale: first of all, it comes in a plastic bag, so there’s no picture to guide you. Secondly, you can see from the myriad sizes of the different pieces that there’s more than one puzzle in there and, thirdly, you know, even as you are handing over your money, that you won’t have all the pieces to complete any one, particular puzzle. [JISC Libraries of the Future | Holding out for a hero: technology, the future and the renaissance of the university librarian.]

Gaynor manages JISC’s TechWatch, keeping up with tech trends for libraries.

I’m not quite sure what the library’s “digital underbelly” is. But this sampling of news art strikes me as one possible example.

Graphics section of the Chicago Tribune, September 9, 1938

Graphics section of the Chicago Tribune, September 9, 1938

The Art the Message: The Story Behind the Chicago Tribune Collection has the same feel of the behind-the-scenes tour Gaynor Backhouse described: “secret stuff” that only the curators know about. This collection was saved by Janet A. Ginsburg, who edits news aggregator trackernews.net and curates a collection of news retrospectives, hosted at her personal site.

For access to the physical collection (now known as the Janet A. Ginsburg Chicago Tribune Collection of the Michigan State University News Archive) contact MSU Communication professor Lucinda Davenport. Images from Janet’s news art exhibit can also be seen at Brainpickings and (with Portuguese commentary) at Segunda Língua. Found via Janet’s comment on Steven Berlin Johnson’s SXSW talk, Old Growth Media And The Future Of News.

Tags: , , , , ,
Posted in library and information science, old newspapers | Comments (0)

Somebody’s Got to Pay (for Investigative Reporting)

March 7th, 2009

Timothy Burke is my new hero. The death* of newspapers, he says, is a problem mainly because somebody’s got to pay for investigative reporting:

We don’t need newspapers to have film criticism or editorial commentary or consumer analysis of automobiles or comic strips or want ads or public records. It might be that existing online provision of those kinds of information could use serious improvement or has issues of its own. It might be that older audiences don’t know where to find some of that information, or have trouble consuming it in its online form. But there’s nothing that makes published newspapers or radio programming inherently superior at providing any of those functions, and arguably many things that make them quite inferior to the potential usefulness of online media. So throw the columnists and the reviewers and the lifestyle reporters off the newspaper liferaft.

So it comes down to independent, sustained investigation of public affairs. The argument that online media cannot provide this function comes down to money

Burke gives more details and examples, and calls for new funding models, including philanthropic and/or foundation money. He concludes that the “The end of the newspaper model of the last century doesn’t have to be the end of independent investigative reporting.”

Go read the whole thing.
*It seems like death and rebirth, to me, especially with some major newspapers reinventing themselves online. But that’s another matter.

Burke first came to my attention last year, from a talk he gave to the LC Working Group on the Future of Bibliographic Control at March’s meeting on the Users and Uses of Bibliographic Data. Burke represented and reflected upon the user perspective, as an academic who searches catalogs outside his area of expertise.

Via John Dupuis’s friendfeed.

Tags: , , , , , ,
Posted in future of publishing, intellectual freedom | Comments (1)