OpenBlock Setup/Matintenance Costs

As I wait for the Knight Foundation to give OpenBlock Rural the official “Go!” I’ve been talking with developers and working on nailing down an estimate of what it would take to get OpenBlock up and running. For that conversation, please tune in to this discussion thread on the eb code Google Group for developers.

Knight News Challenge Proposal: Crowdsourcing Data to Bring OpenBlock to Rural America

At the top of my To Do List this week is the completion of one of the proposals I’ve submitted to the Knight News Challenge this year. I’m posting it here in the hope that you’ll have some feedback on whether/how a service like this would be technically feasible. editorially useful and financially viable. I’m especially interested in hearing from editors of small papers, public records experts, civic/community organizers and anyone who’s worked on the OpenBlock code.

Under what conditions would you volunteer to help a project like this in your community? News organization — how much would you pay for a service like this? What characteristics would it need to have to make it worth your money? What else do you see here that needs further clarification?

(And a big hat-tip here to Penny Abernathy, the Knight Chair in Digital Media Economics here at the UNC-Chapel Hill School of Journalism and Mass Communication. She got this project kicked off with a grant from the McCormick Foundation and who is my co-pilot on this application.)

Here’s our draft pitch:

Crowdsourcing Data to Bring OpenBlock to Rural America

This project would create a co-op to develop and deploy public records databases at news organizations, especially those serving communities of fewer than 75,000 people, preparing those records for presentation and integration in an OpenBlock format.

These rural news organizations are struggling to move to the digital age in part because their staffs are so small they don’t have the capacity to identify, digitize, re-aggregate and map all the various public records available at the state and local level into databases that can be accessed intelligently by both reporters and the reading public.

The project would tackle the lack of capacity at rural papers from two directions. It would create a centralized repository of state, county and city schemas and datafeeds that could be easily used in OpenBlock. This a job well-suited for a small group of experts. In addition, the project will create a statewide corps of amateur data-checkers and records requesters. Data quality assurance and data gathering are jobs well-suited for a crowd of many people, each working on a small piece of the puzzle.

These volunteer citizen-journalists would actually be member-owners of a co-op business. Each task they perform would earn them additional shares in the company’s annual profits. We would generate revenue by charging rural newspapers a fee. The more records and the better their accuracy, the more news organizations would sign on for the service.

In some cases, volunteers would pick up CDs of data from county offices. In others volunteers would scan and upload PDFs of hand-written police incident reports. In still other cases, people would key into a database the information on those PDFs. This job is so big that no single small news organization could do it. But with a corps of member-owners working together, we could create a model for gathering valuable public records from rural America. To individual communities, these records are necessary to foster an informed civic dialog and healthy economy. But in aggregate, these records may also be able to shed light on trends in rural America that would otherwise go unreported.

Improving Delivery of News and Information to Geographic Communities

In small towns and rural America, the local newspaper is more than just a source of information and an engine of commerce.  More importantly, it fosters and builds geographic community and sets the agenda for public policy debate.  This project will foster civic and community engagement — first, by forming a network of knowledgeable volunteer citizen-journalists, and also, by making public records readily available and organized to support decision-making and accountability at all levels of government.

Unmet Needs

In many cases, data that is readily available in GeoRSS or at least CSV format from big cities (such as this example from San Francisco) is simply not available even in print from rural governments. For example, journalism students at the University of North Carolina working last semester to gather and organize public records in two rural counties for an OpenBlock application met with a number of obstacles (which they describe in their blogs) – ranging from significant photocopying fees to inappropriate redactions and denial of access to public information.

Even when acquisition of public datasets is relatively simple – for example, public health restaurant inspections — someone must request that data from a specific county be exported in fielded data format. It is inefficient for each rural news organization to make separate requests for this data in each of North Carolina’s 100  counties. In these cases, our public records coop would outline an initial request for the data for each county.

What’s New?

Currently there is no tool or service that can efficiently gather, format and publish public records on rural news organizations’ sites. In part, this is a technology problem that may soon be overcome with the alpha rollout of OpenBlock later in 2011. But a much bigger piece of the problem is the data itself – neither OpenBlock nor any other technology has the ability to obtain public records as fielded digital data and create a newsworthy user interface for all the various types of records a news organization might need.

Without a project like this there is no indication that OpenBlock will be a viable option for papers in rural communities.

What Will Change?

By the end of the project, we will have

•          at least one member-owner in each county in North Carolina

•          at least 12 news organizations subscribing to the service

•          at least one type of schema for which we’ve collected data from each county

Most importantly, we will have raised public awareness of open government and we will start seeing rural counties and towns publish public data in standardized, machine-readable formats on the Web.

What tasks/benchmarks need to be accomplished to develop your project and by when will you complete them?

How will you measure progress?

Do you see any risk in the development of your project?

How will people learn about what you are doing?

Is this a one-time experiment or do you think it will continue after the grant?

Copyediting and Computer Code

Every now and again I’ll find myself in a conversation with copyeditors about the future of their craft. One point I often bring up is that a big part of the job in online newsrooms needs to be overall QA of the site. And one of the most challenging workflows to support that is the copyediting of computer code. The example I always use to illustrate the point is the AP style on state abbreviations. If the Web developers define the abbreviation for California as “CA” instead of “Calif.” … well that’s something that should stick in the craw of every copyeditor until the code gets changed.

And now I have an actual piece of code to illustrate the example. (This comes from the code that runs OpenBlock — the much awaited open-source version of Adrian Holovaty’s EveryBlock. This isn’t meant to pick on that community. They’re doing difficult and needed work. And this could happen anywhere… which makes it a good anecdote.)

What’s the workflow in your newsroom for making sure that this gets changed to “Reporting Officers’ Names” before launch? Should the designers give editors a mock-up of all the static text elements (including words-as-graphics) on the page? Should the developers give editors printouts of all the tables that contain datafields that might get on the live site? Or do you just publish and come up with some sort of sampling scenario?

How does it work in your newsroom? How should it?


Ryan Thornburg is the author of the new online journalism textbook and newsroom manual, Producing Online News, available from CQPress.com

Job Post: OpenBlock, Django and Community Newspapers

Request for Proposals:
Specifications for Community News Tool Using Python and Django.

The School of Journalism & Mass Communication at the University of North Carolina at Chapel Hill, with funding from the McCormick Foundation, is developing business models and editorial products to help community newspapers transition to the digital age.

We are seeking someone who has experience with Python and the Django Web development framework to install a Django application called OpenBlock on a Web server and write a report that details the technical challenges, specifications and scope required for integrating OpenBlock into newspaper websites hosted by TownNews.com. The report would also propose potential alternatives that would be more efficient than using OpenBlock.

In order to write the report, the person we hire will need to perform these tasks:
1. Install the OpenBlock application on a server, and become familiar with its codebase.

2. Identify technical specifications for transforming data formats given to students by city and county government into geo-coded data formats optimized for use in OpenBlock. (See https://developer.openblockproject.org/wiki/Ideal%20Feed%20Formats) These technical specs might include the technical specs for building a site scraper (See https://developer.openblockproject.org/wiki/ScraperScripts) to retrieve the data, a feed parser or a program to impute the latitude and longitude of data types that are vaguely described in their original format from the government.

3. Identify high-level technical specifications for integrating an OpenBlock installation with the CSS styles, site navigation and URL structure of the news organizations so that users and search engines perceive the TownNews.com content and the OpenBlock content as a single site.

4. Contribute findings back to the OpenBlock project developers wiki at https://developer.openblockproject.org/wiki
We intend to select a candidate by December 1. The project would start immediately upon selection.

Please e-mail your proposals – including a proposed timeline, cost bid, resume, cover-letter and three references — to Christine Shia at shia AT email DOT unc DOT edu. Please include “Proposal – OpenBlock RFP” in your subject line.

Questions about this RFP can be addressed to Assistant Professor Ryan Thornburg at 919-962-4080 or ryan DOT thornburg AT unc DOT edu. Please include: “Query – OpenBlock RFP” in your subject line.