Knight News Challenge Proposal: Crowdsourcing Data to Bring OpenBlock to Rural America

At the top of my To Do List this week is the completion of one of the proposals I’ve submitted to the Knight News Challenge this year. I’m posting it here in the hope that you’ll have some feedback on whether/how a service like this would be technically feasible. editorially useful and financially viable. I’m especially interested in hearing from editors of small papers, public records experts, civic/community organizers and anyone who’s worked on the OpenBlock code.

Under what conditions would you volunteer to help a project like this in your community? News organization — how much would you pay for a service like this? What characteristics would it need to have to make it worth your money? What else do you see here that needs further clarification?

(And a big hat-tip here to Penny Abernathy, the Knight Chair in Digital Media Economics here at the UNC-Chapel Hill School of Journalism and Mass Communication. She got this project kicked off with a grant from the McCormick Foundation and who is my co-pilot on this application.)

Here’s our draft pitch:

Crowdsourcing Data to Bring OpenBlock to Rural America

This project would create a co-op to develop and deploy public records databases at news organizations, especially those serving communities of fewer than 75,000 people, preparing those records for presentation and integration in an OpenBlock format.

These rural news organizations are struggling to move to the digital age in part because their staffs are so small they don’t have the capacity to identify, digitize, re-aggregate and map all the various public records available at the state and local level into databases that can be accessed intelligently by both reporters and the reading public.

The project would tackle the lack of capacity at rural papers from two directions. It would create a centralized repository of state, county and city schemas and datafeeds that could be easily used in OpenBlock. This a job well-suited for a small group of experts. In addition, the project will create a statewide corps of amateur data-checkers and records requesters. Data quality assurance and data gathering are jobs well-suited for a crowd of many people, each working on a small piece of the puzzle.

These volunteer citizen-journalists would actually be member-owners of a co-op business. Each task they perform would earn them additional shares in the company’s annual profits. We would generate revenue by charging rural newspapers a fee. The more records and the better their accuracy, the more news organizations would sign on for the service.

In some cases, volunteers would pick up CDs of data from county offices. In others volunteers would scan and upload PDFs of hand-written police incident reports. In still other cases, people would key into a database the information on those PDFs. This job is so big that no single small news organization could do it. But with a corps of member-owners working together, we could create a model for gathering valuable public records from rural America. To individual communities, these records are necessary to foster an informed civic dialog and healthy economy. But in aggregate, these records may also be able to shed light on trends in rural America that would otherwise go unreported.

Improving Delivery of News and Information to Geographic Communities

In small towns and rural America, the local newspaper is more than just a source of information and an engine of commerce.  More importantly, it fosters and builds geographic community and sets the agenda for public policy debate.  This project will foster civic and community engagement — first, by forming a network of knowledgeable volunteer citizen-journalists, and also, by making public records readily available and organized to support decision-making and accountability at all levels of government.

Unmet Needs

In many cases, data that is readily available in GeoRSS or at least CSV format from big cities (such as this example from San Francisco) is simply not available even in print from rural governments. For example, journalism students at the University of North Carolina working last semester to gather and organize public records in two rural counties for an OpenBlock application met with a number of obstacles (which they describe in their blogs) – ranging from significant photocopying fees to inappropriate redactions and denial of access to public information.

Even when acquisition of public datasets is relatively simple – for example, public health restaurant inspections — someone must request that data from a specific county be exported in fielded data format. It is inefficient for each rural news organization to make separate requests for this data in each of North Carolina’s 100  counties. In these cases, our public records coop would outline an initial request for the data for each county.

What’s New?

Currently there is no tool or service that can efficiently gather, format and publish public records on rural news organizations’ sites. In part, this is a technology problem that may soon be overcome with the alpha rollout of OpenBlock later in 2011. But a much bigger piece of the problem is the data itself – neither OpenBlock nor any other technology has the ability to obtain public records as fielded digital data and create a newsworthy user interface for all the various types of records a news organization might need.

Without a project like this there is no indication that OpenBlock will be a viable option for papers in rural communities.

What Will Change?

By the end of the project, we will have

•          at least one member-owner in each county in North Carolina

•          at least 12 news organizations subscribing to the service

•          at least one type of schema for which we’ve collected data from each county

Most importantly, we will have raised public awareness of open government and we will start seeing rural counties and towns publish public data in standardized, machine-readable formats on the Web.

What tasks/benchmarks need to be accomplished to develop your project and by when will you complete them?

How will you measure progress?

Do you see any risk in the development of your project?

How will people learn about what you are doing?

Is this a one-time experiment or do you think it will continue after the grant?

Welcome to JOMC 491: Public Affairs Reporting for New Media

With only a few weeks left before the start of the fall semester, I wanted to quickly give registered and prospective students a little bit of an idea about what we’ll be doing in Public Affairs Reporting for New Media this semester. Seats are still available, so act now!

The goal of the class will be to develop a new online editorial product for the newspapers in Whiteville and Washington, N.C., that will help them provide be a comprehensive and highly engaging source of news and information for their communities. (Perhaps something like Everyblock.com)

So, the first thing to know about the class is that you will be expected to go to those cities — both about 2.5 hours from Chapel Hill — at least once and probably more during the semester. I’ll pick up the tab for your trips, but you will need to arrange your own transportation and schedule.

The reason we’ll be working with these two towns is that they are part of a larger effort being led by Knight journalism professor Penny Abernathy and funded by The McCormick Foundation (founding family of The Chicago Tribune) aimed at helping small newspapers make a financially sound transition to a digital economy.

So do you need to know anything about computer programming, or media economics or news reporting and editing? Not really, but you’ll probably be much better off if you’ve had exposure to at least one of those topics. If you haven’t then you’ll need to rely on your own curiosity, self-motivation and time commitment to ensure your success and happiness in the course.

The class is going to be structure probably unlike any other class you’ve taken at Carolina. First, it has the experiential service-learning component. That means less reading and note-taking from lectures. It means more class discussion and hands-on group projects. My goal is for this class to teach you — as much as anything else — how to clearly articulate and creatively solve messy, complex real-world problems. To do that, we’ll be using the context of improving public affairs reporting for the people of North Carolina by using new digital news tools and concepts.

What will you do in the class?
The first half of the class will be an introduction to the problem with the second half focused on trying out different solutions. In class, we’ll be discussing articles, brainstorming and prototyping (making models that can give us a better idea of how people might use our website). Outside of class, you’ll be keeping a 2x/week blog of reflections, reading articles, and working in groups to figure out what barriers stand in our way of building a great site and then figuring out for yourself how you will overcome those barriers. I promise to be your guide.

How will you be graded?
30% – You’ll launch your own blog and update it twice a week. Some weeks I will give you specific assignments (write a descriptive report about Whiteville, discuss the readings, etc.) but most of the time you’ll simply write about your experiences.

30% – Prototyping. In many classes, you may have been asked to write or create one big final project that demonstrates your knowledge of what you learned. But in this class, you’ll practice the art of “fertile failure” — trying a lot of ideas, making a lot of mistakes and learning from them. You will be rewarded for failing fast and failing smart. We will use everything from toothpicks to MySQL to build our prototypes. You’ll start by using the materials with which you’re comfortable and end the semester by using tools that terrified you just three months earlier. These will be different tools for each student.

30% – Participation. Come to every class with a lot of questions, fulfill your service obligation, participate in online discussions outside of class.

10% – Data management and public records assignments. A big part of our prototyping and brainstorming will be around how to obtain public records and make them useable in an online database. You’ll have a few projects to get you familiar with the basics of the technology and issues surrounding this topic.

I hope that gives you a rough idea of the class. I’ll be posting a full syllabus and calendar soon. But in the meanwhile, enjoy the rest of your summer and let me know if you have any questions.

Best,
Ryan