Possible Google philanthropic project for transparency

September 25th, 2009

Google has a philanthropic project underway that might be relevant to us. They are in the process of choosing which projects to pursue, determined by a vote:

http://www.project10tothe100.com/vote.html

Last fall we launched Project 10^100, a call for ideas to change the world by helping as many people as possible. Your response was overwhelming.

Make government more transparent

Create a website that enables people from any country or municipality to easily learn about the workings of their government, and rally their fellow citizens to take action to improve it. Numerous user ideas embraced variations on the theme of governmental transparency, with specific proposals ranging from publishing details of proposed laws and politicians’ voting records to making public budgets searchable online and leveraging social networks to let communities make their voices heard by their representatives by voting on pressing issues.

Suggestions that inspired this idea

1. Create a “govwatch” program that allows people to enter in geographic and other info and get back information about bills/laws that affect them

2. Empower individual voters with both online, real-time data on their political representatives’ activity, and tools to analyze, engage and influence outcomes

3. Increase the transparency of laws, eliminate duplicate ones and communicate them better to affected citizens

4. Share information on how municipalities and states use public funds

Have we forgotten how to have an opinion and still be fair?

September 9th, 2009

Maybe it was never true, but I have this sense that we’ve lost something in American public discourse over the last century. We’ve lost the conception of having an opinion and still being fair. It’s like we just can’t imagine both being true in the same brain. After watching the President’s speech tonight I realized that I feel seriously inhibited in what I say publicly because I want to maintain an impartial image so that people see GovTrack as an impartial source. Am I over concerned? I doubt it. This mistaken concept also underlies “professional journalism”, which is the style of most news operations now, and I think is perhaps the second greatest contributing factor to the downfall of news (after “The Internet”). More on that below.

People often mistake me as a liberal. And others mistake me as a conservative. Here’s a story about someone that did both. I’ve gotten some amusing feedback from people who mistook my GovTrack experiment in collaborative letter writing, for which I delievered an anti-gun-control letter to congressmen, as representing my own views. That couldn’t be further from the truth. If it were up to me, guns would be illegal. I explained this contradiction to someone who wrote me a letter. He said:

Julles: But don’t you see the similarities between what this administration is doing and what was done in Germany in the 30’s?

Then I replied:

Me: I really get personally offended sometimes. To compare a president who is trying to improve health care to a regime that killed however many millions is to belittle the damage and suffering done to anyone that experienced it. Disagree on policy all you want, but don’t belittle one of the world’s greatest tragedies.

And he replied:

Julles: HR3200 is a BAD bill . . . Open your eyes, kid.

(H.R. 3200 is the health care bill.) Expressions about eyes always strike a chord with me. But more to the point, I never told this guy I thought H.R. 3200 was a good bill. And, quite honestly, after the President’s speech tonight, I am not so enamored by where health care reform is going. In particular I wonder about the constitutional authority to require everyone to possess health insurance. I suspect it will be turned into a tax penalty to avoid a straightforward law and side-step constitutional questions.

I don’t have an agenda. But if I have an opinion, I may jeopardize the perception of fairness and accuracy in anything I do in the world of civics. Can I have an opinion and still be trusted to be fair when I put my nonpartisan hat on? I’m not even partisan. I vote Democratic, but so does most everyone else in the places I’ve ever lived. Am I allowed to say that? Have I lost credibility merely for being more open about my views?

And this is what I imagine journalists go through. They vote too, I hope. If they write for the New York Times, they probably live in New York and vote like most New Yorkers. But then they turn off their passion when they put their fingers down to the newsroom keyboard. And we suspend disbelief for a moment as we read their articles that journalists can’t have opinions and be fair at the same time. They make it easy for us to suspend disbelief because they write like they’re dead. No interest in the outcome. They’ve got to write a few words because they need to pay for the electricity that keeps their computers going, but if newspapers paid them to write a summary of the tax law they’d do that too. It doesn’t matter to them, at least as far as we can tell from reading.

This is ridiculous and, worse, counterproductive. I’d be more interested in news if articles pleaded with me that the issue was important, that it isn’t a conceptual exercise but that it even matters to the reporter. This is, apparently, how news used to be 100 to 250 years ago. It’s how the most compelling documentaries and long-form video news segments are today. Of course, it was also not very reliable 100-250 years ago. But I don’t think that dichotomy has to be so today. If we opened ourselves up to the idea that a reporter could have an opinion and still be fair, we wouldn’t need to suspend disbelief. Reporters wouldn’t have to die each time they start writing the next piece.

I don’t want reporters to die. Save the reporters. (Ironic hyperbole.)

GSA social media TOS review

May 29th, 2009

Recently the GSA has been negotiating on behalf of federal agencies special Terms of Service agreements with various social media services like YouTube to allow agencies to make use of these services — some of those agreements are now publicly available. My understanding is that GSA’s negotiations were necessary before agencies could use these services because of legal issues like liability. I’ve reviewed the TOS’s to see whether they address open-government concerns.

The use of non-governmental services like these as part of a governmental function raises several openness issues, which we rehashed in an earlier thread on the use of YouTube by Congress. To summarize, the issues include:

  • whether the service provider meets government web standards including accessibility, privacy, security, nondiscrimination, and archival access to media
  • whether the service providers require members of the public to enter into a contractual agreement with them (i.e. more terms of service) in order to access government content, what the public must agree to, and whether these additional terms with the public restrict what the public can do with and how the public can share government media obtained through the service
  • whether use of the service constitutes an endorsement of a particular brand or technology, or if it provides a significant business advantage to a profit-seeking entity
  • whether the service provides government media in a data format that does not impose technical and legal restrictions on users of the media

I think I need to include a special note about privacy. We can expect that any non-governmental site is going to track their users’ behavior as best they can because of the financial incentives of user-targeted advertising and selling demographic data. I don’t know to what extent any of the services below make use of this, but I expect that this is a major component of the revenue of all of them.

The GSA TOS are amendments to the standard TOS employed by the services. I haven’t read through any of the standard TOS, so of course I might be missing something.

I reviewed the TOS’s with respect to these issues. They had common elements.

No Advertising: The service agrees to not place advertisements on pages with government content (i.e. government “channels” and the like). This addresses part of the concern of endorsement. Of course, services may continue to display their brand name and link users to other parts of the service, so they are still able to promote their business.

No Cookies: The service agrees to not set cookies when a widget is placed *in* a government agency webpage. This means that the service gives up its ability to do the most advanced user tracking in the cases where the user may be unaware that they are even accessing a non-governmental service. The service may still track accesses by IP address which still may provide a more rudimentary means to track users but is more likely to be anonymous.

Closed Captioning: The service will provide the ability for government media to include closed-captioning for videos using industry standard practices, which is of course important for accessibility.

The TOS are linked from here:

https://forum.webcontent.gov/Default.asp?page=TOS_agreements

Here are reviews of each TOS:

AddThis.com

AddThis.com provides a “Social Bookmark & Feed Button Builder”. It’s a widget developers can put on their websites to help users share content on other social sites like Facebook. Because of what AddThis does, there are only a few concerns to be addressed. The TOS addresses both main concerns:

- No Advertising — on government “channels” on the AddThis site
(doesn’t seem to actually apply to AddThis).
- No Cookies — when placed on .gov/.mil websites.

https://forum.webcontent.gov/resource/resmgr/terms_of_service_w_socmed/addthistos_4.30.09_final_uns.pdf

Blip.tv

Blip.tv is a video hosting website, like YouTube. All of the privacy concerns above are relevant. The TOS includes:

- No Cookies — Blip.tv will allow the government to disable parts of its embeddable player that sets cookies.
- Closed Captioning.

Other aspects require some elaboration:

Ads-

The GSA TOS has a confusing section on advertising:

“Blip.tv reserves the right to run advertisements on any page on Blip.tv, but will not run advertisements in-stream or directly adjacent to user videos without the opt-in of the user who uploaded the video.”

It sounds like Blip.tv can place ads on the page, just not directly adjacent to or inside of government media.

Privacy-

The GSA TOS say explicitly that Blip.tv does not collect personally identifiable information about users, but does collect and use demographic data for targeted advertising. Users should expect to be asked demographic data.

https://forum.webcontent.gov/resource/resmgr/Docs/Blip_tv_-_Terms_of_Use_Agree.doc

Blist.com

This is a “social data discovery” tool where users can upload tabular data sets to share. I am actually going to skip a review of this TOS because I expect (or sincerely hope) that tabular data sets are shared with the public directly (as a bulk data download) besides through social tools.

Facebook

Facebook is a social networking site.

The negotiated TOS are not available to the public. Given the likely pervasiveness of the use of all of these tools in the future, it would be a shame if the GSA is facilitating government agencies’ use of third party services that violate the public’s expectations for government web content.

Flickr

Flickr is a photo sharing website. All of the concerns listed above are relevant to Flickr. There are no provisions in the TOS relevant to this review.

https://forum.webcontent.gov/resource/resmgr/Docs/Flickr_TOS_Agreement_Amended.doc

MySpace

MySpace is a social networking site.

The negotiated TOS are not available to the public. Given the likely pervasiveness of the use of all of these tools in the future, it would be a shame if the GSA is facilitating government agencies’ use of third party services that violate the public’s expectations for government web content.

SlideShare

SlideShare is a document (i.e. presentations) sharing tool. The TOS are not posted on the GSA website, but this appears to be a publishing mistake as it notes that the TOS are intended to be publicly available.

Vimeo

Vimeo is a video sharing tool. The TOS has one relevant provision, despite all of the concerns being relevant.

- No Advertising — on government “channels” on the Vimeo site

https://forum.webcontent.gov/resource/resmgr/terms_of_service_w_socmed/vimeo_tos_final_april2009.doc

YouTube

YouTube is a video sharing site.

The negotiated TOS are not available to the public. Given the likely pervasiveness of the use of all of these tools in the future, it would be a shame if the GSA is facilitating government agencies’ use of third party services that violate the public’s expectations for government web content.

Conclusion

While I am encouraged by the GSA’s forward thinking to make use of the latest technologies developed in the private sector, I believe that working with the private sector poses a number of risks to government data, to the public’s privacy and free speech rights, and to good governance. These risks can be minimized and some useful provisions have been included in the negotiated TOS’s along these lines, but far more careful thinking is necessary.

While several of the TOS addressed accessibility and privacy concerns, none of the TOS addressed security, nondiscrimination, archival access to media, the TOS the public are required to enter into to access government content through these services, and web media data formats.

Update: See also this related post.

Open Data is Civic Capital: Best Practices for “Open Government Data”

May 19th, 2009

I frequently see questions like how can I convince my government that open data is important?, and what should I do as a government web manager to make data open?. These and other questions came up at Transparency Camp a few months ago, and at the end of the conference Gunnar Hellekson of Red Hat, and later I, decided to take on the project of bringing together a repository of best-practices guides for technology’s role in an open government. We have a wiki page for the project which lists some of the guides we’d like to see written.

Since the conference I’ve been working on the first guide, Open Data is Civic Capital: Best Practices for “Open Government Data”, which you can read by following the link. The goal was 1) to motivate why open government data isn’t just an ideological issue but actually makes society more powerful, and can really make the world a better place, and 2) to outline some suggested priorities and recommendations for open government data, drawing on the recommendations of a number of past groups (e.g. the 8 Principles of Open Government Data, and others). Thanks for feedback to Gunnar, John Wonderlich, Carl Malamud, Joe Germuska, Kevin Lyons, and David Robinson. (They had a lot of great suggestions many of which I haven’t had the energy to follow through with yet.) The essay begins:

“Creating a well-informed public is a core value of representative government. It is a prerequisite for ensuring the best representatives are elected and a crucial component of government oversight—as well as being important in areas well beyond civics. This document speaks to why public government data (also called ‘public sector information’) is a valuable resource to society if put on the Web and shared freely with the public, and discusses how to go about doing it. We discuss technological considerations and end with sixteen guiding principles for best practices in open government data.”

Kevin Lyons, who works for the Nebraska State Legislature, began work on a best practices guide for the use of the PDF format. When is it appropriate, what to look out for. That’s up on the wiki and I’m sure your suggestions & revisions would be welcome.

Senate votes now in XML (success story)

May 5th, 2009
I couldn’t have said it better:
JohnWonderlich: woot! Go Senate XML! (votes data now posted, in policy reversal) http://bit.ly/Il7hF
I’ve been nagging about this for a while. Big thumbs up to the Senate webmaster for a quick turnaround time too.

The new world of government transparency through technology

April 18th, 2009

The big news lately is that the Center for Responsive Politics opened up their large database of normalized campaign contribution records under a Creative Commons license. I think this is more significant to the world of government transparency & technology than it might appear. Just around five years ago this world was quite different. Organizations like CRP were very much using technology to bring new insight to civics. That hasn’t changed. But organizations saw themselves as solitary entities whose primary mission was to provide a new direct-to-citizen service to the public. A web application, for instance. There’s no need for me to list off other examples — every advocacy and government transparency website was like that, to the best of my recollection. (Except maybe IMSP who seemed to be ahead of the pack.)

All that has changed, and I wish I could pinpoint exactly how that happened. The combination of “Web 2.0″ as a buzz-word and grassroots digital campaigning in 2004 probably had a lot to do with it. The Howard Dean presidential campaign got a boost (at least in terms of publicity if not poll numbers) from developers coming together to specialize the Drupal open source CMS for political campaigning (“CivicSpace”). That sent a message, even if no one quite recognized it at the time, that developers have a role to play in the world of civics and that cooperation was a viable model for getting things done. Not to say that the CivicSpace project invented this — I was working on GovTrack for a few years by that point and across the pond Tom Steinberg and the MySociety group had been thinking about open source civics for even longer. But I suspect, even in my own thinking, that CivicSpace crystalized some vague earlier notions of civic hacking.

The story isn’t over yet, though, because I don’t think any of this alone would have brought us to where we are today. Unfortunately, from this point forward I run the risk of giving too much credit to the things I know about and not enough credit elsewhere. Still, here’s how I see it. Four more things had to happen, independently. First, entrepreneur Mike Klein had to make a lot a lot a lot of money. Second, Dan Newman and David Moore had to build MAPLight.org and OpenCongress.org, respectively. These are, now, and especially were at the start, leading examples of how you can do really cool new things by mixing data sources (for MAPLight, mixing my GovTrack legislation data with campaign contribution data from CRP) or re-mixing data sources (for OpenCongress giving my legislation data a more social spin). Third, John Wonderlich had to start, quite by accident, the Open House Project — this was a crucial step in bridging the technology world with staffers for congressmen, especially with Speaker Pelosi’s office. The fourth bit was that Ellen Miller and Micah Sifry had to put it all together and form the Sunlight Foundation: funding from Mike going to two great technology projects (IMO these are Sunlight’s most important grantees) and a policy arm with teeth because of its pragmatic approach to connecting with policymakers.

That’s pretty much it, because from there things just make sense. Sunlight recruited great staff and steamrolled through the open government world stamping out the idea that each open government group should be in its own little world — by funding interaction, in a sense.

The expectations for government transparency advocacy changed. Groups had to walk the walk a bit more by sharing and collaborating. So now besides CRP’s data being opened up for anyone to remix we have the Taxpayers earmark data, the Sunlight Labs API, the MAPLight API, and probably several more databases. The New York Times API probably owes some of its inspiration to these changing expectations too. So it’s a whole new world now of not just open governenment, and not even open government data, but open government transparency advocacy data. (Is there a catchier name for that?)

Update on bulk data from Congress

April 17th, 2009

One of the Open House Project’s recommendations was that Congress share its legislative data with the public in bulk and I’ve had a long history of posts on the subject. Over at the Free Gov info blog (link), Bob Tapella, Public Printer at the Government Printing Office, tells us that they are responding to this recommendation. He writes in a comment (presumably it is really him):

We have recently been called upon by Congress in the joint explanatory statement on the H.R. 1105, to work with the Library of Congress, including the Congressional Research Service, and the Law Library of Congress, to discuss access to bulk data. Specifically, the language is as follows:

[JT: omitted --- I've posted it before here]

To address this request, a Legislative branch task force has been assembled consisting of representatives from the offices of the Secretary of the Senate, the Clerk of the House, the Library of Congress, Congressional Research Service, the Law Library of Congress, and GPO. This task force has already met and is working to develop a position on access to bulk data. We will look to this work and the review by Congress to help guide our work on making bulk data accessible.

Grin.

Check me out: My talk at Berkeley’s Free Culture Conference last year

April 10th, 2009

Watch a video of my talk at the Free Culture Conference last year on Civic Hacking. (Text and slides here.) It was my best talk yet. I’ve got another good one (if I do say so myself) coming up at CITP’s Studying Society in a Digital World conference in a few weeks at Princeton.

SPARQL OLE DB Provider

April 6th, 2009

Andy Gueritz announced on the mail list for my SemWeb RDF library for .NET that he has created an OLE provider for a SPARQL endpoint that is usable in Microsoft Excel. He wrote,

In a moment of insanity (but a great learning experience), I gave myself the challenge of writing an OLE DB provider for SPARQL. It is built on top of the SemWeb libary which has saved a substantial amount of effort and also brings some powerful functionality to the table very quickly (Thanks, Joshua!)

The provider as constructed implements a readonly OLE provider that supports all four SPARQL query types and interfaces to SemWeb through COM-Callable Wrapper. It is not extensively tested yet but seems to work with most of the queries I have now put through it, and of course being built on SemWeb it is able to read both local and remote SPARQL sources.

Moral of the story: populate Excel tables with SPARQL queries.

More here.

Try hacking for government transparency in GSoC

April 1st, 2009
Posted in Mono | No Comments »

Does the thought of “hacking Congress” entice you? I don’t mean breaking in to U.S. Capitol servers, of course, but putting your l33t hacking skillz to use to improve government transparency and civic engagement. The Sunlight Foundation (I have no affiliation) is a mentoring organization in Google Summer of Code 2009. Check it out.

Shameless plug: HackingCongress.org