Joshua Tauberer’s Curriculum Vitae

Joshua Ian Tauberer, Ph.D.

Washington, DC

Professional Experience

2010– POPVOX co-founder and chief technology officer
POPVOX.com is a disruptive online advocacy platform focusing on issues before the U.S. Congress, co-founded by myself, Marci Harris, and Rachna Choudhry. As chief technology officer, I manage the development of our web and mobile products and the technology team, and I currently do most of the coding.
2003– GovTrack.us/Civic Impulse, LLC founder/owner
GovTrack.us is a reference and legislative tracking tool for bills before the U.S. Congress. GovTrack spurred the word-wide open government data movement. The website was incorporated into Civic Impulse, LLC, of which I am the sole member, in 2009 for a short-lived pilot project involving a small team of free-lance writers. I currently manage occasional contractors. The website runs advertising and is profitable.
2004–2010 LARSA, Inc. director of software architecture
LARSA, Inc. develops desktop software for structural engineers, primarily used on bridge and earthquake analysis and design. As the director of software architecture, I lead a rewrite of the user interface and an overhaul of the analysis backend, which included a significant amount of algorithm design and applied linear algebra, and I also supervised the creation of printed material and technical documentation. I have an ongoing consulting relationship. From 2000–2004 I had a position as software developer.
2007 The Open House Project co-author
This collaboration among open government advocates and professionals made a recommendation to then-Speaker of the House Nancy Pelosi on how to better use technology to further Congressional transparency.
2006 XML.com freelance writer
During this time I wrote three articles for XML.com on the semantic web, including a well-regarded technical introduction to the field (“What is RDF”), under the heading “Hacking Congress.”
2003 The Daily Princetonian features editor
I was the last "executive editor for Page 3".

Education

2010 Ph.D., University of Pennsylvania (Linguistics)
My doctoral dissertation, Learning [voice], investigated the phonetics-phonology interface of the so-called voice contrast through large-scale corpus studies with a focus on infant speech. The dissertation made use of machine learning algorithms (k-means, SVMs), maximum liklihood estimation, and large-scale data analysis. Teaching assistant responsibilities included Introduction to Cognitive Science, Introduction to Linguistics, and graduate-level Introduction to Phonetics. Service included several years editing the U. Penn Working Papers in Linguistics.
2008 M.A., University of Pennsylvania (Linguistics)
My masters thesis, Learning in the Face of Infidelity: Evaluating the Robust Interpretive Parsing/Constraint Demotion Model of Optimality Theory Language Acquisition, investigated the accuracy of a machine learning algorithm for language acquisition.
2004 A.B., Princeton University (Psychology)
My senior thesis, Crossing Bridges in Discourse Representation, was a psycholinguistic experiment investigating presuppositions. I also earned certificates (minors) in linguistics and applications of computing.
2000 Plainview-Old Bethpage John F. Kennedy High School, Plainview, New York

Publications

Technology (government transparency, semantic web)

Linguistics (academic publications)

Invited Talks / Media Appearances

Additional Presentations and Manuscripts

Press Clips (selected)

Honors

Professional Service

Additional Technology Projects

2007– OpenGovData.org participant and site maintainer
Formed out of a 2007 meeting of open government activists and professionals, this site presents the Eight Principles of Open Government Data and lists related information.
2005–2009 SemWeb .NET Library creator
An open source .NET library written in C# for working with RDF data for the Semantic Web. It's used in the Gnome application F-Spot, and possibly elsewhere.
2009 New Jersey Gang Survey Viewer co-creator/organizer
This is a visualization tool for the New Jersey State Police Street Gang Survey 2007 developed by five volunteers in Philadelphia over the course of a weekend in December 2009, as part of the Great American Hackathon.
2009 FlyOnTime.us co-creator
This entry for Sunlight Foundation's Apps for America contest, in collaboration with Josh Sulkin, is a mash-up of airline on-time flight statistics from the FAA with historical weather data from the NOAA. Mentions: White House open gov status report, The New York Times (3/12/11), NPR (3/14/10), The Washington Post (7/21/09), The Politico (6/24/09).
2007–2008 Praat-Py creator
This is an extension to the Praat program for phonetic analysis that allows scripts to by written in Python.
2008 U.S. Securities and Exchange Commission Corporate Ownership RDF Data
A semantic web RDF database of SEC data.
2006–2007 The Penn Lambda Calculator co-creator
This is a linguistic semantics pedagogical tool made in conjunction with Lucas Champollion and Maribel Romero.
2004–2007 Sender Verification Extension for Thunderbird creator
A Mozilla Thunderbird extension for verifying the domain name claimed in the From: address of emails using SPF, as a tool to combat phishing. Downloaded around 150,000 times.
2007 U.S. Census RDF Dataset creator
A 1-billion triples RDF database of U.S. Census statistics, at the time the largest open, linked, and dereferencable RDF database of real-world information.
1999 Webcytology [more info] co-creator
My first major web project (I was in high school), this was a winning entry in the 1999 ThinkQuest competition. It featured a cellular automata simulation, inspired by Conway's Game of Life, where users would design organisms with different biologically inspired properties. In collaboration with Andew Kallem.