Beta Workshop Success

We just held our second, BETA, workshop to test out the DataBasic tools and activities.  Our invitation brought a wonderfully diverse set of journalists, students, community organizers, educators, and folks that work in the arts!  We had to limit attendance to 40 people, simply due to physical limitations in the room.


We gathered another round of invaluable feedback, documented some more of initial uses, brainstormed potential applications, and had a bunch of fun!  Here are two quick drawings participants made, comparing the lyrics that various musicians use with WordCounter and SameDiff:

IMG_2121 IMG_2140

BETA Workshop coming up on 12/8

We’re hosting a BETA workshop on our DataBasic suite of tools on Tues 12/8, from 6-8pm. This workshop is designed for journalists, educators and community organizations that are just starting to work with data. Register on our evenbrite page:

Opening Shot

The tools focus on understanding what is in a CSV file, and also starting to analyze large sets of text data in quantitative ways. We introduce each with a fun, hands-on activity, so it isn’t just staring at screens all evening 🙂 Read the invite for more about why you might want to attend. We’d appreciate your help testing these tools out and want to get more feedback from real folks before we launch them publicly!

(Plus free dinner!)


Video Shoot

Sometimes online tools for working with data can be confusing and overwhelming when you first visit them.  One way we can to try to address this is by having short, friendly introductory videos to tell you why you might want to use each tool.  We wrote some scripts, found some clothes that match the logos, and started shooting video intros for each of the three tools.

They are in post-production now, but you’ll be able to watch them soon on the homepage of each tool in our suite!  Here’s some photos to whet your appetite.


Trying to look casual is hard!


Haven’t had to do this much memorizing since grade school


What is DataBasic All About?

DataBasic is a suite of focused and simple tools and activities for journalists, data journalism classrooms and community advocacy groups.  We’re happy to announce that we’ve received funding from the Knight Foundation to build and test DataBasic over the next 6 months!


What is DataBasic?

Though there are numerous data analysis and visualization tools for novices there are some significant gaps that we have identified through prior research. DataBasic is designed to fill these gaps for people who do not know how to code and provide a low barrier to further learning about data analysis for storytelling.

In the first iteration of this project we will build three tools, develop three training activities and run one workshop with journalists and students for feedback. The three tools include:

  • WTFcsv: A web application that takes as input a CSV file and returns a summary of the fields, their data type, their range, and basic descriptive statistics. This is a prettier version of R’s “summary” command and aids at the outset of the data analysis process.
  • WordCounter: A basic word counting tool that takes unstructured text as input and returns word frequency, bigrams (two-word phrases) and trigrams (three-word phrases)
  • SameDiff: A tool that gives you various ways to compare two text documents, to see how they are similar and/or different.

More importantly, we’ll be providing an introductory video and simple training activities for each tool as a way to scaffold learning about data analysis at the same time as doing it. These activities will include fun datasets to start off with, and introduce vocabulary terms and the algorithms at work behind the scenes.  We strongly believe in building tools for learners, and will be putting those ideas into practice on these tools and activities.

Who is Building This?

Catherine D’Ignazio is an Assistant Professor of Data Visualization and Civic Media at Emerson College and a Fellow at the Engagement Lab. She has a background in software development, media analysis and the arts and currently teaches journalism students.

Rahul Bhargava is a Research Scientist at the MIT Center for Civic Media. He works in quantitative media analysis and leads data literacy workshops for students and community groups.

Is it Ready Yet?

We are still developing the first prototypes so we can try them out with folks. Expect to see more updates here as we build them out over the fall.