Open Government Data and Beer Analytics

🏛 + 🍺

Open Data Science Conference | East

Jasmine Dumas | @jasdumas | jasdumas.github.io

Friday, May 5th, 2017

Hi Boston!

Why should you care open government data?

To foster and improve data literacy & statistical comprehension within our community, Scientists and Engineers need to advocate for data that is provided in an consistent and accessible method suitable for analysis.

What this presentation is about (and not about)

2017 has been an interesting time to be a Data Scientist…

What even is, beer analytics?

Discovering analysis-ready beer datasets can be difficult

How to generally search for analysis-ready datasets

The U.S. Government Open Data Portal

The “clearinghouse” for open U.S. government data is located at data.gov. It also contains tools & resources to conduct research, develop web and mobile applications, and design data visualizations.

Examples of analysis-ready datasets

Examples of datasets that are not analysis-ready

Here is the point of all of this…

Just because it’s open doesn’t mean it’s accessible!

Necessity breeds innovation

I developed a R package for beer statistics, called ttbbeer

How I developed the ttbbeer package

Simple web scraping in R for historical tax rates

Insights from the data in the wild

Advocating for open data

https://www.data.gov/issues/request-id/32022/

https://www.data.gov/issues/request-id/32022/

Wrapping things up

Questions & Discussion!

ending slide