Sunday, September 28, 2014

Farmers Markets of Boston

Who doesn't love a good farmers market? Aside from the obvious benefits of access to freshly grown or recently homemade foods, I find the little details of the farmers market most rewarding. The dirt-stained hands and clothes of individuals who don't appear dirty all but rather more wholesome as a result of their profession. The early morning calm, the prideful eyes of vendors, and the curious hands and mouths of would-be buyers. Farmers markets somehow seem even more precious to those of us living in the twisted and tangled concrete  jungles that are modern American cities.

To pay homage to all the street corners and public squares in Boston where slow, local food thrives, I've put together the map below. Find the closest market to you. Don't forget to check the dates and times of operation. Some markets are summer or winter only. Also it should be noted that the source of this information was a set of disparate government files from 2013 (the latest available). It was quite a feat to turn them into something useable, the process for which I plan to go over in my next post.

Enjoy.





Wednesday, September 24, 2014

An Introduction to the Glut

As a relatively new convert to the field of data science, I find myself repeatedly and simultaneously excited, impressed, and humbled by the sheer volumes of data available to those willing to dig for it. The almost non-existent latency with which new information is gathered, stored, and disseminated globally is surely not a reality previous generations could have predicted.

Finding meaning in the veritable ever-accumulating glut of data that fills our lives, however, is no small feat. For that reason, I've chosen to chronicle my mining and munging, my scrubbing and scraping, in an attempt to share outcomes that I feel are either meaningful, interesting, or "cool." I hope and fully intend for my exposition of the process to be as intriguing as the outcomes themselves. After all, the more mundane the data set, the more extraordinary and creative of an approach required to transform that data into knowledge.

I plan to use a combination of relevant applications such as Alteryx and Tableau (both growing in popularity and power with each release) for data manipulation and visualization respectively. I fully expect Python and R to make their way into posts as well. I'm of the opinion that the future belongs to those who can both program software and perform statistical analysis. The days are numbered for statisticians incapable of implementing a MapReduce algorithm as well as for programmers able to write the algorithm but fall short when time comes to interpret the true value of the results. For that reason, I find that a combination of the tools above are almost limitless in their ability to acquire, crunch, and convey information: a proper data scientist's toolbox indeed.

Despite being at risk of becoming the often cited "jack of all trades, master of none," I find myself eager to test all the facets of being a modern data scientist before settling down into a deep trough of specific expertise. Hopefully you'll enjoy the escapades documented here as much as I do.