How do open source data analysis and visualizations by individuals compare to big data projects by data scientists using cutting edge software and infrastructure? Some clue may lie in looking at the impact of blogging on professional journalism. A few years ago I worked for Lycos and helped launch a blogging tool. Blogging was pretty new then and Blogger, Typepad, …
Category Archives: Context
Categorizing Big Data Startups
How to categorize Big Data Companies? These companies are a mix of startups and existing companies. One way to think of them is as follows: Data: Companies creating data, segmentation, content, data overlays and repackaging of existing data. (eg Rapleaf, Dow Jones, National Weather Service) Infrastructure: Companies creating tools to manage, store, deliver, optimize, monitor, or combine …
What is Big Data + Wordclouds
What is Big Data? There seem to be many definitions and descriptions. One way to try an uncover what the term means is through word clouds and phrase clouds. Using ManyEyes, I visualized an article from O’Reilly entitled “What is Big Data?” . Here are the clouds: The tag cloud seems to do a better job of identifying …
Crowdsourcing Soda Flavors + Big Data
A startup in Indianapolis Indiana called Uflavor has a process to create an infinite number of soda flavors from a crowd-sourced process. Inc Magazine just named them one of the three most innovative sites around social media. This presents an interesting problem in data analysis: how to get insights or trends from the large volume of data. The plan is …
What is Big Data?
What exactly is “big data”? Two interesting posts on the definition. First Andrew Brust on ZDNet offers a common definition: Big Data is about the technologies and practice of handling data sets so large that conventional database management systems cannot handle them efficiently, and sometimes cannot handle them at all. ?He goes on to note that the term …
Big Data BBQ: Incredients, Tools, Techniques, Recipes & Meals
Big Data BBQ will focus on the emergence of big data — large volumes of data, ways to analyze and find meaning in them, and methods of visualizing and reporting on data to make core business decisions. While it’s not a technical blog, at times it will look at the technical infrastructure if it impacts combining data and …