Tuesday, January 31, 2012

Chaos of Unstructured Data

From O'Reilly Radar:  Thoughts from a practicing CTO on the chaos of unstructured data.

" ... The heart of data science is designing instruments to turn signals from the real world into actionable information. Fighting the data providers to give you those signals in a convenient form is a losing battle, so the key to success is getting comfortable with messy requirements and chaotic inputs. As an engineer, this can feel like a deal with the devil, as you have to accept error and uncertainty in your results. But the alternative is no results at all.... " 

