In Kdnuggets a good description and visualization of data management needed for data science.
Everything a Data Scientist Should Know About Data Management
For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions. By Phoebe Wong and Robert Bennett.
To be a real “full-stack” data scientist, or what many bloggers and employers call a “unicorn,” you have to master every step of the data science process — all the way from storing your data, to putting your finished product (typically a predictive model) in production. But the bulk of data science training focuses on machine/deep learning techniques; data management knowledge is often treated as an afterthought. Data science students usually learn modeling skills with processed and cleaned data in text files stored on their laptop, ignoring how the data sausage is made. ... "
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment