To train and use machine learning methods, you need lots of data to start with. Very significant amounts. Here Google releases much training data. In the work I have done in this space, you almost never have enough. And in real contexts, you need more over time to detect drift from your solutions. Initially via DSC. See their post for more on data resources and methods.
Google releases massive visual databases for machine learning in Engadget by Richard Lawler.
Millions of images and YouTube videos, linked and tagged to teach computers what a spoon is.
It seems like we hear about a new breakthrough using machine learning nearly every day, but it's not easy. In order to fine-tune algorithms that recognize and predict patterns in data, you need to feed them massive amounts of already-tagged information to test and learn from. For researchers, that's where two recently-released archives from Google will come in. Joining other high-quality datasets, Open Images and YouTube8-M provide millions of annotated links for researchers to train their processes on. .... "
Monday, July 10, 2017
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment