A New Data Trove Could Teach Computers to Tell Blind People What They Need to Know In CACM
Researchers at the University of Texas at Austin (UT Austin) are publishing a database of 31,000 images along with questions and answers about them, and challenging the machine-vision community to use this dataset to train machines as effective assistants for those with visual disabilities.
The dataset comes from the VizWiz application designed by Carnegie Mellon University scientists to help the blind.
The UT Austin team analyzed photos collected by VizWiz, and then presented the images and questions to Amazon's Mechanical Turk workers to supply a short-sentence answer.
A preliminary analysis of the data offers unique insights into the challenges machine vision faces in providing this kind of assistance. .... "
No comments:
Post a Comment