Now the wealth of information in YouTube videos—from mischievous cats and impossible stunts to documentaries and commencement speeches—will be available to researchers. The new YouTube-8M dataset includes 8 million YouTube video URLs (representing over 500,000 hours of video) is Google’s newest research breakthrough. The labeled dataset “enables researchers and students without access to big data or big machines to do their research at previously unprecedented scale,” according to Google’s blog. For quality control, they used only public videos with more than 1,000 views and built a vocabulary of entities (for example, from “acoustic guitar” to “Guitar Hero III: Legends of Rock” in the “Guitars” filter in the “Arts and Entertainment” category).
collectionsInnovation FestivalCurrent Issue
World Changing Ideas
New workplaces, new food sources, new medicine--even an entirely new economic system.
The major tech ecosystems that battle for our attention and dollars.
What’s next for hardware, software, and services.
The brave new world of automation, from AI to drones.
How our urban centers are building toward the future.
Most Creative People
See members of our Most Creative People in Business community: leaders who are shaping the future of business in creative ways.
An award-winning team of journalists, designers, and videographers who tell brand stories through Fast Company's distinctive lens.