First-person video dataset aims to catch people looking (at everything)

��ϱ��Today

First-person video dataset aims to catch people looking (at everything)

Three universities collaborating to develop visual archive to be used for research, artificial intelligence development

Science & technology | January 23, 2020
Mike Wolterbeek

Assistant professor and students calibrate video headset

Mark Lescroart, an assistant professor and neuroscientist in the College of Science, helps graduate student Chris Sinnott calibrate the targeting mechanism for their visual database video recorders as student Matt Shinkle looks on. The device will be used to collect a database of first-person videos to be used for research applications including neuroscience, vision science, cognitive science, artificial intelligence and possibly digital humanities and art.

First-person video dataset aims to catch people looking (at everything)

Three universities collaborating to develop visual archive to be used for research, artificial intelligence development

Science & technology | January 23, 2020
Mike Wolterbeek

To better understand the organization of the brain and the perceptual tendencies in humans, a team of four scientists are recording video from four head-mounted cameras – with eyetracking and head movement – and assembling a massive video database with more than 240 hours of first-person video that can be used by researchers everywhere.

“The brain is adapted to the world around us, but we don’t have good data on what the world actually looks like to human observers,” Mark Lescroart, an assistant professor and neuroscientist in the psychology department at the University of ��ϱ��, Reno said. “There are no collections of videos that sample the world the way that humans do – Hollywood cinematographers don’t zip the cameras around as fast as human eyes move, so movies don’t really reflect the way we take in the world.”

The team of neuroscientists and social scientists is setting out to build a visual database that can be used to more accurately reflect human activity. They will create the vast gallery of videos that show what people see as they go about their daily activities. Their Visual Experience Database can be used to support research and impact future research in fields that rely on the analysis and recognition of images, including neuroscience, vision science, cognitive science, artificial intelligence and possibly digital humanities and art.

To gather the videos, the scientists designed a headset/glasses device to use in collecting the data. While early versions look like a prototype for a Borg device, the team is streamlining the system to keep the weight down and improve wearability. It has two cameras facing forward to see the world and two cameras facing the eyes to track eye movement. There will be five headsets for each of the four labs participating in the research.

The subjects wearing the headsets for the visual data gathering will vary in age from 5 to 70 years old. They’ll go to a variety of spaces and engage in many activities: to museums, libraries, shopping, commuting, riding bikes, walking. The research team will analyze cues for 3D space perception and how people commonly experience walls, corners, landmarks and other built structures in the world.

“We wanted to use mini-computers, but they weren’t robust enough to handle our needs, so we ended up with a laptop in a backpack, it makes the headset a little more user-friendly so our subjects wouldn’t be distracted by the tech,” Paul MacNeilage, assistant professor and neuroscientist in the College of Science at the University of ��ϱ��, Reno, said. “We decided to go with a Pupil Labs product for the base and added devices to it. We didn’t want it to be too distracting for others, either.”

The system must be able to collect GPS data, run four cameras, access software, have a decent power supply, record three video streams at once, utilize an internal motion sensor and an accelerometer. The technology is much more involved than the stationary devices that are typically used in the lab for eye-tracking studies with a chin rest and a display.

“This is definitely not the same type of studies on eye movement that are done in the lab, with chin rest and a display showing pictures of the environment,” MacNeilage said. “This is out in the real world with people interacting with their environment.”

This is especially important to MacNeilage, who runs the Self Motion Lab at the University of ��ϱ��, Reno, where graduate students are involved in the groundbreaking research.

Measuring head and body movement

“The system measures head and body movement through space,” he said. “This allows us to reconstruct visual input moment to moment and get insights on sensory-motor control. No existing database includes head motion.”

MacNeilage, Lescroart and graduate student Christian Sinnott tried out the headset, taking it outdoors. They got a few weird looks from those they passed on the street.

“This is like research in the wild,” Lescroart said. “Walking across campus and looking for objects will let us see how behavior changes when trying to navigate our environment. We’ll be able to ask ‘how does info we select change depending on tasks?’ We’ll see what it looks like when people move their heads and walk.”

“We’re building a first-person view video database to provide visual data that is more in line with human experience, which should help artificial intelligence make human-like decisions,” MacNeilage, one of the team members and an assistant professor and neuroscientist in the College of Science at the University of ��ϱ��, Reno, said. “We aim to find out what does it look like when people walk and move their heads to navigate through the world. We use a simple paradigm, find out how people sample the visual environment with their eyes.”

Artificial Intelligence

Another use for the visual database is informing artificial intelligence. While artificial intelligence is coming to the forefront of technology, it only works as well as the quality of the data that it is trained on. Instead of systems functioning based only on photos and videos curated from the internet, the team of scientists sees its database as a new source of more accurate data.

“We aim to build our database with biases based on human perception,” MacNeilage said. Other visual databases used for AI have a built in bias. For example, photographers focus on particular objects for certain purposes, whether a commercial or a documentary video; or security cameras with fixed positions. “So scientists need more and better databases. Our database will have biases consistent with human behavior, which could be an advantage for AI.”

Current artificial intelligence systems that recognize visual content require millions of training examples to achieve good performance. The databases used to train such systems often take photos and videos from the internet, and thus do not represent the content that humans see on a daily basis. This new database will introduce human-centered biases into the AI systems that can have serious, positive implications for AI-based applications such as self-driving cars.

Between all data sources, the team anticipates that there will be approximately 80 GB of data generated per recording hour, for a total of about 20 terabytes of raw data.

All data will be uploaded and stored in a central location on the University of ��ϱ��, Reno high performance "Pronghorn" cluster of computer servers housed in the Tahoe Reno Data Center, a state-of-the-art facility operated by Switch, headquartered in Las Vegas, ��ϱ��, with a data center in Reno.

From the 20 terabytes of raw data, the team will catalog at least 240 hours of the headcentered video, including the head and eye movements. They will make these data freely available to the public, which will enable scientists, historians and even artists to benefit from the rich resource.

The team includes lead researcher Michelle Greene, an assistant professor of neuroscience and computational vision scientist from Bates College in Maine; Lescroart; MacNeilage; and Benjamin Balas, a visual and cognitive neuroscientist and associate professor of psychology at North Dakota State University.

The various targets of this project will continue MacNeilage's investigations of head movements in natural environments, and will continue Lescroart's investigation of structural scene representations. The database as a whole will provide a highly valuable source of stimuli for ongoing fMRI experiments. It will continue Greene's investigation of the statistics of object location and co-occurrence and visual search; and will continue Balas' investigation of environmental effects on the development of perception.

Science & technology | January 23, 2020
Mike Wolterbeek

Editor's Picks

How can you protect your mental health during an election cycle?

Faces of the Pack: Kari Emm

A photo of the cover of The U. of N. Sagebrush from 1922 with a wolf pinned to it.

Origin story: from the 'Sagebrushers' to the 'Wolf Pack'

Jahahi wearing a ��ϱ�� polo and smiling.

Faces of the Pack: Jahahi Mazariego Contreras

Latest From

��ϱ�� Today

Research & Innovation

Engineering professor partners with UNLV and University of Houston on project to reduce carbon emissions in the iron industry

Dev Chidambaram and team bring expertise and key laboratory infrastructure

Dev Chidambaram, dressed in a suit, stands on an outdoor walkway with trees and a building in the background.

11/13/2024

Business & Entrepreneurship

Business Environmental Program wins major EPA grant

College of Business program plans work with auto-body shops to cut pollution

11/12/2024

Education & Public Service

Reducing Racial Disproportionality in Special Education

Researchers in the College of Education & Human Development examine the use of early intervention funds to support academic and behavioral services

Collage of Alexandra Aylward, Ph.D., Randall Owen, Ph.D., Ashley Greenwald, Ph.D., BCBA-D, LBA, and Ruby Batz, Ph.D.

11/7/2024

Impact & Student Success

Faces of the Pack: Honors College student Miles McMahan

Miles McMahon discusses his Honors Fellowship experience at the University’s Writing and Speaking Center

11/6/2024

Education & Public Service

College of Education and Human Development launches new Indigenous Education course

Course is aimed at deepening understanding and promoting inclusivity within the educational landscape

11/6/2024

Impact & Student Success

Graduate Student Association starts regalia donation program

Donating gently used regalia helps students facing financial barriers to join in commencement ceremonies

A group of six people dressed in graduation regalia during an outdoor commencement ceremony.

11/5/2024

Media & Society

KUNR Public Radio to open studio at University's Lake Tahoe campus

As part of the Reynolds School of Journalism, KUNR has announced its plans to build a studio and begin broadcasting from the University of ��ϱ��, Reno at Lake Tahoe