New Technique Identifies Human Actions in Videos

NBC News Clone summarizes the latest on: New Technique Identifies Human Actions Videos N121786 - Technology and Innovation | NBC News Clone. This article is rewritten and presented in a simplified tone for a better reader experience.

A technique for intelligently picking human actions out of videos shares many similarities with understanding everyday language.
The system was able to identify the parts of the weight-lifting event.
The system was able to identify the parts of the weight-lifting event.Hamed Pirsiavash

Research at MIT has produced a system that can intelligently pick human actions and movements out of videos. What makes it possible isn't some high-resolution sensor, but a clever way of applying language rules to images.

Let's say a computer's task is to identify, in a few minutes of video, a person making a sandwich. Of course the person must take out the bread, spread the mayo and add the ingredients, but perhaps not in that order. Do they slice the tomato first? Do they put the bread away in the middle? Such variance in human activity makes identifying such simple things difficult.

Hamed Pirsiavash, of MIT's Computer Science and Artificial Intelligence Laboratory, took a unique approach. By using algorithms normally applied to understanding human language, he was able to improve the quality of a system for understanding human actions.

You might say "he walked to the store" or "he went to the shop," but generally you can't avoid saying "he" or "to" when talking about a man going somewhere. Pirsiavash applied this logic to computer vision: Now the computer knows that whether a person does tomato or lettuce first, they always take out bread before spreading the mayo, and the second slice always happens last.

The system was able to identify the parts of the weight-lifting event.
The system was able to identify the parts of the weight-lifting event.Hamed Pirsiavash

Using this technique, his system (after being "trained" on more structured video) was able to watch video of Olympic contestants and identify what event was shown, based on its individual portions: the run, release and throw of the javelin, for instance — even if certain portions are obscured or not pictured.

Such a system could be used to watch for actions on video feeds, from alerting medics if someone collapses to monitoring athletic training or physical rehabilitation.

Pirsiavash told NBC News in an email that the field of computer vision is blowing up as processors speed up and more data is made available — but if his research is any indication, it takes more than raw computing power to make sense of it.

×
AdBlock Detected!
Please disable it to support our content.

Related Articles

Donald Trump Presidency Updates - Politics and Government | NBC News Clone | Inflation Rates 2025 Analysis - Business and Economy | NBC News Clone | Latest Vaccine Developments - Health and Medicine | NBC News Clone | Ukraine Russia Conflict Updates - World News | NBC News Clone | Openai Chatgpt News - Technology and Innovation | NBC News Clone | 2024 Paris Games Highlights - Sports and Recreation | NBC News Clone | Extreme Weather Events - Weather and Climate | NBC News Clone | Hollywood Updates - Entertainment and Celebrity | NBC News Clone | Government Transparency - Investigations and Analysis | NBC News Clone | Community Stories - Local News and Communities | NBC News Clone