We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Depth based enlarged temporal dimension of 3D deep convolutional network for activity recognition.
- Authors
Singh, Roshan; Dhillon, Jagwinder Kaur; Kushwaha, Alok Kumar Singh; Srivastava, Rajeev
- Abstract
An activity takes many seconds to complete which makes it a spatiotemporal structure. Many contemporary techniques tried to learn activity representation using convolutional neural network from such structures to recognize activities from videos. Nevertheless, these representation failed to learn complete activity because they utilized very few video frames for learning. In this work we use raw depth sequences considering its capabilities to record geometric information of objects and apply proposed enlarged time dimension convolution to learn features. Due to these properties, depth sequences are more discriminatory and insensitive to lighting changes as compared to RGB video. As we use raw depth data, time to do preprocessing are also saved. The 3 dimensional space-time filters have been used over increased time dimension for feature learning. Experimental results demonstrated that by lengthening the temporal resolution over raw depth data, accuracy of activity recognition has been improved significantly. We also studied the impact of different spatial resolution and conclude that accuracy stabilizes at larger spatial sizes. We shows the state-of-the-art results on three human activity recognition depth datasets: NTU-RGB + D, MSRAction3D and MSRDailyActivity3D.
- Subjects
ARTIFICIAL neural networks; HUMAN activity recognition; MACHINE learning; DIMENSIONS; BONE lengthening (Orthopedics)
- Publication
Multimedia Tools & Applications, 2019, Vol 78, Issue 21, p30599
- ISSN
1380-7501
- Publication type
Article
- DOI
10.1007/s11042-018-6425-3