Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








9 Hits in 3.2 sec

Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection [article]

Rui Dai, Srijan Das, Saurav Sharma, Luca Minciullo, Lorenzo Garattoni, Francois Bremond, Gianpiero Francesca
2022 arXiv   pre-print
In this paper, we introduce a new untrimmed daily-living dataset that features several real-world challenges: Toyota Smarthome Untrimmed (TSU).  ...  We provide an analysis of the real-world challenges featured by our dataset, highlighting the open issues for detection algorithms.  ...  Toyota Smarthome Trimmed Vs Untrimmed dataset The Toyota Smarthome Trimmed dataset contains only a single activity instance per video.  ... 
arXiv:2010.14982v2 fatcat:ddfxeoxiyrbsjhldb7c2lq222y

Toyota Smarthome: Real-World Activities of Daily Living

Srijan Das, Rui Dai, Michal Koperski, Luca Minciullo, Lorenzo Garattoni, Francois Bremond, Gianpiero Francesca
2019 2019 IEEE/CVF International Conference on Computer Vision (ICCV)  
In this paper, we introduce a large real-world video dataset for activities of daily living: Toyota Smarthome.  ...  Activities were annotated with both coarse and fine-grained labels. These characteristics differentiate Toyota Smarthome from other datasets for activity recognition.  ...  Acknowledgement The authors are grateful to Sophia Antipolis -Mediterranean "NEF" computation cluster for providing resources and support.  ... 
doi:10.1109/iccv.2019.00092 dblp:conf/iccv/DasDKMGBF19 fatcat:dy3twymez5fz3iqvce4wqmlz3u

AAN: Attributes-Aware Network for Temporal Action Detection [article]

Rui Dai, Srijan Das, Michael S. Ryoo, Francois Bremond
2023 arXiv   pre-print
By leveraging CLIP features, AAN outperforms state-of-the-art approaches on two popular action detection datasets: Charades and Toyota Smarthome Untrimmed datasets.  ...  Although the CLIP visual features exhibit discriminative properties for various vision tasks, particularly in object encoding, they are suboptimal for long-term video understanding.  ...  The authors are also grateful to the OPAL infrastructure from Université Côte d'Azur for providing resources and support.  ... 
arXiv:2309.00696v1 fatcat:kpqf7v7mnjf6fhcf7leqm6d4gm

PDAN: Pyramid Dilated Attention Network for Action Detection

Rui Dai, Srijan Das, Luca Minciullo, Lorenzo Garattoni, Gianpiero Francesca, Francois Bremond
2021 2021 IEEE Winter Conference on Applications of Computer Vision (WACV)  
The authors are grateful to the OPAL infrastructure from Université Côte d'Azur for providing resources and support.  ...  Consequently, both the DAL and NL layers explore the whole content of the video. Real-world untrimmed videos [38] have long duration, large temporal variance, and concurrent actions.  ...  Evaluation datasets We evaluate our PDAN on three challenging datasets: MultiTHUMOS [38] , Charades [29] and an Toyota Smarthome Untrimmed (TSU) [5] dataset.  ... 
doi:10.1109/wacv48630.2021.00301 fatcat:evbdo6jiwbcdnjp3uk7imu772m

Real-Time Elderly Monitoring for Senior Safety by Lightweight Human Action Recognition [article]

Han Sun, Yu Chen
2022 arXiv   pre-print
Real-time monitoring and action recognition are essential to raise an alert timely when abnormal behaviors or unusual activities occur.  ...  In this paper, leveraging the Independently-Recurrent neural Network (IndRNN) we propose a novel Real-time Elderly Monitoring for senior Safety (REMS) based on lightweight human action recognition (HAR  ...  Beside NTU RGB+D, the Toyota Smarthome Untrimmed (TSU) dataset is newly published and contains 536 video streams with an average of 21 minutes, which annotated with 51 activities.  ... 
arXiv:2207.10519v1 fatcat:j7fg5l7z75burc6hw3gvsmnsne

A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition [article]

Andong Deng, Taojiannan Yang, Chen Chen
2023 arXiv   pre-print
BEAR is a collection of 18 video datasets grouped into 5 categories (anomaly, gesture, daily, sports, and instructional), which covers a diverse set of real-world applications.  ...  Our observation suggests that current state-of-the-art cannot solidly guarantee high performance on datasets close to real-world applications, and we hope BEAR can serve as a fair and challenging evaluation  ...  Toyota smarthome: Real-world activities of daily living.  ... 
arXiv:2303.13505v2 fatcat:6dyqjybdmzcyro43b6brn4s4ji

ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications

Hochul Hwang, Cheongjae Jang, Geonwoo Park, Junghyun Cho, Ig-Jae Kim
2021 IEEE Access  
including RGB videos, two-and three-dimensional skeleton trajectories.  ...  From the experiments following several newly proposed scenarios that assume different real and synthetic dataset configurations for training, we observe a noticeable performance improvement by augmenting  ...  The authors would also like to show their gratitude to Youngjoong Kwon for her initial work with the simulation development and Haetsal Lee for his comments that improved the quality of the manuscript.  ... 
doi:10.1109/access.2021.3051842 fatcat:g7wvvxx32fczvhbnanvhk5sfii

ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications [article]

Hochul Hwang, Cheongjae Jang, Geonwoo Park, Junghyun Cho, Ig-Jae Kim
2020 arXiv   pre-print
including RGB videos, two- and three-dimensional skeleton trajectories.  ...  From the experiments following several newly proposed scenarios that assume different real and synthetic dataset configurations for training, we observe a noticeable performance improvement by augmenting  ...  Acknowledgment The authors thank Jaewang Lee for his supportive work with realistic 3D modeling of implemented features in ElderSim.  ... 
arXiv:2010.14742v1 fatcat:jujkionbjnep3knwiakkl6bt7i

Table of Contents

2019 2019 IEEE/CVF International Conference on Computer Vision (ICCV)  
NVIDIA) liii Toyota Smarthome: Real-World Activities of Daily Living 833 Srijan Das (INRIA), Rui Dai (INRIA), Michal Koperski (INRIA), Luca Minciullo (Toyota-Europe), Lorenzo Garattoni (Toyota-Europe  ...  of Maryland), and David Crandall (Indiana University) StartNet: Online Detection of Action Start in Untrimmed Videos 5541 Mingfei Gao (University of Maryland), Mingze Xu (Indiana University), Larry Davis  ... 
doi:10.1109/iccv.2019.00004 fatcat:5aouo4scprc75c7zetsimylj2y