Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Semantic and structural analysis of TV diving programs

  • Computer Network and Internet
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Automatic content analysis of sports videos is a valuable and challenging task. Motivated by analogies between a class of sports videos and languages, the authors propose a novel approach for sports video analysis based on compiler principles. It integrates both semantic analysis and syntactic analysis to automatically create an index and a table of contents for a sports video. Each shot of the video sequence is first annotated and indexed with semantic labels through detection of events using domain knowledge. A grammar-based parser is then constructed to identify the tree structure of the video content based on the labels. Meanwhile, the grammar can be used to detect and recover errors during the analysis. As a case study, a sports video parsing system is presented in the particular domain of diving. Experimental results indicate the proposed approach is effective.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Dimitrova N, Zhang H J, Shahraray B, Sezan I, Huang T, Zakhor A. Applications of video-content analysis and retrieval.IEEE Multimedia, 2002, 9(3): 42–55.

    Article  Google Scholar 

  2. Li F C, Gupta A, Sanccki E, He L, Rui Y. Browsing digital video. InProc. ACM Conference on Human Factors in Computing Systems, Hague, Netherlands, Apr. 2000, pp.169–176.

  3. Zhang H J, Gong Y H, Smoliar S W, Tan S Y. Automatic parsing of news video. InProc. IEEE International Conference on Multimedia Computing and Systems, Boston, MA, USA, May 1994, pp.45–54.

  4. Wang W, Gao W. Automatic segmentation of news items based on video and audio features.Journal of Computer Science and Technology, Mar. 2002, 17(2): 189–195.

    Article  Google Scholar 

  5. Babaguchi N, Kawai Y, Kitahashi T. Event based indexing of broadcasted sports video by intermodal collaboration.IEEE Trans. Multimedia, Mar. 2002, 4(1): 68–75.

    Article  Google Scholar 

  6. Duan L Y, Xu M, Chua T S, Tian Q, Xu C S. A midlevel representation framework for semantic sports video analysis. InProc. ACM Multimedia, Berkeley, CA, USA, Nov. 2003, pp.33–44.

  7. Rui Y, Gupta A, Acero A. Automatically extracting highlights for TV baseball programs. InProc. ACM Multimedia, Marina del Rey, CA, USA, Oct. 2000, pp.105–115.

  8. Hanjalic A. Generic approach to highlights extraction from a sports video. InProc. IEEE International Conference on Image Processing, Barcelona, Spain, Sep. 2003, 1: 1–4.

  9. Assfalg J, Bertini M, Colombo C, Bimbo A D. Semantic annotation of sports videos.IEEE Multimedia, 2002, 9(2): 52–60.

    Article  Google Scholar 

  10. Duan L Y, Xu M, Yu X D, Tian Q. A unified framework for semantic shot classification in sports videos. InProc. ACM Multimedia, Juan-les-Pins, France, Dec. 2002, pp.419–420.

  11. Zhong D, Chang S F. Structure analysis of sports video using domain models. InProc. IEEE International Conference on Multimedia and Expo, Tokyo, Japan, Aug. 2001, pp.713–716.

  12. Xie L, Chang S F, Divakaran A, Sun H. Structure analysis of soccer video with hidden Markov models. InProc. International Conference on Acoustic, Speech, and Signal Processing, Orlando, FL, USA, May 2002, 4: 4096–4099.

  13. Rui Y, Huang T S, Mehrotra S. Constructing table-of-content for videos.Multimedia Systems, 1999, 7(5): 359–368.

    Article  Google Scholar 

  14. Lienhart R. Comparison of automatic shot boundary detection algorithms. InProc. SPIE Storage and Retrieval for Imagee and Video Databases, San Jose, CA, USA, Jan. 1999, pp.291–301.

  15. Zhong Y, Zhang H J, Jain A K. Automatic caption localization in compressed video.IEEE Trans. Pattern Analysis and Machine Intelligence, Apr. 2002, 22(4): 385–392.

    Article  Google Scholar 

  16. Wang F, Li J T, Zhang Y D, Lin S X. Automatically extracting highlights for diving video. InProc. China National Computer Conference, Beijing, China, Nov. 2003, 1: 471–475.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fei Wang.

Additional information

This work was supported in part by the State Physical Culture Administration of China under Grant No.02005.

Fei Wang was born in 1977. He is a Ph.D. candidate at Institute of Computing Technology (ICT), the Chinese Academy of Sciences (CAS). He received the B.S. degree in electrical engineering from Zhejiang University in 1999 and the M.S degree in computer science from Graduate School of the Chinese Academy of Sciences in 2001. His current research interests include content-based video analysis and retrieval.

Jin-Tao Li was born in 1962. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia data compression, virtual reality, and home network.

Yong-Dong Zhang was born in 1973. He is an associate professor at ICT, CAS. His main research areas include multimedia data compression and multimedia information retrieval.

Shou-Xun Lin was born in 1948. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia technology and systems.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, F., Li, JT., Zhang, YD. et al. Semantic and structural analysis of TV diving programs. J. Comput. Sci. & Technol. 19, 928–935 (2004). https://doi.org/10.1007/BF02973456

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02973456

Keywords