Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1645953.1646037acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Probabilistic skyline queries

Published:02 November 2009Publication History

ABSTRACT

The ability to deal with uncertain information is becoming increasingly important for modern database applications. Whereas a conventional (certain) object is usually represented by a vector from a multidimensional feature space, an uncertain object is represented by a multivariate probability density function (PDF). This PDF can be defined either discretely (e.g. by a histogram) or continuously in parametric form (e.g. by a Gaussian Mixture Model). For a database of uncertain objects, the users expect similar data analysis techniques as for a conventional database of certain objects. An important analysis technique for certain objects is the skyline operator which finds maximal or minimal vectors with respect to any possible attribute weighting. In this paper, we propose the concept of probabilistic skylines, an extension of the skyline operator for uncertain objects. In addition, we propose efficient and effective methods for determining the probabilistic skyline of uncertain objects which are defined by a PDF in parametric form (e.g. a Gaussian function or a Gaussian Mixture Model). To further accelerate the search, we elaborate how the computation of the probabilistic skyline can be supported by an index structure for uncertain objects. An extensive experimental evaluation demonstrates both the effectiveness and the efficiency of our technique.

References

  1. R. S. Blum, Y. Zhang, B. M. Sadler, and R. J. Kozick. Approximation of correlated nongaussian noise pdfs using gaussian mixture models, published. In American University, Washington DC, 1999.Google ScholarGoogle Scholar
  2. C. Böhm, A. Pryakhin, and M. Schubert. The gauss-tree: Efficient object identification in databases of probabilistic feature vectors. In ICDE, page 9, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Börzsönyi, D. Kossmann, and K. Stocker. The skyline operator. In ICDE, pages 421--430, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Cheng, D. V. Kalashnikov, and S. Prabhakar. Evaluating probabilistic queries over imprecise data. In SIGMOD, pages 551--562, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. R. Cheng, Y. Xia, S. Prabhakar, R. Shah, and J. S. Vitter. Efficient indexing methods for probabilistic threshold queries over uncertain data. In VLDB, pages 876--887, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Chomicki, P. Godfrey, J. Gryz, and D. Liang. Skyline with presorting: Theory and optimizations. In Intelligent Information Systems, pages 595--604, 2005.Google ScholarGoogle Scholar
  7. N. N. Dalvi and D. Suciu. Answering queries from statistics and probabilistic views. In VLDB, pages 805--816, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Faradjian, J. Gehrke, and P. Bonnet. Gadt: A probability space adt for representing and querying the physical world. In ICDE, pages 201--211, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Kossmann, F. Ramsak, and S. Rost. Shooting stars in the sky: An online algorithm for skyline queries. In VLDB, pages 275--286, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Q. Li, B. Moon, and I. Lopez. Skyline index for time series data. IEEE Transactions on Knowledge and Data Engineering, 16(6):669--684, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. X. Lin, Y. Yuan, W. Wang, and H. Lu. Stabbing the sky: Efficient skyline computation over sliding windows. In ICDE, pages 502--513, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Lu, Y. Qian, D. Rodríguez, W. Rivera, and M. Rodriguez. Wireless sensor networks for environmental monitoring applications: A design framework. In GLOBECOM, pages 1108--1112, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  13. A. M. Mainwaring, D. E. Culler, J. Polastre, R. Szewczyk, and J. Anderson. Wireless sensor networks for habitat monitoring. In WSNA, pages 88--97, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Papadias, Y. Tao, G. Fu, and B. Seeger. An optimal and progressive algorithm for skyline queries. In SIGMOD, pages 467--478, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Pei, B. Jiang, X. Lin, and Y. Yuan. Probabilistic skylines on uncertain data. In VLDB, pages 15--26, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Sarikaya, M. A. Alim, and S. Rezaei. Integrating wireless eegs into medical sensor networks. In IWCMC, pages 1369--1374, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. D. Sarma, O. Benjelloun, A. Y. Halevy, and J. Widom. Working models for uncertain data. In ICDE, page 7, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. K.-L. Tan, P.-K. Eng, and B. C. Ooi. Efficient progressive skyline computation. In VLDB, pages 301--310, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Y. Tao, R. Cheng, X. Xiao, W. K. Ngai, B. Kao, and S. Prabhakar. Indexing multi-dimensional uncertain data with arbitrary probability density functions. In VLDB, pages 922--933, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. K. H. Tung, Z. Huang, H. Lu, and B. C. Ooi. Continuous skyline queries for moving objects. IEEE Transactions on Knowledge and Data Engineering, 18(12):1645--1658, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D.-L. Yu and D.-W. Yu. Detecting sensor faults for a chemical reactor rig via adaptive neural network model. In ISNN (3), pages 544--549, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Probabilistic skyline queries

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
        November 2009
        2162 pages
        ISBN:9781605585123
        DOI:10.1145/1645953

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 2 November 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader