Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








1,550 Hits in 5.3 sec

Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive

Dongjin Yu, Wensheng Dou, Zhixiang Zhu, Jiaojiao Wang
2015 International Journal of Computational Intelligence Systems  
We establish a cost model that integrates the query, maintenance and storage costs to evaluate the performance of approaches and measure the fitness of an individual in the genetic algorithm.  ...  Both the simulation results and experiments on Apache Hive show that the approximately optimal solution for selecting materialized views can be obtained effectively using the approach presented.  ...  The authors would also like to thank anonymous reviewers and the colleagues in UC Santa Barbara who made valuable suggestions to improve the quality of the paper.  ... 
doi:10.1080/18756891.2015.1113744 fatcat:junv3falinao5do5slj4nsnvyq

Evaluating the Open Source Data Containers for Handling Big Geospatial Raster Data

Fei Hu, Mengchao Xu, Jingchao Yang, Yanshou Liang, Kejin Cui, Michael M. Little, Christopher S. Lynnes, Daniel Q. Duffy, Chaowei Yang
2018 ISPRS International Journal of Geo-Information  
This paper provides a comprehensive evaluation of six popular data containers (i.e., Rasdaman, SciDB, Spark, ClimateSpark, Hive, and MongoDB) for handling multi-dimensional, array-based geospatial raster  ...  These containers optimize their performance from different aspects, such as data organization modeling, indexing, and data pipelines.  ...  ClimateSpark was developed by Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/ijgi7040144 fatcat:csbbnucfbzd2za4ghkqnyclihm

CloudETL

Xiufeng Liu, Christian Thomsen, Torben Bach Pedersen
2014 Proceedings of the 18th International Database Engineering & Applications Symposium on - IDEAS '14  
The results show that CloudETL scales very well and significantly outperforms the dimensional ETL capabilities of Hive both with respect to performance and programmer productivity.  ...  We present different performance optimizations including a purpose-specific data placement policy to co-locate data.  ...  Experiments In this section, we empirically evaluate the performance of CloudETL by studying 1) the performance of processing different DW schemas, including a star schema and schemas with an SCD and a  ... 
doi:10.1145/2628194.2628249 dblp:conf/ideas/LiuTP14 fatcat:rxyx3q63unbohptqbcjuldomqi

Multi-hive bee foraging algorithm for multi-objective optimal power flow considering the cost, loss, and emission

Hanning Chen, Ma Lian Bo, Yunlong Zhu
2014 International Journal of Electrical Power & Energy Systems  
This paper proposes a multi-hive multi-objective bee algorithm (M 2 OBA) for optimal power flow (OPF) in power systems.  ...  Our algorithm uses the concept of Pareto dominance and comprehensive learning mechanism to determine the flight direction of a bee and maintains nondominated solution vectors in external archive based  ...  Acknowledgement This research is partially supported by National Natural Science Foundation of China und Grant 61105067, 61203161, 61174164 and 51205389.  ... 
doi:10.1016/j.ijepes.2014.02.017 fatcat:ka6fij5lrjgx5ntwg4r4flg6ra

Bee Foraging Algorithm Based Multi-Level Thresholding For Image Segmentation

Zhicheng Zhang, Jianqin Yin
2020 IEEE Access  
The proposed algorithm provides different flying trajectories for different types of bees and takes both single-dimensional and multi-dimensional search aiming to maintain a proper balance between exploitation  ...  Multi-level thresholding is one of the essential approaches for image segmentation. Determining the optimal thresholds for multi-level thresholding needs exhaustive searching which is time-consuming.  ...  EXPERIMENTS AND RESULTS In this section, the experiments and results for evaluating the performance of the proposed BFA applied on multi-level thresholding segmentation for a number of benchmark images  ... 
doi:10.1109/access.2020.2966665 fatcat:pfq3jo2fnrho5hygcdb7styiku

A hybrid swarm-based algorithm for single-objective optimization problems involving high-cost analyses

Enrico Ampellio, Luca Vassio
2016 Swarm Intelligence  
AsBeC is designed to provide fast convergence speed, high solution accuracy and robust performance over a wide range of problems.  ...  In many technical fields, single-objective optimization procedures in continuous domains involve expensive numerical simulations.  ...  Their collaboration was fundamental in the successfully application of AsBeC algorithm to turbine design.  ... 
doi:10.1007/s11721-016-0121-6 fatcat:g4tedfua3vbl5cehc6yglssoxm

Design and implementation of a real-time interactive analytics system for large spatio-temporal data

Shiming Zhang, Yin Yang, Wei Fan, Marianne Winslett
2014 Proceedings of the VLDB Endowment  
Third, OceanRT contains a novel storage scheme that optimizes for queries with joins and multi-dimensional selections, which are common for large spatiotemporal data.  ...  Although there already exist systems for querying big data in real time, OceanRT's performance stands out due to several novel designs and components that address key efficiency and scalability issues  ...  and a storage scheme that reduces file fragmentation and serves the purpose of a multi-dimensional primary index.  ... 
doi:10.14778/2733004.2733079 fatcat:gqnny6fhz5c2pklcg53wlayyli

Hadoop GIS

Ablimit Aji, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang, Joel Saltz
2013 Proceedings of the VLDB Endowment  
Hadoop-GIS is available as a set of library for processing spatial queries, and as an integrated software package in Hive.  ...  Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services  ...  The spatial indexing aware query optimization will take advantage of RESQUE for efficient spatial query support in Hive.  ... 
doi:10.14778/2536222.2536227 fatcat:z7w7hmd23na4flngnhyfvapbe4

Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce

Ablimit Aji, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang, Joel Saltz
2013 Proceedings of the VLDB Endowment  
Hadoop-GIS is available as a set of library for processing spatial queries, and as an integrated software package in Hive.  ...  Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services  ...  Acknowledgments This work is supported in part by PHS Grant UL1RR025008 from the CTSA program, R24HL085343 from the NHLBI, by Grant Number R01LM009239 from the NLM, by NCI Contract No.  ... 
pmid:24187650 pmcid:PMC3814183 fatcat:v5ov5f6vhzcklbkoyztq7zovda

GeoSpark SQL: An Effective Framework Enabling Spatial Queries on Spark

Zhou Huang, Yiran Chen, Lin Wan, Xia Peng
2017 ISPRS International Journal of Geo-Information  
Hive and Spark.  ...  , and (3) spatial query optimization methods under Spark.  ...  In our experiment, each test case has been run 10 times and the average time cost is taken for performance evaluation.  ... 
doi:10.3390/ijgi6090285 fatcat:ixe7urqadzb5zcbtyxbc4zheva

DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index [article]

Yue Liu, Songlin Hu, Tilmann Rabl, Wantao Liu, Hans-Arno Jacobsen, Kaifeng Wu, Jian Chen, Jintao Li
2014 arXiv   pre-print
Unlike the existing indexes in Hive, which stores all combinations of multiple dimensions, DGFIndex only stores the information of cubes. This leads to smaller index size and faster query processing.  ...  Our comprehensive experiments show that DGFIndex can save significant disk space in comparison with the existing indexes in Hive and the query performance with DGFIndex is 2-50 times faster than existing  ...  EXPERIMENTS AND RESULTS In this section, we evaluate the DGFIndex and compare it with the existing indexes in Hive.  ... 
arXiv:1404.5686v3 fatcat:sooqbhisa5e2fpgagmpn63dsyq

DGFIndex for smart grid

Yue Liu, Songlin Hu, Tilmann Rabl, Wantao Liu, Hans-Arno Jacobsen, Kaifeng Wu, Jian Chen, Jintao Li
2014 Proceedings of the VLDB Endowment  
Unlike the existing indexes in Hive, which stores all combinations of multiple dimensions, DGFIndex only stores the information of cubes. This leads to smaller index size and faster query processing.  ...  Our comprehensive experiments show that DGFIndex can save significant disk space in comparison with the existing indexes in Hive and the query performance with DGFIndex is 2-50 times faster than existing  ...  EXPERIMENTS AND RESULTS In this section, we evaluate the DGFIndex and compare it with the existing indexes in Hive.  ... 
doi:10.14778/2733004.2733021 fatcat:zvlcirruajeu5csyzlkcdro6fm

NoSQL Databases for RDF: An Empirical Evaluation [chapter]

Philippe Cudré-Mauroux, Iliya Enchev, Sever Fundatureanu, Paul Groth, Albert Haque, Andreas Harth, Felix Leif Keppmann, Daniel Miranker, Juan F. Sequeda, Marcin Wylot
2013 Lecture Notes in Computer Science  
In recent years, much effort was spent on optimizing native RDF stores and on repurposing relational query engines for large-scale RDF processing.  ...  This work is, to the best of our knowledge, the first systematic attempt at characterizing and comparing NoSQL stores for RDF processing.  ...  Indexing and storage schemas have a large impact on the performance of a database. Sidirourgos et al. [17] use a single system to evaluate the performance impact of different approaches.  ... 
doi:10.1007/978-3-642-41338-4_20 fatcat:ubefu6jkkfgvxpqbuensoaxbq4

A performance evaluation of Hive for scientific data management

Taoying Liu, Jing Liu, Hong Liu, Wei Li
2013 2013 IEEE International Conference on Big Data  
of underlying storage facilities and indexing mechanism.  ...  A complete strategy of migrating SSDB to Hive is described in detail including query HQL implementation, data partition schema and adjustments of underlying storage facilities.  ...  Rubao Lee and Dr. Yin Huai, for their insightful suggestions. We would also thank Dixin Tang, Liechun Zhou and Shufang Wang, they did valuable work to support this research.  ... 
doi:10.1109/bigdata.2013.6691696 dblp:conf/bigdataconf/LiuLLL13 fatcat:mvgsipp2dbeujh6r3t3caexzca

ABC-SVM: Artificial Bee Colony and SVM Method for Microarray Gene Selection and Multi Class Cancer Classification

Hala M. Alshamlan, Ghada H. Badr, Yousef A. Alohali
2016 International Journal of Machine Learning and Computing  
We evaluate the performance of the proposed ABC-SVM algorithm by conducting extensive experiments on six binary and multi-class microarrays dataset.  ...  In this paper, we propose apply ABC algorithm in analyzing microarray dataset.  ...  In this section, we evaluate the overall performance of gene selection methods using six popular binary and multi-class microarray cancer datasets, which were downloaded from http://www.gems-system.org  ... 
doi:10.18178/ijmlc.2016.6.3.596 fatcat:3b3xd7uerfd65doprkjb434e6i
« Previous Showing results 1 — 15 out of 1,550 results