A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Filters
Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive
2015
International Journal of Computational Intelligence Systems
We establish a cost model that integrates the query, maintenance and storage costs to evaluate the performance of approaches and measure the fitness of an individual in the genetic algorithm. ...
Both the simulation results and experiments on Apache Hive show that the approximately optimal solution for selecting materialized views can be obtained effectively using the approach presented. ...
The authors would also like to thank anonymous reviewers and the colleagues in UC Santa Barbara who made valuable suggestions to improve the quality of the paper. ...
doi:10.1080/18756891.2015.1113744
fatcat:junv3falinao5do5slj4nsnvyq
Evaluating the Open Source Data Containers for Handling Big Geospatial Raster Data
2018
ISPRS International Journal of Geo-Information
This paper provides a comprehensive evaluation of six popular data containers (i.e., Rasdaman, SciDB, Spark, ClimateSpark, Hive, and MongoDB) for handling multi-dimensional, array-based geospatial raster ...
These containers optimize their performance from different aspects, such as data organization modeling, indexing, and data pipelines. ...
ClimateSpark was developed by
Conflicts of Interest: The authors declare no conflict of interest. ...
doi:10.3390/ijgi7040144
fatcat:csbbnucfbzd2za4ghkqnyclihm
CloudETL
2014
Proceedings of the 18th International Database Engineering & Applications Symposium on - IDEAS '14
The results show that CloudETL scales very well and significantly outperforms the dimensional ETL capabilities of Hive both with respect to performance and programmer productivity. ...
We present different performance optimizations including a purpose-specific data placement policy to co-locate data. ...
Experiments In this section, we empirically evaluate the performance of CloudETL by studying 1) the performance of processing different DW schemas, including a star schema and schemas with an SCD and a ...
doi:10.1145/2628194.2628249
dblp:conf/ideas/LiuTP14
fatcat:rxyx3q63unbohptqbcjuldomqi
Multi-hive bee foraging algorithm for multi-objective optimal power flow considering the cost, loss, and emission
2014
International Journal of Electrical Power & Energy Systems
This paper proposes a multi-hive multi-objective bee algorithm (M 2 OBA) for optimal power flow (OPF) in power systems. ...
Our algorithm uses the concept of Pareto dominance and comprehensive learning mechanism to determine the flight direction of a bee and maintains nondominated solution vectors in external archive based ...
Acknowledgement This research is partially supported by National Natural Science Foundation of China und Grant 61105067, 61203161, 61174164 and 51205389. ...
doi:10.1016/j.ijepes.2014.02.017
fatcat:ka6fij5lrjgx5ntwg4r4flg6ra
Bee Foraging Algorithm Based Multi-Level Thresholding For Image Segmentation
2020
IEEE Access
The proposed algorithm provides different flying trajectories for different types of bees and takes both single-dimensional and multi-dimensional search aiming to maintain a proper balance between exploitation ...
Multi-level thresholding is one of the essential approaches for image segmentation. Determining the optimal thresholds for multi-level thresholding needs exhaustive searching which is time-consuming. ...
EXPERIMENTS AND RESULTS In this section, the experiments and results for evaluating the performance of the proposed BFA applied on multi-level thresholding segmentation for a number of benchmark images ...
doi:10.1109/access.2020.2966665
fatcat:pfq3jo2fnrho5hygcdb7styiku
A hybrid swarm-based algorithm for single-objective optimization problems involving high-cost analyses
2016
Swarm Intelligence
AsBeC is designed to provide fast convergence speed, high solution accuracy and robust performance over a wide range of problems. ...
In many technical fields, single-objective optimization procedures in continuous domains involve expensive numerical simulations. ...
Their collaboration was fundamental in the successfully application of AsBeC algorithm to turbine design. ...
doi:10.1007/s11721-016-0121-6
fatcat:g4tedfua3vbl5cehc6yglssoxm
Design and implementation of a real-time interactive analytics system for large spatio-temporal data
2014
Proceedings of the VLDB Endowment
Third, OceanRT contains a novel storage scheme that optimizes for queries with joins and multi-dimensional selections, which are common for large spatiotemporal data. ...
Although there already exist systems for querying big data in real time, OceanRT's performance stands out due to several novel designs and components that address key efficiency and scalability issues ...
and a storage scheme that reduces file fragmentation and serves the purpose of a multi-dimensional primary index. ...
doi:10.14778/2733004.2733079
fatcat:gqnny6fhz5c2pklcg53wlayyli
Hadoop GIS
2013
Proceedings of the VLDB Endowment
Hadoop-GIS is available as a set of library for processing spatial queries, and as an integrated software package in Hive. ...
Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services ...
The spatial indexing aware query optimization will take advantage of RESQUE for efficient spatial query support in Hive. ...
doi:10.14778/2536222.2536227
fatcat:z7w7hmd23na4flngnhyfvapbe4
Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce
2013
Proceedings of the VLDB Endowment
Hadoop-GIS is available as a set of library for processing spatial queries, and as an integrated software package in Hive. ...
Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services ...
Acknowledgments This work is supported in part by PHS Grant UL1RR025008 from the CTSA program, R24HL085343 from the NHLBI, by Grant Number R01LM009239 from the NLM, by NCI Contract No. ...
pmid:24187650
pmcid:PMC3814183
fatcat:v5ov5f6vhzcklbkoyztq7zovda
GeoSpark SQL: An Effective Framework Enabling Spatial Queries on Spark
2017
ISPRS International Journal of Geo-Information
Hive and Spark. ...
, and (3) spatial query optimization methods under Spark. ...
In our experiment, each test case has been run 10 times and the average time cost is taken for performance evaluation. ...
doi:10.3390/ijgi6090285
fatcat:ixe7urqadzb5zcbtyxbc4zheva
DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index
[article]
2014
arXiv
pre-print
Unlike the existing indexes in Hive, which stores all combinations of multiple dimensions, DGFIndex only stores the information of cubes. This leads to smaller index size and faster query processing. ...
Our comprehensive experiments show that DGFIndex can save significant disk space in comparison with the existing indexes in Hive and the query performance with DGFIndex is 2-50 times faster than existing ...
EXPERIMENTS AND RESULTS In this section, we evaluate the DGFIndex and compare it with the existing indexes in Hive. ...
arXiv:1404.5686v3
fatcat:sooqbhisa5e2fpgagmpn63dsyq
DGFIndex for smart grid
2014
Proceedings of the VLDB Endowment
Unlike the existing indexes in Hive, which stores all combinations of multiple dimensions, DGFIndex only stores the information of cubes. This leads to smaller index size and faster query processing. ...
Our comprehensive experiments show that DGFIndex can save significant disk space in comparison with the existing indexes in Hive and the query performance with DGFIndex is 2-50 times faster than existing ...
EXPERIMENTS AND RESULTS In this section, we evaluate the DGFIndex and compare it with the existing indexes in Hive. ...
doi:10.14778/2733004.2733021
fatcat:zvlcirruajeu5csyzlkcdro6fm
NoSQL Databases for RDF: An Empirical Evaluation
[chapter]
2013
Lecture Notes in Computer Science
In recent years, much effort was spent on optimizing native RDF stores and on repurposing relational query engines for large-scale RDF processing. ...
This work is, to the best of our knowledge, the first systematic attempt at characterizing and comparing NoSQL stores for RDF processing. ...
Indexing and storage schemas have a large impact on the performance of a database. Sidirourgos et al. [17] use a single system to evaluate the performance impact of different approaches. ...
doi:10.1007/978-3-642-41338-4_20
fatcat:ubefu6jkkfgvxpqbuensoaxbq4
A performance evaluation of Hive for scientific data management
2013
2013 IEEE International Conference on Big Data
of underlying storage facilities and indexing mechanism. ...
A complete strategy of migrating SSDB to Hive is described in detail including query HQL implementation, data partition schema and adjustments of underlying storage facilities. ...
Rubao Lee and Dr. Yin Huai, for their insightful suggestions. We would also thank Dixin Tang, Liechun Zhou and Shufang Wang, they did valuable work to support this research. ...
doi:10.1109/bigdata.2013.6691696
dblp:conf/bigdataconf/LiuLLL13
fatcat:mvgsipp2dbeujh6r3t3caexzca
ABC-SVM: Artificial Bee Colony and SVM Method for Microarray Gene Selection and Multi Class Cancer Classification
2016
International Journal of Machine Learning and Computing
We evaluate the performance of the proposed ABC-SVM algorithm by conducting extensive experiments on six binary and multi-class microarrays dataset. ...
In this paper, we propose apply ABC algorithm in analyzing microarray dataset. ...
In this section, we evaluate the overall performance of gene selection methods using six popular binary and multi-class microarray cancer datasets, which were downloaded from http://www.gems-system.org ...
doi:10.18178/ijmlc.2016.6.3.596
fatcat:3b3xd7uerfd65doprkjb434e6i
« Previous
Showing results 1 — 15 out of 1,550 results