Open Access Open Access  Restricted Access Subscription Access

Feature-Based High Availability Mechanism for Extreme Aggregation Tasks in Real-Time Data Stream Processing

Wei-Long Ding,
Yan-Bo Han,
Jing Wang,
Zhuo-Feng Zhao,

Abstract


Data stream under Cloud or IoT (Internet of Things) implies real-time and continuous processing, during which availability guarantee is essential but expensive. For extreme aggregation tasks, traditional HA mechanisms require vast space at run-time and much longer recovery latency at fail-time especially under worse input. In this paper, feature-based high availability mechanism is proposed for extreme aggregation tasks, in which space-bounded feature is maintained through random sampling over time-based sliding window and failed tasks can be recovered precisely with high probability in an efficient way. The probabilistic effectiveness has been proved theoretically, and meanwhile the acceptable tradeoff between related overheads and performance has been demonstrated by comprehensive experiments on both synthetic and real data.

Keywords


Extreme aggregation; High availability; Data stream; Sampling

Citation Format:
Wei-Long Ding, Yan-Bo Han, Jing Wang, Zhuo-Feng Zhao, "Feature-Based High Availability Mechanism for Extreme Aggregation Tasks in Real-Time Data Stream Processing," Journal of Internet Technology, vol. 14, no. 2 , pp. 327-339, Mar. 2013.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com