Skip to Main content Skip to Navigation
Conference papers

Big Data Storage and Management: Challenges and Opportunities

Abstract : The paper is focused on today’s very popular theme – Big Data. We describe and discuss its characteristics by eleven V’s (Volume, Velocity, Variety, Veracity, etc.) and Big Data quality. These characteristics represent both data and process challenges. Then we continue with problems of Big Data storage and management. Principles of NoSQL databases are explained including their categorization. We also shortly describe Hadoop and MapReduce technologies as well as their inefficiency for some interactive queries and applications within the domain of large-scale graph processing and streaming data. NoSQL databases and Hadoop M/R are designed to take advantage of cloud computing architectures and allow massive computations to be run inexpensively and efficiently. The term of Big Data 1.0 was introduced for these technologies. We continue with some new approaches called currently Big Data 2.0 processing systems. Particularly their four categories are introduced and discussed: General purpose Big Data Processing Systems, Big SQL Processing Systems, Big Graph Processing Systems, and Big Stream Processing Systems. Then, an attention is devoted to Big Analytics – the main application area for Big Data storage and processing. We argue that enterprises with complex, heterogeneous environments no longer want to adopt a BI access point just for one data source (Hadoop). More heterogeneous software platforms are needed. Even Hadoop has become a multi-purpose engine for ad hoc analysis. Finally, we mention some problems with Big Data. We also remind that Big Data creates a new type of digital divide. Having access and knowledge of Big Data technologies gives companies and people a competitive edge in today’s data driven world.
Document type :
Conference papers
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download

https://hal.inria.fr/hal-01852621
Contributor : Hal Ifip <>
Submitted on : Thursday, August 2, 2018 - 9:49:35 AM
Last modification on : Thursday, August 2, 2018 - 9:56:08 AM
Long-term archiving on: : Saturday, November 3, 2018 - 1:02:42 PM

File

467210_1_En_3_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Jaroslav Pokorný. Big Data Storage and Management: Challenges and Opportunities. 12th International Symposium on Environmental Software Systems (ISESS), May 2017, Zadar, Croatia. pp.28-38, ⟨10.1007/978-3-319-89935-0_3⟩. ⟨hal-01852621⟩

Share

Metrics

Record views

263

Files downloads

168