,
Discretized streams: Fault-tolerant streaming computation at scale, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, ser. SOSP '13, pp.423-438, 2013. ,
,
Mapreduce: Simplified data processing on large clusters, Proceedings of the 6th Conference on Symposium on OSDI. USENIX Association, 2004. ,
, Hadoop
Semantics of data streams and operators, Proceedings of the 10th International Conference on Database Theory, ser. ICDT'05, pp.37-52, 2005. ,
, The world beyond batch: Streaming, vol.101
, The world beyond batch: Streaming 102
The cql continuous query language: Semantic foundations and query execution, The VLDB Journal, vol.15, issue.2, pp.121-142, 2006. ,
DOI : 10.1007/s00778-004-0147-z
Realtime data processing at facebook, Proceedings of the 2016 International Conference on Management of Data, ser. SIGMOD '16, pp.1087-1098, 2016. ,
DOI : 10.1145/2882903.2904441
, STREAM2016: Streaming Requirements, Experience, Applications and Middleware Workshop, 2016.
DOI : 10.2172/1344785
Final report from the nsf workshop on future directions in wireless networking, USA, Tech. Rep, 2013. ,
The hadoop distributed file system, Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), ser. MSST '10, pp.1-10, 2010. ,
High-availability algorithms for distributed stream processing, Proceedings of the 21st International Conference on Data Engineering, ser. ICDE '05, pp.779-790, 2005. ,
The dataflow model: A practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing, Proc. VLDB Endow, vol.8, issue.12, pp.1792-1803, 2015. ,
Hyperion: High volume stream archival for retrospective querying, 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference, ser. ATC'07, vol.4, pp.1-4, 2007. ,
Gigascope: A stream database for network applications, Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '03, pp.647-651, 2003. ,
A framework for clustering evolving data streams, Proceedings of the 29th International Conference on Very Large Data Bases, vol.29, pp.81-92, 2003. ,
DOI : 10.1016/b978-012722442-8/50016-1
Macrobase: Prioritizing attention in fast data, Proceedings of the 2017 ACM International Conference on Management of Data, ser. SIGMOD '17, pp.541-556, 2017. ,
Workload analysis of a large-scale key-value store, Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, ser. SIGMETRICS '12, pp.53-64, 2012. ,
Internet of things (iot): A vision, architectural elements, and future directions, Future Gener ,
DOI : 10.1016/j.future.2013.01.010
URL : http://arxiv.org/pdf/1207.0203
, Comput. Syst, vol.29, issue.7, pp.1645-1660, 2013.
Leveraging adaptive i/o to optimize collective data shuffling patterns for big data analytics, IEEE Transactions on Parallel and Distributed Systems, vol.28, issue.6, pp.1663-1674, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01531374
Processing flows of information: From data stream to complex event processing, ACM Comput. Surv, vol.44, issue.3, pp.1-15, 2012. ,
,
Column-stores vs. row-stores: How different are they really, Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '08, pp.967-980, 2008. ,
Dremel: Interactive analysis of web-scale datasets, Proc. VLDB Endow, vol.3, pp.330-339, 2010. ,
,
Partitioning functions for stateful data parallelism in stream processing, The VLDB Journal, vol.23, issue.4, pp.517-539, 2014. ,
A framework for partitioning and execution of data stream applications in mobile cloud computing, Eval. Rev, vol.40, issue.4, pp.23-32, 2013. ,
,
X-stream: Edge-centric graph processing using streaming partitions, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, ser. SOSP '13, pp.472-488, 2013. ,
Scalable data partitioning techniques for parallel sliding window processing over data streams, 8th International Workshop on Data Management for Sensor Networks, 2011. ,
High performance stream query processing with correlation-aware partitioning, Proc. VLDB Endow, vol.7, issue.4, pp.265-276, 2013. ,
DOI : 10.14778/2732240.2732245
URL : http://www.vldb.org/pvldb/vol7/p265-cao.pdf
Streaming queries over streaming data, Proceedings of the 28th International Conference on Very Large Data Bases, ser. VLDB '02. VLDB Endowment, pp.203-214, 2002. ,
DOI : 10.1016/b978-155860869-6/50026-3
URL : http://www.cs.berkeley.edu/~franklin/Papers/psoupVLDB02.pdf
Scalable distributed stream processing, First Biennial Conference on Innovative Data Systems Research, 2003. ,
Highly available, fault-tolerant, parallel dataflows, Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '04, pp.827-838, 2004. ,
DOI : 10.1145/1007568.1007662
URL : http://db.cs.berkeley.edu/papers/sigmod04-fluxft.pdf
Semantics and evaluation techniques for window aggregates in data streams, Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '05, pp.311-322, 2005. ,
Nephele streaming: Stream processing under qos constraints at scale, Cluster Computing, vol.17, issue.1, pp.61-78, 2014. ,
DOI : 10.1007/s10586-013-0281-8
URL : http://arxiv.org/pdf/1308.1031
,
Kafka: A distributed messaging system for log processing, Proceedings of 6th International Workshop on Networking Meets Databases, ser. NetDB'11, 2011. ,
,
, Rabbitmq
, Amqp
, Zeromq
, Hornetq
,
Gobblin: Unifying data ingestion for hadoop, Proc. VLDB Endow, vol.8, issue.12, pp.1764-1769, 2015. ,
, Gobblin Documentation
, Elasticsearch
,
Aurora: A new model and architecture for data stream management, The VLDB Journal, vol.12, issue.2, pp.120-139, 2003. ,
,
, Redis
Consistency in non-transactional distributed storage systems, ACM Comput. Surv, vol.49, issue.1, pp.1-19, 2016. ,
The ramcloud storage system, ACM Trans. Comput. Syst, vol.33, issue.3, pp.1-7, 2015. ,
Slik: Scalable low-latency indexes for a key-value store, Proceedings of the 2016 USENIX Conference on Usenix Annual Technical Conference, ser. USENIX ATC '16, pp.57-70, 2016. ,
Mica: A holistic approach to fast in-memory key-value storage, Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation, ser. NSDI'14, pp.429-444, 2014. ,
Architecting to achieve a billion requests per second throughput on a single key-value store server platform, Proceedings of the 42Nd Annual International Symposium on Computer Architecture, ser. ISCA '15, pp.476-488, 2015. ,
Hyperdex: A distributed, searchable key-value store, Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, ser. SIGCOMM '12, pp.25-36, 2012. ,
, Memcached
Scaling memcache at facebook, Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation, ser. nsdi'13, pp.385-398, 2013. ,
, Rocksdb
, Lmdb
Dxram: A persistent in-memory storage for billions of small objects, 2013. ,
Druid: A real-time analytical data store, Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '14, pp.157-168, 2014. ,
, Druid
Scalable storage support for data stream processing, 26th Symposium on Mass Storage Systems and Technologies, ser. MSST'10, 2010. ,
Continuous queries over data streams, SIGMOD Rec, vol.30, issue.3, pp.109-120, 2001. ,
DOI : 10.1145/603867.603884
URL : http://pages.cs.wisc.edu/~jhuang/qual/continuous-query-data-stream-01.pdf
Pvfs: A parallel file system for linux clusters, Proceedings of the 4th Annual Linux Showcase & Conference, vol.4, pp.28-28, 2000. ,
Flexible and scalable storage management for data-intensive stream processing, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, ser. EDBT '09, pp.934-945, 2009. ,
DOI : 10.1145/1516360.1516467
,
Linear road: A stream data management benchmark, Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol.30, pp.480-491, 2004. ,
VOLAP: A scalable distributed system for real-time OLAP with high velocity data, 2016 IEEE International Conference on Cluster Computing, pp.354-363, 2016. ,
Persistent temporal streams, Proceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware, ser. Middleware '09, vol.17, 2009. ,
DOI : 10.1007/978-3-642-10445-9_17
URL : https://link.springer.com/content/pdf/10.1007%2F978-3-642-10445-9_17.pdf
Liquid: Unifying nearline and offline big data integration, CIDR 2015, Seventh Biennial Conference on Innovative Data Systems Research, 2015. ,
,
The freeze-frame file system, Proceedings of the Seventh ACM Symposium on Cloud Computing, ser. SoCC '16, pp.307-320, 2016. ,
, Timescaledb
, Influxdb
,
, Opentsdb
,
,
Bigtable: A distributed storage system for structured data, ACM Trans. Comput. Syst, vol.26, issue.2, pp.1-4, 2008. ,
Analysis of hdfs under hbase: A facebook messages case study, Proceedings of the 12th USENIX Conference on File and Storage Technologies, ser. FAST'14, pp.199-212, 2014. ,
Cassandra: A decentralized structured storage system, SIGOPS Oper. Syst. Rev, vol.44, issue.2, pp.35-40, 2010. ,
,
Dynamic storage allocation: A survey and critical review, Proceedings of the International Workshop on Memory Management, ser. IWMM '95, pp.1-116, 1995. ,
,
, Apache Parquet
,
In-memory big data management and processing: A survey, IEEE Trans. on Knowl. and Data Eng, vol.27, issue.7, 2015. ,
Towards a streaming sql standard, Proc. VLDB Endow, vol.1, pp.1379-1390, 2008. ,
State access patterns in stream parallel computations, International Journal of High Performance Computing Applications, 2017. ,
, Streaming-data algorithms for high-quality clustering, Proceedings of the 18th International Conference on Data Engineering, ser. ICDE '02, p.685, 2002.
Data stream clustering: A survey, ACM Comput. Surv, vol.46, issue.1, pp.1-13, 2013. ,
Graph stream algorithms: A survey, SIGMOD Rec, vol.43, issue.1, pp.9-20, 2014. ,
Distributedlog: A high performance replicated log service, IEEE 33rd International Conference on Data Engineering, ser. ICDE'17. IEEE, 2017. ,
Data ingestion for the connected world, CIDR, Online Proceedings, 2017. ,
The google file system, Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles, ser. SOSP '03, pp.29-43, 2003. ,
DOI : 10.1145/945445.945450
Hopsfs: Scaling hierarchical file system metadata using newsql databases, Proceedings of the 15th Usenix Conference on File and Storage Technologies, ser. FAST'17, pp.89-103, 2017. ,
DOI : 10.1007/978-3-319-77525-8_146
URL : http://arxiv.org/pdf/1606.01588
An optimized approach for storing and accessing small files on cloud storage, J. Netw. Comput. Appl, vol.35, issue.6, pp.1847-1862, 2012. ,
Iot stream processing and analytics in the fog, 2017. ,
Fog computing for sustainable smart cities: A survey, ACM Comput. Surv, vol.50, issue.3, 2017. ,
DOI : 10.1145/3057266
URL : http://eprints.hud.ac.uk/id/eprint/31927/8/__nas01_librhome_librsh3_Desktop_acmsmall-sample.pdf