References
- White, Tom. Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale, O'Reilly Media.
- Aven, Jeffrey. Hadoop in 24 Hours, Sams Teach Yourself, Pearson Education.
Carpenter, Jeff; Hewitt, Eben. Cassandra: The Definitive Guide: Distributed Data at Web Scale, O'Reilly Media.
Chodorow, Kristina. MongoDB: The Definitive Guide: Powerful and Scalable Data Storage, O'Reilly Media
Sundog Education by Frank Kane, The Ultimate Hands-On Hadoop - Tame your Big Data!
https://hadoop.apache.org/docs/r2.4.1/hadoop-project-dist/hadoop-common/FileSystemShell.html
https://www.safaribooksonline.com/library/view/learning-spark/9781449359034/ch01.html
http://searchdatamanagement.techtarget.com/definition/data-engineer
AWS whitepaper, Lambda Architecture for Batch and RealTime Processing on AWS with Spark Streaming and Spark SQL
https://www.oreilly.com/ideas/questioning-the-lambda-architecture