This book is about Data Analytics. In that respect, it is like others. What distinguishes it from the rest is the variety of open-source tool applications. This book incorporates the use of R Studio, Python, SAS Studio (University Edition), and KNIME. This book is also about manipulating Big Data. Apache Hadoop on Hortonworks Sandbox is introduced and we manage, move, handle, and transform data using Apache Hive, Apache Spark, MapReduce and TEZ, with...