Mastering Apache Spark 2.0
by Jacek Laskowski
Publisher: GitBook 2016
Number of pages: 1621
This collections of notes (what some may rashly call a 'book') serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark.
Home page url
Download or read it online for free here:
by Chris Eaton, et al. - McGraw-Hill
Big Data represents a new era in data exploration and utilization, and IBM helps clients navigate this transformation. The book reveals how to use Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
by Matthew North - Global Text Project
In this book, professor Matt North uses simple examples, clear explanations and free, powerful, easy-to-use software to teach you the basics of data mining; techniques that can help you answer some of your toughest business questions.
by Open Knowledge Foundation - School of Data
The Data Wrangling Handbook is a companion text to the School of Data. Its function is something like a traditional textbook -- it will provide the detail and background theory to support the School of Data courses and challenges.
by Eric Redmond - GitBook
This is a free little book about Riak, a scalable, high availability NoSQL datastore. Riak is an open-source, distributed key/value database for high availability and near-linear scalability. Riak has remarkably high uptime and grows with you.