If you're like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools such as Apache Spark makes a lot of sense. With this practical book, data scientists and professionals working with large-scale data applications will learn how to use Spark from R to tackle big data and big compute problems.
Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to use R with Spark to solve different data analysis problems. This book covers relevant data science topics, cluster computing, and issues that should interest even the most advanced users.
ISBN: | 9781492046370 |
Publication date: | 18th October 2019 |
Author: | Javier Luraschi, Kevin Kuo, Edgar Ruiz |
Publisher: | O'Reilly an imprint of O'Reilly Media |
Format: | Paperback |
Pagination: | 293 pages |
Genres: |
Programming and scripting languages: general Database design and theory Data capture and analysis Data mining Computer science Information visualization Information architecture |