10% off all books and free delivery over £40
Buy from our bookstore and 25% of the cover price will be given to a school of your choice to buy more books. *15% of eBooks.

Mastering Spark With R

View All Editions

£44.99 £40.49

Temporarily Out Of Stock. Usually available in 3-5 working days.

Add To Wishlist
Write A Review

About

Mastering Spark With R Synopsis

If you're like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools such as Apache Spark makes a lot of sense. With this practical book, data scientists and professionals working with large-scale data applications will learn how to use Spark from R to tackle big data and big compute problems.

Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to use R with Spark to solve different data analysis problems. This book covers relevant data science topics, cluster computing, and issues that should interest even the most advanced users.

  • Analyze, explore, transform, and visualize data in Apache Spark with R
  • Create statistical models to extract information and predict outcomes; automate the process in production-ready workflows
  • Perform analysis and modeling across many machines using distributed computing techniques
  • Use large-scale data from multiple sources and different formats with ease from within Spark
  • Learn about alternative modeling frameworks for graph processing, geospatial analysis, and genomics at scale
  • Dive into advanced topics including custom transformations, real-time data processing, and creating custom Spark extensions

About This Edition

ISBN: 9781492046370
Publication date: 18th October 2019
Author: Javier Luraschi, Kevin Kuo, Edgar Ruiz
Publisher: O'Reilly an imprint of O'Reilly Media
Format: Paperback
Pagination: 293 pages
Genres: Programming and scripting languages: general
Database design and theory
Data capture and analysis
Data mining
Computer science
Information visualization
Information architecture