Simplify Big Data Analytics with Amazon EMR

A beginner's guide to learning and implementing Amazon EMR for building data analytics solutions

Amazon EMR, formerly Amazon Elastic MapReduce, provides a managed Hadoop cluster in Amazon Web Services (AWS) that you can use to implement batch or streaming data pipelines. By gaining expertise in Amazon EMR, you can design and implement data analytics pipelines with persistent or transient EMR clusters in AWS. This book is a practical guide to Amazon EMR for building data pipelines. You'll start by understanding the Amazon EMR architecture, cluster nodes, features, and deployment options, along with their pricing. Next, the book covers the various big data applications that EMR supports. You'll then focus on the advanced configuration of EMR applications, hardware, networking,... alles anzeigen expand_more

Amazon EMR, formerly Amazon Elastic MapReduce, provides a managed Hadoop cluster in Amazon Web Services (AWS) that you can use to implement batch or streaming data pipelines. By gaining expertise in Amazon EMR, you can design and implement data analytics pipelines with persistent or transient EMR clusters in AWS.
This book is a practical guide to Amazon EMR for building data pipelines. You'll start by understanding the Amazon EMR architecture, cluster nodes, features, and deployment options, along with their pricing. Next, the book covers the various big data applications that EMR supports. You'll then focus on the advanced configuration of EMR applications, hardware, networking, security, troubleshooting, logging, and the different SDKs and APIs it provides. Later chapters will show you how to implement common Amazon EMR use cases, including batch ETL with Spark, real-time streaming with Spark Streaming, and handling UPSERT in S3 Data Lake with Apache Hudi. Finally, you'll orchestrate your EMR jobs and strategize on-premises Hadoop cluster migration to EMR. In addition to this, you'll explore best practices and cost optimization techniques while implementing your data analytics pipeline in EMR.
By the end of this book, you'll be able to build and deploy Hadoop- or Spark-based apps on Amazon EMR and also migrate your existing on-premises Hadoop workloads to AWS.





J-Novel Club is a digital publishing company started by translators and fans like you! Our mission is to translate and release the coolest, funnest, and newest light novels from Japan to the world. By focusing on digital releases, and providing a membership service to let people read the books as soon as they are translated, our goal is to build a community of light novel readers and to grow the market, so that more and more releases can be officially licensed and translated. We won't just publish the big hit light novels that get anime adaptations, but also newer titles or books from small publishers and web novels... as long as it's a blast to read, we'll bring it to you! So pull out your tablet or ereader, sit down in a comfy chair, and join the club! weniger anzeigen expand_less
Weiterführende Links zu "Simplify Big Data Analytics with Amazon EMR"

Versandkostenfreie Lieferung! (eBook-Download)

Als Sofort-Download verfügbar

eBook
34,79 €

  • SW9781801077729450914

Ein Blick ins Buch

Book2Look-Leseprobe

Andere kauften auch

Andere sahen sich auch an

info