blog_posts: 275fdb0582

This data as json

id	createdDate	title	link	postExcerpt	featuredImageUrl	hash	contributors	modifiedDate	displayDate
blog-posts#33-10448	2020-06-08 19:00:21	How Drop used the Amazon EMR runtime for Apache Spark to halve costs and get results 5.4 times faster	https://aws.amazon.com/blogs/big-data/how-drop-used-the-amazon-emr-runtime-for-apache-spark-to-halve-costs-and-get-results-5-4-times-faster/	This post details how we designed and implemented our data lake’s batch ETL pipeline to use Amazon EMR, and the numerous ways we iterated on its architecture to reduce Apache Spark runtimes from hours to minutes and save over 50% on operational costs.	https://d2908q01vomqb2.cloudfront.net/b6692ea5df920cad691c20319a6fffd7a4a766b8/2020/06/03/HowDropUsedEMRSpark1-300x188.png	275fdb0582	Michael Chau, Leonardo Gomez	2022-03-03 22:02:20	08 Jun 2020

Links from other tables