blog_posts: 275fdb0582
This data as json
id | createdDate | title | link | postExcerpt | featuredImageUrl | hash | contributors | modifiedDate | displayDate |
---|---|---|---|---|---|---|---|---|---|
blog-posts#33-10448 | 2020-06-08 19:00:21 | How Drop used the Amazon EMR runtime for Apache Spark to halve costs and get results 5.4 times faster | https://aws.amazon.com/blogs/big-data/how-drop-used-the-amazon-emr-runtime-for-apache-spark-to-halve-costs-and-get-results-5-4-times-faster/ | This post details how we designed and implemented our data lake’s batch ETL pipeline to use Amazon EMR, and the numerous ways we iterated on its architecture to reduce Apache Spark runtimes from hours to minutes and save over 50% on operational costs. | https://d2908q01vomqb2.cloudfront.net/b6692ea5df920cad691c20319a6fffd7a4a766b8/2020/06/03/HowDropUsedEMRSpark1-300x188.png | 275fdb0582 | Michael Chau, Leonardo Gomez | 2022-03-03 22:02:20 | 08 Jun 2020 |
Links from other tables
- 8 rows from blog_post_hash in blog_post_tags