Saturday, June 2, 2018

10 Considerations to Quickly Find Success When Adopting Amazon Redshift Spectrum

Amazon Redshift Spectrum, an interactive query service for Redshift customers, was introduced in April 2017. The service allows to avoid time-consuming ETL workflows and run queries directly against the data stored in Amazon S3. This article covers what is important to know when adopting Amazon Redshift Spectrum for interactive queries and how to automate certain processes to improve performance and lower query costs.

1. Schema and Table Definitions

Setting up Amazon Redshift Spectrum requires creating an external schema and tables. You can use Amazon Athena data catalog or Amazon EMR as a “metastore” in which to create an external schema. Note, external tables are read-only, and won’t allow you to perform insert, update, or delete operations.



from DZone.com Feed https://ift.tt/2La8G2P

No comments:

Post a Comment