Many Teradata customers are interested in integrating Teradata Vantage with Amazon Web Services (AWS) First Party Services. This guide will help you connect Teradata Vantage with the AWS Data Exchange service.
The procedure offered in this guide has been implemented and tested by Teradata. However, it is offered on an as-is basis. Neither AWS nor Teradata provide validation of Teradata Vantage with AWS Data Exchange.
This guide includes content from both AWS and Teradata product documentation.
This post describes the process to subscribe to a dataset in AWS Data Exchange, export it to Amazon S3, and then query it with Teradata Vantage, either leveraging Native Object Store (NOS) capability or the Teradata Parallel Transporter (TPT) utility.
About AWS Data Exchange
AWS Data Exchange makes it easy to find, subscribe to, and use third-party data in the cloud. Today, AWS Data Exchange contains data products from a broad range of domains, including financial services, healthcare and life sciences, geospatial, weather, and mapping. Once subscribed to a data product, you can use the AWS Data Exchange API to export data into Amazon S3, and then use Teradata Vantage to explore and analyze the data.
About Teradata Vantage
Teradata Vantage is the connected multi-cloud data platform for enterprise analytics. It enables ecosystem simplification by unifying analytics, data lakes and data warehouses. Leading the way with multi-cloud environments and priced for flexibility, Vantage leverages 100 percent of a business’s data, regardless of scale, volume, or complexity.
Vantage combines descriptive, predictive, prescriptive analytics, autonomous decision-making, machine learning (ML) functions, and visualization tools into a unified, integrated platform that uncovers real-time business intelligence at scale, no matter where the data resides.
Vantage enables companies to start small and elastically scale compute or storage, paying only for what they use, harnessing low-cost object stores and integrating their analytic workloads. Vantage supports R, Python, Teradata Studio, and any other SQL-based tools. You can deploy Vantage across public clouds, on-premises, on optimized or commodity infrastructure, or as-a-service.
Teradata Vantage Native Object Store (NOS) can be used to explore data in external object stores, like Amazon S3, using standard SQL. No special object storage-side compute infrastructure is required to use NOS. You can explore data located in an Amazon S3 bucket by simply creating a NOS table definition that points to your bucket. With NOS, you can quickly import data from Amazon S3 or even join it other tables in the database.
Alternatively, the Teradata Parallel Transporter (TPT) utility can be used to import data from Amazon S3 to Teradata Vantage in bulk fashion. Once loaded, data can be efficiently queried within Vantage.
Wenjie is a Technical Consulting Manager, currently working with the Teradata Global Alliances team.
With over 20 years in the IT industry, Wenjie has worked as developer, tester, business analyst, solution designer and project manager. This breadth of roles makes her perfect for the current role, understanding how the business needs data and how this data can be managed to meet those business needs.
Wenjie has a BS in computer science from University of California at San Diego, and ME in computer engineering at Cornell University. Wenjie is also certified on both Teradata and AWS.
View all posts by Wenjie Tehan