Amazon DataZone: Connect People and Data

Overview

Increase business teams' efficiency by collaborating seamlessly across teams and providing self-service access to data and analytics tools. Use business terms to search, share, and access cataloged data stored on AWS, on premises, or with third-party providers. Learn more about data that you want to use with the business glossary.

Page Topics

Key features

Key features

Projects help a group of users collaborate with each other and produce business outcomes. Project members consume assets from the business data catalog using one or more analytical workflows and produce new assets using one or more analytical workflows. You can use data projects to manage and monitor data assets across projects through usage auditing capabilities. Only users who are explicitly added to the project can access the project's data and tools.

Data publishers bring all the assets into the inventory where other users can enrich the data with business context before the asset is shared. The seamless handshake between users is managed by the Amazon DataZone portal, an out-of-console experience that facilitates cross-functional collaboration while working with data and analytics tools in a self-service fashion. The portal uses existing credentials from your identity provider.

Leverage your choice of BI and analytics tools to analyze governed data effortlessly. Amazon DataZone supports authentication through the Athena JDBC Driver, enabling seamless access to your subscribed data lake assets. Connect with popular tools like Tableau, Domino, Power BI, Excel, SQL Workbench, DBeaver, and more for enhanced insights.

Use cases

Give analysts and data scientists a unified data management portal to discover, access, prepare, analyze, and report on data across the organization. The single portal helps you more easily collaborate with data engineers and IT admins to get insights from data faster. Provide the right balance between centralized and decentralized data governance, so business teams can be agile and not depend on IT teams while helping to ensure adherence to and compliance with data governance policies.

Increase visibility and access to data teams to more easily find and analyze data wherever it is stored. Provide the right level of data governance by capturing metadata from its source so that you can trust the data. With this increased data visibility, you don't need to create copies for every person wanting to work with the data.

Catalog your metadata from Amazon Simple Storage Service (Amazon S3) data lakes, many of the AWS purpose-built databases like Amazon Relational Database Service (Amazon RDS), and other data sources using more than 70 AWS Glue crawlers. With AWS Glue integration with the 100-plus Amazon AppFlow connectors, you can now catalog far more data sources to make data visible and available across the organization, removing siloed and hidden data. 

FAQ

How can users across my organization gain access to the Amazon DataZone portal?

When initially setting up Amazon DataZone, your administrators will use their AWS credentials to deploy the root domain and have the option to enable access to the root domain through AWS IAM Identity Center. Users can then access the Amazon DataZone portal with IAM or SSO credentials, helping them seamlessly access the portal using an existing identity provider. Existing users and groups can be assigned as data stewards or be associated with a specific project.