Free Amazon Web Services Data-Engineer-Associate Practice Exam with Questions & Answers | Set: 9

Name: How to Pass Data-Engineer-Associate Exams
Brand: Examstrack
SKU: data-engineer-associate
Price: 31.5 USD
Availability: InStock

Questions 81

A company wants to build a dimension table in an Amazon S3 bucket. The bucket contains historical data that includes 10 million records. The historical data is 1 TB in size.

A data engineer needs a solution to update changes for up to 10,000 records in the base table every day.

Which solution will meet this requirement with the LOWEST runtime?

Options:

Develop an Apache Spark job in Amazon EMR to read the historical data and the new changes into two Spark DataFrames. Use the Spark update method to update the base table.

Develop an AWS Glue Python job to read the historical data and new changes into two Pandas DataFrames. Use the Pandas update method to update the base table.

Develop an AWS Glue Apache Spark job to read the historical data and new changes into two Spark DataFrames. Use the Spark update method to update the base table.

Develop an Amazon EMR job to read new changes into Apache Spark DataFrames. Use the Apache Hudi framework to create the base table in Amazon S3. Use the Spark update method to update the base table.

Amazon Web Services Data-Engineer-Associate Premium Access

Hannah

12-May-2026

examstrack's 24/7 support team is exceptional. They provide unwavering assistance throughout the certification journey.

Questions 82

A data engineer needs to analyze time-sensitive sales data. The company stores the data in an Amazon S3 bucket. The data engineer uses AWS Glue Data Catalog to access the data.

When performing the analysis, the data engineer notices that some records are missing or out of date.

What is the likely cause of these issues?

Options:

AWS Glue Data Catalog is not up to date with the latest S3 partition changes.

Incorrect IAM roles are assigned to the AWS Glue jobs.

Versioning is not enabled on the S3 bucket.

The AWS Glue job schedules overlap with one another.

Questions 83

A company stores its processed data in an S3 bucket. The company has a strict data access policy. The company uses IAM roles to grant teams within the company different levels of access to the S3 bucket.

The company wants to receive notifications when a user violates the data access policy. Each notification must include the username of the user who violated the policy.

Which solution will meet these requirements?

Options:

Use AWS Config rules to detect violations of the data access policy. Set up compliance alarms.

Use Amazon CloudWatch metrics to gather object-level metrics. Set up CloudWatch alarms.

Use AWS CloudTrail to track object-level events for the S3 bucket. Forward events to Amazon CloudWatch to set up CloudWatch alarms.

Use Amazon S3 server access logs to monitor access to the bucket. Forward the access logs to an Amazon CloudWatch log group. Use metric filters on the log group to set up CloudWatch alarms.

Questions 84

A company creates a new non-production application that runs on an Amazon EC2 instance. The application needs to communicate with an Amazon RDS database instance using Java Database Connectivity (JDBC). The EC2 instances and the RDS database instance are in the same subnet.

Which solution will meet this requirement?

Options:

Modify the IAM role that is assigned to the database instance to allow connections from the EC2 instances.

Modify the ec2_authorized_hosts parameter in the RDS parameter group to include the EC2 instances. Restart the database instance.

Update the database security group to allow connections from the EC2 instances.

Enable the Amazon RDS Data API and specify the Amazon Resource Name (ARN) of the database instance in the JDBC connection string.

Questions 85

A company stores customer data that contains personally identifiable information (PII) in an Amazon Redshift cluster. The company ' s marketing, claims, and analytics teams need to be able to access the customer data.

The marketing team should have access to obfuscated claim information but should have full access to customer contact information.

The claims team should have access to customer information for each claim that the team processes.

The analytics team should have access only to obfuscated PII data.

Which solution will enforce these data access requirements with the LEAST administrative overhead?

Options:

Create a separate Redshift cluster for each team. Load only the required data for each team. Restrict access to clusters based on the teams.

Create views that include required fields for each of the data requirements. Grant the teams access only to the view that each team requires.

Create a separate Amazon Redshift database role for each team. Define masking policies that apply for each team separately. Attach appropriate masking policies to each team role.

Move the customer data to an Amazon S3 bucket. Use AWS Lake Formation to create a data lake. Use fine-grained security capabilities to grant each team appropriate permissions to access the data.

Answer:

Explanation:

Step 1: Understand the Data Access Requirements

The question presents distinct access needs for three teams:

Marketing team: Needs full access to customer contact info but only obfuscated claim information.

Claims team: Needs access to customer information relevant to the claims they process.

Analytics team: Needs only obfuscated PII data.

These teams require different levels of access, and the solution needs to enforce data security while keeping administrative overhead low.

Step 2: Why Option B is Correct

Option B (Creating Views) is a common best practice in Amazon Redshift to restrict access to specific data without duplicating data or managing multiple clusters. By creating views:

You can define customized views of the data with obfuscated fields for the analytics team and marketing team while still providing full access where necessary.

Views provide a logical separation of data and allow Redshift administrators to grant access permissions based on roles or groups, ensuring that each team sees only what they are allowed to.

Obfuscation or masking of PII can be easily applied to the views by transforming or hiding sensitive data fields.

This approach avoids the complexity of managing multiple Redshift clusters or S3-based data lakes, which introduces higher operational and administrative overhead.

Step 3: Why Other Options Are Not Ideal

Option A (Separate Redshift Clusters) introduces unnecessary administrative overhead by managing multiple clusters. Maintaining several clusters for each team is costly, redundant, and inefficient.

Option C (Separate Redshift Roles) involves creating multiple roles and managing complex masking policies, which adds to administrative burden and complexity. While Redshift does support column-level access control, it ' s still more overhead than managing simple views.

Option D (Move to S3 and Lake Formation) is a more complex and heavy-handed solution, especially when the data is already stored in Redshift. Migrating the data to S3 and setting up a data lake with Lake Formation introduces significant operational complexity that isn ' t needed for this specific requirement.

Conclusion:

Creating views in Amazon Redshift allows for flexible, fine-grained access control with minimal overhead, making it the optimal solution to meet the data access requirements of the marketing, claims, and analytics teams.

Questions 86

A data engineer needs to deploy a complex pipeline. The stages of the pipeline must run scripts, but only fully managed and serverless services can be used.

Options:

Deploy AWS Glue jobs and workflows. Use AWS Glue to run the jobs and workflows on a schedule.

Use Amazon MWAA to build and schedule the pipeline.

Deploy the script to EC2. Use EventBridge to schedule it.

Use AWS Glue DataBrew and EventBridge to run on a schedule.

Exam Code: Data-Engineer-Associate

Certification Provider: Amazon Web Services

Exam Name: AWS Certified Data Engineer - Associate (DEA-C01)

Last Update: Jun 17, 2026

Questions: 289

How to Pass Data-Engineer-Associate Exams

PDF + Testing Engine
~~$164.99~~ $49.5 Add to Cart

Testing Engine
~~$124.99~~ $37.5 Add to Cart

PDF (Q&A)
~~$104.99~~ $31.5 Add to Cart

Amazon Web Services Related Exams

AIP-C01 - AWS Certified Generative AI Developer - Professional

AXS-C01 - AWS Certified Alexa Skill Builder-Specialty

SAP-C02 - AWS Certified Solutions Architect - Professional

AIF-C01 - AWS Certified AI Practitioner Exam

SOA-C03 - AWS Certified CloudOps Engineer - Associate

GDP-C01 - AWS Certified Generative AI Developer - Professional

CLF-C02 - AWS Certified Cloud Practitioner

SOA-C01 - AWS Certified SysOps Administrator - Associate

DVA-C02 - AWS Certified Developer - Associate

MLS-C01 - AWS Certified Machine Learning - Specialty

MLA-C01 - AWS Certified Machine Learning Engineer - Associate

SCS-C03 - AWS Certified Security – Specialty

DOP-C02 - AWS Certified DevOps Engineer - Professional

Get Amazon Web Services Full Access

Amazon Web Services Free Exams
Get free access to Amazon Web Services exam prep materials and practice tests at Examstrack. Achieve your Amazon Web Services certification goals by exploring Examstrack.

Summer Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70track

Navigation:

examstrack logo

Hot Vendors:

Free Amazon Web Services Data-Engineer-Associate Practice Exam with Questions & Answers | Set: 9

How to Pass Data-Engineer-Associate Exams

Amazon Web Services Related Exams

Amazon Web Services Free Exams