exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 130 discussion

A hospital is building a research data lake to ingest data from electronic health records (EHR) systems from multiple hospitals and clinics. The EHR systems are independent of each other and do not have a common patient identifier. The data engineering team is not experienced in machine learning (ML) and has been asked to generate a unique patient identifier for the ingested records.
Which solution will accomplish this task?

  • A. An AWS Glue ETL job with the FindMatches transform
  • B. Amazon Kendra
  • C. Amazon SageMaker Ground Truth
  • D. An AWS Glue ETL job with the ResolveChoice transform
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
polooor
Highly Voted 3 years, 7 months ago
You can now use AWS Glue to find matching records across a dataset (including ones without identifiers) by using the new FindMatches ML Transform, a custom machine learning transformation that helps you identify matching records. By adding the FindMatches transformation to your Glue ETL jobs, you can find related products, places, suppliers, customers, and more. You can also use the FindMatches transformation for deduplication, such as to identify customers who have signed up more than once, products that have accidentally been added to your product catalog more than once, and so forth. You can teach the FindMatches ML Transform your definition of a “duplicate” through examples, and it will use machine learning to identify other potential duplicates in your dataset.
upvoted 11 times
...
JiyuKim
Most Recent 1 year, 5 months ago
I also agree with A. But I have a doubt about whether it can "generate" a unique identifier.
upvoted 2 times
...
pk349
2 years, 2 months ago
A: I passed the test
upvoted 3 times
...
cloudlearnerhere
2 years, 8 months ago
Correct answer is A as Glue can be used to perform matching across data stores using the FinMatches API. https://docs.aws.amazon.com/glue/latest/dg/machine-learning.html Option B is wrong as Amazon Kendra is an intelligent search service powered by machine learning (ML). Kendra reimagines enterprise search for your websites and applications so your employees and customers can easily find the content they’re looking for, even when it’s scattered across multiple locations and content repositories within your organization. Option C is wrong as Amazon SageMaker Ground Truth is a data labeling service that makes it easy to label data and gives you the option to use human annotators through Amazon Mechanical Turk. Option D is wrong as ResolveChoice helps resolve a choice type within a DynamicFrame. It is ideal for format changes.
upvoted 2 times
...
maitis
2 years, 11 months ago
Selected Answer: A
A is the answer
upvoted 2 times
...
rocky48
2 years, 11 months ago
Selected Answer: A
A is the answer
upvoted 1 times
...
Thiya
3 years, 7 months ago
A is the answer
upvoted 2 times
...
Chints01
3 years, 7 months ago
Answer should be A as FindMatches completely addresses the use case in question
upvoted 3 times
...
goutes
3 years, 7 months ago
OPTION A: FindMatches ML: identify duplicate or matching records in your dataset, even when the records do not have a common unique identifier and no fields match exactly.
upvoted 1 times
...
Fazil_Cp
3 years, 8 months ago
Option A - https://aws.amazon.com/about-aws/whats-new/2019/08/aws-glue-provides-findmatches-ml-transform-to-deduplicate/
upvoted 4 times
...
srinivasa
3 years, 9 months ago
Answer: B https://aws.amazon.com/kendra/
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...