exam questions

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 All Questions

View all questions & answers for the AWS Certified Machine Learning Engineer - Associate MLA-C01 exam

Exam AWS Certified Machine Learning Engineer - Associate MLA-C01 topic 1 question 107 discussion

A company needs to extract entities from a PDF document to build a classifier model.

Which solution will extract and store the entities in the LEAST amount of time?

  • A. Use Amazon Comprehend to extract the entities. Store the output in Amazon S3.
  • B. Use an open source AI optical character recognition (OCR) tool on Amazon SageMaker to extract the entities. Store the output in Amazon S3.
  • C. Use Amazon Textract to extract the entities. Use Amazon Comprehend to convert the entities to text. Store the output in Amazon S3.
  • D. Use Amazon Textract integrated with Amazon Augmented AI (Amazon A2I) to extract the entities. Store the output in Amazon S3.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
snna4
3 days, 16 hours ago
Selected Answer: C
C. * Amazon Textract is specifically designed to quickly extract text, forms, and tables from PDF documents. * Amazon Comprehend can then process the extracted text to identify entities (like names, locations, dates, etc.)
upvoted 1 times
...
AgboolaKun
1 week, 4 days ago
Selected Answer: A
The best solution is to use Amazon Comprehend to extract entities and store the output in Amazon S3 because it provides direct entity extraction from text documents using pre-trained models without additional processing steps. Using Amazon Textract followed by Amazon Comprehend (Option C) would create unnecessary processing overhead since it requires two sequential services to run, increasing both the processing time and cost of the solution.
upvoted 1 times
...
Rams2025
4 weeks ago
Selected Answer: A
https://aws.amazon.com/blogs/aws/now-process-pdfs-word-documents-and-images-with-amazon-comprehend-for-idp/ Amazon Comprehend feature for intelligent document processing (IDP). This feature allows you to classify and extract entities from PDF documents, Microsoft Word files, and images directly from Amazon Comprehend without you needing to extract the text first.
upvoted 2 times
...
chris_spencer
1 month, 2 weeks ago
Selected Answer: C
Agree with C. Normally Amazon Comprehend is sufficient if the pdf contains only text. Since the question does not mention the exact contents of the pdf files. It would be safer to use Amazon Textract to extract the text, then Amazon Comprehend do the entity extraction.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago