exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 117 discussion

An e commerce company wants to launch a new cloud-based product recommendation feature for its web application. Due to data localization regulations, any sensitive data must not leave its on-premises data center, and the product recommendation model must be trained and tested using nonsensitive data only. Data transfer to the cloud must use IPsec. The web application is hosted on premises with a PostgreSQL database that contains all the data. The company wants the data to be uploaded securely to Amazon S3 each day for model retraining.
How should a machine learning specialist meet these requirements?

  • A. Create an AWS Glue job to connect to the PostgreSQL DB instance. Ingest tables without sensitive data through an AWS Site-to-Site VPN connection directly into Amazon S3.
  • B. Create an AWS Glue job to connect to the PostgreSQL DB instance. Ingest all data through an AWS Site-to-Site VPN connection into Amazon S3 while removing sensitive data using a PySpark job.
  • C. Use AWS Database Migration Service (AWS DMS) with table mapping to select PostgreSQL tables with no sensitive data through an SSL connection. Replicate data directly into Amazon S3.
  • D. Use PostgreSQL logical replication to replicate all data to PostgreSQL in Amazon EC2 through AWS Direct Connect with a VPN connection. Use AWS Glue to move data from Amazon EC2 to Amazon S3.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
cnethers
Highly Voted 3 years, 6 months ago
ASK : Extract Data over IPsec So we need an ETL + Site to site VPN GLUE is an ETL service but can it connect to PostgreSQL? yes https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-connect.html#aws-glue-programming-etl-connect-jdbc How to connect Glue to an on-site DB https://aws.amazon.com/blogs/big-data/how-to-access-and-analyze-on-premises-data-stores-using-aws-glue/ My Answer would be A Anser C only makes a 443 (SSL) connection so does not meet the IPsec requirement
upvoted 27 times
...
ksrivastavaSumit
Highly Voted 3 years, 7 months ago
A? IPSec needs to be covered as well
upvoted 8 times
StelSen
3 years, 6 months ago
Yes. https://aws.amazon.com/blogs/big-data/how-to-access-and-analyze-on-premises-data-stores-using-aws-glue/. 'A' is the correct answer.
upvoted 3 times
...
...
MultiCloudIronMan
Most Recent 6 months, 2 weeks ago
Selected Answer: A
It's 'A' because IPSec is required.
upvoted 1 times
...
sachin80
12 months ago
A: https://medium.com/awsblackbelt/loading-on-prem-postgres-data-into-amazon-s3-with-server-side-filtering-c13bcee8b769
upvoted 1 times
...
VR10
1 year, 1 month ago
Selected Answer: A
B - Doesnt take care of only nonsensitive data being allowed to leave the on-premise. C - Uses SSL and not IPSec. D - like B transfers all data. Hence the correct answer is A
upvoted 1 times
...
AIWave
1 year, 2 months ago
I will go with B Site to Site VPN -> IPsec requirement AWS Glue -> connect and catalog PostgressSQL Pyspark -> remove sensitive information. AWS glue supports pyspark
upvoted 1 times
...
kyuhuck
1 year, 2 months ago
Selected Answer: C
The best option is to use AWS Database Migration Service (AWS DMS) with table mapping to select PostgreSQL tables with no sensitive data through an SSL connection. Replicate data directly intoAmazon S3. This option meets the following requirements:It ensures that only nonsensitive data is transferred to the cloud by using table mapping to filter outthe tables that contain sensitive data1.It uses IPsec to secure the data transfer by enabling SSL encryption for the AWS DMS endpoint2.It uploads the data to Amazon S3 each day for model retraining by using the ongoing replicationfeature of AWS DMS3
upvoted 3 times
LeoD
4 months, 1 week ago
IPsec and SSL are two different things. Using SSL does not necessarily mean option C has IPsec implemented, which is required.
upvoted 1 times
...
...
Rejju
1 year, 7 months ago
but glue can not filter out the data during the ingestion and hence option A wouldn't be the right one! I would go for B
upvoted 1 times
LeoD
4 months, 1 week ago
I think A is saying only to ingest tables that don't contain sensitive data, meaning while configuring Glue, the specialist will only select the tables that don't contain sensitive data for ingestion.
upvoted 1 times
...
...
jopaca1216
1 year, 7 months ago
B Both A and C are not correct... due that the question is not talking about tables with no sensitive data... and that DMS tipically act on the data on AWS side, the right answer is B AWS Glue connects to the PostgreSQL database, allowing the removal of sensitive data using a PySpark job BEFORE securely ingesting the data into Amazon S3, thus aligning with the requirements.
upvoted 3 times
Hybrid_Cloud_boy
1 year, 4 months ago
I think the issue with this answer would be that the data actually leaves the DC and enters the glue service before sensitive data is redacted. - Which makes me lean A
upvoted 3 times
...
...
Mickey321
1 year, 8 months ago
Selected Answer: C
Option c
upvoted 1 times
...
ADVIT
1 year, 10 months ago
A: https://aws.amazon.com/blogs/big-data/doing-data-preparation-using-on-premises-postgresql-databases-with-aws-glue-databrew/
upvoted 2 times
...
AjoseO
2 years, 2 months ago
Selected Answer: A
A. Create an AWS Glue job to connect to the PostgreSQL DB instance. Ingest tables without sensitive data through an AWS Site-to-Site VPN connection directly into Amazon S3. This solution meets the requirements of data localization regulations and secure data transfer. By creating an AWS Glue job to connect to the PostgreSQL DB instance, the machine learning specialist can extract tables without sensitive data. By using a Site-to-Site VPN connection, the data can be securely transferred from the on-premises data center to Amazon S3, where it can be used for model retraining. This solution ensures that any sensitive data remains in the on-premises data center, and that only non-sensitive data is uploaded to the cloud.
upvoted 2 times
...
matteocal
2 years, 9 months ago
Selected Answer: A
IPSec means VPN
upvoted 4 times
...
geekgirl007
3 years, 3 months ago
Answer is A. IPsec is not the same as SSL. Site to site VPN is for IPsec: https://aws.amazon.com/vpn/site-to-site-vpn/ Also Glue can directly connect to Postgres and upload to S3: https://aws.amazon.com/blogs/big-data/how-to-access-and-analyze-on-premises-data-stores-using-aws-glue/
upvoted 3 times
...
Deepsachin
3 years, 5 months ago
A is the answer
upvoted 1 times
...
Dr_Kiko
3 years, 6 months ago
Between A and C, I pick A because IPSec requires VPN; otherwise DMS is a better option
upvoted 3 times
...
Dr_Kiko
3 years, 6 months ago
Between A and C, I pick A because IPSec requires VPN; otherwise DMS is a better option
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago