Exam AI-100 All Questions

View all questions & answers for the AI-100 exam

Exam AI-100 topic 3 question 6 discussion

Actual exam question from Microsoft's AI-100

Question #: 6
Topic #: 3

You are designing an AI solution that will provide feedback to teachers who train students over the Internet. The students will be in classrooms located in remote areas. The solution will capture video and audio data of the students in the classrooms.
You need to recommend Azure Cognitive Services for the AI solution to meet the following requirements:
✑ Alert teachers if a student seems angry or distracted.
✑ Identify each student in the classrooms for attendance purposes.
✑ Allow the teachers to log the text of conversations between themselves and the students.
Which Cognitive Services should you recommend?

A. Computer Vision, Text Analytics, and Face API
B. Video Indexer, Face API, and Text Analytics
C. Computer Vision, Speech to Text, and Text Analytics
D. Text Analytics, QnA Maker, and Computer Vision
E. Video Indexer, Speech to Text, and Face API

Show Suggested Answer

Suggested Answer: E 🗳️
Azure Video Indexer is a cloud application built on Azure Media Analytics, Azure Search, Cognitive Services (such as the Face API, Microsoft Translator, the
Computer Vision API, and Custom Speech Service). It enables you to extract the insights from your videos using Video Indexer video and audio models.
Face API enables you to search, identify, and match faces in your private repository of up to 1 million people.
The Face API now integrates emotion recognition, returning the confidence across a set of emotions for each face in the image such as anger, contempt, disgust, fear, happiness, neutral, sadness, and surprise. These emotions are understood to be cross-culturally and universally communicated with particular facial expressions.
Speech-to-text from Azure Speech Services, also known as speech-to-text, enables real-time transcription of audio streams into text that your applications, tools, or devices can consume, display, and take action on as command input. This service is powered by the same recognition technology that Microsoft uses for
Cortana and Office products, and works seamlessly with the translation and text-to-speech.
Incorrect Answers:
Computer Vision or the QnA is not required.
References:
https://docs.microsoft.com/en-us/azure/media-services/video-indexer/video-indexer-overview https://azure.microsoft.com/en-us/services/cognitive-services/face/ https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text

by TDg at Sept. 18, 2020, 9:03 a.m.

Comments

Submit Cancel

zDavid

Highly Voted 4 years, 6 months ago

Given answer is correct You need speech to text to log text of conversations, text analytics is not really needed here.

upvoted 5 times

...

rveney

Most Recent 2 years ago

recommended Azure Cognitive Services to meet the requirements are: B. Video Indexer, Face API, and Text Analytics.

upvoted 1 times

...

Distinctive

4 years, 6 months ago

The given answer is correct, I want to believe the order is not important.

upvoted 3 times

Cornholioz

4 years, 4 months ago

Not just because of the order, but because Text Analytics is not required. Speech-to-Text (Speech Recognition) is sufficient. I'm glad they didn't make it even harder by adding a choice between Video Indexer and Face API. Video Indexer can do Face Detection while Face API can do Face Detection + Face Recognition. We need both here and hence B or E. And only E because of the above point on the Text part.

upvoted 3 times

...

AcetheTest

4 years, 7 months ago

am I missing something? the question says nothing about analyzing what people are saying aloud. all it says it to analyze the text. the mood of the students can be detected by their facial expression. I don't think Speech to text is needed.

upvoted 1 times

AcetheTest

4 years, 7 months ago

turns out I am missing something. "solution will capture video and audio data"

upvoted 1 times

...

allanm

4 years, 1 month ago

"Allow the teachers to log the text of conversations between themselves and the students." This is where you will need Speech to Text API

upvoted 1 times

...

japhlet

4 years, 9 months ago

The correct option is actually B. Video Indexer helps you analyze the student's facial reaction. With Face API, you can identity students based on their gender, age and depending on the data already provided, even their names. Text Analytics allows the teachers analyze their conversations with the students.

upvoted 2 times

UpsetUser

4 years, 5 months ago

What do you mean by ""Analyze"". Question says ""Log"" thats it. So given answer is correct. Same answer is marked in Whizlabs Practice tests.

upvoted 2 times

...

sayak17

4 years, 9 months ago

No. The given answer is correct because we don't need to analyse the conversations, just need to log them. Speech-to-text, also known as speech recognition, enables real-time transcription of audio streams into text.

upvoted 12 times

Srinivas1

4 years, 9 months ago

Do you mean, the solutions order in the given option is not important or do you mean to say FaceAPI can be used for text logging?

upvoted 1 times

sayak17

4 years, 8 months ago

order is not important

upvoted 1 times

...

TDg

4 years, 9 months ago

I think the correct option is -B

upvoted 2 times

...