exam questions

Exam AI-100 All Questions

View all questions & answers for the AI-100 exam

Exam AI-100 topic 3 question 6 discussion

Actual exam question from Microsoft's AI-100
Question #: 6
Topic #: 3
[All AI-100 Questions]

You are designing an AI solution that will provide feedback to teachers who train students over the Internet. The students will be in classrooms located in remote areas. The solution will capture video and audio data of the students in the classrooms.
You need to recommend Azure Cognitive Services for the AI solution to meet the following requirements:
✑ Alert teachers if a student seems angry or distracted.
✑ Identify each student in the classrooms for attendance purposes.
✑ Allow the teachers to log the text of conversations between themselves and the students.
Which Cognitive Services should you recommend?

  • A. Computer Vision, Text Analytics, and Face API
  • B. Video Indexer, Face API, and Text Analytics
  • C. Computer Vision, Speech to Text, and Text Analytics
  • D. Text Analytics, QnA Maker, and Computer Vision
  • E. Video Indexer, Speech to Text, and Face API
Show Suggested Answer Hide Answer
Suggested Answer: E 🗳️
Azure Video Indexer is a cloud application built on Azure Media Analytics, Azure Search, Cognitive Services (such as the Face API, Microsoft Translator, the
Computer Vision API, and Custom Speech Service). It enables you to extract the insights from your videos using Video Indexer video and audio models.
Face API enables you to search, identify, and match faces in your private repository of up to 1 million people.
The Face API now integrates emotion recognition, returning the confidence across a set of emotions for each face in the image such as anger, contempt, disgust, fear, happiness, neutral, sadness, and surprise. These emotions are understood to be cross-culturally and universally communicated with particular facial expressions.
Speech-to-text from Azure Speech Services, also known as speech-to-text, enables real-time transcription of audio streams into text that your applications, tools, or devices can consume, display, and take action on as command input. This service is powered by the same recognition technology that Microsoft uses for
Cortana and Office products, and works seamlessly with the translation and text-to-speech.
Incorrect Answers:
Computer Vision or the QnA is not required.
References:
https://docs.microsoft.com/en-us/azure/media-services/video-indexer/video-indexer-overview https://azure.microsoft.com/en-us/services/cognitive-services/face/ https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
zDavid
Highly Voted 4 years, 6 months ago
Given answer is correct You need speech to text to log text of conversations, text analytics is not really needed here.
upvoted 5 times
...
rveney
Most Recent 2 years ago
recommended Azure Cognitive Services to meet the requirements are: B. Video Indexer, Face API, and Text Analytics.
upvoted 1 times
...
Distinctive
4 years, 6 months ago
The given answer is correct, I want to believe the order is not important.
upvoted 3 times
Cornholioz
4 years, 4 months ago
Not just because of the order, but because Text Analytics is not required. Speech-to-Text (Speech Recognition) is sufficient. I'm glad they didn't make it even harder by adding a choice between Video Indexer and Face API. Video Indexer can do Face Detection while Face API can do Face Detection + Face Recognition. We need both here and hence B or E. And only E because of the above point on the Text part.
upvoted 3 times
...
...
AcetheTest
4 years, 7 months ago
am I missing something? the question says nothing about analyzing what people are saying aloud. all it says it to analyze the text. the mood of the students can be detected by their facial expression. I don't think Speech to text is needed.
upvoted 1 times
AcetheTest
4 years, 7 months ago
turns out I am missing something. "solution will capture video and audio data"
upvoted 1 times
...
allanm
4 years, 1 month ago
"Allow the teachers to log the text of conversations between themselves and the students." This is where you will need Speech to Text API
upvoted 1 times
...
...
japhlet
4 years, 9 months ago
The correct option is actually B. Video Indexer helps you analyze the student's facial reaction. With Face API, you can identity students based on their gender, age and depending on the data already provided, even their names. Text Analytics allows the teachers analyze their conversations with the students.
upvoted 2 times
UpsetUser
4 years, 5 months ago
What do you mean by ""Analyze"". Question says ""Log"" thats it. So given answer is correct. Same answer is marked in Whizlabs Practice tests.
upvoted 2 times
...
sayak17
4 years, 9 months ago
No. The given answer is correct because we don't need to analyse the conversations, just need to log them. Speech-to-text, also known as speech recognition, enables real-time transcription of audio streams into text.
upvoted 12 times
Srinivas1
4 years, 9 months ago
Do you mean, the solutions order in the given option is not important or do you mean to say FaceAPI can be used for text logging?
upvoted 1 times
sayak17
4 years, 8 months ago
order is not important
upvoted 1 times
...
...
...
...
TDg
4 years, 9 months ago
I think the correct option is -B
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...