NLP Class 10 Questions and Answers

NLP Class 10 Questions and Answers – The CBSE has changed the syllabus of Std. X. The Questions and Answers are made based on the new syllabus and based on the CBSE textbook, Sample paper and Board Paper.  All the important Information are taken from the Artificial Intelligence Class X Textbook Based on CBSE Board Pattern.

NLP Class 10 Questions and Answers

1. Mention a few features of natural languages.

Answer: They are governed by set rules that include syntax, lexicon, and semantics, All natural languages are redundant, i.e., the information can be conveyed in multiple ways. All natural languages change over time.

2. What is the significance of NLP?

Answer: Artificial Intelligence nowadays is becoming an integral part of our lives, its applications are very commonly used by the majority of people in their daily lives. Here are some of the applications of Natural Language Processing which are used in the real-life scenario:

  • Voice assistants – Voice assistants take our natural speech, process it, and give us an output. These assistants leverage NLP to understand natural language and execute tasks efficiently.
  • Auto generated captions – Captions are generated by turning natural speech into text in real-time. It is a valuable feature for enhancing the accessibility of video content.
  • Language translation – It incorporates the generation of translation from another language. This involves the conversion of text or speech from one language to another, facilitating cross-linguistic communication and fostering global connectivity.
  • Sentiment analysis – Sentiment Analysis is a tool to express an opinion, whether the underlying sentiment is positive, negative, or neutral. Customer sentiment analysis helps in the automatic detection of emotions when customers interact with the products, services, or brand

3. What do you mean by lexical analysis in NLP?

Answer: NLP starts with identifying the structure of input words. It is the process of dividing a large chunk of words into structural paragraphs, sentences, and words. Lexicon stands for a collection of the various words and phrases used in a language.

4. What do you mean by a chatbot?

Answer: A chatbot is a computer program that’s designed to simulate human conversation through voice commands or text chats or both. It can learn over time how to best interact with humans. It can answer questions and troubleshoot customer problems, evaluate and qualify prospects, generate sales leads and increase sales on an ecommerce site.

5. What does the term “Bag of Words” refer to in Natural Language Processing (NLP)?

Answer: Bag of Words is a Natural Language Processing model which helps in extracting features out of the text which can be helpful in machine learning algorithms. In the bag of words, we get the occurrences of each word and construct the vocabulary for the corpus.

6. Describe two practical uses of Natural Language Processing in real-world scenarios.

Answer: The two practical uses of NLP in real world are chatbots and voice assistant.

  • Chatbots: A chatbot is a computer program that’s designed to simulate human conversation through voice commands or text chats or both. It can learn over time how to best interact with humans. It can answer questions and troubleshoot customer problems, evaluate and qualify prospects, generate sales leads and increase sales on an ecommerce site.
  • Voice assistant: Voice assistants take our natural speech, process it, and give us an output. These assistants leverage NLP to understand natural language and execute tasks efficiently.

7. Explain the process of stemming and lemmatization in text processing, supported by an example.

Answer: Stemming and lemmatization both are alternative processes to each other as the role of both the processes is same – removal of affixes. But the difference between both of them isthat in lemmatization, the word we get after affix removal (also known as lemma) is a meaningful one. Lemmatization makes sure that a lemma is a word with meaning and hence it takes a longer time to execute than stemming. Example,

  • Caring -> Lemmatization -> Care
  • Caring -> Stemming -> Car

8. Describe any four applications of TFIDF.

Answer: TFIDF is commonly used in the Natural Language Processing domain. Some of its applications are:

  • Document Classification – Helps in classifying the type and genre of a document.
  • Topic Modelling – It helps in predicting the topic for a corpus.
  • Information Retrieval System – To extract the important information out of a corpus.
  • Stop word filtering – Helps in removing unnecessary words from a text body.

9. Samiksha, a student of class X was exploring the Natural Language Processing domain. She got stuck while performing the text normalisation. Help her to normalise the text on the segmented sentences given below:

  • Document 1: Akash and Ajay are best friends.
  • Document 2: Akash likesto play football but Ajay prefers to play online games.

Answer: Text normalization helps to convert text into a standard format to improve NLP processing, it include lowercasing, removing extra spaces, correcting spellings, stemming and lemmatization.

Step 1: Input text given

  • Document 1: Akash and Ajay are best friends.
  • Document 2: Akash likesto play football but Ajay prefers to play online games.

Step 2: Convert to lowercase

  • Document 1: akash and ajay are best friends.
  • Document 2: akash likesto play football but ajay prefers to play online games.

Step 3: Remove Extra space

  • Document 1: akash and ajay are best friends.
  • Document 2: akash likes to play football but ajay prefers to play online games.

Step 4: Tokenization

  • Document 1 Tokens: akash, and, ajay, are, best, friends
  • Document 2 Tokens: akash, likes, to, play, football, but, ajay, prefers, to, play, online, games

Step 5: Lemmatization

  • Document 1: akash and ajay are best friend.
  • Document 2: akash like to play football but ajay prefer to play online game.

10. Through a step-by-step process, calculate TFIDF for the given corpus

  • Document 1: Johny Johny Yes Papa,
  • Document 2: Eating sugar? No Papa
  • Document 3: Telling lies? No Papa
  • Document 4: Open your mouth, Ha! Ha! Ha!

Answer: To calculate TF-IDF, follow the following steps:

Step 1: Tokenize the Documents

  • Document 1: Johny, Johny, Yes, Papa
  • Document 2: Eating, sugar, No, Papa
  • Document 3: Telling, lies, No, Papa
  • Document 4: Open, your, mouth, Ha, Ha, Ha

Step 2: Create a Term Frequency (TF) table

Formula for finding Term Frequency

Term Frequency
TermDocument 1Document 2Document 3Document 4
Johny2/4 = 0.5000
Yes1/4 = 0.25000
Papa1/4 = 0.251/4 = 0.251/4 = 0.250
Eating01/4 = 0.2500
sugar01/4 = 0.2500
No01/4 = 0.251/4 = 0.250
Telling001/4 = 0.250
lies001/4 = 0.250
Open0001/6 = 0.1667
your0001/6 = 0.1667
mouth0001/6 = 0.1667
Ha0003/6 = 0.5

Step 3: Calculate Inverse Document Frequency (IDF)

Formula for Inverse Document Frequency-

Inverse Document Frequency
TermAppears in how many docs?IDF CalculationIDF Value
Johny1log(4/1)1.386
Yes1log(4/1)1.386
Papa3log(4/3)0.287
Eating1log(4/1)1.386
sugar1log(4/1)1.386
No2log(4/2)0.693
Telling1log(4/1)1.386
lies1log(4/1)1.386
Open1log(4/1)1.386
your1log(4/1)1.386
mouth1log(4/1)1.386
Ha1log(4/1)1.386

Step 4: Calculate TF-IDF Score

Formula for TF-IDF calculation are –

TFIDF(W) = TF(W) * log( IDF(W) )

TermDocument 1Document 2Document 3Document 4
Johny0.5 x 1.386 = 0.693000
Yes0.25 x 1.386 = 0.3465000
Papa0.25 x 0.287 = 0.071750.25 x 0.287 = 0.071750.25 x 0.287 = 0.071750
Eating00.25 x 1.386 = 0.346500
sugar00.25 x 1.386 = 0.346500
No00.25 x 0.693 = 0.173250.25 x 0.693 = 0.173250
Telling000.25 x 1.386 = 0.34650
lies000.25 x 1.386 = 0.34650
Open0000.1667 x 1.386 = 0.2310
your0000.1667 x 1.386 = 0.2310
mouth0000.1667 x 1.386 = 0.2310
Ha0000.5 x 1.386 = 0.693

11. Identify any two stop words which should not be removed from the given sentence and why?
Get help and support whether you’re shopping now or need help with a past purchase.
Contact us at abc@pwershel.com or on our website www.pwershel.com

Answer: Stopwords in the given sentence which should not be removed are: @, . (fullstop) ,_(underscore) , 123(numbers) These tokens are generally considered as stopwords, but in the above sentence, these tokens are part of email id. removing these tokens may lead to invalid website address and email ID. So these words should not be removed from the above sentence.

12. All of us use smartphones. When we install a new app, it asks us for several permissions to access our phone’s data in different ways. Why do apps collect such data?

Answer:

  1. To provide customized notifications and recommendations.
  2. To improve the efficiency and accuracy of the app.

13. What will be the results of conversion of the term, ‘happily’ in the process of stemming and lemmatization? Which process takes longer time for execution?

Answer: Stemming Lemmatization happily happi happy Process that takes longer time for execution is lemmatization.

14. What do we get from the “bag of words” algorithm?

Answer: Bag of words gives us two things:

  • The frequency of these words
  • A vocabulary of words for the corpus

15. Samiksha, a student of class X was exploring the Natural Language Processing domain. She got stuck while performing the text normalisation. Help her to normalise the text on the segmented sentences given below:

  • Document 1: Akash and Ajay are best friends.
  • Document 2: Akash likes to play football but Ajay prefers to play online games.

Answer:

  1. Tokenisation: Akash, and, Ajay, are, best, friends, Akash, likes, to, play, football, but, Ajay, prefers, to, play, online, games
  2. Removal of stopwords: Akash, Ajay, best, friends, Akash, likes, play, football, Ajay, prefers, play, online, games
  3. converting text to a common case: akash, ajay, best, friends, akash, likes, play, football, ajay, prefers, play, online, games
  4. Stemming/Lemmatisation: akash, ajay, best, friend, akash, like, play, football, ajay, prefer, play, online, game

16. Identify any 2 stopwords in the given sentence:
Pollution is the introduction of contaminants into the natural environment that cause adverse change.The three types of pollution are air pollution, water pollution and land pollution.

Answer: Stopwords in the given sentence are: is, the, of, that, into, are, and

17. “Automatic summarization is used in NLP applications”. Is the given statement correct? Justify your answer with an example.

Answer: Yes, the given statement is correct. Automatic summarization is relevant not only for summarizing the meaning of documents and information, but also to understand the emotional meanings within the information, such as in collecting data from social media. Automatic summarization is especially relevant when used to provide an overview of a news item or blog post, while avoiding redundancy from multiple sources and
maximizing the diversity of content obtained.

18. Write any two applications of TFIDF

Answer:

  1. Document Classification: Helps in classifying the type and genre of a document.
  2. Topic Modelling: It helps in predicting the topic for a corpus.
  3. Information Retrieval System: To extract the important information out of a corpus.
  4. Stop word filtering: Helps in removing the unnecessary words out of a text body.

19. Write down the steps to implement bag of words algorithm.

Answer: The steps to implement bag of words algorithm are as follows:

  1. Text Normalisation: Collect data and pre-process it
  2. Create Dictionary: Make a list of all the unique words occurring in the corpus. (Vocabulary)
  3. Create document vectors: For each document in the corpus, find out how many times the word from the unique list of words has occurred.
  4. Create document vectors for all the documents.

20. Explain the term Text Normalisation in Data Processing.

Answer: The first step in Data processing is Text Normalisation. Text Normalisation helps in cleaning up the textual data in such a way that it comes down to a level where its complexity is lower than the actual data. In this we undergo several steps to normalise the text to a lower level. We work on text from multiple documents and the term used for the whole textual data from all the documents altogether is known as corpus.

21. Name any 2 applications of Natural Language Processing which are used in the real-life scenario.

Answer: The 4 applications of Natural Language processing are –

  • Automatic Summarization: Automatic summarization is relevant not only for summarizing the meaning of documents and information, but also to understand the emotional meanings within the information, such as in collecting data from social media.
  • Sentiment Analysis: The goal of sentiment analysis is to identify sentiment among several posts or even in the same post where emotion is not always explicitly expressed.
  • Text classification: Text classification makes it possible to assign predefined categories to a document and organize it to help you find the information you need or simplify some activities.
  • Virtual Assistants: With the help of speech recognition, these assistants can not only detect our speech but can also make sense out of it.

Disclaimer: We have taken an effort to provide you with the accurate handout of “NLP Class 10 Questions and Answers“. If you feel that there is any error or mistake, please contact me at anuraganand2017@gmail.com. The above CBSE study material present on our websites is for education purpose, not our copyrights. All the above content and Screenshot are taken from Artificial Intelligence Class 10 CBSE Textbook, Sample Paper, Old Sample Paper, Board Paper and Support Material which is present in CBSEACADEMIC website, This Textbook and Support Material are legally copyright by Central Board of Secondary Education. We are only providing a medium and helping the students to improve the performances in the examination. 

Images and content shown above are the property of individual organizations and are used here for reference purposes only.

For more information, refer to the official CBSE textbooks available at cbseacademic.nic.in

cbseskilleducation

Leave a Comment

error: Content is protected !!