NLP Flashcards: Key Concepts and Techniques
Table of Contents
- 1. NLP Flashcards
- 1.1. What does NLP stand for? drill nlp
- 1.2. Tokenization drill nlp
- 1.3. Part-of-Speech (POS) Tagging drill nlp
- 1.4. Named Entity Recognition (NER) drill nlp
- 1.5. Sentiment Analysis drill nlp
- 1.6. Word Embedding drill nlp
- 1.7. LSTM drill nlp
- 1.8. Transformer Architecture drill nlp
- 1.9. TF-IDF drill nlp
- 1.10. Lemmatization drill nlp
1. NLP Flashcards
1.1. What does NLP stand for? drill nlp
1.1.1. Front
What does NLP stand for in the context of computer science?
1.1.2. Back
Natural Language Processing
1.2. Tokenization drill nlp
1.2.1. Front
What is tokenization in NLP?
1.2.2. Back
Tokenization is the process of breaking down text into smaller units called tokens, typically words or subwords.
1.3. Part-of-Speech (POS) Tagging drill nlp
1.3.1. Front
What is Part-of-Speech (POS) tagging?
1.3.2. Back
POS tagging is the process of assigning grammatical categories (e.g., noun, verb, adjective) to each word in a text.
1.4. Named Entity Recognition (NER) drill nlp
1.4.1. Front
What is Named Entity Recognition (NER)?
1.4.2. Back
NER is the task of identifying and classifying named entities (e.g., person names, organizations, locations) in text.
1.5. Sentiment Analysis drill nlp
1.5.1. Front
What is Sentiment Analysis?
1.5.2. Back
Sentiment Analysis is the process of determining the emotional tone or opinion expressed in a piece of text, typically classifying it as positive, negative, or neutral.
1.6. Word Embedding drill nlp
1.6.1. Front
What is a Word Embedding?
1.6.2. Back
A word embedding is a dense vector representation of a word that captures semantic meaning, allowing words with similar meanings to have similar vector representations.
1.7. LSTM drill nlp
1.7.1. Front
What does LSTM stand for, and what is it used for in NLP?
1.7.2. Back
LSTM stands for Long Short-Term Memory. It's a type of recurrent neural network (RNN) architecture used for processing and predicting time series data, particularly useful for tasks involving sequential data like text.
1.8. Transformer Architecture drill nlp
1.8.1. Front
What is the Transformer architecture in NLP?
1.8.2. Back
The Transformer is a neural network architecture that uses self-attention mechanisms to process sequential data. It has become the foundation for many state-of-the-art NLP models, including BERT and GPT.
1.9. TF-IDF drill nlp
1.9.1. Front
What does TF-IDF stand for, and what is it used for?
1.9.2. Back
TF-IDF stands for Term Frequency-Inverse Document Frequency. It's a numerical statistic used to reflect the importance of a word in a document within a collection or corpus, often used in information retrieval and text mining.
1.10. Lemmatization drill nlp
1.10.1. Front
What is lemmatization in NLP?
1.10.2. Back
Lemmatization is the process of reducing words to their base or dictionary form (lemma). For example, "running" and "ran" would both be lemmatized to "run".