NLP Flashcards: Key Concepts and Techniques

Table of Contents

NLP Flashcards

What does NLP stand for?   drill nlp

Front

What does NLP stand for in the context of computer science?

Back

Natural Language Processing

Tokenization   drill nlp

Front

What is tokenization in NLP?

Back

Tokenization is the process of breaking down text into smaller units called tokens, typically words or subwords.

Part-of-Speech (POS) Tagging   drill nlp

Front

What is Part-of-Speech (POS) tagging?

Back

POS tagging is the process of assigning grammatical categories (e.g., noun, verb, adjective) to each word in a text.

Named Entity Recognition (NER)   drill nlp

Front

What is Named Entity Recognition (NER)?

Back

NER is the task of identifying and classifying named entities (e.g., person names, organizations, locations) in text.

Sentiment Analysis   drill nlp

Front

What is Sentiment Analysis?

Back

Sentiment Analysis is the process of determining the emotional tone or opinion expressed in a piece of text, typically classifying it as positive, negative, or neutral.

Word Embedding   drill nlp

Front

What is a Word Embedding?

Back

A word embedding is a dense vector representation of a word that captures semantic meaning, allowing words with similar meanings to have similar vector representations.

LSTM   drill nlp

Front

What does LSTM stand for, and what is it used for in NLP?

Back

LSTM stands for Long Short-Term Memory. It's a type of recurrent neural network (RNN) architecture used for processing and predicting time series data, particularly useful for tasks involving sequential data like text.

Transformer Architecture   drill nlp

Front

What is the Transformer architecture in NLP?

Back

The Transformer is a neural network architecture that uses self-attention mechanisms to process sequential data. It has become the foundation for many state-of-the-art NLP models, including BERT and GPT.

TF-IDF   drill nlp

Front

What does TF-IDF stand for, and what is it used for?

Back

TF-IDF stands for Term Frequency-Inverse Document Frequency. It's a numerical statistic used to reflect the importance of a word in a document within a collection or corpus, often used in information retrieval and text mining.

Lemmatization   drill nlp

Front

What is lemmatization in NLP?

Back

Lemmatization is the process of reducing words to their base or dictionary form (lemma). For example, "running" and "ran" would both be lemmatized to "run".

Author: Jason Walsh

j@wal.sh

Last Updated: 2024-10-30 16:43:54