Post

๐Ÿค— NLP Course - Summary of Concept to Transformer from NLP

 LLM Research Trends

Introduction

This course will teach you about natural language processing (NLP) using libraries from the Hugging Face ecosystem โ€” ๐Ÿค— Transformers, ๐Ÿค— Datasets, ๐Ÿค— Tokenizers, and ๐Ÿค— Accelerate โ€” as well as the Hugging Face Hub. Itโ€™s completely free and without ads.

This course:

  • Requires a good knowledge of Python
  • Is better taken after an introductory deep learning course, such as fast.aiโ€™s Practical Deep Learning for Coders or one of the programs developed by DeepLearning.AI
  • Does not expect prior PyTorch or TensorFlow knowledge, though some familiarity with either of those will help

What is NLP?

NLP is a field of linguistics and machine learning focused on understanding everything related to human language. The aim of NLP tasks is not only to understand single words individually, but to be able to understand the context of those words.

The following is a list of common NLP tasks, with some examples of each:

  • Classifying whole sentences: Getting the sentiment of a review, detecting if an email is spam, determining if a sentence is grammatically correct or whether two sentences are logically related or not
  • Classifying each word in a sentence: Identifying the grammatical components of a sentence (noun, verb, adjective), or the named entities (person, location, organization)
  • Generating text content: Completing a prompt with auto-generated text, filling in the blanks in a text with masked words
  • Extracting an answer from a text: Given a question and a context, extracting the answer to the question based on the information provided in the context
  • Generating a new sentence from an input text: Translating a text into another language, summarizing a text
  • NLP isnโ€™t limited to written text though. It also tackles complex challenges in speech recognition and computer vision, such as generating a transcript of an audio sample or a description of an image.

Why is it challenging?

Computers donโ€™t process information in the same way as humans. For example, when we read the sentence โ€œI am hungry,โ€ we can easily understand its meaning.

Similarly, given two sentences such as โ€œI am hungryโ€ and โ€œI am sad,โ€ weโ€™re able to easily determine how similar they are. For machine learning (ML) models, such tasks are more difficult.

The text needs to be processed in a way that enables the model to learn from it. And because language is complex, we need to think carefully about how this processing must be done. There has been a lot of research done on how to represent text, and we will look at some methods in the next chapter.

โ€ฆ (to be continue)

This post is licensed under CC BY 4.0 by the author.