Natural Language Processing (NLP)


Natural Language Processing or NLP is a field of Artificial Intelligence that gives the machines the ability to read, understand and derive meaning from human languages.

Overview

NLP Model Selection
NLP model selection guide to make it easier to select models. This is prescriptive in nature and has to be used with caution.
transfer-learning natural-language-processing neural-networks transformers
The State of Transfer Learning in NLP
This post expands on the NAACL 2019 tutorial on Transfer Learning in NLP. It highlights key insights and takeaways and provides updates based on recent ...
transfer-learning natural-language-processing pretraining tutorial
The Future of (Transfer Learning in) Natural Language Processing
Transfer Learning in Natural Language Processing (NLP): Open questions, current trends, limits, and future directions.
natural-language-processing transfer-learning tutorial video

Tutorials

Tokenizers: How Machines Read
A survey of different tokenization strategies in NLP.
tokenizers natural-language-processing survey tutorial
Natural Language Processing: Pretraining - d2l
An interactive deep learning book with code, math, and discussions, based on the NumPy interface.
pretraining natural-language-processing bert mxnet
A Deep Dive into the Wonderful World of Preprocessing in NLP
A glimpse into the surprisingly deep and interesting world of preprocessing in NLP.
tokenization preprocessing natural-language-processing tutorial
Top Down Introduction to BERT with HuggingFace and PyTorch
I will also provide some intuition into how BERT works with a top down approach (applications to algorithm).
bert top-down huggingface pytorch
PyTorch Transformers Tutorials
A set of annotated Jupyter notebooks, that give user a template to fine-tune transformers model to downstream NLP tasks such as classification, NER etc.
transformers text-classification text-summarization named-entity-recognition

Libraries

Transformers - Hugging Face
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
transformers huggingface attention bert
BLINK: Better entity LINKing
Entity Linking python library that uses Wikipedia as the target knowledge base.
named-entity-recognition wikification natural-language-processing code
Rasa
An open source machine learning framework to automate text-and voice-based conversations.
conversational-ai natural-language-processing library code
Token2index
A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and Tensorflow.
tokenization preprocessing natural-language-processing sequence-to-sequence
FlashText
Extract Keywords from sentence or Replace keywords in sentences.
text-extraction regex natural-language-processing library
Finetune: Scikit-learn Style Model Finetuning for NLP
Finetune is a library that allows users to leverage state-of-the-art pretrained NLP models for a wide variety of downstream tasks.
natural-language-processing finetuning pretraining transformers
Jellyfish
🎐 a python library for doing approximate and phonetic matching of strings.
text-matching text-similarity jellyfish levenshtein
TextBrewer
A PyTorch-based model distillation toolkit for natural language processing.
model-distillation natural-language-processing model-compression distillation
Jiant
A software toolkit for research on general-purpose text understanding models.
multi-task-learning natural-language-processing huggingface pytorch
NLP Viewer 🤗
A simple website for browsing popular NLP datasets.
natural-language-processing huggingface datasets streamlit
SentencePiece
Unsupervised text tokenizer for Neural Network-based text generation.
tokenization word-segmentation machine-translation natural-language-processing
Table of Contents
Share a resource
Share a resource you found useful for this specific topic.
Topic experts
Share