Natural Language Processing in Python - with Spacy, NLTK, TextBlob, Sumy, and YAKE

Published: 18 July 2022
on channel: BugBytes
1,917
68

In this video, we look at the application of different NLP methods using the Python programming language.

We analyze UK Tory politicians' resignation letters, with the following text analytics techniques:

1. Sentiment Analysis - to determine whether sentences are positive, neutral or negative
2. Wordclouds - for visualizing common words and tokens
3. Keyword Analysis - finding key words and phrases in the text
4. Document Similarity - comparing different texts and using Jaccard Similarity to quantify how similar two documents are.
5. Text Summarisation - to cut down a text into its key parts
6. Named Entity Recognition - An automated way of finding entities such as people, organisations and places in a text

Files for this task are available on Github:
https://github.com/bugbytes-io/tory-nlp

📌 𝗖𝗵𝗮𝗽𝘁𝗲𝗿𝘀:
00:00 Intro
03:04 Sentiment Analysis with TextBlob and NLTK
11:18 Wordclouds in Python
13:33 Keyword Analysis in Python
16:42 Document Similarity using Jaccard Coefficient
19:24 Stemming Text with PorterStemmer
23:55 Document Summarisation
27:32 Named Entity Recognition with spaCy

☕️ 𝗕𝘂𝘆 𝗺𝗲 𝗮 𝗰𝗼𝗳𝗳𝗲𝗲:
To support the channel and encourage new videos, please consider buying me a coffee here:
https://ko-fi.com/bugbytes

𝗦𝗼𝗰𝗶𝗮𝗹 𝗠𝗲𝗱𝗶𝗮:
📖 Blog: https://www.bugbytes.io/posts/
👾 Github: https://github.com/bugbytes-io/tory-nlp
🐦 Twitter:   / bugbytesio  

📚 𝗙𝘂𝗿𝘁𝗵𝗲𝗿 𝗿𝗲𝗮𝗱𝗶𝗻𝗴 𝗮𝗻𝗱 𝗶𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻:
TextBlob: https://textblob.readthedocs.io/en/dev/
TextBlob (install): https://anaconda.org/conda-forge/text...
WordCloud: https://anaconda.org/conda-forge/word...
YAKE: https://anaconda.org/conda-forge/yake
Sumy: https://pypi.org/project/sumy/
Sumy (install): https://anaconda.org/conda-forge/sumy
Latent Semantic Analysis: https://en.wikipedia.org/wiki/Latent_...
Spacy (install): https://spacy.io/usage
Spacy Named Entity Recognition: https://spacy.io/usage/linguistic-fea...

#python #nlp #nltk #datascience