Description
Text Processing with Python Essential Training Course. In the world of big data, more and more information is consumed and analyzed in text form. Websites, social media, emails, and chats have become important sources of data and insights. When working with data, it is important to understand how to handle unstructured text data. In this course, instructor Kumaran Ponnambalam helps you build your text mining skills and covers key techniques for extracting, cleaning, and processing text in Python. Kumaran explores key text processing concepts such as tokenization and rooting. He also covers techniques for converting text into a form ready for analysis, including n-grams and TF-IDF. Along the way, he provides examples of these techniques using Python and the NLTK library.
What you will learn in the Python Essential Text Processing training course
- Text mining today
- Document concept
- Corpus concepts
- Introduction to the NLTK Library
- Setting up the environment
- Reading raw files
- Reading files with Corpus Reader
- Exploring the corpus
- Analysis of the corpus
- Cleaning text
1 min 58 sec
Stop removing words - To stem
- lemmatization
- Build N-gram
- Mark parts of speech
- Term frequency-inverse document frequency (TF-IDF)
- Construction of a TF-IDF matrix
- Save text
- Processing of text data
- Scalable processing of text data
Course details
Course headings
Course pictures
Sample video of the course
installation Guide
After extracting, you can watch it with your favorite player.
English subtitles
Quality: 720p
Download link
free download software
Size
73.5MB