For private teaching, tutoring, feel free to reach out: https://www.superprof.com/ivy-league-phd-quantitative-biomedical-sciences-biochemistry-and-applied-mathematics-years-experience-data.html Follow the Code: https://github.com/bricard1/Lstm_textclassifier/blob/master/LSTM_pt1_tokenization_wordembed_similarity.ipynb Get the data: https://www.yelp.com/dataset About the Data: 0:30 Exploring data in terminal/powershell using 'more' (optional): 0:46 Opening JSON data in Python: 2:44 Binarizing output data: 4:23 Overview: 9:37 Tokenization and Stopwords: 10:11 Word Embeddings: 11:17 Cosine similarity for Documents, Words, Sentences: 12:20 Calculating similarity of documents: 13:40 About LSTM: 20:05 LSTM papers: Original: https://dl.acm.org/doi/10.1162/neco.1997.9.8.1735 Good review: https://arxiv.org/abs/1503.04069 Please 🙏 like and subscribe 👍! I would like to make videos full time to allow all people to access them for free instead of teaching privately for a school, and every bit of support helps me be able to reach that goal! Ask me anything on Discord or in the comments: https://discord.gg/tshJMB6Gsk