In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
In this post you will see 5 recipes of supervised classification algorithms applied to small standard datasets that are provided with the scikit-learn library.
K. Xu, Y. Feng, S. Huang, и D. Zhao. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing EMNLP, стр. 536–540. (2015)cite arxiv:1506.07650.