Skip to content
Related Articles

Related Articles

Improve Article

Python NLTK | nltk.TweetTokenizer()

  • Difficulty Level : Easy
  • Last Updated : 07 Jun, 2019

With the help of NLTK nltk.TweetTokenizer() method, we are able to convert the stream of words into small small tokens so that we can analyse the audio stream with the help of nltk.TweetTokenizer() method.

Syntax : nltk.TweetTokenizer()
Return : Return the stream of token

Example #1 :
In this example when we pass audio stream in the form of string it will converted to small tokens from a long string with the help of nltk.TweetTokenizer() method.




# import TweetTokenizer() method from nltk
from nltk.tokenize import TweetTokenizer
  
# Create a reference variable for Class TweetTokenizer
tk = TweetTokenizer()
  
# Create a string input
gfg = "Geeks for Geeks"
  
# Use tokenize method
geek = tk.tokenize(gfg)
  
print(geek)

Output :

[‘Geeks’, ‘for’, ‘Geeks’]



Example #2 :




# import TweetTokenizer() method from nltk
from nltk.tokenize import TweetTokenizer
  
# Create a reference variable for Class TweetTokenizer
tk = TweetTokenizer()
  
# Create a string input
gfg = ":-) <> () {} [] :-p"
  
# Use tokenize method
geek = tk.tokenize(gfg)
  
print(geek)

Output :

[‘:-)’, ”, ‘(‘, ‘)’, ‘{‘, ‘}’, ‘[‘, ‘]’, ‘:-p’]

 Attention geek! Strengthen your foundations with the Python Programming Foundation Course and learn the basics.  

To begin with, your interview preparations Enhance your Data Structures concepts with the Python DS Course. And to begin with your Machine Learning Journey, join the Machine Learning – Basic Level Course




My Personal Notes arrow_drop_up
Recommended Articles
Page :