Python NLTK | nltk.TweetTokenizer()

Last Updated : 12 Sep, 2023

With the help of NLTK nltk.TweetTokenizer() method, we are able to convert the stream of words into small tokens so that we can analyse the audio stream with the help of nltk.TweetTokenizer() method.

Syntax : nltk.TweetTokenizer() Return : Return the stream of token

Example #1 : In this example when we pass audio stream in the form of string it will converted to small tokens from a long string with the help of nltk.TweetTokenizer() method.

Python3

# import TweetTokenizer() method from nltk 
from nltk.tokenize import TweetTokenizer 
  
# Create a reference variable for Class TweetTokenizer 
tk = TweetTokenizer() 
  
# Create a string input 
gfg = "Geeks for Geeks" 
  
# Use tokenize method 
geek = tk.tokenize(gfg) 
  
print(geek) 

Output :

[‘Geeks’, ‘for’, ‘Geeks’]

Example #2 :

Python3

# import TweetTokenizer() method from nltk 
from nltk.tokenize import TweetTokenizer 
  
# Create a reference variable for Class TweetTokenizer 
tk = TweetTokenizer() 
  
# Create a string input 
gfg = ":-) <> () {} [] :-p"
  
# Use tokenize method 
geek = tk.tokenize(gfg) 
  
print(geek) 

Output :

[‘:-)’, ”, ‘(‘, ‘)’, ‘{‘, ‘}’, ‘[‘, ‘]’, ‘:-p’]

Suggest improvement

Python NLTK | nltk.WhitespaceTokenizer

Share your thoughts in the comments

Python NLTK | nltk.TweetTokenizer()

Python3

Python3

Please Login to comment...

Similar Reads

What kind of Experience do you want to share?