Open In App

Python NLTK | nltk.TweetTokenizer()

Last Updated : 12 Sep, 2023
Improve
Improve
Like Article
Like
Save
Share
Report

With the help of NLTK nltk.TweetTokenizer() method, we are able to convert the stream of words into small  tokens so that we can analyse the audio stream with the help of nltk.TweetTokenizer() method.

Syntax : nltk.TweetTokenizer() Return : Return the stream of token

Example #1 : In this example when we pass audio stream in the form of string it will converted to small tokens from a long string with the help of nltk.TweetTokenizer() method. 

Python3




# import TweetTokenizer() method from nltk
from nltk.tokenize import TweetTokenizer
  
# Create a reference variable for Class TweetTokenizer
tk = TweetTokenizer()
  
# Create a string input
gfg = "Geeks for Geeks"
  
# Use tokenize method
geek = tk.tokenize(gfg)
  
print(geek)


Output :

[‘Geeks’, ‘for’, ‘Geeks’]

Example #2 : 

Python3




# import TweetTokenizer() method from nltk
from nltk.tokenize import TweetTokenizer
  
# Create a reference variable for Class TweetTokenizer
tk = TweetTokenizer()
  
# Create a string input
gfg = ":-) <> () {} [] :-p"
  
# Use tokenize method
geek = tk.tokenize(gfg)
  
print(geek)


Output :

[‘:-)’, ”, ‘(‘, ‘)’, ‘{‘, ‘}’, ‘[‘, ‘]’, ‘:-p’]


Like Article
Suggest improvement
Share your thoughts in the comments

Similar Reads