- Defining a grammar to parse 3 phrase types.
- ChunkRule class that looks for an optional determiner followed by one or more nouns is used for noun phrases.
- To add an adjective to the front of a noun chunk, MergeRule class is used.
- Any IN word is simply chunked for the prepositional phrases.
- an optional modal word (such as should) followed by a verb is chunked for the verb phrases.
Code #1 :
Accuracy : 0.6148573545757688
treebank_chunk corpus is a special version of the treebank corpus and it provides a chunked_sents() method. Duw to its file format, the regular treebank corpus cannot provide that method.
Code #2 : Using treebank_chunk
Accuracy : 0.49033970276008493
Chunk Score Metrices
It provides metrics other than accuracy. Of the chunks
Precision means how many were correct.
Recall means how well the chunker did at finding correct chunks compared to how many total chunks there were.
Code #3 : Chunk Score Metrices
Precision : 0.60201948127375 Recall : 0.606072502505847 Length for missed one : 47161 Length for incorrect one : 47967 Length for correct one : 119720 Length for guessed one : 120526
- Parsing and Processing URL using Python - Regex
- NLP | Chunking and chinking with RegEx
- NLP | Expanding and Removing Chunks with RegEx
- NLP | Regex and Affix tagging
- Partial Functions in Python
- XML parsing in Python
- Pandas | Parsing JSON Dataset
- Command-Line Option and Argument Parsing using argparse in Python
- Parsing XML with DOM APIs in Python
- Argparse VS Docopt VS Click - Comparing Python Command-Line Parsing Libraries
- Parsing PDFs in Python with Tika
- NLP | Classifier-based Chunking | Set 2
- Processing text using NLP | Basics
- Readability Index in Python(NLP)
- Feature Extraction Techniques - NLP
- Python | NLP analysis of Restaurant reviews
- Applying Multinomial Naive Bayes to NLP Problems
- NLP | Training Unigram Tagger
- NLP | Synsets for a word in WordNet
- NLP | Part of Speech - Default Tagging
If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to firstname.lastname@example.org. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.