Q37.Marks: +2.0UGC NET Paper 2: Computer Science 2nd January 2026 Shift 1
Arrange the following sequence related to FASTUS system:
A. Basic-group handling
B. Structure merging
C. Tokenization
D. Complex phrase handling
E. Complex word handling
Choose the correct answer from the options given below:
1.C, A, D, E, B
2.E, A, C, D, B
3.C, E, A, D, B✓ Correct
4.C, E, D, A, B
Solution
The correct answer is Option 3.
Key Points
The FASTUS system is a framework used for information extraction, particularly in natural language processing (NLP).
The correct sequence for the processes in the FASTUS system is as follows:
Tokenization: This is the first step where the text is broken down into individual tokens or words.
Complex word handling: In this step, words that have special meanings or structures are analyzed and processed.
Basic-group handling: Basic phrases and groups of words are identified and grouped for further processing.
Complex phrase handling: More advanced phrases and structures are processed and interpreted.
Structure merging: Finally, the extracted structures are merged to form a cohesive representation of the information.
This sequence is designed to progressively extract and organize information from unstructured text.
Additional Information
Applications of the FASTUS System:
Used in automated text summarization to extract key information from large documents.
Helps in named entity recognition (NER) tasks to identify entities such as names, locations, and organizations.
Provides a foundation for machine learning and AI systems to understand and process human language.
Used in customer service systems to analyze and respond to user queries effectively.
Key Features of the FASTUS System:
Modular design allows for incremental processing of text.
Capable of handling complex and nested structures in sentences.
Efficient and scalable for large-scale information extraction tasks.
Advantages of Using FASTUS:
Improves the accuracy of information extraction by processing text in a structured manner.
Reduces computational complexity by focusing on relevant parts of the text.
Adaptable to various languages and domains with minimal modifications.
Important Points:
FASTUS was originally developed for specific NLP tasks but has since been adapted for broader applications.
It is particularly effective in domains where structured information needs to be extracted from unstructured text, such as medical records or legal documents.
The system's modular approach allows for easy integration into larger NLP pipelines.