Textual Analysis

  1. Core Analysis Techniques and Tasks
    1. Part-of-Speech Tagging
      1. POS Tag Sets
        1. Universal Dependencies
          1. Penn Treebank Tags
            1. Language-Specific Tagsets
            2. Tagging Approaches
              1. Rule-Based Tagging
                1. Statistical Tagging
                  1. Neural Network Tagging
                  2. Applications of POS Tagging
                    1. Syntactic Analysis
                      1. Information Extraction
                        1. Text Preprocessing
                        2. Evaluation of POS Taggers
                          1. Accuracy Metrics
                            1. Error Analysis
                          2. Named Entity Recognition
                            1. Entity Types
                              1. Standard Entity Types
                                1. Persons
                                  1. Organizations
                                    1. Locations
                                      1. Dates and Times
                                      2. Extended Entity Types
                                        1. Products
                                          1. Events
                                            1. Quantities
                                              1. Miscellaneous Entities
                                            2. NER Approaches
                                              1. Rule-Based Methods
                                                1. Pattern Matching
                                                  1. Gazetteer Lookup
                                                  2. Machine Learning Approaches
                                                    1. Conditional Random Fields
                                                      1. Neural Networks
                                                        1. Transformer Models
                                                      2. Challenges in NER
                                                        1. Entity Ambiguity
                                                          1. Nested Entities
                                                            1. Cross-Domain Generalization
                                                              1. Multilingual NER
                                                              2. NER Evaluation
                                                                1. Exact Match Evaluation
                                                                  1. Partial Match Evaluation
                                                                    1. Entity-Level vs Token-Level Metrics
                                                                  2. Syntactic Parsing
                                                                    1. Dependency Parsing
                                                                      1. Dependency Trees
                                                                        1. Grammatical Relations
                                                                          1. Universal Dependencies Framework
                                                                            1. Parsing Algorithms
                                                                              1. Transition-Based Parsing
                                                                                1. Graph-Based Parsing
                                                                              2. Constituency Parsing
                                                                                1. Phrase Structure Trees
                                                                                  1. Context-Free Grammars
                                                                                    1. Parsing Algorithms
                                                                                      1. CYK Algorithm
                                                                                        1. Earley Parser
                                                                                        2. Probabilistic Context-Free Grammars
                                                                                        3. Applications of Parsing
                                                                                          1. Information Extraction
                                                                                            1. Question Answering
                                                                                              1. Machine Translation
                                                                                            2. Sentiment Analysis
                                                                                              1. Sentiment Polarity
                                                                                                1. Binary Classification
                                                                                                  1. Multi-Class Classification
                                                                                                    1. Fine-Grained Sentiment Scales
                                                                                                    2. Emotion Detection
                                                                                                      1. Basic Emotions
                                                                                                        1. Joy
                                                                                                          1. Anger
                                                                                                            1. Sadness
                                                                                                              1. Fear
                                                                                                                1. Surprise
                                                                                                                  1. Disgust
                                                                                                                  2. Emotion Taxonomies
                                                                                                                    1. Dimensional Models of Emotion
                                                                                                                    2. Sentiment Analysis Approaches
                                                                                                                      1. Lexicon-Based Methods
                                                                                                                        1. Sentiment Lexicons
                                                                                                                          1. Rule-Based Scoring
                                                                                                                            1. Lexicon Construction
                                                                                                                            2. Machine Learning Methods
                                                                                                                              1. Feature Engineering for Sentiment
                                                                                                                                1. Supervised Learning Approaches
                                                                                                                                  1. Deep Learning Models
                                                                                                                                2. Aspect-Based Sentiment Analysis
                                                                                                                                  1. Aspect Extraction
                                                                                                                                    1. Explicit Aspects
                                                                                                                                      1. Implicit Aspects
                                                                                                                                      2. Aspect Sentiment Classification
                                                                                                                                        1. Opinion Target Identification
                                                                                                                                        2. Challenges in Sentiment Analysis
                                                                                                                                          1. Sarcasm Detection
                                                                                                                                            1. Context Dependency
                                                                                                                                              1. Domain Adaptation
                                                                                                                                                1. Negation Handling
                                                                                                                                              2. Topic Modeling
                                                                                                                                                1. Latent Dirichlet Allocation
                                                                                                                                                  1. Generative Process
                                                                                                                                                    1. Dirichlet Distribution
                                                                                                                                                      1. Gibbs Sampling
                                                                                                                                                        1. Variational Inference
                                                                                                                                                          1. Hyperparameter Tuning
                                                                                                                                                            1. Alpha Parameter
                                                                                                                                                              1. Beta Parameter
                                                                                                                                                                1. Number of Topics
                                                                                                                                                              2. Alternative Topic Models
                                                                                                                                                                1. Non-Negative Matrix Factorization
                                                                                                                                                                  1. Latent Semantic Analysis
                                                                                                                                                                    1. Hierarchical Dirichlet Process
                                                                                                                                                                      1. Dynamic Topic Models
                                                                                                                                                                      2. Topic Interpretation
                                                                                                                                                                        1. Topic Coherence Measures
                                                                                                                                                                          1. Manual Topic Labeling
                                                                                                                                                                            1. Topic Visualization
                                                                                                                                                                            2. Topic Model Evaluation
                                                                                                                                                                              1. Perplexity
                                                                                                                                                                                1. Topic Coherence
                                                                                                                                                                                  1. Human Evaluation
                                                                                                                                                                                    1. Stability Measures
                                                                                                                                                                                  2. Text Classification
                                                                                                                                                                                    1. Problem Formulation
                                                                                                                                                                                      1. Binary Classification
                                                                                                                                                                                        1. Multi-Class Classification
                                                                                                                                                                                          1. Multi-Label Classification
                                                                                                                                                                                            1. Hierarchical Classification
                                                                                                                                                                                            2. Feature Engineering
                                                                                                                                                                                              1. Text-Specific Features
                                                                                                                                                                                                1. Feature Selection Methods
                                                                                                                                                                                                  1. Feature Weighting
                                                                                                                                                                                                  2. Classification Algorithms
                                                                                                                                                                                                    1. Naive Bayes
                                                                                                                                                                                                      1. Multinomial Naive Bayes
                                                                                                                                                                                                        1. Bernoulli Naive Bayes
                                                                                                                                                                                                        2. Support Vector Machines
                                                                                                                                                                                                          1. Linear SVM
                                                                                                                                                                                                            1. Kernel Methods
                                                                                                                                                                                                            2. Neural Networks
                                                                                                                                                                                                              1. Feedforward Networks
                                                                                                                                                                                                                1. Convolutional Neural Networks
                                                                                                                                                                                                                  1. Recurrent Neural Networks
                                                                                                                                                                                                                2. Specific Applications
                                                                                                                                                                                                                  1. Spam Detection
                                                                                                                                                                                                                    1. Intent Classification
                                                                                                                                                                                                                      1. Language Identification
                                                                                                                                                                                                                        1. Document Categorization
                                                                                                                                                                                                                        2. Evaluation Methods
                                                                                                                                                                                                                          1. Cross-Validation
                                                                                                                                                                                                                            1. Stratified Sampling
                                                                                                                                                                                                                              1. Performance Metrics
                                                                                                                                                                                                                            2. Text Clustering
                                                                                                                                                                                                                              1. Document Similarity Measures
                                                                                                                                                                                                                                1. Cosine Similarity
                                                                                                                                                                                                                                  1. Jaccard Similarity
                                                                                                                                                                                                                                    1. Euclidean Distance
                                                                                                                                                                                                                                      1. Manhattan Distance
                                                                                                                                                                                                                                      2. Clustering Algorithms
                                                                                                                                                                                                                                        1. K-Means Clustering
                                                                                                                                                                                                                                          1. Algorithm Steps
                                                                                                                                                                                                                                            1. Initialization Methods
                                                                                                                                                                                                                                              1. Choosing K
                                                                                                                                                                                                                                              2. Hierarchical Clustering
                                                                                                                                                                                                                                                1. Agglomerative Clustering
                                                                                                                                                                                                                                                  1. Divisive Clustering
                                                                                                                                                                                                                                                    1. Linkage Criteria
                                                                                                                                                                                                                                                    2. Density-Based Clustering
                                                                                                                                                                                                                                                      1. DBSCAN
                                                                                                                                                                                                                                                        1. OPTICS
                                                                                                                                                                                                                                                        2. Topic-Based Clustering
                                                                                                                                                                                                                                                        3. Cluster Evaluation
                                                                                                                                                                                                                                                          1. Internal Validation
                                                                                                                                                                                                                                                            1. Silhouette Score
                                                                                                                                                                                                                                                              1. Calinski-Harabasz Index
                                                                                                                                                                                                                                                              2. External Validation
                                                                                                                                                                                                                                                                1. Adjusted Rand Index
                                                                                                                                                                                                                                                                  1. Normalized Mutual Information
                                                                                                                                                                                                                                                                2. Applications of Text Clustering
                                                                                                                                                                                                                                                                  1. Document Organization
                                                                                                                                                                                                                                                                    1. Customer Segmentation
                                                                                                                                                                                                                                                                      1. News Article Grouping
                                                                                                                                                                                                                                                                    2. Text Summarization
                                                                                                                                                                                                                                                                      1. Extractive Summarization
                                                                                                                                                                                                                                                                        1. Sentence Scoring Methods
                                                                                                                                                                                                                                                                          1. TF-IDF Based Scoring
                                                                                                                                                                                                                                                                            1. Graph-Based Methods
                                                                                                                                                                                                                                                                              1. Machine Learning Approaches
                                                                                                                                                                                                                                                                              2. Sentence Selection Algorithms
                                                                                                                                                                                                                                                                                1. Redundancy Removal
                                                                                                                                                                                                                                                                                2. Abstractive Summarization
                                                                                                                                                                                                                                                                                  1. Sequence-to-Sequence Models
                                                                                                                                                                                                                                                                                    1. Attention Mechanisms
                                                                                                                                                                                                                                                                                      1. Transformer-Based Models
                                                                                                                                                                                                                                                                                        1. Content Planning
                                                                                                                                                                                                                                                                                        2. Summarization Evaluation
                                                                                                                                                                                                                                                                                          1. ROUGE Metrics
                                                                                                                                                                                                                                                                                            1. BLEU Scores
                                                                                                                                                                                                                                                                                              1. Human Evaluation
                                                                                                                                                                                                                                                                                                1. Semantic Similarity Measures
                                                                                                                                                                                                                                                                                                2. Domain-Specific Summarization
                                                                                                                                                                                                                                                                                                  1. News Summarization
                                                                                                                                                                                                                                                                                                    1. Scientific Paper Summarization
                                                                                                                                                                                                                                                                                                      1. Meeting Summarization
                                                                                                                                                                                                                                                                                                    2. Information Extraction
                                                                                                                                                                                                                                                                                                      1. Relation Extraction
                                                                                                                                                                                                                                                                                                        1. Binary Relations
                                                                                                                                                                                                                                                                                                          1. N-ary Relations
                                                                                                                                                                                                                                                                                                            1. Relation Types
                                                                                                                                                                                                                                                                                                              1. Person-Organization
                                                                                                                                                                                                                                                                                                                1. Event-Location
                                                                                                                                                                                                                                                                                                                  1. Cause-Effect
                                                                                                                                                                                                                                                                                                                  2. Extraction Methods
                                                                                                                                                                                                                                                                                                                    1. Pattern-Based Extraction
                                                                                                                                                                                                                                                                                                                      1. Supervised Learning
                                                                                                                                                                                                                                                                                                                        1. Distant Supervision
                                                                                                                                                                                                                                                                                                                      2. Event Extraction
                                                                                                                                                                                                                                                                                                                        1. Event Detection
                                                                                                                                                                                                                                                                                                                          1. Event Argument Identification
                                                                                                                                                                                                                                                                                                                            1. Event Coreference Resolution
                                                                                                                                                                                                                                                                                                                              1. Temporal Event Processing
                                                                                                                                                                                                                                                                                                                              2. Knowledge Base Population
                                                                                                                                                                                                                                                                                                                                1. Entity Linking
                                                                                                                                                                                                                                                                                                                                  1. Slot Filling
                                                                                                                                                                                                                                                                                                                                    1. Knowledge Graph Construction