Almohaimeed et al., 2024 - Google Patents
Transfer Learning and Lexicon-Based Approaches for Implicit Hate Speech Detection: A Comparative Study of Human and GPT-4 AnnotationAlmohaimeed et al., 2024
- Document ID
- 10754623939360250202
- Author
- Almohaimeed S
- Almohaimeed S
- Bölöni L
- Publication year
- Publication venue
- 2024 IEEE 18th International Conference on Semantic Computing (ICSC)
External Links
Snippet
Detecting harmful speech is the subject of significant research effort both in the academia and industry. While good progress was made on detecting explicit hate speech, detecting implicit hate remains difficult as it requires a deep understanding of the allusions of the text …
- 238000013459 approach 0 title abstract description 29
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/2775—Phrasal analysis, e.g. finite state techniques, chunking
- G06F17/278—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G06K9/6807—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
- G06K9/6842—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries according to the linguistic properties, e.g. English, German
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Thorne et al. | The fact extraction and VERification (FEVER) shared task | |
Gugnani et al. | Implicit skills extraction using document embedding and its use in job recommendation | |
Ansari et al. | Ensemble hybrid learning methods for automated depression detection | |
US10140272B2 (en) | Dynamic context aware abbreviation detection and annotation | |
US9996526B2 (en) | System and method for supplementing a question answering system with mixed-language source documents | |
US10303766B2 (en) | System and method for supplementing a question answering system with mixed-language source documents | |
Kausar et al. | ProSOUL: a framework to identify propaganda from online Urdu content | |
Biradar et al. | Hate or non-hate: Translation based hate speech identification in code-mixed hinglish data set | |
Mahdavi et al. | Automatic external Persian plagiarism detection using vector space model | |
Gupta et al. | Designing and development of stemmer of Dogri using unsupervised learning | |
Lahbari et al. | A rule-based method for Arabic question classification | |
Jaber et al. | NER in English translation of hadith documents using classifiers combination | |
Almohaimeed et al. | Transfer Learning and Lexicon-Based Approaches for Implicit Hate Speech Detection: A Comparative Study of Human and GPT-4 Annotation | |
Shalinda et al. | Hate words detection among sri lankan social media text messages | |
Le et al. | CRYPTEXT: Database and Interactive Toolkit of Human-Written Text Perturbations in the Wild | |
Ariyanto et al. | A Systematic Review on Semantic Role Labeling for Information Extraction in Low-Resource Data | |
Siddiqui | Sarcasm detection from Twitter database using text mining algorithms | |
Ptaszynski et al. | Detecting emotive sentences with pattern-based language modelling | |
Rodriguez et al. | Machine learning for detecting hate speech in low resource languages | |
Chaturvedi et al. | Predicting word vectors for microtext | |
Mansouri et al. | A new fuzzy support vector machine method for named entity recognition | |
Schönle et al. | Linguistic-Aware WordPiece Tokenization: Semantic Enrichment and OOV Mitigation | |
Casula | Transfer learning for multilingual offensive language detection with bert | |
LeBlanc | Model-driven abusive language detection | |
Dsilva | From sentence embeddings to large language models to detect and understand wordplay |