Prieto et al., 2012 - Google Patents
Analysis and detection of web spam by means of web contentPrieto et al., 2012
View PDF- Document ID
- 15750333393579843832
- Author
- Prieto V
- Álvarez M
- López-García R
- Cacheda F
- Publication year
- Publication venue
- Multidisciplinary Information Retrieval: 5th Information Retrieval Facility Conference, IRFC 2012, Vienna, Austria, July 2-3, 2012 Proceedings 5
External Links
Snippet
Web Spam is one of the main difficulties that crawlers have to overcome. According to Gyöngyi and Garcia-Molina it is defined as “any deliberate human action that is meant to trigger an unjustifiably favourable relevance or importance of some web pages considering …
- 238000001514 detection method 0 title abstract description 34
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G06F17/30864—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems
- G06F17/30867—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems with filtering and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/50—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
- G06F21/55—Detecting local intrusion or implementing counter-measures
- G06F21/56—Computer malware detection or handling, e.g. anti-virus arrangements
- G06F21/562—Static detection
- G06F21/563—Static detection by source code analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/30707—Clustering or classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Rao et al. | Jail-Phish: An improved search engine based phishing detection system | |
Zhu et al. | OFS-NN: an effective phishing websites detection model based on optimal feature selection and neural network | |
Drost et al. | Thwarting the nigritude ultramarine: Learning to identify link spam | |
Chakraborty et al. | Recent developments in social spam detection and combating techniques: A survey | |
Mishne et al. | Blocking Blog Spam with Language Model Disagreement. | |
Wu et al. | A phishing detection system based on machine learning | |
CA2508060C (en) | Search engine spam detection using external data | |
Prieto et al. | SAAD, a content based Web Spam Analyzer and Detector | |
Buber et al. | NLP based phishing attack detection from URLs | |
Chu et al. | Protect sensitive sites from phishing attacks using features extractable from inaccessible phishing URLs | |
Chiew et al. | Leverage website favicon to detect phishing websites | |
Zhang et al. | Detecting spam and promoting campaigns in Twitter | |
Prieto et al. | Analysis and detection of web spam by means of web content | |
Wardman et al. | High-performance content-based phishing attack detection | |
Glater et al. | Intent-aware semantic query annotation | |
Wahsheh et al. | A link and content hybrid approach for Arabic web spam detection | |
Shyni et al. | Phishing detection in websites using parse tree validation | |
Chandra et al. | A survey on web spam and spam 2.0 | |
Wahsheh et al. | Detecting Arabic web spam | |
Wahsheh et al. | Using Machine Learning Algorithms to Detect Content-based Arabic Web Spam. | |
Alsaleh et al. | Analysis of web spam for non-english content: toward more effective language-based classifiers | |
Algur et al. | Hybrid spamicity score approach to web spam detection | |
Wahsheh et al. | Analyzing the popular words to evaluate spam in Arabic web pages | |
Wahsheh et al. | Evaluating Arabic spam classifiers using link analysis | |
Liu et al. | Filtering spam in social tagging system with dynamic behavior analysis |