Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2022
Webformer: Pre-training with Web Pages for Information Retrieval
SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 1502–1512https://doi.org/10.1145/3477495.3532086Pre-trained language models (PLMs) have achieved great success in the area of Information Retrieval. Studies show that applying these models to ad-hoc document ranking can achieve better retrieval effectiveness. However, on the Web, most information is ...
- research-articleJanuary 2021
Modeling an web community discovery method with web page attraction
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 40, Issue 6Pages 11159–11169https://doi.org/10.3233/JIFS-202366An improved Web community discovery algorithm is proposed in this paper based on the attraction between Web pages to effectively reduce the complexity of Web community discovery. The proposed algorithm treats each Web page in the Web pages collection as ...
- research-articleDecember 2019
DOM-based keyword extraction from web pages
AIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud ComputingArticle No.: 62, Pages 1–6https://doi.org/10.1145/3371425.3371495We present D-rank, an unsupervised, language and domain independent method for automatically extracting keywords from a single web page. The method does not use any corpus, and relies only on the information and features on the web page including page ...
- research-articleSeptember 2019
Isolating the Effects of Web Page Visual Appearance on the Perceived Credibility of Online News among College Students
HT '19: Proceedings of the 30th ACM Conference on Hypertext and Social MediaPages 191–200https://doi.org/10.1145/3342220.3343663Online news sources have transformed civic discourse, and much has been made of their credibility. Although web page credibility has been investigated generally, most work has focused on the credibility of web page content. In this work, we study the ...
- research-articleJuly 2018
An eye tracking study of viewing behaviour and preferences of Arabic, English and Chinese users
HCI '18: Proceedings of the 32nd International BCS Human Computer Interaction ConferenceArticle No.: 51, Pages 1–13https://doi.org/10.14236/ewic/HCI2018.51The aim of this research is to investigate possible variations in visual behaviour and eye movements on webpages among users from three different cultures (Arab, English and Chinese). The paper reports an experiment using eye tracking technique and ...
-
- research-articleMarch 2017
Syntactic entropy for main content extraction from web pages
BDCA'17: Proceedings of the 2nd international Conference on Big Data, Cloud and ApplicationsArticle No.: 63, Pages 1–4https://doi.org/10.1145/3090354.3090419In this paper, we present a solution for main content identification in web pages. Our solution is language-independent; Web pages may be written in different languages. It is topic-independent; no domain knowledge or dictionary is applied. And it is ...
- ArticleSeptember 2014
A Pure URL-Based Genre Classification of Web Pages
ISLC '14: Proceedings of the 2014 International Semiconductor Laser ConferencePages 233–237https://doi.org/10.1109/DEXA.2014.56In this paper, we propose a new approach for multi-label genre classification of web pages that exploits character n-grams extracted from the URL of the web page rather than its content. Using only the URL reduces the time needed for feature extraction ...
- ArticleDecember 2013
Using Virtual World Environments in Conservation-Restoration
DESE '13: Proceedings of the 2013 Sixth International Conference on Developments in eSystems EngineeringPages 221–224https://doi.org/10.1109/DeSE.2013.47In this paper it is presented a visualization method for the investigation data obtained on the surfaces of the artworks. This method delivers a series of visual data, in digital format, associated with a digital three dimensional replica of the ...
- ArticleDecember 2013
Automatic Extraction of Event Information from Newspaper Articles and Web Pages
ICADL 2013: Proceedings of the 15th International Conference on Digital Libraries: Social Media and Community Networks - Volume 8279Pages 171–175https://doi.org/10.1007/978-3-319-03599-4_21In this paper, we propose a method for extracting travel-related event information, such as an event name or a schedule from automatically identified newspaper articles, in which particular events are mentioned. We analyze news corpora using our method, ...
- ArticleJuly 2013
A novel human-computer interface for browsing web data by leaping up web pages
HCI International'13: Proceedings of the 15th international conference on Human Interface and the Management of Information: information and interaction design - Volume Part IPages 197–202https://doi.org/10.1007/978-3-642-39209-2_23With the rapid growth of network technologies, various web services have been developed for providing information. Therefore, search engines become popular to obtain the useful data. It is critical to efficiently acquire the data from huge data pool in ...
- ArticleDecember 2012
Multiple Methods to Test Usability
ISISE '12: Proceedings of the 2012 Fourth International Symposium on Information Science and EngineeringPages 141–143https://doi.org/10.1109/ISISE.2012.38This paper is mainly talking about the basic method about the usability tests on the web page and the auxiliary methods to assist the usability tests which could contribute to revise the vague result. Nowadays, Internet is one of the most common ...
- ArticleMarch 2012
Fuzzy combinations of criteria: an application to web page representation for clustering
CICLing'12: Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part IIPages 157–168https://doi.org/10.1007/978-3-642-28601-8_14Document representation is an essential step in web page clustering. Web pages are usually written in HTML, offering useful information to select the most important features to represent them. In this paper we investigate the use of nonlinear ...
- research-articleFebruary 2012
IAAS: an integrity assurance service for web page via a fragile watermarking chain module
ICUIMC '12: Proceedings of the 6th International Conference on Ubiquitous Information Management and CommunicationArticle No.: 83, Pages 1–10https://doi.org/10.1145/2184751.2184849As the main facial point of the Web-based e-commerce which is frequently considered as a most important application area of Internet, Web page has been being given more and more duties. Accompanied by this trend, the importance of integrity protection ...
- research-articleDecember 2011
Relationship extraction methods based on co-occurrence in web pages and files
iiWAS '11: Proceedings of the 13th International Conference on Information Integration and Web-based Applications and ServicesPages 82–89https://doi.org/10.1145/2095536.2095552Every day, information on the Web becomes increasingly enriched. Web access is now very useful in many aspects of daily life, particularly for writing documents and programs. In fact, it has become quite usual to edit files while referring to information ...
- ArticleAugust 2011
Implementation Functions of WebGIS Based on Bitmap
ISIE '11: Proceedings of the 2011 International Conference on Intelligence Science and Information EngineeringPages 519–522https://doi.org/10.1109/ISIE.2011.88As a kind of GIS technology based on B/S pattern, Web GIS has been widely applied. However, in certain specific circumstances, such as lack of digital vector maps, or the precision of digital map can not meet the request, the function of GIS can not be ...
- ArticleJune 2011
Information extraction from web pages based on their visual representation
ICWE'11: Proceedings of the 11th international conference on Current Trends in Web EngineeringPages 342–346https://doi.org/10.1007/978-3-642-27997-3_37This research is dedicated to enhancing the efficiency of web information extraction and web accessibility. The motivation behind the research, its aim and objectives are presented, and the performed work on developing web page model for information ...
- research-articleFebruary 2011
Integrity for the In-flight web page based on a fragile watermarking chain scheme
ICUIMC '11: Proceedings of the 5th International Conference on Ubiquitous Information Management and CommunicationArticle No.: 86, Pages 1–5https://doi.org/10.1145/1968613.1968715In recent years, it has been found that middle modifications and attacks widely exist when web pages are transferred from a web server to a user via HTTP. And the reason is that HTTP does not guarantee the integrity of network traffic. This paper ...
- ArticleDecember 2010
Study and Implementation of an Online Visual Design Tool for Web Pages
ISISE '10: Proceedings of the 2010 Third International Symposium on Information Science and EngineeringPages 39–42https://doi.org/10.1109/ISISE.2010.67Design and development of web pages were traditionally conducted by professional staff, which usually can't satisfy various users' individual requirement. Therefore, the study of a highly efficient, personalized, and visual design tool for web pages is ...
- ArticleSeptember 2010
Link proximity analysis: clustering websites by examining link proximity
ECDL'10: Proceedings of the 14th European conference on Research and advanced technology for digital librariesPages 449–452This research-in-progress paper presents a new approach called Link Proximity Analysis (LPA) for identifying related web pages based on link analysis. In contrast to current techniques, which ignore intra-page link analysis, the one put forth here ...
- research-articleFebruary 2010
Personalized reading support for second-language web documents by collective intelligence
IUI '10: Proceedings of the 15th international conference on Intelligent user interfacesPages 51–60https://doi.org/10.1145/1719970.1719978Novel intelligent interface eases the browsing of Web documents written in the second languages of users. It automatically predicts words unfamiliar to the user by collective intelligence and glosses them with their meaning in advance. If the prediction ...