JP2009048380A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2009048380A5 JP2009048380A5 JP2007213169A JP2007213169A JP2009048380A5 JP 2009048380 A5 JP2009048380 A5 JP 2009048380A5 JP 2007213169 A JP2007213169 A JP 2007213169A JP 2007213169 A JP2007213169 A JP 2007213169A JP 2009048380 A5 JP2009048380 A5 JP 2009048380A5
- Authority
- JP
- Japan
- Prior art keywords
- search
- url
- browsing information
- unit
- traffic rank
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Claims (15)
取得した閲覧情報を所定の閲覧管理サーバ装置に電気通信回線を介して送信する閲覧情報送信部と、
を有するクライアント端末に設けられるツールバー装置と、
前記閲覧情報送信部から送信される閲覧情報を収集し、収集した閲覧情報に基づいて
検索エンジンにて新規に検索対象として利用するURLである新規検索対象URLを抽出するとともに、抽出したURLのコンテンツをインデクシングする第一インデクサ部を有する第一閲覧管理サーバ装置を含む検索装置と、からなる検索システム。 A browsing information acquisition unit that acquires browsing information including a browsing URL that is at least a URL of a resource browsed by a user from a browser;
A browsing information transmission unit that transmits the acquired browsing information to a predetermined browsing management server device via an electric communication line ;
A toolbar device provided in a client terminal having
Collecting browsing information transmitted from the browsing information transmitting unit, extracting a new search target URL that is a URL to be newly used as a search target in a search engine based on the collected browsing information, and contents of the extracted URL And a search device including a first browsing management server device having a first indexer unit for indexing.
収集した閲覧情報を蓄積する閲覧情報蓄積部と、
閲覧情報蓄積部に蓄積されている閲覧情報に基づいてURL毎に視聴度指数であるトラフィックランクを算出するトラフィックランクスコアリング部を有する第二閲覧管理サーバ装置を含む
請求項1に記載の検索システム。 The search device includes:
A browsing information storage unit for storing the collected browsing information;
The search system according to claim 1, further comprising a second browsing management server device having a traffic rank scoring unit that calculates a traffic rank that is an audience rating index for each URL based on browsing information stored in the browsing information storage unit. .
クローラ部と、
トラフィックランクスコアリング部で算出されたトラフィックランクに基づいてクローラ部のクローリングスケジュールを決定するスケジュール決定部と、
を有する請求項2に記載の検索システム。 The search device includes:
Crawler part,
A schedule determination unit that determines a crawling schedule of the crawler unit based on the traffic rank calculated by the traffic rank scoring unit;
The search system according to claim 2 .
トラフィックランクスコアリング部で算出されたトラフィックランクに基づいて
検索エンジンにて新規に検索対象として利用するURLである新規検索対象URLを抽出するとともに、抽出したURLのコンテンツをインデクシングする第二インデクサ部を有する
請求項2から4のいずれか一に記載の検索システム。 The search device includes:
A second indexer unit that extracts a new search target URL, which is a URL to be newly used as a search target in the search engine, based on the traffic rank calculated by the traffic rank scoring unit and indexes the content of the extracted URL The search system according to any one of claims 2 to 4 .
収集した閲覧情報に基づいて検索エンジンにて新規に検索対象として利用するURLである新規検索対象URLを抽出するとともに、抽出したURLのコンテンツをインデクシングする第一インデクサ部を有する第一閲覧管理サーバ装置を含む検索装置。 A search device that collects browsing information from a plurality of toolbar devices according to claim 1,
A first browsing management server apparatus having a first indexer unit that extracts a new search target URL, which is a URL that is newly used as a search target in a search engine, based on the collected browsing information, and indexes the content of the extracted URL Search device including
収集した閲覧情報を蓄積する閲覧情報蓄積部と、
閲覧情報蓄積部に蓄積されている閲覧情報に基づいてURL毎に視聴度指数であるトラフィックランクを算出するトラフィックランクスコアリング部を有する第二閲覧管理サーバ装置を含む検索装置。 A search device that collects browsing information from a plurality of toolbar devices according to claim 1,
A browsing information storage unit for storing the collected browsing information;
A search device including a second browsing management server device having a traffic rank scoring unit that calculates a traffic rank that is an audience rating index for each URL based on browsing information stored in a browsing information storage unit.
トラフィックランクスコアリング部で算出されたトラフィックランクに基づいてクローラ部のクローリングスケジュールを決定するスケジュール決定部と、
を有する請求項7または8に記載の検索装置。 Crawler part,
A schedule determination unit that determines a crawling schedule of the crawler unit based on the traffic rank calculated by the traffic rank scoring unit;
The search device according to claim 7 or 8 , comprising:
検索エンジンにて新規に検索対象として利用するURLである新規検索対象URLを抽出するとともに、抽出したURLのコンテンツをインデクシングする第二インデクサ部を有する
請求項7から9のいずれか一に記載の検索装置。 A second indexer unit that extracts a new search target URL, which is a URL to be newly used as a search target in the search engine, based on the traffic rank calculated by the traffic rank scoring unit and indexes the content of the extracted URL The search device according to any one of claims 7 to 9 .
収集した閲覧情報に基づいて検索エンジンにて新規に検索対象として利用するURLである新規検索対象URLを抽出するステップと、
抽出したURLのコンテンツをインデクシングする第一インデクシングステップと、
を計算機に実行させる検索方法。 A browsing information collection step for collecting browsing information including a browsing URL;
Extracting a new search target URL that is a URL to be newly used as a search target in the search engine based on the collected browsing information;
A first indexing step for indexing the content of the extracted URL;
A search method that causes a computer to execute.
収集した閲覧情報を保持するため格納する閲覧情報格納ステップと、
閲覧情報に基づいてURL毎に視聴度指数であるトラフィックランクを算出するトラフィックランクスコアリングステップと、
を計算機に実行させる検索方法。 A browsing information collection step for collecting browsing information including a browsing URL;
A browsing information storage step for storing collected browsing information;
A traffic rank scoring step for calculating a traffic rank that is an audience rating index for each URL based on browsing information;
A search method that causes a computer to execute.
をさらに計算機に実行させる請求項12に記載の検索方法。 13. The search method according to claim 12 , further causing a computer to execute a rank sort output step of sorting search results based on the traffic rank calculated in the traffic rank scoring step and outputting the result to the client.
トラフィックランクスコアリングステップで算出されたトラフィックランクに基づいてクローラステップのクローリングスケジュールを決定するスケジュール決定ステップと、
を計算機に実行させる請求項12または13に記載の検索方法。 Crawler step,
A schedule determination step for determining a crawling schedule for the crawler step based on the traffic rank calculated in the traffic rank scoring step;
The search method according to claim 12 or 13 , wherein the computer is executed.
検索エンジンにて新規に検索対象として利用するURLである新規検索対象URLを抽出するステップと、
抽出したURLのコンテンツをインデクシングする第二インデクシングステップと、
を計算機に実行させる請求項12から14のいずれか一に記載の検索方法。 Extracting a new search target URL that is a URL to be newly used as a search target in the search engine based on the traffic rank calculated in the traffic rank scoring step;
A second indexing step for indexing the content of the extracted URL;
The search method according to any one of claims 12 to 14 , wherein the computer is executed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2007213169A JP4868245B2 (en) | 2007-08-17 | 2007-08-17 | SEARCH SYSTEM, SEARCH DEVICE, AND SEARCH METHOD |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2007213169A JP4868245B2 (en) | 2007-08-17 | 2007-08-17 | SEARCH SYSTEM, SEARCH DEVICE, AND SEARCH METHOD |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2009048380A JP2009048380A (en) | 2009-03-05 |
JP2009048380A5 true JP2009048380A5 (en) | 2009-05-14 |
JP4868245B2 JP4868245B2 (en) | 2012-02-01 |
Family
ID=40500538
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2007213169A Active JP4868245B2 (en) | 2007-08-17 | 2007-08-17 | SEARCH SYSTEM, SEARCH DEVICE, AND SEARCH METHOD |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP4868245B2 (en) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5147790B2 (en) * | 2009-07-24 | 2013-02-20 | ヤフー株式会社 | Crawler adjusting device and crawler adjusting method |
JP5440008B2 (en) * | 2009-07-30 | 2014-03-12 | 富士通株式会社 | Information providing apparatus, information providing program, and information providing method |
JP5488031B2 (en) * | 2010-02-19 | 2014-05-14 | 日本電気株式会社 | Search device |
JP5493983B2 (en) * | 2010-02-23 | 2014-05-14 | 日本電気株式会社 | Search device |
JP5494125B2 (en) * | 2010-03-30 | 2014-05-14 | 富士通株式会社 | Trend monitoring method, trend monitoring server program, and trend monitoring apparatus |
JP5416023B2 (en) * | 2010-04-15 | 2014-02-12 | ヤフー株式会社 | Reading terminal and method |
JP5462713B2 (en) * | 2010-05-25 | 2014-04-02 | 株式会社Kddi研究所 | Web page collection apparatus, method, and program |
KR101100782B1 (en) * | 2011-04-05 | 2011-12-29 | 주식회사 로그 | Device and method for detecting new site |
US8782031B2 (en) * | 2011-08-09 | 2014-07-15 | Microsoft Corporation | Optimizing web crawling with user history |
JP5801256B2 (en) * | 2012-06-13 | 2015-10-28 | 日本電信電話株式会社 | Estimation apparatus and estimation method |
EP2857987A4 (en) * | 2012-06-30 | 2015-04-15 | Huawei Tech Co Ltd | Acquiring method, device and system of user behavior |
JP5939579B2 (en) | 2013-03-19 | 2016-06-22 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | Apparatus, method and program for creating list |
JP6341031B2 (en) * | 2014-09-22 | 2018-06-13 | 富士通株式会社 | Access control program, access control method, and information processing apparatus |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001290843A (en) * | 2000-02-04 | 2001-10-19 | Fujitsu Ltd | Device and method for document retrieval, document retrieving program, and recording medium having the same program recorded |
JP2002117206A (en) * | 2000-07-28 | 2002-04-19 | Toshiba Corp | Web viewer analysis method, web viewer analysis program, recording medium and web viewer analysis system |
JP2003178092A (en) * | 2001-12-10 | 2003-06-27 | Mitsubishi Electric Corp | Information retrieval system, information providing device, information retrieving method and program |
JP4396262B2 (en) * | 2003-12-22 | 2010-01-13 | 富士ゼロックス株式会社 | Information processing apparatus, information processing method, and computer program |
JP2005190065A (en) * | 2003-12-25 | 2005-07-14 | Nippon Telegr & Teleph Corp <Ntt> | User terminal for information retrieval and collection, information retrieval and collection system, and information retrieval and collection method |
US7801880B2 (en) * | 2005-03-29 | 2010-09-21 | Microsoft Corporation | Crawling databases for information |
JP2006277288A (en) * | 2005-03-29 | 2006-10-12 | Nec Corp | Display time measuring system, display time measuring method, retrieval system, and retrieval method |
-
2007
- 2007-08-17 JP JP2007213169A patent/JP4868245B2/en active Active
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2009048380A5 (en) | ||
Zubiaga | Enhancing navigation on wikipedia with social tags | |
AU2009276354B2 (en) | Providing posts to discussion threads in response to a search query | |
CN100520778C (en) | Internet topics file searching method, reptile system and search engine | |
CN102567407B (en) | Method and system for collecting forum reply increment | |
CN103077254B (en) | Webpage acquisition methods and device | |
CN104166683B (en) | A kind of data digging method | |
CN104615627B (en) | A kind of event public feelings information extracting method and system based on microblog | |
CN101261629A (en) | Specific information searching method based on automatic classification technology | |
CN103631794A (en) | Method, device and equipment for sorting search results | |
CN102567494B (en) | Website classification method and device | |
CN106021418B (en) | The clustering method and device of media event | |
CN105159930A (en) | Search keyword pushing method and apparatus | |
CN102254027A (en) | Method for obtaining webpage contents in batch | |
CN102375813A (en) | Duplicate detection system and method for search engines | |
CN102760151A (en) | Implementation method of open source software acquisition and searching system | |
CN104252348A (en) | Webpage access statistics method and device based on browser | |
CN104298780B (en) | A kind of pre-acquiring method and system of browsing device net page information | |
CN104991904A (en) | Page data acquisition method of dynamic webpage | |
RU2014127401A (en) | METHOD FOR SELECTING A TARGET MESSAGE TO INCLUDE SEARCH SYSTEM RESULTS (SERP) AND SERVER IN THE PAGE | |
CN103116635A (en) | Field-oriented method and system for collecting invisible web resources | |
CN103279492B (en) | A kind of method and apparatus capturing webpage | |
CN101354718B (en) | Method and apparatus for determining file bag resource identification information | |
Kumar et al. | Learnable focused meta crawling through Web | |
CN104008213A (en) | Method and device for finding and counting webpage information updating |